Reddit Reddit reviews Data Science from Scratch: First Principles with Python

We found 5 Reddit comments about Data Science from Scratch: First Principles with Python. Here are the top ones, ranked by their Reddit score.

Business & Money
Books
Business Education & Reference
Business Statistics
Data Science from Scratch: First Principles with Python
Check price on Amazon

5 Reddit comments about Data Science from Scratch: First Principles with Python:

u/commentsrus · 4 pointsr/badeconomics

I work with econ/stat people who are great at running and interpreting models and thinking about causality issues, but don't know much about programming. They've specialized, I get it, but in the future teams would benefit from everyone knowing some basics. It'll also make stats people more productive and help prevent errors. Also also, econ, other sciences, and the policy world really should embrace open source, open science, open access, etc.

But anyway, here's how to do it.

Below are a bunch of random resources. If you're looking for free courses, Software Carpentry has a bunch on the topics listed below and more. The terminal and Bash, Python, R, Matlab, Git, SQL, GNU Make, continuous integration, and data visualization. Data Carpentry has lessons for some of these topics, geared more toward social scientists. Apparently they're developing a course for doing econ with Bash(?). If you're into macro or computational stuff and want to learn Python, can't do wrong with QuantEcon.

I'll echo what the other guy said. If you have a Mac, cool. If not, consider dual booting with linux. It has a reputation for being difficult to use, but Ubuntu, Mint, and ElementaryOS are all very simple and work just like what you're used to in Proprietary World. It's possible to do the following with Windows, but requires a more setup work.

Learn to use the terminal (this is the point of using Mac or Linux, they come with a terminal and unix tools). Here's a decent book on the basics. Learn to navigate around your filesystem, run programs from the terminal, and use a bit of Bash. You can probably skip the chapters on actually programming with Bash. Bash as a programming language is cool, but not super necessary, and kinda quirky. It wouldn't be a waste of time though, since you can do certain things in Bash very quickly and easily. And you'll be a master haxxer.

Check out Data Science at the Command Line for a decent overview of stats programming in a linux environment. Goes over basic Python and R, and other tools to make life simple. There's also The Plain Person's Guide to Plain Text Social Science, geared toward people who do science but may not do programming atm. Covers more useful tools.

Learn Python or R or both. If Python, here. If R, here. If you're into ML, here for Python and possibly here for R but the code may be dated. Still, that book is The intro book for ML.

Learn Git. You should be in the habit of tracking changes you make to your code and the data/results it produces, especially if your data is being shared with anyone. If you use R, here's a great intro to Git and RStudio's fantastic Git integration.

Learn SQL. This one's harder to pick up on your own, at home, since you need a database set up to query. Look at the software/data carpentry courses.

Learn Docker. It makes your analyses/projects more shareable and--gasp--more reproducible (though I've gotten shit in the past for this, so let's compromise and say it helps but doesn't GUARANTEE reproducibility). This one is more optional than the others.

Once you have the basics down, you can do what interests you and learn best practices. Perhaps you want to know about Efficient R Programming (and general best practices). Or best practices in Python and more comprehensive coverage. Or how to make reports and papers with RMarkdown (want to make a paper that looks like it's published in AER? there's a template for that in Rmd).

u/monkeyunited · 3 pointsr/datascience

Data Science from Scratch

Python Machine Learning

DSFS covers basics of Python. If you're comfortable with that and want to dive into implementing algorithm (using Tensorflow2, for example), then PML is a great book for that.

u/Aidtor · 1 pointr/datascience

If you want to be valuable to companies post graduation you should learn more about programming (design templates, how to write tests, how to go from a paper to code). I recommend this book as a good starting place. Once you're comfortable with how the different methods work, pick up this book.

u/ttelbarto · 1 pointr/datascience

Hi, There are so many resources out there I don't know where to start! I would work through some kind of beginner python book (recommendation below). Then maybe try Andrew Ng's Machine Learning Coursera course to get a taste of Machine Learning. Once you have completed both of those I would reassess what you would like to focus on. I will include some other books I would recommend below.

Beginner Python - https://www.amazon.co.uk/Python-Crash-Course-Hands-Project-Based/dp/1593276036/ref=sr_1_3?keywords=python+books&qid=1565035502&s=books&sr=1-3

Machine Learning Coursera - https://www.coursera.org/learn/machine-learning

Python Machine Learning - https://www.amazon.co.uk/Hands-Machine-Learning-Scikit-Learn-TensorFlow/dp/1491962291/ref=sr_1_7?crid=2QF98N9Q9GCJ9&keywords=hands+on+data+science&qid=1565035593&s=books&sprefix=hands+on+data+sc%2Cstripbooks%2C183&sr=1-7

https://www.amazon.co.uk/Data-Science-Scratch-Joel-Grus/dp/1492041130/ref=sr_1_1?crid=PJEJNNUBNQ8N&keywords=data+science+from+scratch&qid=1565035617&s=books&sprefix=data+science+from+s%2Cstripbooks%2C140&sr=1-1

Statistics (intro) - https://www.amazon.co.uk/Naked-Statistics-Stripping-Dread-Data/dp/039334777X/ref=sr_1_1?keywords=naked+statistics&qid=1565035650&s=books&sr=1-1

More stats (I haven't read this but gets recommended) - https://www.amazon.co.uk/Think-Stats-Allen-B-Downey/dp/1491907339/ref=sr_1_1?keywords=think+stats&qid=1565035674&s=books&sr=1-1