Reddit Reddit reviews Introduction to Machine Learning with Python: A Guide for Data Scientists

We found 5 Reddit comments about Introduction to Machine Learning with Python: A Guide for Data Scientists. Here are the top ones, ranked by their Reddit score.

Computers & Technology
Books
Computer Science
AI & Machine Learning
Natural Language Processing
Introduction to Machine Learning with Python: A Guide for Data Scientists
Check price on Amazon

5 Reddit comments about Introduction to Machine Learning with Python: A Guide for Data Scientists:

u/AchillesDev · 12 pointsr/cscareerquestions

Hey I have no CS degree and started out knowing nothing about ML! I do a mix of data science and data engineering, my internal clients are data scientists (and biologists!), and my boss is a data scientist. I build applications around models, build some models and applications around them, etc.

A few things that have been helpful:

  • Google's ML Crash Course
  • AWS Machine Learning Full Course
  • Introduction to Machine Learning with Python

    I have a ton of other books (and a lot of deep learning, but most of the frameworks make it easy peasy), but really after doing the Google crash course, the most helpful thing has been building and training models. You'll need to get familiar with tools like Jupyter notebooks, packages like Pandas, frameworks like TensorFlow, scikit-learn, etc. and maybe some data visualization stuff. I like to use AWS SageMaker, since you can spin up whatever resources you need for training and deployment, and has easy access to other AWS resources like S3.
u/Milleuros · 3 pointsr/trendingsubreddits

I can suggest having a look at r/DataScience , they seem to be focused on how to become an actual data scientist and get a job with that.

Machine learning is a tool by the way, which you generally learn while doing other things. I'm personally using ML in the framework of my PhD in Physics. I'll most probably be eligible for jobs as data scientists afterwards. I do know a lot of maths, which is useful to understand deep down what is going on.

Of course there are self-taught data scientists and analysts. I know some people started by e.g. reading around on the web (there are a lot of blogs, open source code, ...) and then participating to competitions on Kagle.

I will make some advertisement for MachineLearningMastery.com, because that blog was very helpful when I started. It's a blog that proposes to learn ML in a top-down approach: start by coding and practising, understand later. And also this book, which you might be able to find on the internet. For people more into theory and who want to see the maths behind it, a 800 pages book on deep learning

 

(At that point I'm just throwing infos and links in case anyone is interested)

u/breakz · 2 pointsr/MachineLearning

Sarah Guido from Bitly and NYC Python is working on exactly this book:

http://www.amazon.com/Introduction-Machine-Learning-Python-Sarah/dp/1449369413

u/admiralwaffles · 1 pointr/bigdata

Glad it's useful! I'll copy some of a reply I gave to somebody who PM'd me about advice for a data science career, because it's pertinent to you:

You need to understand where you want to go -- more science-y or more business-y. See, science-y type of analytics are heavy on the stats, applying really advanced methods to glean some counterintiutive and/or non-obvious insight. Business-y type stuff is digging through the data to understand what it's telling you and to build a bit of a story to figure out what the business is doing, and then measuring success after something changes. Both have their value. Essentially: science side tells you about the data, but the business side tells you how to make decisions based on the data. You'll fall somewhere on that spectrum, so just play to your strengths.

Once you've determined this, you need to learn a few things:

  1. Excel. Excel is the greatest data tool ever invented and I'll fight anybody who says different. Learn all about formulas, pivot tables, and whatnot. Excel is so deep, but really understand that Data tab. There is no better tool to connect to your data and just play around with it to figure out what you're looking at.
  2. Python, specifically NumPy and Pandas. Those are the two modules that will let you play with data very quickly. Pandas puts data in tables and allows you to operate on them. NumPy handles very complex calculations very quickly. Also, learn about Jupyter notebooks -- they're wonderful when doing analysis.
  3. Business! You may already know this, but you need to understand what the analytics are looking for. Data's only value is to use it to make better decisions. That's it. Data has no inherent value, and you need to understand how it can be leveraged. Even if you're more interested in the science of it, you still should have some grounding in how data is used.
  4. Bonus: GIS stuff. QGIS is a good tool and it's free. You can also do a ton of GIS stuff in Python with Shapely and Matplotlib (honestly, like 85% of my GIS work is in Python). This is especially helpful if you have some really interesting things geographically. Just be cognizant that every geographic insight isn't useful.

    As for some resources, here are some courses I think would be good from MIT CourseWare (full disclosure: I haven't sat through these specific courses, but these are the topics that are important):

  5. Statistical Thinking and Data Analysis
  6. Data, Models, and Decisions
  7. Communicating with Data (Little dated, but still valuable)

    You may also want to read up on machine learning. I like the O'Reilly book on it, but there are tons of books out there about it now.

    Hope that helps!
u/onepraveen · 1 pointr/learnpython

Thanks for the suggestion. Is below book good for beginning....


https://www.amazon.com/Introduction-Machine-Learning-Python-Scientists/dp/1449369413#