Need to turn your skills at programming into effective data science skills? Principles of Data Science is created to help you join the dots between mathematics, programming, and business analysis. With this book, you'll feel confident about asking―and answering―complex and sophisticated questions of your data to move from abstract and raw statistics to actionable ideas. With a unique approach that bridges the gap between mathematics and computer science, this books takes you through the entire data science pipeline. Beginning with cleaning and preparing data, and effective data mining strategies and techniques, you'll move on to build a comprehensive picture of how every piece of the data science puzzle fits together. Learn the fundamentals of computational mathematics and statistics, as well as some pseudocode being used today by data scientists and analysts. You'll get to grips with machine learning, discover the statistical models that help you take control and navigate even the densest datasets, and find out how to create powerful visualizations that communicate what your data means. Sinan Ozdemir is a data scientist, startup founder, and educator living in the San Francisco Bay Area with his dog, Charlie; cat, Euclid; and bearded dragon, Fiero. He spent his academic career studying pure mathematics at Johns Hopkins University before transitioning to education. He spent several years conducting lectures on data science at Johns Hopkins University and at the General Assembly before founding his own start-up, Legion Analytics, which uses artificial intelligence and data science to power enterprise sales teams. After completing the Fellowship at the Y Combinator accelerator, Sinan has spent most of his days working on his fast-growing company, while creating educational material for data science.
Great introduction book to cover the important concepts and algorithms in Data Science and Machine Learning at the entry level, with the hands-on examples and practices. The descriptions and explanations are easy to understand and follow. Not too much "mathematic", but of course it requires the basic ideas of possibility and statistics. But in general the book is suitable and friendly for the beginners.
The only problem is the example code was written by Python2. It requires the reader/learner has a lot of Python knowledge and programming skills to fix the problems made by the differences between Python2 and Python3 before running the code.
Excellent introduction - not perfect for sure, but explainations are simple, clear, engaging and often funny. Covers types of data and some very basics of: mathematics, probability, statistics, data visualization, machine learning and prediction methods.
A great book! The only downside is the code which is written in Python 2 and sometimes needs a lot of modifications in order to run. Additionally, some libraries used in the book are outdated. A must read if you want a great introduction to the data science field…