Goodreads helps you keep track of books you want to read.
Start by marking “Data Science from Scratch: First Principles with Python” as Want to Read:
Data Science from Scratch: First Principles with Python
by
Data science libraries, frameworks, modules, and toolkits are great for doing data science, but they’re also a good way to dive into the discipline without actually understanding data science. In this book, you’ll learn how many of the most fundamental data science tools and algorithms work by implementing them from scratch.
If you have an aptitude for mathematics and som
Kindle Edition, 330 pages
Published
April 14th 2015
by O'Reilly Media
Friend Reviews
To see what your friends thought of this book,
please sign up.
Reader Q&A
To ask other readers questions about
Data Science from Scratch,
please sign up.
Popular Answered Questions
Community Reviews
Showing 1-30
I'm still struggling to find the book I want around data science. I've learned that there are two levels:
1. KNOWING data science
2. DOING data science
This book is about the second one. Make no mistake, this is a "statistical computation" manual. This shows you how to find statistical answers using Python. Fully half this book is code samples. If you do not plan to actually attempt to find statistical answers to known questions by writing Python code, then this isn't the book for you.
I would look ...more
1. KNOWING data science
2. DOING data science
This book is about the second one. Make no mistake, this is a "statistical computation" manual. This shows you how to find statistical answers using Python. Fully half this book is code samples. If you do not plan to actually attempt to find statistical answers to known questions by writing Python code, then this isn't the book for you.
I would look ...more
I worked thru all of the examples in this book. Rather than have you import numpy and pandas and scikit-learn, he walks you through how to build up these tools yourself. What you build will be terribly inefficient and you should never use them in real life, but you will get a great feel for how they work under the hood.
(I also learned that my linear algebra is very rusty and I need a brush up ...)
I disagree with some of the reviews that they he doesn't do a good job explaining the computation ...more
(I also learned that my linear algebra is very rusty and I need a brush up ...)
I disagree with some of the reviews that they he doesn't do a good job explaining the computation ...more
The idea of the book is nice, I still think is a useful book, but:
1. you'll not learn math behind this or the methods will be explained (it's good for a programming, though)
2. regarding programming part, I think that people would benefit more if there were some actual exercises for them to do, not just "type in this code" attitude
3. would be nice if all of the data sets are actually generated in a book, not just "there is some data set with 2000 points, that I just pulled out of my ass"
4. more u ...more
1. you'll not learn math behind this or the methods will be explained (it's good for a programming, though)
2. regarding programming part, I think that people would benefit more if there were some actual exercises for them to do, not just "type in this code" attitude
3. would be nice if all of the data sets are actually generated in a book, not just "there is some data set with 2000 points, that I just pulled out of my ass"
4. more u ...more
Отличная книга, чтобы погрузиться в мир машинного обучения. Не сказать, что после прочтения вы будете знать много, но зато про многое из области ML. Подход автора состоит в том, чтобы вместо детального описания алгоритмов словами, привести реализацию в виде кода на питоне. Хорошего, понятного, компактного кода (пользуясь питоном время от времени не первый год, не думал что этот язык может быть таким элегантным). Во-первых, это позволяет сэкономить место – хорошо написанный код лучше описания алг
...more
Great book for a general overview of the concepts, and understanding what 'data science' actually means. Lots of code to drive to the points home, and it taught me quite a few Python tricks.
I can foresee using this as a reference for the main concepts, or when looking for a straightforward implementation of the algorithms discussed. The information is very solid.
If you want to power straight through, it's a tough read at times--but Joel's a very good writer, and I enjoyed the dry humor intersp ...more
I can foresee using this as a reference for the main concepts, or when looking for a straightforward implementation of the algorithms discussed. The information is very solid.
If you want to power straight through, it's a tough read at times--but Joel's a very good writer, and I enjoyed the dry humor intersp ...more
Fundamental concepts revealed, libraries for the win
Joel does a great job walking through the tasks a data scientist would take to solve hypothetical problems, and explaining the models most popularly implemented in machine learning. An overwhelming majority of the code examples are useless, which is intentional as Joel notes how to build things from scratch. Libraries (like pandas, scikit-learn, etc) provide APIs to accomplish many of these tasks without writing from scratch, but without the un ...more
Joel does a great job walking through the tasks a data scientist would take to solve hypothetical problems, and explaining the models most popularly implemented in machine learning. An overwhelming majority of the code examples are useless, which is intentional as Joel notes how to build things from scratch. Libraries (like pandas, scikit-learn, etc) provide APIs to accomplish many of these tasks without writing from scratch, but without the un ...more
De nada adianta conhecer ciência de dados sem fazer ciência de dados. Partindo deste pressuposto, este livro traz o essencial para "colocar a mão na massa" e torturar alguns dados. O mais interessante deste livro é que ele parte do absoluto zero nos algoritmos. Por não confiar em nenhuma biblioteca de análise, ele demonstra toda construção técnica por traz de regressões, redes neurais, árvores de decisão, classificadores bayesianos, etc.
Leitura recomendada para um sólido entendimento da prática ...more
Leitura recomendada para um sólido entendimento da prática ...more
This was a fun survey of popular topics in contemporary data science. It was well written for a text book, and easy to read. I suppose it was light on formal proofs, but it made up for that by having you build toy models of all the major ideas. Well worth the read for me, as I am very new to data science but well versed in Python and math. I would like to see a follow-up book that covers the same topics, but using the real libraries people use in industry to solve these same problems.
Not terribly impressed with this one. The way I see it, readers of this book either will already know how to do data science, or they won't. If they do (and here I'm ignoring the fact that why would they, since the title of the book is "data science from scratch"), then they will find the explanations of concepts too basic, and the Python code implementation examples mostly useless (they, after all, are not using the libraries specifically designed to do data science, but rather implementing a n
...more
An excellent tool for aspiring data scientists like myself.
There's no shortage of information on the topic, but it's hard to find it all in one place. You could spend weeks combing through forums, blog posts, and video tutorials only to find half as much useful information. Data Science from Scratch covers the foundations of many basic Machine Learning algorithms in a succinct and humorous way.
As fair warning, the math is a little much to take in for a single book. The author provides introducti ...more
There's no shortage of information on the topic, but it's hard to find it all in one place. You could spend weeks combing through forums, blog posts, and video tutorials only to find half as much useful information. Data Science from Scratch covers the foundations of many basic Machine Learning algorithms in a succinct and humorous way.
As fair warning, the math is a little much to take in for a single book. The author provides introducti ...more
Practical book which covers what's essential for data analysts getting into statistical analysis, machine learning and related topics. Good book for those starting out, but didn't have much to offer on the statistical learning side, principles and concepts wise. You're better off looking at books such as IPSUR (Jay G Kearns) and ISLR (Hastie & Tibshirani) for such content. However, this is a practical book because it introduces many relevant ideas. Some qualms: MapReduce treatment is probabl
...more
Aside form the author's enthusiasm and breadth of knowledge I did not get much out of this book. For me there are not enough details on the statistical concepts and too much detail in the 'from scratch' code samples. The code samples are also never to be used again, as the author admits at the end, because there are many python packages that do an infinitely more efficient and scalable job of analysing data. The modelling concepts are not differentiated clearly enough so it's not understood why
...more
I read this prior to beginning an MSc in Data Science and found it to be a great introduction to data science, starting out with the very basics before moving into more general ML techniques and finishing up with some of the more complex topics such as MapReduce. Not an in-depth textbook by any means, but I do not think that is the purpose of this book, moreover to give the reader a well-rounded idea of the field.
It is a wonderful book to understand the detail of some machine learning methods implementation. It is also a good practice to use Python basic. As it is suggested, everything function is constructed from scratch. I really enjoyed the book, however I would not recommend it to learn ML and go directly to developing ML applications.
I rate it 4 because , some examples shown in the book do not provide data to test them
I rate it 4 because , some examples shown in the book do not provide data to test them
This book is nice to improve the understanding of some details underlying the data science algorithms, but it falls short in the deepness of the content. Some concepts feels rushed and incomplete; the explanation sometimes isn't clear.
Even though the book is shallow, I would recommend it; here and there you can get a valuable piece of information from it.
Even though the book is shallow, I would recommend it; here and there you can get a valuable piece of information from it.
The book covers a vast topic required to get started with data science stream. It introduces theory, frameworks and library. As a result none of the topics is hands on with example problem solving. Though the book working code example for all the concepts. To get a decent grip in data science the problem solving is very crucial.
Um bom livro para quem quiser começar a aprender sobre estatística, princípios de data science e machine learning através de uma abordagem prática usando a linguagem Python. Me lembrou bastante o Collective Intelligence. Para um curso introdutório ainda prefiro este segundo pois ele trás algumas atividades e exercícios mais interessantes.
Data Science from Scratch is a good Data Science overview. It covers the breadth of the "field" targeting (aspiring) practitioners (for example, I couldn't find a "definition" of data science beyond the "it's a Venn diagram thing - data, math, hacking"). For practitioners, the "from scratch" approach is very useful. Some topics will be o quick skim, others are a close analysis of the code (python) to understand specific implementation of "cartoon" examples. The from-scratch approach builds up th
...more
topics | posts | views | last activity | |
---|---|---|---|---|
Goodreads Librari...: Spelling errors in book title | 3 | 9 | Jan 09, 2017 06:57AM |
Goodreads is hiring!
No trivia or quizzes yet. Add some now »
“This means that, where appropriate, we will dive into mathematical equations, mathematical intuition, mathematical axioms, and cartoon versions of big mathematical ideas.”
—
0 likes
“Just run: pip install ipython and then search the Internet for solutions to whatever cryptic error messages that causes.”
—
0 likes
More quotes…