What do you think?
Rate this book
274 pages, Kindle Edition
First published May 1, 2015
The big projects where unfortunately not the strong parts of the book. Where all the parts should have come together is now another mix from the high level data science process with low level inner workings of Twitter and StackOverflow. It’s hard to keep the levels apart when there is so much going on. If Twitter and StackOverflow would have been introduced in a chapter on their own, then the last two chapters could focus on the data cleaning part. As it is now, you may need to do that work for yourself, what harms the idea of the book.
However, this book is the best I read from Packt Publishing in a long time. If you are interested in working with data from different sources, you still should buy the book. The technical part is done very well and will help you a lot.