Goodreads helps you keep track of books you want to read.
Start by marking “Agile Data Science 2.0: Building Full-Stack Data Analytics Applications with Spark” as Want to Read:
Agile Data Science 2.0: Building Full-Stack Data Analytics Applications with Spark
Enlarge cover
Rate this book
Clear rating
Open Preview

Agile Data Science 2.0: Building Full-Stack Data Analytics Applications with Spark

3.59  ·  Rating details ·  39 ratings  ·  7 reviews
Data science teams looking to turn research into useful analytics applications require not only the right tools, but also the right approach if they're to succeed. With the revised second edition of this hands-on guide, up-and-coming data scientists will learn how to use the Agile Data Science development methodology to build data applications with Python, Apache Spark, Ka ...more
Paperback, 352 pages
Published June 23rd 2017 by O'Reilly Media
More Details... Edit Details

Friend Reviews

To see what your friends thought of this book, please sign up.

Reader Q&A

To ask other readers questions about Agile Data Science 2.0, please sign up.

Be the first to ask a question about Agile Data Science 2.0

This book is not yet featured on Listopia. Add this book to your favorite list »

Community Reviews

Showing 1-30
Average rating 3.59  · 
Rating details
 ·  39 ratings  ·  7 reviews

More filters
Sort order
Start your review of Agile Data Science 2.0: Building Full-Stack Data Analytics Applications with Spark
Kristian Edlund
May 19, 2018 rated it it was ok
I really wanted to love this book. The concept of walking through a quite elaborate example is excellent. I think it shows many of the pit falls and iterations you need to do from start to finished data product.

However, as one of the other reviews also points out, the code example in there are unfinished and the instructions are hard to follow. For me it started in chapter 4, where you are asked to run the first piece of code to convert data. However, I wasn't sure where to run it and it took m
May 08, 2018 rated it it was ok
It's a good book for programmer or data science beginner level to know the data science concept and popular tool-set in the industry.
However the book is not up-to-date to keep with the latest software version. Thus some of the codes in the book are not working as expected(Too bad..). So it means the quality of the book is under the standard.
For example, it's using pyElasticsearch package however it's only support ElasticSearch (<2.0) version.
It's an ok book and needs more polished in my opinion
Rebecca Bilbro
Mar 07, 2018 rated it really liked it
Favorite quotes:
- "In data science, by contrast to software engineering, code shouldn't always be good; it should be eventually good."
- "In Agile Data Science, we value generalists over specialists...Examples of good Agile Data Science team members include: Designers who deliver working CSS; Web developers who build entire applications and understand the user interface and user experience; Data scientists capable of both research and building web services and applications; Researchers who check
Alex Galea
Apr 09, 2019 rated it really liked it
This book describes Russell's perspectives on good data science workflow using an agile methodology. He walks through a project about airline flight data in great detail and shows off some really neat tricks for building web apps and doing predictive analytics at scale. I would describe the material at intermediate level, where the reader should already be familiar with the data science ecosystem.

I loved chapter 2, which introduces the technology stack. It's awesome to see minimal working snippe
Jose Manuel
Dec 20, 2017 rated it it was amazing
impresionante. Pese a ser R mi opción principal y este libro usar Python, su enfoque , centrándose en la parte "científica" de la labor del Data Scientist es claramente acertada. Los primeros capítulos describen mi día a día de forma tan acertada que me ha llegado a emocionar. Su enfoque de mantener las cosas tan sencillas y escalables como sea posible centrándonos en las personas más que en los procesos, liberando resultados de manera rápida y continuada a lo largo del proceso, son consejos que ...more
Oct 02, 2019 rated it it was ok
Not bad at illustrating the concepts, but a bit too specific for the technology stack that was mentioned in the book. I thought this was helpful for data scientists to understand different steps in the process that they don't always see(DevOps, etc.).

The author's definition of "data science" (page 4) is more similar to "big data" than "statistics," so beware that you're not going to get a lot of stats out of this book.
Aug 24, 2018 rated it it was amazing
Interesting ideas and quite detail explanation of implementation.
I read this mainly for the description of the process and for hints how one might actually go about implementing all the steps. Book is clear on these points and therefore 5 stars. If one actually went on and ran the code probably something might not really work - but that's software :)
I am actually into these ideas and will do my best to get DS process at my current employer as close as is practical to this.
Dan Ryan
rated it it was amazing
Apr 02, 2018
rated it it was ok
Jun 20, 2019
rated it it was amazing
Jan 14, 2020
rated it really liked it
May 18, 2019
Glen Ritschel
rated it really liked it
Jul 09, 2018
Chris Garnett
rated it really liked it
Sep 16, 2020
rated it liked it
May 26, 2018
Daan Tor
rated it it was ok
Apr 26, 2020
Igor Vieira
rated it really liked it
Aug 01, 2020
rated it liked it
Apr 17, 2019
rated it really liked it
Jun 27, 2019
rated it really liked it
Jan 15, 2018
rated it liked it
Dec 30, 2018
Antje Fitzner
rated it really liked it
Jul 07, 2019
rated it really liked it
Nov 16, 2017
rated it really liked it
Dec 08, 2017
Gustav Lindqvist
rated it really liked it
Aug 15, 2018
Peter Roelants
rated it it was ok
Jan 11, 2018
Syafiq  Firdaus
rated it liked it
Jan 20, 2017
rated it really liked it
Jul 18, 2018
rated it really liked it
Jul 29, 2019
Arthur Mello
rated it it was ok
Sep 24, 2019
rated it really liked it
Jan 20, 2019
« previous 1 next »
There are no discussion topics on this book yet. Be the first to start one »

Readers also enjoyed

  • Range: Why Generalists Triumph in a Specialized World
  • Hands-On Machine Learning with Scikit-Learn and TensorFlow
  • Sprint: How to Solve Big Problems and Test New Ideas in Just Five Days
  • The Kremlin School of Negotiation
  • Metropolis Sourcebook
  • Legions Of Darkness: A Sourcebook For Kult
  • Storytelling with Data: A Data Visualization Guide for Business Professionals
  • Survival of the Prettiest: The Science of Beauty
  • Scar Tissue
  • Designing Data-Intensive Applications
  • Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy
  • Python Data Science Handbook: Tools and Techniques for Developers
  • Data Science from Scratch: First Principles with Python
  • Python for Data Analysis
  • Soumission
  • Forecasting: Methods and Applications
  • The Inconvenient Indian: A Curious Account of Native People in North America
  • The Vinland Sagas: The Norse Discovery of America
See similar books…

Related Articles

There's something great about a paperback book: They're perfect book club choices, you can throw them in your bag and go, and they've been out in...
57 likes · 17 comments