Goodreads helps you keep track of books you want to read.
Start by marking “Hadoop: The Definitive Guide” as Want to Read:
Hadoop: The Definitive Guide
Enlarge cover
Rate this book
Clear rating
Open Preview

Hadoop: The Definitive Guide

3.94  ·  Rating Details ·  558 Ratings  ·  44 Reviews
Hadoop: The Definitive Guide helps you harness the power of your data. Ideal for processing large datasets, the Apache Hadoop framework is an open source implementation of the MapReduce algorithm on which Google built its empire. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems: programmers will find details for a
Kindle Edition, 528 pages
Published (first published May 1st 2009)
More Details... edit details

Friend Reviews

To see what your friends thought of this book, please sign up.

Reader Q&A

To ask other readers questions about Hadoop, please sign up.

Be the first to ask a question about Hadoop

This book is not yet featured on Listopia. Add this book to your favorite list »

Community Reviews

(showing 1-30)
filter  |  sort: default (?)  |  Rating Details
Ahmed Attyah
Jan 10, 2012 Ahmed Attyah marked it as to-read  ·  review of another edition
Shelves: programming
i got really interested about Hadoop, that is why i started reading this book :), there are only 3 books about Hadoop, and from reviews i read looks like this one is the best.
Todd N
Feb 19, 2012 Todd N rated it it was amazing  ·  review of another edition
Shelves: kindle, big-data
This is the single best reference guide to Hadoop and related projects, and it's the only O'Reilly book I have read cover to cover.

Here is the way I recommend reading it: Read through the first two chapters including the tutorial walk through with the weather examples, then jump ahead and read the introduction for each of the related projects Pig (chapter 11), Hive (12), HBase (13), Zookeeper (14), Sqoop (15). Then read the case studies in the last chapter. Then go back and read about Hadoop in
Veselin Nikolov
May 22, 2010 Veselin Nikolov rated it it was amazing  ·  review of another edition
Обяснява някои концепции на NoSQL, както и идеологията на Hadoop, като на места навлиза в детайли отвъд моите интереси.

Ако я сравня с "Hadoop Pro", която въобще не ми беше полезна, тази трябва да има 5 звездички. Въпреки това има известни пропуски, примерно няма информация за Hive, а тази за HBase e ограничена.

За целите на дипломната ми работа и първоначално запознаване с технологията, книгата е повече от достатъчна, а и още няма алтернатива.
Alex Ott
Very good book, that allows to get high level overview of Hadoop, and related projects, together with description of other Hadoop-related projects - Pig, HBase, and other.
I'll recommend this book for all developers, who want to learn about Hadoop, it's usage and programming for it
Ha Truong
Dec 10, 2014 Ha Truong rated it it was amazing  ·  review of another edition
Recommends it for: software engineer, software architect, developer
Shelves: technology
The book opens the door to Hadoop world and guides you to major places such as HDFS, Map Reduce, Hive, Pig, ZooKeeper, HBase, Sqoop. Not only gives a first impression of what Hadoop, it also gives a deeper knowledge about each component and related technologies. Thus, if you just want a book to rule them all, pick this one.

However, because the ambition of the author is to put all into one book, you might feel overwhelmed with many details under the hood. It should be better you just read the int
Saul Cruz
Definitely a good way to start, I'd recommend the latest version as many blocks are not being used anymore, however if you really want to understand the underlying engine, this is the book to start with, Map Reduce is a complex Model that probably you'll never tweak, however, it is very important to completely understand how this model works so that you can optimize a cluster, and if you want probably come up with a new data processing technology (i.e. there are some tools that work on top of ma ...more
Sep 15, 2014 Sam rated it really liked it  ·  review of another edition
Shelves: tech, data, safari
This is a great overview of the various tools/technologies that make up the Hadoop ecosystem. Each chapter that covers a different tool/technology is a good overview of each. Each area is quickly finding a slew of of books on each individually, but I still find this is a good place to start. With a fourth edition coming soon (available in pre-release online), it's nice to see that they're trying to keep this up to date as the technology changes.
May 29, 2015 Amit rated it really liked it  ·  review of another edition
Shelves: computers
This is best Hadoop book. Brief introduction of all related tools e.g. Hive/Pig/HBase/ZooKeeper/Sqoop
1. Initial 10 chapters are devoted for Hadoop.
2. Writing Map/Reduce programs using the given online reference is enough; this books is just good to understand the internals of these operations.
3. Best is to start referring Apache Hadoop developer reference along with Hadoop stand alone setup.
4. Book is helpful to get more deeper into the Hadoop Logic.
Sidhartha Ray
I've already read the following chapters:

2nd Chapter - MapReduce:
>A good point to start different components of MapReduce program, Mapper, Reducer and all...
>Got a good dataset(weather dataset from NCDC) to play around...
>We can use Cloudera's distribution CDH4 for practicing the programs

7th Chapter - MapReduce Types and Formats:
Apr 10, 2016 Manzur rated it it was amazing  ·  review of another edition
Shelves: programming
This book is really fantastic! It's a complete reference on Hadoop ecosystem, and should be first point of contact for the person playing with Hadoop. Content and writing style is really approachable -- I wish that other technical authors are able to write on the same level as Tom White does.
Alex Ott
Good book on basics of Hadoop (HDFS, MapReduce & other related technologies). This book provides all necessary details to start work with Hadoop, program using it, administer, etc.

I actually read 1st edition as well, but I found many new & useful additions in new edition
Dec 31, 2015 Dariusz rated it liked it  ·  review of another edition
Shelves: owned, informatyka
Świetna jako przegląd technoligii związanych z Hadoopem, wyjątkowo mizerna jako źródło przykładów kodu i zastosowań (bo "200 sposobów na wyliczenie temperatury maksymalnej" to nie jest to czego oczekiwałem).
Anatoliy Kaverin
Nov 26, 2014 Anatoliy Kaverin rated it really liked it  ·  review of another edition
Best book to dive into Hadoop world.
Of course hadoop API evolves pretty fast, but I was able with minor changes to launch most of code samples.
Very handy, especially provides guidance to use local/dev mode to start immediate implementation of M/R stuff
Michael Economy
Jan 29, 2012 Michael Economy rated it really liked it  ·  review of another edition
Shelves: work-related
Pretty good summary. Hadoop and it's ecosystem are incredibly complex. I'd be terrified to deploy it without reading this book first. I guess I'm still pretty terrified, but markedly less so.

Some of the writing was a bit wonky, but overall really good.
The layout is confusing and non-intuitive. The writing often omits important points. And there is much space given over to specific technologies and not to general Hadoop understanding and programming.
Frank Palardy
Aug 29, 2014 Frank Palardy rated it really liked it  ·  review of another edition
Shelves: coding-misc
This seemed like a good book, but not written in much of an order.
Aug 27, 2013 David rated it it was amazing  ·  review of another edition
Detailed and clear introduction to Hadoop. My main source for getting started with Hadoop.
Christopher Noyes
For those trying to learn hadoop, pig, hive and the like and other big data technology, it's a real useful book.
Anton Kaliaev
Dec 24, 2014 Anton Kaliaev rated it really liked it  ·  review of another edition
Книга написана простым языком, иллюстрации добротные - в общем книга понравилась
Jul 03, 2013 Nourlcn rated it it was amazing  ·  review of another edition
Good book to understand hadoop and mapreduce. As well as hbase, hive and zookeeper.
Giovanni Pelosi
Aug 20, 2012 Giovanni Pelosi rated it really liked it  ·  review of another edition
Shelves: comp-lang
i'd liked a flume chapter too ...
Aug 01, 2013 Ian rated it really liked it  ·  review of another edition
Ow, my head hurts.
Aug 04, 2012 David rated it it was amazing  ·  review of another edition
Shelves: computers-cloud
Just about every single page of this book is useful. My copy must be heavier with all the pencil marks and notes I've made. (3rd edition)
Jun 17, 2013 Karishma rated it it was amazing  ·  review of another edition
the best book for any newbie who wants to get started in the world of hadoop
Collin Rogowski
Sep 13, 2012 Collin Rogowski rated it it was amazing  ·  review of another edition
Very thorough and easily readable introduction for the whole Hadoop ecosystem. Can be read "as is" to get an overview, but can also be used as a reference while implementing projects with Hadoop.
Ravi Kumar
Jun 20, 2013 Ravi Kumar is currently reading it  ·  review of another edition
Started reading it today (2oth June)... #in
Mar 24, 2016 Arun rated it really liked it  ·  review of another edition
Shelves: 2016
Useful hadoop reference. Reads like a novel!
Harry Yeh
Dec 12, 2012 Harry Yeh rated it really liked it  ·  review of another edition
Good overview of Hadoop - Definitely a useful guide if you are looking at getting into Big Data, Map Reduce etc
Ambarish Hazarnis
Jun 10, 2014 Ambarish Hazarnis rated it it was amazing  ·  review of another edition
Shelves: technical
A good book if you want to get started hands on with Hadoop.
Joe Feeney
Feb 13, 2016 Joe Feeney rated it really liked it  ·  review of another edition
This was not about about an elephant named Hadoop.
« previous 1 3 4 5 6 7 8 9 next »
topics  posts  views  last activity   
Hadoop Training 1 1 Aug 15, 2016 09:54PM  
  • Hadoop Operations
  • MapReduce Design Patterns: Building Effective Algorithms and Analytics for Hadoop and Other Systems
  • Hadoop in Action
  • HBase: The Definitive Guide
  • MongoDB: The Definitive Guide
  • Mining of Massive Datasets
  • Seven Databases in Seven Weeks: A Guide to Modern Databases and the NoSQL Movement
  • Beautiful Data: The Stories Behind Elegant Data Solutions (Theory In Practice, #31)
  • Learning Spark
  • Natural Language Processing with Python
  • Programming Scala: Scalability = Functional Programming + Objects
  • The Art of Multiprocessor Programming
  • Machine Learning in Action
  • Scala in Depth
  • Lucene in Action
  • RESTful Web Services
  • Data Analysis with Open Source Tools
  • Big Data

Goodreads is hiring!

If you like books and love to build cool products, we may be looking for you.
Learn more »

Share This Book

“cluster.” 0 likes
“Chapter 10. Setting Up a Hadoop Cluster” 0 likes
More quotes…