With the almost unfathomable increase in web traffic over recent years, driven by millions of connected users, businesses are gaining access to massive amounts of complex, unstructured data from which to gain insight. When Hadoop was introduced by Yahoo in 2007, it brought with it a paradigm shift in how this data was stored and analysed. Hadoop allowed small and medium sized companies to store huge amounts of data on cheap commodity servers in racks. The introduction of Big Data has allowed businesses to make decisions based on quantifiable analysis. Hadoop is now implemented in major organizations such as Amazon, IBM, Cloudera, and Dell to name a few. This book introduces you to Hadoop and to concepts such as ‘MapReduce’, ‘Rack Awareness’, ‘Yarn’ and ‘HDFS Federation’, which will help you get acquainted with the technology.
A Content Specialist by profession, Aravind Shenoy’s core interests are Web Design, Marketing, Content Writing, Content Development, and Business Analysis. A music buff - he loves listening to Alternative, Electronic, and Techno-industrial stuff. ATB, Massive Attack, The Prodigy, Chemical Brothers, Fatboy Slim, Darude, Fragma, Bomfunk MC's, and Jamiroquai rule his playlists. An engineering graduate from the Manipal Institute of Technology and author of several books, he feels “The most important thing is to be happy - it’s all that matters. After all, we are here for a good time, not a long time".
Ok as a quick (gets read in less than an hour) introduction to Hadoop's architecture. Very shallow, it doesn't get into the details at all. But enough to understand what this technology is about and how it works at bird eye.
The book summarize the difficult subject of big data handling and analysis using Hadoop in a simple approach. I recommend it to anyone interested in starting to study this field.
I think the book was not so easy to understand for a beginner like me. There were so many technical terms which were not thoroughly explained. Also, the book was too brief. I believe there should have been more details added.
Nice explanation about hadoop framework and its components technology. Fairly explained concepts to better understanding for processing bug data. Nice book
The book claims to be 156 pages, but per page turn Kindle counted 2 or 3%. which means the book is more likely between 33 and 50 pages. The explanations are very high level and unfortunately not fully free of technical jargon.
I just finished this pamphlet (30 pages - what else can you call it?) in about 30 minutes.
If you are looking for an in-depth technical manual on what is Hadoop and how to implement it - find something else to read. if however you are looking for a high-level review of the genesis of Hadoop and it's relationship to the current 'Big Data' technological wave and want to investigate the basics this may be just what you need to read. There is enough technical background to allow you to determine if a more in-depth analysis is required. At the very least - you will come away with enough understanding to have an intelligable conversation about Hadoop with non-technical people (hate to say this, but non-Technical managers - no offense meant, implied or otherwise).
Too short and some language usage problems. I think English is not the primary language of the author. It does have some good basic info. It covers both Hadoop 1.x and 2.0 which is very helpful.
This book is mainly for beginners. This book was available for free download for kindle. If you are seriously interested in learning hadoop, buy the definitive guide instead of spending money on this book
Incredibly poorly written. I wouldn't be surprised to learn that it was a foreign language blog post simply pasted into Google translate. It's only 20 pages long, but I simply couldn't finish it.