Jump to ratings and reviews
Rate this book

Programming MapReduce with Scalding

Rate this book
A practical guide to designing, testing, and implementing complex MapReduce applications in ScalaAbout This BookDevelop MapReduce applications using a functional development language in a lightweight, high-performance, and testable wayRecognize the Scalding capabilities to communicate with external data stores and perform machine learning operationsFull of illustrations and diagrams, practical examples, and tips for deeper understanding of MapReduce application developmentWho This Book Is ForThis book is for developers who are willing to discover how to effectively develop MapReduce applications. Prior knowledge of Hadoop or Scala is not required; however, investing some time on those topics would certainly be beneficial.

What You Will LearnSet up an environment to execute jobs in local and Hadoop modePreview the complete Scalding API through examples and illustrationsLearn about Scalding capabilities, testing, and pipelining jobsUnderstand the concepts of MapReduce patterns and the applications of its ecosystemImplement logfile analysis and ad-targeting applications using best practicesApply a test-driven development (TDD) methodology and structure Scalding applications in a modular and testable wayInteract with external NoSQL and SQL data stores from ScaldingDeploy, schedule, monitor, and maintain production systemsIn DetailProgramming MapReduce with Scalding is a practical guide to setting up a development environment and implementing simple and complex MapReduce transformations in Scalding, using a test-driven development methodology and other best practices.

This book will first introduce you to how the Cascading framework allows for higher abstraction reasoning over MapReduce applications and then dive into how Scala DSL Scalding enables us to develop elegant and testable applications. It will then teach you how to test Scalding jobs and how to define specifications and behavior-driven development (BDD) with Scalding. This book will also demonstrate how to monitor and maintain cluster stability and efficiently access SQL, NoSQL, and search platforms.

Programming MapReduce with Scalding provides hands-on information starting from proof of concept applications and progressing to production-ready implementations.

229 pages, Kindle Edition

First published June 25, 2014

7 people are currently reading
1 person want to read

About the author

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
2 (40%)
4 stars
1 (20%)
3 stars
0 (0%)
2 stars
2 (40%)
1 star
0 (0%)
No one has reviewed this book yet.

Can't find what you're looking for?

Get help and learn more about the design.