Creating a Hadoop Pseudo-Distributed Environment

Hadoop developers usually test their scripts and code on a pseudo-distributed environment (also known as a single node setup), which is a virtual machine that runs all of the Hadoop daemons simultaneously on a single machine. This allows you to quickly write scripts and test them on limited data sets without having to connect to a remote cluster or pay the expense of EC2. If you're learning Hadoop, you'll probably also want to set up a pseudo-distributed environment to facilitate your...

 •  0 comments  •  flag
Share on Twitter
Published on January 08, 2015 06:56
No comments have been added yet.