Hadoop: The Definitive Guide: Storage and Analysis at Internet Scale
Rate it:
Open Preview
28%
Flag icon
One rule of thumb is to aim for reducers that each run for five minutes or so, and which produce at least one HDFS block’s worth of output.
38%
Flag icon
HDFS clusters do not benefit from using RAID
38%
Flag icon
To work seamlessly, SSH needs to be set up to allow passwordless login for the hdfs and yarn users from machines in the cluster.[