Spurred by Page’s ambition, Ghemawat and Jeff Dean came up with a dramatic improvement in handling massive amounts of information spread over multiple data centers. It split tasks among machines in a faster manner, in the same way a programmer performing an operation on large collections of data can spread the work over many computers without worrying about how to apportion the work. The program worked in two steps—first by mapping the system (figuring out how the information was spread out and duplicated in various locations—basically an indexing process) and then by reducing the information
...more