This book covers everything you need to build your first Hadoop cluster and begin analyzing and deriving value from your business and scientific data. Learn to solve big-data problems the MapReduce way, by breaking a big problem into chunks and creating small-scale solutions that can be flung across thousands upon thousands of nodes to analyze large data volumes in a short amount of wall-clock time. Learn how to let Hadoop take care of distributing and parallelizing your software―you just focus on the code; Hadoop takes care of the rest.
- Covers all that is new in Hadoop 2.0
- Written by a professional involved in Hadoop since day one
- Takes you quickly to the seasoned pro level on the hottest cloud-computing framework