Hadoop in “Plain English”

By Tuesday, April 10, 2012 0 Permalink 0

In a nutshell Hadoop is.... An opensource java framework bringing compute as physically close to the data as possible. Hadoop basically constists of two parts: Hadoop Distributed File System (HDFS) and Hadoop MapReduce Plain & Simple... What is HDFS? It will take a file and split it up into many small 'chunks', it then distributes and stores these ...