In a nutshell Hadoop is.... An opensource java framework bringing compute as physically close to the data as possible. Hadoop basically constists of two parts: Hadoop Distributed File System (HDFS) and Hadoop MapReduce Plain & Simple... What is HDFS? It will take a file and split it up into many small 'chunks', it then distributes and stores these ...
Sam
Gadget girl, Tech Lover, Baker & Running enthusiast