Question : What is data localization ? 1. Before processing the data, bringing them to the local node. 2. Hadoop will start the Map task on the node where data block is kept via HDFS 3. Access Mostly Uused Products by 50000+ Subscribers 4. None of the 1 and 2 is correct
Map Reduce Data Locality: Whenever possible, Hadoop will attempt to ensure that a MapTask on a node is working on a block of data stored locally On that node via HDFS.
There is no concept of data locality for the Reducers. All mappers in general have to communicate with all reducers.
Refer HadoopExam.com Recorded Training Module : 3
Question : All the mappers, have to communicate with all the reducers... 1. True 2. False
Explanation: If Mappers and Reducer runs on the same node, then there is ni need of transferring the data over the network. Which will reduce lot of network overhead.
Refer HadoopExam.com Recorded Training Module : 3 and 4