Question : What is map - side join? 1. Map-side join is done in the map phase and done in memory 2. Map-side join is a technique in which data is eliminated at the map step 3. Access Mostly Uused Products by 50000+ Subscribers 4. None of these answers are correct
Explanation: The map-side join is a technique that allows for splitting map file between different data nodes. The data will be loaded into memory. This technique allow very fast performance for the join
Question : How can you disable the reduce step?
1. The Hadoop administrator has to set the number of the reducer slot to zero on all slave nodes. This will disable the reduce step. 2. It is impossible to disable the reduce step since it is critical part of the Map-Reduce abstraction. 3. Access Mostly Uused Products by 50000+ Subscribers 4. While you cannot completely disable reducers you can set output to one. There needs to be at least one reduce step in Map-Reduce abstraction.
Explanation: If developer uses MapReduce API he has full access to any number of mappers and reducers for job execution
Question : Why would one create a map-reduce without the reduce step? 1. Developers should design Map-Reduce jobs without reducers only if no reduce slots are available on the cluster 2. Developers should never design Map-Reduce jobs without reducers. An error will occur upon compile 3. Access Mostly Uused Products by 50000+ Subscribers 4. It is not possible to create a map-reduce job without at least one reduce step. A developer may decide to limit to one reducer for debugging purposes
Explanation: This is a map step only. MapReduce jobs are very common. They normally are used to perform transformations on data without sorting and aggregations