Question : Which one of the following statements describes the relationship between the ResourceManager and the ApplicationMaster?
1. The ApplicationMaster requests resources from the ResourceManager 2. The ApplicationMaster starts a single instance of the ResourceManager 3. Access Mostly Uused Products by 50000+ Subscribers 4. The ApplicationMaster starts an instance of the ResourceManager within each Container
Correct Answer : Get Lastest Questions and Answer : Explanation: ResourceManager (RM) is the master that arbitrates all the available cluster resources and thus helps manage the distributed applications running on the YARN system. It works together with the per-node NodeManagers (NMs) and the per-application ApplicationMasters (AMs).
NodeManagers take instructions from the ResourceManager and manage resources available on a single node. ApplicationMasters are responsible for negotiating resources with the ResourceManager and for working with the NodeManagers to start the containers.
ResourceManager is the central authority that manages resources and schedules applications running atop of YARN. Hence, it is potentially a single point of failure in a Apache YARN cluster. ` This document gives an overview of ResourceManager Restart, a feature that enhances ResourceManager to keep functioning across restarts and also makes ResourceManager down-time invisible to end-users.
ResourceManager Restart feature is divided into two phases:
ResourceManager Restart Phase 1 (Non-work-preserving RM restart): Enhance RM to persist application/attempt state and other credentials information in a pluggable state-store. RM will reload this information from state-store upon restart and re-kick the previously running applications. Users are not required to re-submit the applications.
ResourceManager Restart Phase 2 (Work-preserving RM restart): Focus on re-constructing the running state of ResourceManager by combining the container statuses from NodeManagers and container requests from ApplicationMasters upon restart. The key difference from phase 1 is that previously running applications will not be killed after RM restarts, and so applications won't lose its work because of RM outage.
Question : When can a reduce class also serve as a combiner without affecting the output of a MapReduce program?
1. When the types of the reduce operation's input key and input value match the types of the reducer's output key and output value and when the reduce operation is both communicative and associative. 2. When the signature of the reduce method matches the signature of the combine method. 3. Access Mostly Uused Products by 50000+ Subscribers 4. Always. The point of a combiner is to serve as a mini-reducer directly after the map phase to increase performance. 5. Never. Combiners and reducers must be implemented separately because they serve different purposes.
Correct Answer : Get Lastest Questions and Answer : Explanation: You can use your reducer code as a combiner if the operation performed is commutative and associative.
1. 1. Iterate over the DistributedCache instance in the Mapper and add all the cached file paths to an array. 2. 2. There is a direct method available on the DistributedCache.getAllFilePath() 3. Access Mostly Uused Products by 50000+ Subscribers 4. 4. All of the above
Question : Select the correct statement while reading/writing the data in RDBMS using MapReduce 1. In order to use DBInputFormat you need to write a class that deserializes the columns from the database record into individual data fields to work with 2. The DBOutputFormat writes to the database by generating a set of INSERT statements in each reducer 3. Access Mostly Uused Products by 50000+ Subscribers 4. If you want to export a very large volume of data, you may be better off generating the INSERT statements into a text file, and then using a bulk data import tool provided by your database to do the database import. 5. All of the above