Premium

Mapr (HP) Hadoop Developer Certification Questions and Answers (Dumps and Practice Questions)



Question : What is the default input format?

  : What is the default input format?
1. The default input format is xml. Developer can specify other input formats as appropriate if xml is not the correct input
2. There is no default input format. The input format always should be specified.
3. Access Mostly Uused Products by 50000+ Subscribers
4. The default input format is TextInputFormat with byte offset as a key and entire line as a value



Correct Answer : Get Lastest Questions and Answer :


Explanation: Hadoop permits a large range of input formats. The default is text input format. This format is the simplest way to access data as text lines.





Question : How can you overwrite the default input format?


  : How can you overwrite the default input format?
1. In order to overwrite default input format, the Hadoop administrator has to change default settings in config file
2. In order to overwrite default input format, a developer has to set new input format
on job config before submitting the job to a cluster
3. Access Mostly Uused Products by 50000+ Subscribers
4. None of these answers are correct

Correct Answer : Get Lastest Questions and Answer :

Developer can always set different input formats on job configuration (e.g sequence files, binary files, compressed format).





Question : What are the common problems with map-side join?

  : What are the common problems with map-side join?
1. The most common problem with map-side joins is introducing a high level of code complexity.
This complexity has several downsides: increased risk of bugs and performance degradation.
Developers are cautioned to rarely use map-side joins.
2. The most common problem with map-side joins is lack of the available map slots since map-side joins require a lot of mappers.
3. Access Mostly Uused Products by 50000+ Subscribers
4. The most common problem with map-side join is not clearly specifying primary index in the join.
This can lead to very slow performance on large datasets.



Correct Answer : Get Lastest Questions and Answer :

Exp: - Map-side join uses memory for joining the data based on a key. As a result the data size is limited to the size of the available memory. If this exceeds available memory an
out of memory error will occur





Related Questions


Question : Put statements in correct order

A. Applications Manager talks directly to the Node Managers on those nodes to launch the containers for tasks.
B. Applications Manager, on boot-up, registers with ResourceManager
C. Resource Manager allocates a container for Applications Manager on a node. Tells Node Manager in charge of that node to launch the Applications Manager container
D. Resource Manager allocates the containers on different nodes in the cluster.
E. A client program submits the application



 : Put statements in correct order
1. A,B,C,D,E
2. E,C,B,D,A
3. Access Mostly Uused Products by 50000+ Subscribers
4. A,E,B,D,C
5. D,A,B,E,C


Question : Select the correct steps in which Non-MapReduce jobs are submitted on YARN cluster
A. The Resource Manager starts the Application Master to allocate a container to the job.
B. User submits an app request by passing configuration to the Application Master and to the Resource Manager.
C. When the Application Master is done, it then unregisters from the Resource Manager.
D. Then the Application Master launches the container and monitors it.

 : Select the correct steps in which Non-MapReduce jobs are submitted on YARN cluster
1. D,B,A,C
2. D,A,B,C
3. Access Mostly Uused Products by 50000+ Subscribers
4. C,D,A,B
5. D,C,B,A


Question : Which of the following component is part of resource manager


 : Which of the following component is part of resource manager
1. Application Master

2. Application Manager'

3. Access Mostly Uused Products by 50000+ Subscribers

4. Container



Question : Place the following steps in order of execution for MapR Direct Shuffle work flow in YARN.

A. After that the Node Manager on each node launches containers using information about the node s local volume from the LocalVolumeAuxiliaryService.
B. The Application Master service initializes the application by calling initialize Application() on the LocalVolumeAuxiliaryService.
C. Application Master Service requests task containers from the Resource Manager.
D. Then the Resource Manager sends the App Master information that AppMaster uses to request containers from the NodeManager.

 : Place the following steps in order of execution for MapR Direct Shuffle work flow in YARN.
1. A,B,C,D
2. D,C,B,A
3. Access Mostly Uused Products by 50000+ Subscribers
4. B,C,D,A
5. C,D,B.A


Question : You have an Hadoop ecosystem components that use MapReduce under the hood , how can you define which Version of the to use either Classic or yarn


 : You have an Hadoop ecosystem components that use MapReduce under the  hood , how can you define which Version of the to use either Classic or yarn
1. set a parameter called default_mode in their /opt/mapr/conf/hadoop_version configuration file of each component

2. While submitting the job we must use maprcli command and setting mode argument

3. Access Mostly Uused Products by 50000+ Subscribers

4. None of the above



Question : In YARN cluster, which of the following component, you can use to monitor your job?


 : In YARN cluster, which of the following component, you can use to monitor your job?
1. Job Tracker

2. Task Tracker

3. Access Mostly Uused Products by 50000+ Subscribers

4. 1,3
5. 1,2,3