Premium

Mapr (HP) Hadoop Developer Certification Questions and Answers (Dumps and Practice Questions)



Question : When a JobTracker stops receiving heartbeats from the task tracker, the JobTracker


  : When a JobTracker stops receiving heartbeats from the task tracker, the JobTracker
1. Retry three times to schedule the task on same task tracker

2. Reschedules the tasks on failed TaskTracker to other TaskTracker

3. Report Failures and stops

4. Restarts the failed TaskTracker

Correct Answer : 2
Explanation:




Question : Which is the default scheduler in YARN architecture

  : Which is the default  scheduler  in YARN architecture
1. Fair Scheduler

2. Capacity Scheduler

3. Map Scheduler

4. Hash scheduler


Correct Answer : 1
Explanation:




Question : Which of the following is a correct pattern to use Hadoop framework

1. Summarizing the data for instance statistical summaries, counts, and indexes for group of data.
2. Filtering data for instance Sample, sanitize, identify top n , and filter unique data
3. Optimize data for instance Transform, Partition, sort and generate data


  :  Which of the following is a correct pattern to use Hadoop framework
1. 1,2

2. 2,3

3. 1,3

4. 1,2 and 3


Correct Answer : 4
Explanation:


Related Questions


Question : Which HDFS command copies an HDFS file named foo to the local filesystem as localFoo?

  : Which HDFS command copies an HDFS file named foo to the local filesystem as localFoo?
1. hadoop fs -get foo LocalFoo
2. hadoop -cp foo LocalFoo
3. Access Mostly Uused Products by 50000+ Subscribers
4. hadoop fs -put foo LocalFoo


Question : Which one of the following statements describes the relationship between the NodeManager and the ApplicationMaster?

  : Which one of the following statements describes the relationship between the NodeManager and the ApplicationMaster?
1. The ApplicationMaster starts the NodeManager in a Container
2. The NodeManager requests resources from the ApplicationMaster
3. Access Mostly Uused Products by 50000+ Subscribers
4. The NodeManager creates an instance of the ApplicationMaster


Question : Which one of the following statements is FALSE regarding the communication between DataNodes and a federation of NameNodes in Hadoop .?

  : Which one of the following statements is FALSE regarding the communication between DataNodes and a federation of NameNodes in Hadoop .?
1. Each DataNode receives commands from one designated master NameNode.
2. DataNodes send periodic heartbeats to all the NameNodes.
3. Access Mostly Uused Products by 50000+ Subscribers
4. DataNodes send periodic block reports to all the NameNodes.


Question : You need to run the same job many times with minor variations. Rather than hardcoding all
job configuration options in your drive code, you've decided to have your Driver subclass
org.apache.hadoop.conf.Configured and implement the org.apache.hadoop.util.Toolinterface.
Identify which invocation correctly passes.mapred.job.name with a value of Example to Hadoop?

  : You need to run the same job many times with minor variations. Rather than hardcoding all
1. hadoop "mapred.job.name=Example" MyDriver input output
2. hadoop MyDriver mapred.job.name=Example input output
3. Access Mostly Uused Products by 50000+ Subscribers
4. hadoop setproperty mapred.job.name=Example MyDriver input output
5. hadoop setproperty ("mapred.job.name=Example") MyDriver input output


Question : What types of algorithms are difficult to express in MapReduce v (MRv)?


  : What types of algorithms are difficult to express in MapReduce v (MRv)?
1. Algorithms that require applying the same mathematical function to large numbers of individual binary records.
2. Relational operations on large amounts of structured and semi-structured data.
3. Access Mostly Uused Products by 50000+ Subscribers
4. Large-scale graph algorithms that require one-step link traversal.
5. Text analysis algorithms on large collections of unstructured text (e.g, Web crawls).



Question : Which project gives you a distributed, Scalable, data store that allows you random, , real-time read/write access to hundreds of terabytes of data?

  : Which project gives you a distributed, Scalable, data store that allows you random, , real-time read/write access to hundreds of terabytes of data?
1. HBase
2. Hue
3. Access Mostly Uused Products by 50000+ Subscribers
4. Hive
5. Sqoop