Mapr (HP) Hadoop Developer Certification Questions and Answers (Dumps and Practice Questions)

Question : To Create a MapReduce Jobs workflow which framework can be used ?

1. Sqoop
2. Hive
3. Access Mostly Uused Products by 50000+ Subscribers
4. OOzie

Correct Answer : Get Lastest Questions and Answer :

Apache OOzie :

Oozie is a workflow engine
- Runs on a server
- Typically outside the cluster
- Runs workflows of Hadoop jobs
- Including Pig, Hive, Sqoop jobs
- Submits those jobs to the cluster based on a workflow definition
- Workflow definitions are submited via HTTP
- Jobs can be run at specific times
- One time or recurring jobs
- Jobs can be run when data is present in a directory

Question :

Which language is used in oozie to define MapReduce workflow

1. Java
2. XML
3. Access Mostly Uused Products by 50000+ Subscribers
4. None of the above

Correct Answer : Get Lastest Questions and Answer :

Question :

Select the correct statement?

1. In oozie workflow, all the MapReduce jobs can run in sequence only
2. Jobs can run parallel as well as in sequence
3. Access Mostly Uused Products by 50000+ Subscribers
4. All of the above
5. 2 and 3

Correct Answer : Get Lastest Questions and Answer :

Oozie is a system for describing the workflow of a job,
where that job may contain a set of map reduce jobs, pig scripts, fs operations etc and supports fork and joining of the data flow.

It doesnt however allow you to stream the input of one MR job as the input to another
- the map-reduce action in oozie still requires an output format of some type,
typically a File based on, so your output from job 1 will still be serialized via HDFS,
before being processed by job 2.

Oozie can run jobs sequentially (one after the other) and in parallel (multiple at a time)

Related Questions

Question : To check the MapR Job Performance, we use MapR Control System. However, a job completed with % map task and % reduce tasks and Job is not finishing.
So you can use MapR control system as
A. You can filter the views in the MapR Control System to list only reduce tasks
B. Once you have a list of your job's reduce tasks, you can sort the list by duration to see if any reduce task attempts are taking an abnormally long time to execute
C. you can not filter the views in the MapR Control System to list only reduce tasks
D. MapR Control System can display detailed information about those task attempts, including log files for those task attempts

1. A,B,C
2. B,C,D
3. Access Mostly Uused Products by 50000+ Subscribers
4. A,B,D

Question : Can we use MapR control system Metrics displays to gauge performance of two different jobs that perform the same function one written n Python using pydoop and other is
written in C++ using Pipes

1. Yes
2. No

Question : To use MapR Metrics, set up a ________ database to log metrics data.

1. MySQL

2. Oracle

3. Access Mostly Uused Products by 50000+ Subscribers

4. SQL Server

Question : Hadoop will start transferring the data as soon as Mapper finishes it task and it will not wait till last Map Task finished

1. True
2. False

Question : If a Mapper runs slow relative to other than ?

1. No reducer can start until last Mapper finished
2. If mapper is running slow then another instance of Mapper will be started by Hadoop on another machine
3. Access Mostly Uused Products by 50000+ Subscribers
4. The result of the first mapper finished will be used
5. All of the above

Question : What is the Combiner ?

1. Runs locally on a single Mappers output
2. Using Combiner can reduce the network traffic
3. Access Mostly Uused Products by 50000+ Subscribers
4. None of the 1,2 and 3
5. All 1,2 and 3 applicable to the Combiner