Premium

Cloudera Hadoop Developer Certification Questions and Answer (Dumps and Practice Questions)



Question :

If X and Y are two MapReduce jobs and their dependency is set as below

x.addDependingJob(y)

What does it mean ?
 :
1. X will not start until y has finished
2. Y will not start until x has finished
3. Access Mostly Uused Products by 50000+ Subscribers
4. All of the above



Correct Answer : Get Lastest Questions and Answer :

Using addDependingJob(Job job) method you can define the dependecy of the MapReuce Job, to control the flow.




Question :

The option or switch in "hadoop fs" command for detailed help is
 :
1. '-show'
2. '-help'
3. Access Mostly Uused Products by 50000+ Subscribers
4. Any of the above

Correct Answer : Get Lastest Questions and Answer :

You have to use hadoop fs -help to show all the command related to hadoop file system.





Question :

Which of the following method or methods of JobControl object can be used to track the execution state of Jobs
 :
1. allFinished()
2. getFailedJobs()
3. Access Mostly Uused Products by 50000+ Subscribers
4. All of the above

Correct Answer : Get Lastest Questions and Answer :

JobControl object is used to control the group of Jobs, This class encapsulates a set of MapReduce jobs and its dependency
and there are various method get the status. It tracks the states of the jobs by placing them into different tables according to their states.
This class has a thread that submits jobs when they become ready, monitors the states of the running jobs, and updates the states of jobs based on the state changes of their depending jobs states.
e.g. getFailedJobs(),getReadyJobs(),getRunningJobs() ,getState(),getSuccessfulJobs(),getWaitingJobs()


Related Questions


Question :

Is there anyway in the MapReduce model that reducers communicate with each other in Hadoo framework ?

 :
1. Yes, using JobConf confguration object it is possible
2. Using distributed cache it is possible
3. Access Mostly Uused Products by 50000+ Subscribers
4. No, each reducers runs independently and in isolation.





Question :

What is true about combiner ?

 :
1. Combiner does the local aggregation of data, thereby allowing the number of mappers to process input data faster.
2. Combiner does the local aggregation of data, thereby reducing the number of mappers that need to run.
3. Access Mostly Uused Products by 50000+ Subscribers




Question :

in 3 mappers and 2 reducers how many distinct copy operations will be there in the sort or shuffle phase

 :
1. 3
2. 2
3. Access Mostly Uused Products by 50000+ Subscribers
4. 5




Question :

In which scenerio MapReduce is not suitable..

 :
1. text mining on the unstructured documents
2. Analyzing web documents
3. Access Mostly Uused Products by 50000+ Subscribers
4. for a large computation of finacial risk modeling and performance analysis.




Question : How can you use binary data in MapReduce?

  : How can you use binary data in MapReduce?
1. Binary data can be used directly by a map-reduce job. Often binary data is added to a sequence file
2. Binary data cannot be used by Hadoop fremework. Binary data should be converted to a Hadoop compatible format prior to loading
3. Access Mostly Uused Products by 50000+ Subscribers
4. Hadoop can freely use binary files with map-reduce jobs so long as the files have headers



Question : What is map - side join?
  : What is map - side join?
1. Map-side join is done in the map phase and done in memory
2. Map-side join is a technique in which data is eliminated at the map step
3. Access Mostly Uused Products by 50000+ Subscribers
4. None of these answers are correct