Cloudera Hadoop Developer Certification Questions and Answer (Dumps and Practice Questions)

Question :

You have following data in a hive table

ID:INT,COLOR:TEXT,WIDTH:INT
1,green,190
2,blue,300
3,yellow,220
4,blue,199
5,green,199
6,yellow,299
7,green,799

Select the correct MapReduce program which
can produce the output similar to below Hive Query.

Select id,color from table where width >=220;

1. 1
2. 2
3. Access Mostly Uused Products by 50000+ Subscribers
4. 4

Correct Answer : Get Lastest Questions and Answer :

Exp:

Mapper produces the output like

2,blue 1
3,yellow 1
6,yellow 1
7,green 1

And reducer will produce the output only key part from the Mapper

2,blue
3,yellow
6,yellow
7,green

Question :

You have following data in a hive table

ID:INT,COLOR:TEXT,WIDTH:INT
1,green,190
2,blue,300
3,yellow,220
4,blue,199
5,green,199
6,yellow,299
7,green,799

Select the correct MapReduce program which can produce
the output similar to below Hive Query
(Assuming single reducer is configured).

Select color,max(width) from table group by color;

1. 1
2. 2
3. Access Mostly Uused Products by 50000+ Subscribers
4. 4

Correct Answer : Get Lastest Questions and Answer :

Explanation: From the mapper, it emits the color as key and its width part as value.
In the reducer, for each color we iterate over all the widths value and calculate the maximum width.
And emits the color as key and max width as value.

Question :

You have following data in a hive table

ID:INT,COLOR:TEXT,WIDTH:INT
1,green,190
2,blue,300
3,yellow,220
4,blue,199
5,green,199
6,yellow,299
7,green,799

Select the correct MapReduce program
which can produce the output similar to below Hive Query

Select id,color,max(width) from table ;

1. 1
2. 2
3. Access Mostly Uused Products by 50000+ Subscribers
4. 4

Correct Answer : Get Lastest Questions and Answer :

Explanation: Mappper produces output as below, every key value is as 1, hence all values go to same reducer.

1 1,green,190
1 2,blue,300
1 3,yellow,220
1 4,blue,199
1 5,green,199
1 6,yellow,299
1 7,green,799

In the reducer part we will calculate the required result, by splitting the each value based on "," and
Stores the max value, its id and color, which is grouped by color.

Related Questions

Question :

Workflows expressed in Oozie can contain:

1. Iterative repetition of MapReduce jobs until a desired answer or state is reached.
2. Sequences of MapReduce and Pig jobs. These are limited to linear sequences of actions with exception handlers but no forks.
3. Sequences of MapReduce jobs only; no Pig or Hive tasks or jobs. These MapReduce sequences can be combined with forks and path joins.
4. Sequences of MapReduce and Pig. These sequences can be combined with other actions including forks, decision points, and path joins.

Question :

You have an employee who is a Date Analyst and is very comfortable with SQL.
He would like to run ad-hoc analysis on data in your HDFS duster.
Which of the following is a data warehousing software built on top of
Apache Hadoop that defines a simple SQL-like query language well-suited for this kind of user?
A. Pig B. Hue C. Hive D. Sqoop E. Oozie

1. A
2. B
3. C
4. D
5. E

Question :

You need to import a portion of a relational database every day as files to HDFS,
and generate Java classes to Interact with your imported data. Which of the following tools should you use to accomplish this?
A. Pig B. Hue C. Hive D. Flume E. Sqoop F. Oozie G. fuse-dfs

1. A,B
2. B,C
3. C,E
4. F,G

Question :

The size of block in HDFS is

1. 512 Bytes
2. 64 MB
3. 1024 KB
4. None of the above

Question :

To Create Sequence of Multiple MapReduce job (Chaining) same JobConf object is used

1. True
2. False

Question :

Which class/object represent a MapReduce job

1. Job
2. JobControl
3. Access Mostly Uused Products by 50000+ Subscribers