Premium

IBM Certified Data Architect - Big Data Certification Questions and Answers (Dumps and Practice Questions)



Question : Which of the following scheduler ensures that certain users, groups, or production applications always get sufficient resources.
When a queue contains waiting applications, it gets at least its minimum share
 : Which of the following scheduler ensures that certain users, groups, or production applications always get sufficient resources.
1. Fair Scheduler
2. Capacity Scheduler
3. FIFO Scheduler
4. Both 1 and 2
5. Both 2 and 3


Correct Answer : 1
Exp: In addition to providing fair sharing, the Fair scheduler allows guaranteed minimum shares to be assigned to queues, which is useful for ensuring that
certain users, groups, or production applications always get sufficient resources. When a queue contains waiting applications, it gets at least its minimum
share; in contrast, when the queue does not need its full guaranteed share, the excess is split between other running applications. To avoid a single user
flooding the clusters with hundreds of jobs, the Fair scheduler can limit the number of running applications per user and per queue through the
configurations file. Using this limit, user applications will wait in the queue until previously submitted jobs finish.





Question :

Select the correct statement for the Fair Scheduler



 :
1. allows containers to request variable amounts of memory and schedules based on those requirements
2. If an application is given a container that it cannot use immediately due to a shortage of memory, it can reserve that container, and no other application can use it until the container is released.
3. works best when the workloads are well known, which helps in assigning the minimum capacity.
4. 1 and 2
5. 1 and 3

Correct Answer : 2
The YARN Fair scheduler allows containers to request variable amounts of memory and schedules based on those requirements. Support for other
resource specifications, such as type of CPU, is under development. To prevent multiple smaller memory applications from starving a single large memory
application, a "reserved container" has been introduced. If an application is given a container that it cannot use immediately due to a shortage of memory, it
can reserve that container, and no other application can use it until the container is released. The reserved container will wait until other local containers
are released and then use this additional capacity (i.e., extra RAM) to complete the job. One reserved container is allowed per node, and each node may
have only one reserved container. The total reserved memory amount is reported in the ResourceManager UI. A larger number means that it may take
longer for new jobs to get space.

The Capacity scheduler works best when the workloads are well known, which helps in assigning the minimum capacity. For this scheduler to work most
effectively, each queue should be assigned a minimal capacity that is less than the maximal expected workload. Within each queue, multiple applications
are scheduled using hierarchical FIFO queues similar to the approach used with the stand-alone FIFO scheduler.






Question : The Fair scheduler works best when there is a

 : The Fair scheduler works best when there is a
1. When there is a need of Higher Memory
2. lot of variability between queues
3. workloads are well known
4. When there is a need of Higher CPU
5. When all the Jobs needs to be processed in submission order

Correct Answer : 2

Explanation: A new feature in the YARN Fair scheduler is support for hierarchical queues. Queues may now be nested inside other queues, with each queue splitting
the resources allotted to it among its subqueues in a fair scheduling fashion. One use of hierarchical queues is to represent organizational boundaries and
hierarchies. For example, Marketing and Engineering departments may now arrange a queue structure to reflect their own organization. A queue can also
be divided into subqueues by job characteristics, such as short, medium, and long run times.
The Fair scheduler works best when there is a lot of variability between queues. Unlike with the Capacity scheduler, all jobs make progress rather than
proceeding in a FIFO fashion in their respective queues.



Related Questions


Question : Performing a mathematical operation on a Big R vector variable will automatically loop through each item in the vector

 : Performing a mathematical operation on a Big R vector variable will automatically loop through each item in the vector
1. True
2. False


Question : You are working with an online e-commerce company. Where you need to find recommendations for each user based on their interest. Which of the following tool will be
most useful
  : You are working with an online e-commerce company. Where you need to find recommendations for each user based on their interest. Which of the following tool will be
1. Spark

2. Hadoop

3. Access Mostly Uused Products by 50000+ Subscribers

4. Cloudant


Question : Which of the following is a correct statement for IBM Business Data Model?
 : Which of the following is a correct statement for IBM Business Data Model?
1. Enterprise-wide and applies to the industry, independently of line-of-business considerations

2. Independent of organizational or technological considerations, providing a stable basis for business modeling

3. Access Mostly Uused Products by 50000+ Subscribers

4. 1 and 2
5. 1,2,and 3


Question : Which of the following master component will take care of replicating data across the nodes in a Hadoop Cluster
 : Which of the following master component will take care of replicating data across the nodes in a Hadoop Cluster
1. Job Tracker

2. Edge Node

3. Access Mostly Uused Products by 50000+ Subscribers

4. Data Node


Question : Select the correct statement for the NameNode ?

 :  Select the correct statement for the NameNode ?
1. NameNode daemon must be running at all the times
2. NameNode holds all its metadata in RAM for fast access.
3. Access Mostly Uused Products by 50000+ Subscribers
4. 1,2 and 3 are correct
5. 1 and 2 are correct




Question : If a file which is MB how much space block space it will used ?
  : If a file which is MB how much space block space it will used ?
1. 33 MB
2. 64 MB
3. Access Mostly Uused Products by 50000+ Subscribers
4. None of the Above