Premium

IBM Certified Data Architect - Big Data Certification Questions and Answers (Dumps and Practice Questions)



Question : Which of the following statements is TRUE regarding IAAS vs PAAS?
 : Which of the following statements is TRUE regarding IAAS vs PAAS?
1. Performance and scalability requirements are a critical factor for deciding between Platform as a Service and Infrastructure as a Service deployment models
2. In PAAS, you will be getting Root access to the operating system.

3. If your web application has a very high transactions volumes are good candidates for Platform as a Service

4. In an infrastructure as a service deployment, the cloud provider provides security patching, monitoring and fail over capabilities


Correct Answer : 1
Explanation: Managed cloud providers and some unmanaged cloud providers will either automatically, or with some manual control, provide installation and maintenance of
operating system patches for your workloads that are hosted in the cloud. While operating system patching will affect infrastructure as a service (IaaS), platform as a service
(PaaS) and software as a service (SaaS) clouds, in PaaS and SaaS offerings the middleware and applications are often under the provider's control. IaaS, however, is generally
there to host your applications, and, as we all know, some of our applications can be quite inflexible when it comes to their operating environment.





Question : You are an enterprise architect of ARINIKA Inc. You have after every days there will be a big spike of new data with multiple TB. And regulations says data
older than one year needs to be archived older than 3 years data needs to be removed. Which of the following is a best solution as well as low cost.


 : You are an enterprise architect of ARINIKA Inc.  You have after every  days there will be a big spike of new data with multiple TB. And regulations says data
1. Estimate the peak volume over a 3 year period and set up a Hadoop system with commodity HW and storage to accommodate that volume.

2. Estimate the peak volume over a 3 year period and set up a Hadoop system with NAS to accommodate the expected volume

3. Use Cloud elasticity capabilities to handle the peak and valley data volume

4. Use SAN storage with compression to handle the peak and valley data volume

Correct Answer : 1
Explanation: Cheaper solution is using hadoop with commodity hardware. As we need to estimate approximate data volume for every 3 years and accordingly setup hadoop
cluster. Because, we can delete data after 3 years.





Question : SAN or NAS should not be used to set up HDFS
 : SAN or NAS should not be used to set up HDFS
1. True
2. False

Correct Answer : 1
Explanation: Why use DAS when you can use Network Attached Storage (NAS) or SAN? For starters Hadoop is designed to be a "shared nothing" architecture where DAS will
suffice. Shared network storage like NAS or SAN could very well be overkill from a cost point of view if all you plan to store is petabytes of log and web 2.0 data in the Hadoop
cluster.



Related Questions


Question : Which of the following statements is TRUE regarding cloud computing solutions?
 : Which of the following statements is TRUE regarding cloud computing solutions?
1. Cloud security is planned, developed, and layered on top of an application after
the application development process is complete

2. Stateless applications are better candidates for cloud services than applications
that maintain state

3. Access Mostly Uused Products by 50000+ Subscribers
scaling

4. Server virtualization is a requirement in a cloud implementation


Question : You are an independent software vendor. You need to create a BigData solution for a customer. You want to select the software
components that are going to provide the most compatibility with new open source
components. In addition to Hadoop and any other software components you may
need, which one of the following would you select that is part of the initial release
of the Open Data Platform (ODP)
  : You are an independent software vendor. You need to create a BigData solution for a customer. You want to select the software
1. Spark

2. Hive

3. Access Mostly Uused Products by 50000+ Subscribers

4. Ambari


Question : Need to analyze your data with ZERO programming? Need to analyze your data from different data sources? Need to be able to visually display your data to find
patterns? BigSheets allows you to do all of that and more! BigSheets comes with all editions of IBM InfoSphere BigInsights.

Which of the following are correct sheets which contain predefined logic for analyzing data

A. Complement
B. Limit
C. Group
D. Update
E. Insert
  : Need to analyze your data with ZERO programming? Need to analyze your data from different data sources? Need to be able to visually display your data to find
1. A,B,C
2. B,C,D
3. Access Mostly Uused Products by 50000+ Subscribers
4. A,D,E
5. A,C,E


Question : Which of the following is NOT a valid Service Level Agreement (SLA) metric?
  : Which of the following is NOT a valid Service Level Agreement (SLA) metric?
1. Mean time between failures

2. Mean time to repair

3. Access Mostly Uused Products by 50000+ Subscribers

4. Identification of failing component


Question : You are , working as chief solution architect in a software company. For your designed solution you want to organize projects, manage the complexity of the solution,
and ensure that all architecture requirements have been addressed. Which of the following document will help to do this?

  : You are , working as chief solution architect in a software company. For your designed solution you want to organize projects, manage the complexity of the solution,
1. Class Diagram

2. Composition Model

3. Access Mostly Uused Products by 50000+ Subscribers

4. Operational Model

5. Component Model


Question : While working in IRINIKA INC, you have found that your other team is using Hadoop latest version to run lot of BigData processes and getting advantages of the
platform. You also own a one large application, and you want this application to port on the Hadoop. You also want to monitor this application. You should also have to deploy a
replacement, for example, for any failed components. Which of the following will solve given requirement
 : While working in IRINIKA INC, you have found that your other team is using Hadoop latest version to run lot of BigData processes and getting advantages of the
1. Ganglia and YARN

2. NAGIOS and YARN

3. Access Mostly Uused Products by 50000+ Subscribers

4. Spark and YARN

5. Slider and YARN