Premium

IBM Certified Data Architect - Big Data Certification Questions and Answers (Dumps and Practice Questions)



Question : You are working in an organization, which provide data storage solutions for many companies and government. However, their are various types of data , which of the
following approach can help to solve the impact on performance data and capacity
 : You are working in an organization, which provide data storage solutions for many companies and government. However, their are various types of data , which of the
1. Define a data catalog in a traditional data warehouse

2. Create different solutions to handle every kind of data

3. Store a wide range of data formats on the same platform

4. Define a comprehensive taxonomy and constantly review

Correct Answer : 4
Explanation:




Question : Which of the following is a browser based virtualization tool?
 : Which of the following is a browser based virtualization tool?
1. BigR

2. BigSheets

3. Analytics Workbench

4. Watson Explorer

Correct Answer : 2
Explanation: IBM BigSheets
A revolutionary browser-based analytics tool





Question : You are working in a Financial Risk Analytics company, where you have last years of history data, which is stored in TeraData data warehouse system and underline
storage is very costly. Hence, you decided to move this data to Hadoop commodity hardware for historical data and for ongoing data they will still use Teradata and uses federation
method to
access both sets of data. which of the following Big Data value proposition for this use case?
 : You are working in a Financial Risk Analytics company, where you have last  years of history data, which is stored in TeraData data warehouse system and underline
1. IBM Logical Data Warehouse and IBM Big SQL
2. Enterprise Data Warehouse
3. Pure Data for Analytics
4. InfoSphere Information Server

Correct Answer : 1
Explanation:


Related Questions


Question : Which statement is true about the storing files in HDFS


 : Which statement is true about the storing files in HDFS
1. Files are split in the block
2. All the blocks of the files should remain on same machine
3. Access Mostly Uused Products by 50000+ Subscribers
4. All of the above
5. 1 and 3 are correct


Question : You are designing an Apache Spark applications, which will be executed on IBM ODP. However, you know Spark Application can run on various cluster managers. Which will you find the best suited with IBM ODP
 : You are designing an Apache Spark applications, which will be executed on IBM ODP. However, you know Spark Application can run on various cluster managers. Which will you find the best suited with IBM ODP
1. Job Tracker

2. Apache Mesos

3. Access Mostly Uused Products by 50000+ Subscribers

4. Standalone deploy mode


Question : You are working as a researcher in a media company. This media company has access to various social network data. Now, they are planning to change the theme of their
existing stand-up talk show and want to check current trends and discussions going on among the people on various social networking site. Hence, they need to research this data and
apply analytics on social data. How this can be done? Using

  : You are working as a researcher in a media company. This media company has access to various social network data. Now, they are planning to change the theme of their
1. Hadoop

2. Netezza

3. Access Mostly Uused Products by 50000+ Subscribers

4. IBM Enterprise Content management


Question : Which of the following statements is TRUE regarding cloud applications?


  : Which of the following statements is TRUE regarding cloud applications?
1. Migrating a legacy application to the cloud is a simple solution to drive down cost

2. Architecting and deploying a scalable cloud application requires a private cloud implementation

3. Access Mostly Uused Products by 50000+ Subscribers

4. Leveraging a private vs. public cloud may result in sacrificing some of the core advantages of cloud computing



Question : In Hadoop framework , you know there are 's of nodes, working as a data nodes. Now, while putiing data in the cluster, it will be decided by NameNode , on which
node and which rack data should be copied? Which of the following will help NameNode to find the correct node in a rack ?
 : In Hadoop framework , you know there are 's of nodes, working as a data nodes. Now, while putiing data in the cluster, it will be decided by NameNode , on which
1. Admin has to do the pre-configuration on a NameNode

2. ResourceManager Will help

3. Access Mostly Uused Products by 50000+ Subscribers

4. YARN Cluster Manager


Question : You are working as a Chief Data Architect in a Retail Bank. And you are being asked to do following activities

- Monitor Each ATM transaction
- Monitor Each online Transaction

Also, you need to create a personalized model for each customer, using existing customer data and also using Customer facebook data. And system should be able to learn by this and
provide, highly targeted promotions. Which of the following system will help you to implement this
 : You are working as a Chief Data Architect in a Retail Bank. And you are being asked to do following activities
1. Apache Spark

2. Apache Hive

3. Access Mostly Uused Products by 50000+ Subscribers

4. Netezza