Premium

IBM Certified Data Architect - Big Data Certification Questions and Answers (Dumps and Practice Questions)



Question : Which statement is true about the storing files in HDFS


 : Which statement is true about the storing files in HDFS
1. Files are split in the block
2. All the blocks of the files should remain on same machine
3. Access Mostly Uused Products by 50000+ Subscribers
4. All of the above
5. 1 and 3 are correct

Correct Answer : Get Lastest Questions and Answer :





Question : You are designing an Apache Spark applications, which will be executed on IBM ODP. However, you know Spark Application can run on various cluster managers. Which will you find the best suited with IBM ODP
 : You are designing an Apache Spark applications, which will be executed on IBM ODP. However, you know Spark Application can run on various cluster managers. Which will you find the best suited with IBM ODP
1. Job Tracker

2. Apache Mesos

3. Access Mostly Uused Products by 50000+ Subscribers

4. Standalone deploy mode

Correct Answer : Get Lastest Questions and Answer :
Explanation: Specifically, to run on a cluster, the SparkContext can connect to several types of cluster managers (either Sparks own standalone cluster manager, Mesos
or YARN), which allocate resources across applications. Once connected, Spark acquires executors on nodes in the cluster, which are processes that run computations and store data
for your application. Next, it sends your application code (defined by JAR or Python files passed to SparkContext) to the executors. Finally, SparkContext sends tasks to the
executors to run.

IBM ODP supports open source Hadoop framework. Hence, YARN cluster manager will be used




Question : You are working as a researcher in a media company. This media company has access to various social network data. Now, they are planning to change the theme of their
existing stand-up talk show and want to check current trends and discussions going on among the people on various social networking site. Hence, they need to research this data and
apply analytics on social data. How this can be done? Using

  : You are working as a researcher in a media company. This media company has access to various social network data. Now, they are planning to change the theme of their
1. Hadoop

2. Netezza

3. Access Mostly Uused Products by 50000+ Subscribers

4. IBM Enterprise Content management

Correct Answer : Get Lastest Questions and Answer :
Explanation: Successful organizations understand that business content matters more than ever as mobile, social and cloud technologies transform their business models.
They are discovering new value with business content solutions that bring the power of analytics, process optimization and collaboration to all forms of content to achieve greater
degrees of insight and customer engagement.
See for yourself how IBM enterprise content management solutions use analytics to provide access to content for the right people at the right time to help make better decisions.



Related Questions


Question : : Operational modeling elements represent the parts of the application and describe how they communicate with each other. Some of these elements include components
(pieces of the application itself), nodes (pieces of infrastructure that can run the application), locations (security zones or physical places), and connections (communication
links between elements). In operational model, two levels have been documented very well, one of them is Theoretical level and other one will be ...

 : : Operational modeling elements represent the parts of the application and describe how they communicate with each other. Some of these elements include components
1. Regional

2. Physical

3. Access Mostly Uused Products by 50000+ Subscribers

4. Logical


Question : In Arinika Inc, there are big web server farms, which continuously generates the data in log files. And you data science team want to analyze this logs. Which of the
following recommended ?
  : In Arinika Inc, there are big web server farms, which continuously generates the data in log files. And you data science team want to analyze this logs. Which of the
1. Apache Pig and Hive

2. Apache Spark

3. Access Mostly Uused Products by 50000+ Subscribers

4. IBM (InfoSphere Streams), and BigInsight



Question : You are working in a financial organization, where they have strict policy to keep the data for at least years. Hence, being a solution architect, you will be
asked to have data retention and archival in place. So what are all the requirement for Data retention and archival

  : You are working in a financial organization, where they have strict policy to keep the data for at least  years. Hence, being a solution architect, you will be
1. A format and storage repository for archived data

2. Public cloud

3. Access Mostly Uused Products by 50000+ Subscribers

4. Solid-state technology


Question : The Annotation Query Language (AQL) is the easiest and most flexible tool to pull structured output from which of the following?
  : The Annotation Query Language (AQL) is the easiest and most flexible tool to pull structured output from which of the following?
1. Hive data structures

2. Unstructured text

3. Access Mostly Uused Products by 50000+ Subscribers

4. JDBC connected relational data marts


Question : You have been, storing data in IBm NoSQL solution, known as IBM Cloudant. And you want to pre-create some of the functions as a view. So that they can be used later
on to fetch the data e.g avrage sale price of a product id. Which of the language, you will be using to write views for Cloudant
  : You have been, storing data in IBm NoSQL solution, known as IBM Cloudant. And you want to pre-create some of the functions as a view. So that they can be used later
1. Go

2. Java

3. Access Mostly Uused Products by 50000+ Subscribers

4. Python

5. Scala



Question : Cloudant is a graph database ?
  : Cloudant is a graph database ?
1. True
2. False