Premium

Dell EMC Data Science and BigData Certification Questions and Answers



Question : Which of the following statement is correct with regards to Association rules?
A. It is an unsupervised learning method
B. It is a supervised learning method
C. It is a predictive method
D. It is a nonproductive method

 : Which of the following statement is correct with regards to Association rules?
1. A,B
2. B,C
3. C,D
4. A,D
5. B,D

Correct Answer : Get Lastest Questions and Answer :
Explanation: Association rule is an unsupervised learning method. Which is descriptive, non-predictive method which is used to discover the interesting relationship hidden in a large datasets. Once the
relationship is found this can be represented as rules or frequent itemsets. Association rules helps in mining the transactions in transactions in the databases.




Question : Which of the following questions can be answered using the Association Rules?


 : Which of the following questions can be answered using the Association Rules?
1. Which all books are purchased together on a portal

2. Of those readers who are similar to this reader, what books do they tend to buy?

3. Of those readers who have purchased this book, what other similar books do they tend to buy.

4. 1,2

5. 1,2,3

Correct Answer : Get Lastest Questions and Answer :
Explanation: Association rules can help in answering all the questions listed in the options.




Question : Which of the following statements apply Apriori algorithm?


 : Which of the following statements apply Apriori algorithm?
1. It can be used to generate clustering.

2. It can be used for classification

3. It can be used to reduce the dimensionality

4. It can be used to generate association rules


Correct Answer : Get Lastest Questions and Answer :
Explanation: Apriori algorithm can be used to generate the Association Rules.


Related Questions


Question : Since R factors are categorical variables, they are most closely related to which data classification level?
  : Since R factors are categorical variables, they are most closely related to which data classification level?
1. interval
2. ordinal
3. Access Mostly Uused Products by 50000+ Subscribers
4. ratio




Question : In which phase of the analytic lifecycle would you expect to spend most of the project time?


  : In which phase of the analytic lifecycle would you expect to spend most of the project time?
1. Discovery
2. Data preparation
3. Access Mostly Uused Products by 50000+ Subscribers
4. Operationalize



Question : You are building a logistic regression model to predict whether a tax filer will be audited within the
next two years. Your training set population is 1000 filers. The audit rate in your training data is
4.2%. What is the sum of the probabilities that the model assigns to all the filers in your training set
that have been audited?
  : You are building a logistic regression model to predict whether a tax filer will be audited within the
1. 42.0
2. 4.2
3. Access Mostly Uused Products by 50000+ Subscribers
4. 0.042




Question : Refer to exhibit

You are asked to write a report on how specific variables impact your client's sales using a data
set provided to you by the client. The data includes 15 variables that the client views as directly
related to sales, and you are restricted to these variables only.
After a preliminary analysis of the data, the following findings were made:
1. Multicollinearity is not an issue among the variables
2. Only three variables-A, B, and C-have significant correlation with sales
You build a linear regression model on the dependent variable of sales with the independent
variables of A, B, and C. The results of the regression are seen in the exhibit.
You cannot request additional datA. what is a way that you could try to increase the R2 of the
model without artificially inflating it?

  : Refer to exhibit
1. Create clusters based on the data and use them as model inputs
2. Force all 15 variables into the model as independent variables
3. Access Mostly Uused Products by 50000+ Subscribers
4. Break variables A, B, and C into their own univariate models



Question : You have two tables of customers in your database. Customers in cust_table_ were sent an email
promotion last year, and customers in cust_table_2 received a newsletter last year.
Customers can only be entered in once per table. You want to create a table that includes all
customers, and any of the communications they received last year. Which type of join would you
use for this table?


  :  You have two tables of customers in your database. Customers in cust_table_ were sent an email
1. Full outer join
2. Inner join
3. Access Mostly Uused Products by 50000+ Subscribers
4. Cross join



Question : In which lifecycle stage are initial hypotheses formed?


  :  In which lifecycle stage are initial hypotheses formed?
1. Model planning
2. Discovery
3. Access Mostly Uused Products by 50000+ Subscribers
4. Data preparation