Question : You have been given a huge datasets with the following occurrences Bread is 80% of the time in all transactions, combination of bread and milk is 60% of the time in all transactions. Which of the following statement is correct with regards to Apriori?
1. Support for {bread} is 0.8
2. Support for {bread} is 0.6
3. Support for {bread} is 1.4
4. Support for {bread} is 0.2
Correct Answer : Get Lastest Questions and Answer : Explanation: As bread occurs 80% of the time in all the transactions, hence you can say that support for bread is 0.8, similarly combination of {bread, milk} is 60% time, so we can say that support is 0.6 for combination of {bread, milk}
Question : For Apriori algorithm you have decided that minimum support value is ., which of the following are frequent itemsets, if following percentage occurrences are given? Bread->80% Milk->70% Bread,Milk -> 55% Bread, Banana -> 30% A. Bread B. Milk C. Bread, Milk D. Banana E. Bread, Banana
1. A,B,C 2. B,C,D 3. C,D,E 4. A,D,E 5. A,C,E
Correct Answer : Get Lastest Questions and Answer : Explanation: A frequent itemset has items that appear together often enough. The term "often enough" is formally defined with a minimum support criterion. Suppose minimum support is 0.5 then any itemset appear more than 0.5 are considered frequent itemsets. Hence Bread, Milk and combination of this Bread and Milk are considered frequent dataset.
Question : You have been given combination of three item sets as {a,b,c} are having . support and minimum support is defined as .. So which of the following statement is correct? A. Combination of {a,b} are frequent item sets B. Combination of {b,c} are frequent item sets C. Combination of {a,c} are frequent item sets D. Item {a} is a frequent dataset E. Item {c} is a frequent dataset
Correct Answer : Get Lastest Questions and Answer : Explanation: : In the question it is given that itemset {a,b,c} has 0.8 support, which is higher than 0.7 so it is a frequent item set. Similarly any subset of frequent dataset is also a frequent itemset.
1. Divide the data into sets of 1, 000 user profiles, and run K-means clustering in RHadoop iteratively. 2. Run MapReduce to transform the data, and find relevant key value pairs. 3. Access Mostly Uused Products by 50000+ Subscribers 4. Partition the data by XML file size, and run K-means clustering in each partition.
1. Use the extracted text and your regular expression to perform a sentiment analysis based on mentions of the new product 2. Convert the extracted text into a suitable document representation and index into a review corpus 3. Access Mostly Uused Products by 50000+ Subscribers 4. Group the reviews using Naive Bayesian classification