Logistic regression is a model used for prediction of the probability of occurrence of an event. It makes use of several variables that may be___________ 1. Numerical 2. Categorical 3. Access Mostly Uused Products by 50000+ Subscribers 4. None of the 1 and 2 are correct
Explanation: Logistic regression is a model used for prediction of the probability of occurrence of an event. It makes use of several predictor variables that may be either numerical or categories.
Question : Select the correct statement regarding the naive Bayes classification
1. it only requires a small amount of training data to estimate the parameters 2. Independent variables can be assumed 3. Access Mostly Uused Products by 50000+ Subscribers 4. for each class entire covariance matrix need to be determined
Explanation: An advantage of naive Bayes is that it only requires a small amount of training data to estimate the parameters (means and variances of the variables) necessary for classification. Because independent variables are assumed, only the variances of the variables for each class need to be determined and not the entire covariance matrix.
Explanation: Clustering is an example of unsupervised learning. The clustering algorithm finds groups within the data without being told what to look for upfront. This contrasts with classification, an example of supervised machine learning, which is the process of determining to which class an observation belongs. A common application of classification is spam filtering. With spam filtering we use labeled data to train the classifier: e-mails marked as spam or ham.
1. Select one of the four datasets and begin planning and building a model 2. Combine the data from all four of the datasets and begin planning and bulding a model 3. Access Mostly Uused Products by 50000+ Subscribers 4. Visualize the data to further explore the characteristics of each data set