Premium

SAS Certified BI Content Developer for SAS 9 and Business Analytics Questions and Answer (Dumps and Practice Questions)



Question : Consider the boxplot below.
Which of the following statements are true?
I. The distribution is skewed right.
II. The interquartile range is about 8.
III. The median is about 10.
 :  Consider the boxplot below.
1. I only
2. II only
3. III only
4. I and III


Correct Answer : 2

Explanation : Most of the observations are on the high end of the scale, so the distribution is skewed left. The interquartile range is indicated by the length of the box, which is 18 minus 10 or 8. And the median is indicated by the vertical line running through the middle of the box, which is roughly centered over 15. So the median is about 15.






Question : Assume some output variable "y" is a linear combination of some independent input variables "A" plus some independent noise "e".
The way the independent variables are combined is defined by a parameter vector B
y=AB+e
where X is an m x n matrix, B is a vector of n unknowns, and b is a vector of m values.
Assuming that m is not equal to n and the columns of X are linearly independent, which expression correctly solves for B?

 : Assume some output variable
1. A
2. B
3. C
4. D



Correct Answer : 4

This is the standard solution of the normal equations for linear regression. Because A is not square, you cannot simply take its inverse.






Question : This question will ask you to provide missing code segments.
A logistic regression model was fit on a data set where 40% of the outcomes
were events(TARGET=1) and 60% were non-events (TARGET=0).
The analyst knows that the population where the model
will be deployed has 5% events and 95% non-events.
The analyst also knows that the company's profit margin for correctly
targeted events is nine times higher than the company's loss for incorrectly targeted non-event.
Given the following SAS program:
What X and Y values should be added to the program to correctly score the data?
 : This question will ask you to provide missing code segments.
1. X=40, Y=10
2. X=.05, Y=10
3. X=.05, Y=.40
4. X=.10.Y=05


Correct Answer : 2



Related Questions


Question : Which of the following describes a concordant pair of observations in the LOGISTIC procedure?
 : Which of the following describes a concordant pair of observations in the LOGISTIC procedure?
1. An observation with the event has an equal probability as another observation with the event.
2. An observation with the event has a lower predicted probability than the observation without the event.
3. Access Mostly Uused Products by 50000+ Subscribers
4. An observation with the event has a higher predicted probability than the observation without the event


Question : Refer to the exhibit
An analyst examined logistic regression models for predicting whether a customer
would make a purchase. The ROC curve displayed summarizes the models. Using the
selected model and the analyst's decision rule, 25% of the customers who did not
make a purchase are incorrectly classified as purchasers. What can be concluded from the graph?
 : Refer to the exhibit
1. About 25% of the customers who did make a purchase are correctly classified as making a purchase.
2. About 50% of the customers who did make a purchase are correctly classified as making a purchase.
3. Access Mostly Uused Products by 50000+ Subscribers
4. About 95% of the customers who did make a purchase are correctly classified as making a purchase.


Question : One common approach for predicting rare events in the LOGISTIC procedure is to build a model that disproportionately
over-re presents those cases with an event occurring (e.g. a 50-50 event/non-event split).
What problem does this present?

 : One common approach for predicting rare events in the LOGISTIC procedure is to build a model that disproportionately
1. All parameter estimates are biased.
2. Only the intercept estimate is biased.
3. Access Mostly Uused Products by 50000+ Subscribers
4. Sensitivity estimates are biased.



Question : A financial services manager wants to assess the probability that certain clients will
default on their Home Equity Line of Credit (HELOC). A former employee left the code listed below.
The training data set is named HELOC, while a similar data set of more recent clients is named RECENT_HELOC.
Which SAS data steps will calculate the predicted probability of default on recent clients? (Choose two.)

 : A financial services manager wants to assess the probability that certain clients will
1. A,B
2. B,C
3. Access Mostly Uused Products by 50000+ Subscribers
4. D,E