Question : Consider the boxplot below. Which of the following statements are true? I. The distribution is skewed right. II. The interquartile range is about 8. III. The median is about 10. 1. I only 2. II only 3. III only 4. I and III
Correct Answer : 2
Explanation : Most of the observations are on the high end of the scale, so the distribution is skewed left. The interquartile range is indicated by the length of the box, which is 18 minus 10 or 8. And the median is indicated by the vertical line running through the middle of the box, which is roughly centered over 15. So the median is about 15.
Question : Assume some output variable "y" is a linear combination of some independent input variables "A" plus some independent noise "e". The way the independent variables are combined is defined by a parameter vector B y=AB+e where X is an m x n matrix, B is a vector of n unknowns, and b is a vector of m values. Assuming that m is not equal to n and the columns of X are linearly independent, which expression correctly solves for B?
1. A 2. B 3. C 4. D
Correct Answer : 4
This is the standard solution of the normal equations for linear regression. Because A is not square, you cannot simply take its inverse.
Question : This question will ask you to provide missing code segments. A logistic regression model was fit on a data set where 40% of the outcomes were events(TARGET=1) and 60% were non-events (TARGET=0). The analyst knows that the population where the model will be deployed has 5% events and 95% non-events. The analyst also knows that the company's profit margin for correctly targeted events is nine times higher than the company's loss for incorrectly targeted non-event. Given the following SAS program: What X and Y values should be added to the program to correctly score the data? 1. X=40, Y=10 2. X=.05, Y=10 3. X=.05, Y=.40 4. X=.10.Y=05