Statistics: The Institute for Statistics Education
A 10-question quiz drawing from various analytics areas.
In fitting a model to predict whether a person viewing an ecommerce web site will click on a particular link, a certain company drew the training data from web logs of the browsing records of prior visitors. Various variables were found to be useful in predicting the target, including a binary variable indicating whether or not the person made a purchase. How should that variable be handled:
It should be included as a predictor due to its likely predictive power.
It should be excluded since it is uncorrelated with the target variable.
It should be excluded since it will not be available in new data.
It should be included, but only in models that rely on binary input variables.