1.

We have got a dataset where a number of variables is greater than the number of observations or rows. Can we use classical Regression techniques here? How would you deal with this situation?

Answer»

No, CLASSICAL regression techniques can not be used here.

Since a number of variables are greater than a number of observations, it is a high dimension dataset and ordinary least SQUARES cannot be considered for an ESTIMATE as standard DEVIATION and variance will be infinite.

We will have to use regression techniques such as Lasso, Ridge, ETC. which will penalize coefficients and will reduce variance and standard deviation. Subset regression and/or stepwise regression can also be explored with a forward step approach.



Discussion

No Comment Found

Related InterviewSolutions