When a large number of variables/predictors are avaialable for predicting the response, the conventional MLE/Least Square Error
Used when we want to penlize the model complexity to encourage a more simpler model
Ridge Regression
The ordinary least square estimate of is the solution that minimizes the residual sum of aquare
Minimizing the terms of error + square betas, we minimize the effect of relying on specific betas to explain y (overfitting). The act of minimizing approximates all the betas with its dimensions to a circle/ x-d spheres
show
Least Absolute Shrinkage and Selection Operation (LASSO)