Machine Learning · Regression

MSE cost function

Regression

Introduction

Mean Squared Error (MSE) is the default regression loss. In this lesson we decompose it: why squared (and not absolute), what the cost surface looks like (a convex paraboloid) and why convexity is invaluable for optimization, what units it has (squared units of y), how it relates to RMSE and MAE, when it fails (outliers, heavy tails), and how it leads to the bias-variance decomposition. We show its equivalence with MLE under Gaussian noise and the geometric intuition: an orthogonal projection of y onto the column space of X.