Linear models: Misspecification

In our discussion of linear model inference in Unit 2, we assumed the normal linear model throughout:

\[ \boldsymbol{y} = \boldsymbol{X} \boldsymbol{\beta} + \boldsymbol{\epsilon}, \quad \text{where} \ \boldsymbol{\epsilon} \sim N(\boldsymbol{0}, \sigma^2 \boldsymbol{I}_n). \]

In this unit, we will discuss what happens when this model is misspecified:

For each type of misspecification, we will discuss its origins, consequences, detection, and fixes (Non-normality-Outliers). We then discuss methodological approaches to address model misspecification, including asymptotic robust inference methods (14  Asymptotic methods), the bootstrap (15  The bootstrap), the permutation test (16  The permutation test), and robust estimation (17  Robust estimation and inference). We conclude with an R demo (18  R demo).