📘 Section 12.5 – Model Adequacy Checking
  🔎 12.5.1 Residual Analysis
  Residual: eᵢ = yᵢ − ŷᵢ
  
    - Use residual plots to detect non-linearity or omitted variables
- Standardized Residual: dᵢ = eᵢ / √MSE
- Studentized Residual: rᵢ = eᵢ / √(MSE × (1 − hᵢᵢ))
    Example: d₁₅ = 5.84 / √5.2352 = 2.55
    d₁₇ = 4.33 / √5.2352 = 1.89
    r₁₅ = 5.84 / √(5.2352 × (1 − 0.0737)) = 2.65
    r₁₇ = 4.33 / √(5.2352 × (1 − 0.2593)) = 2.20
  
 
  🎯 Hat Matrix and Leverage
  Hat Matrix: H = X(X′X)⁻¹X′
  Leverage: hᵢᵢ = x′ᵢ(X′X)⁻¹xᵢ
 
  🔥 12.5.2 Influential Observations
  Cook’s Distance identifies influential points in regression analysis:
  Dᵢ = r²ᵢ × [hᵢᵢ / (p × (1 − hᵢᵢ))]
  If Dᵢ > 1 → point is potentially influential
  
    | Observation | hᵢᵢ | Dᵢ | 
|---|
    | 1 | 0.1573 | 0.035 | 
    | 15 | 0.0737 | 0.187 | 
    | 17 | 0.2593 | 0.565 | 
  
 
  📘 Summary
  
    - Residual plots help assess model fit and nonlinearity
- Standardized and studentized residuals detect outliers
- Hat matrix shows leverage
- Cook’s Distance finds influential data points
 
  🎓 Student GPA Example
  We are modeling GPA based on:
  
    - x₁: Number of study hours per week
- x₂: Total number of credits this semester
- x₃: Sleep hours per night
The multiple regression model is:
  GPA = β₀ + β₁x₁ + β₂x₂ + β₃x₃ + ε
  Suppose after fitting, we have the following estimated residuals and leverage:
  
    | Student | ŷᵢ | yᵢ | eᵢ | hᵢᵢ | 
|---|
    | 5 | 3.0 | 2.3 | -0.7 | 0.12 | 
    | 12 | 3.4 | 4.0 | 0.6 | 0.08 | 
    | 17 | 2.5 | 4.1 | 1.6 | 0.28 | 
  
  Let MSE = 0.45, p = 4 (3 predictors + intercept)
 
  🔎 Residual Analysis
  
    - Standardized Residual: dᵢ = eᵢ / √MSE
- Studentized Residual: rᵢ = eᵢ / √(MSE × (1 − hᵢᵢ))
    d₁₇ = 1.6 / √0.45 = 2.38
    r₁₇ = 1.6 / √(0.45 × (1 − 0.28)) = 2.64
  
 
  🎯 Hat Matrix and Leverage
  Leverage: hᵢᵢ = x′ᵢ(X′X)⁻¹xᵢ
 
  🔥 Cook’s Distance for GPA Data
  Formula:
  Dᵢ = r²ᵢ × [hᵢᵢ / (p × (1 − hᵢᵢ))]
  
    D₁₇ = (2.64)² × [0.28 / (4 × (1 − 0.28))] = 6.97 × 0.0972 = 0.678
  
  Since D₁₇ < 1, the point is not highly influential, but it should be monitored.
 
  📘 Summary
  
    - Use residual plots to assess fit and detect outliers
- Standardized residuals scale residuals for comparison
- Studentized residuals adjust for leverage
- Cook’s Distance flags influential points