📘 Section 11.9 – Regression on Transformed Variables
🔍 Why Transform Variables?
Linear regression assumes a straight-line relationship between x and y. If the data are non-linear, applying linear regression without transformation can lead to:
- Biased parameter estimates
- Low R² (explained variance)
- Non-random residuals (model inadequacy)
🧠 When to Suspect Nonlinearity?
- Scatterplot of
x vs. y is curved.
- Residuals vs. fitted values show a bow or funnel shape.
- Theoretical model suggests exponential or inverse relationships.
🔁 Common Transformations
| Original Form | Transformation | Resulting Linear Form |
| Exponential: Y = β₀e^{β₁x}ε | ln(Y) | ln Y = ln β₀ + β₁x + ln ε |
| Inverse: Y = β₀ + β₁(1/x) + ε | z = 1/x | Y = β₀ + β₁z + ε |
| Reciprocal Exponential: Y = 1 / exp(β₀ + β₁x + ε) | ln(1/Y) | ln Y* = β₀ + β₁x + ε |
| Power: Y = β₀x^β₁ε | log-log | ln Y = ln β₀ + β₁ ln x |
📌 Tip: Always check scatterplots and residual plots before and after transformation to verify model improvement.
📊 Example – Windmill Power
A research engineer wants to model the relationship between wind velocity (x) and DC output (y).
Before Transformation (Linear Fit)
After Transformation (1/x)
✅ Summary
- Always check residual plots for signs of model inadequacy.
- Try transformations if the relationship is not linear.
- Compare R² and residual plots before and after transformation.
- The best transformation is often guided by the shape of the scatterplot and theory.
Transformation = Rescue when linear regression fails!