20 Prediction and Estimation

In modern statistics, econometrics, and machine learning, two primary goals often motivate data analysis:

  1. Prediction: To build a function \(\hat{f}\) that accurately predicts an outcome \(Y\) from observed features (predictors) \(X\).

  2. Estimation or Causal Inference: To uncover and quantify the relationship (often causal) between \(X\) and \(Y\), typically by estimating parameters like \(\beta\) in a model \(Y = g(X; \beta)\).

These goals, while superficially similar, rest on distinct philosophical and mathematical foundations. Below, we explore the difference in detail, illustrating key ideas with formal definitions, theorems, proofs (where relevant), and references to seminal works.


This chapter is fully available in the published Springer volumes.
The online preview is limited per publisher guidelines.

To access the complete content, purchase the book on Springer:

Vol. Title Link
1 Foundations of Data Analysis Buy on Springer
2 Regression Techniques for Data Analysis Buy on Springer
3 Advanced Modeling and Data Challenges Buy on Springer
4 Experimental Design Buy on Springer
📖 Free preview — limited per publisher guidelines. Purchase the complete A Guide on Data Analysis series (Vols. 1–4) on Springer.
Vol. 1 Vol. 2 Vol. 3 Vol. 4