Session 7.2 — Point Estimation Simulation + CLT (Full App)

From the textbook — core definitions you must know

Point Estimator & Point Estimate

If a population has unknown parameter $ \theta $ and you observe a random sample $X_1,\ldots,X_n$, any statistic $ \widehat{\Theta} = h(X_1,\ldots,X_n) $ is a point estimator of $ \theta $. After you see the data, the computed number $ \hat{\theta} $ is the point estimate.

Sample Mean

$ \bar{X} = \frac{1}{n}\sum X_i $, $ \mathbb{E}[\bar{X}] = \mu $, $ \mathrm{Var}(\bar{X}) = \sigma^2/n $, $ \mathrm{SE}(\bar{X}) \approx s/\sqrt{n} $.

Sample Proportion

$ \hat{p} = \frac{1}{n}\sum I\{\text{success}\} $, $ \mathbb{E}[\hat{p}] = p $, $ \mathrm{SE}(\hat{p}) \approx \sqrt{\hat{p}(1-\hat{p})/n} $.

Sample Variance

$ s^2 = \frac{1}{n-1}\sum (X_i - \bar{X})^2 $.

① Configure & simulate

Target (Estimator)

Population distribution

Param 1

Param 2

Sample size per study (n)

Number of repeated studies (M)

Presets

SE(mean) ≈ σ / √n, SE(p̂) ≈ √(p(1−p)/n). Changes with n, not with M.

True parameter (θ)–

Avg. of M estimates ( $ \overline{\hat{\theta}} $ )–

SE (theory; from n)–

Replicates M–

SE (empirical; from M)–

n (per replicate)–

② 2D view — Sampling distribution of your estimator

This view is not raw data. It is the distribution of the M estimator values (e.g., x̄). For tiny M we show dots, not a histogram.

③ Central Limit Theorem — small n vs large n (overlay)

Central Limit Theorem

If $X_1,\ldots,X_n$ are i.i.d. with mean $\mu$ and finite variance $\sigma^2$, then

$$ Z \;=\; \frac{\bar X - \mu}{\sigma/\sqrt{n}} \;\Rightarrow\; \mathcal{N}(0,1)\ \text{as}\ n\to\infty $$

SE depends on n, not on how many times you repeat the study (M).

(CLT)

Compare n:

Overlay shows small-n (wider) vs large-n (tighter). Same population, same M. Dotted line is the true θ.

④ CLT in student life — Compare scenarios

Use CLT to decide if a claim about an average is reasonable. Choose a scenario, draw a single sample (without CLT), then compute a confidence interval (with CLT). Finally, simulate many sample means to visualize variability.

Scenario

Distribution

μ

σ

Target T (Mbps)

Sample size n

Replicates M

CLT level

One-sample mean x̄–

Sample SD s–

SE(x̄)=s/√n–

What this teaches (without vs with CLT)

Without CLT: One x̄ can be misleading (especially with skew & small n).
With CLT: Use a CI for μ and compare to the target T with the correct direction (≤ or ≥).
Skewed data need bigger n; the n > 30 rule helps, and the simulation shows why.

⑤ Using Excel for a 95% CI (and how to conclude)

If your data are roughly Normal (or n ≥ 30 so CLT kicks in), use the t-interval for the mean: x̄ ± t_{α/2, n−1}·s/√n (because σ is unknown).

Paste into Excel column A (e.g., A2:A(n+1)). Then follow the steps below.

A) Step-by-step (Excel)

Put your sample in one column (e.g., A2:A(n+1)). Count n rows.
Compute x̄, s, n:
x̄: =AVERAGE(A2:A(n+1))
s: =STDEV.S(A2:A(n+1))
n: =COUNT(A2:A(n+1))
Get the t critical for 95% CI (two-sided α=0.05):
t*: =T.INV.2T(0.05, n-1)

(Older Excel: TINV(0.05, n-1). If σ is truly known—rare—use z = NORM.S.INV(0.975) and CONFIDENCE.NORM.)
Compute SE and margin:
SE: =s/SQRT(n)
ME: =t* * SE (or single shot: =CONFIDENCE.T(0.05, s, n))
Compute the 95% CI:
Lower: =x̄ - ME
Upper: =x̄ + ME
Compare to target T using the right direction (≤ or ≥) and write a sentence conclusion.

Meaning of 95% CI: If you repeated this sampling many times, ~95% of such intervals would cover the true μ. For one computed interval, don’t say “95% probability μ is inside”; the interval either covers μ or not—we don’t know which.

B) Worked examples (plug your own data)

1) Dorm Wi‑Fi speed ≥ 50 Mbps

Treat speeds as roughly Normal here. After measuring n speeds, build a 95% CI for μ. Decision rule: if the entire CI ≥ 50, the claim is supported; if the CI is entirely below 50, it’s contradicted; if it straddles 50, evidence is mixed → collect more data.

2) Textbook spending ≤ $120

Spending can be skewed; but per this demo we use Normal by default. Compute x̄, s, n → 95% CI. Decision: CI entirely ≤ 120 → supports claim; entirely > 120 → contradicts; straddles → unclear. Aim for n ≥ 30 if possible.

3) Walk time between classes ≤ 8 minutes

Times may be mildly skewed; CLT usually fine by n ≥ 30. Same t‑interval steps. Decision: CI entirely ≤ 8 → supports; entirely > 8 → contradicts; straddles → mixed.

C) Super‑short checklist

Plan: Define parameter μ and target T with direction (≤ / ≥). Choose n (≥ 30 if skewed).
Compute: x̄ = AVERAGE, s = STDEV.S, n = COUNT.
Critical value: t* = T.INV.2T(0.05, n-1) (use z only if σ known).
SE & Margin: SE = s/√n; ME = t*·SE (or CONFIDENCE.T).
CI: [x̄ − ME, x̄ + ME].
Conclude: Compare the CI to T with the correct inequality and write one sentence with context + uncertainty.

Q&A • Why collect multiple years? (CLT & SAT example)

“JU assigned you to estimate the average SAT of incoming students. Is one year enough?”

Short answer: One year might be fine for that cohort, but if you want the long‑run JU average, using multiple years is better. By the Central Limit Theorem (CLT), as your total sample size $N$ grows, the sampling distribution of the mean is approximately normal with standard error $\text{SE}=s/\sqrt{N}$. Bigger $N$ ⇒ smaller SE ⇒ a tighter confidence interval around the true average.

Important: CLT shrinks random error (variance). It does not fix bias. If your data exclude some groups (e.g., only honors students), the estimate can still be systematically off even with large $N$. Fix bias with better sampling (random/stratified/weighting), not with more of the same biased data.

Mini calculator: Years × Cohort size ⇒ SE & 95% CI

Years of data Students per year Sample mean SAT ( $\bar x$ ) Sample st. dev. ( $s$ )

Note: Using $z=1.96$ for a 95% CI. For small $N$, use the t critical value.

Notes: when does CLT help?

Precision: Pooling more independent, representative observations (more years or bigger cohorts) reduces $\text{SE}=s/\sqrt{N}$ and narrows the CI.
Bias: If your sample is systematically off (e.g., missing test‑optional admits), CLT won’t fix it. Use representative sampling or weights.
Stationarity: If admissions policies changed drastically, “true mean” may shift; compare means by year and consider modeling trends.

Excel playbook (95% CI for the overall mean)

Put all SAT scores (all years) in one column (e.g., A2:A9999).
$\bar x$: =AVERAGE(A2:A9999)
$s$: =STDEV.S(A2:A9999)
$N$: =COUNT(A2:A9999)
$\text{SE}$: =STDEV.S(A2:A9999)/SQRT(COUNT(A2:A9999))
95% t-critical (two-tailed): =T.INV.2T(0.05, COUNT(A2:A9999)-1)
CI lower/upper: =AVERAGE(...) - tcrit*SE and =AVERAGE(...) + tcrit*SE

Takeaway: More years (or bigger $N$) ⇒ smaller SE ⇒ tighter CI around the long‑run mean—provided your sampling is representative.

Check your understanding — 10 Q&A (click to expand)

1) Why do we need point estimation at all?

We rarely observe the whole population. We estimate unknown parameters (μ, p, σ²) from samples so we can make decisions (tolerances, budgets, quality gates) with a single, usable number.

2) Estimator vs estimate — what’s the difference?

Estimator Θ̂ = h(X₁,…,Xₙ) is a random variable (changes with the sample). After you collect data, the computed number θ̂ is the point estimate.

3) What is a sampling distribution and why do we use M?

The distribution you’d see if you could repeat the entire study many times and plot the estimator. Here, M controls how many repeats we simulate to visualize that distribution (smoother histogram with larger M). SE depends on n, not on M.

4) What exactly does the CLT say (plain English + formula)?

Plain: The sample mean x̄ is approximately Normal with mean μ and SD σ/√n for moderate/large n (i.i.d., finite variance).

Symbolically: x̄ ≈ Normal(μ, σ²/n).

5) Why is CLT important in engineering?

SPC/control charts: predictable false‑alarm rates for averages.
Tolerances & acceptance sampling: tractable probability a lot’s mean passes specs.
Sensor fusion: averaging reduces noise ~1/√n.
DOE/ANOVA: valid inference on mean differences even if raw data are skewed.
Reliability/throughput: confidence intervals for average lifetimes/cycle times.

6) When should I be cautious using the CLT?

Heavy tails/outliers → need larger n; consider trimming/transforming or bootstrap.
Dependence (time series/spatial) → slows convergence.
Non‑identical data (mixing distributions) → approximation may degrade.

7) Why does larger n help, and how fast?

For means/proportions, SE shrinks like 1/√n. Double n → SE × 1/√2 (~0.71). To halve SE, need ~4× n.

8) How big should n be?

Mild skew: n ≈ 30 often okay for means; strong skew/heavy tails: aim 50–200+. Use this app to compare n=5 vs n=50; check Normal fit; see SE vs n.

9) SD vs SE — what’s the difference?

SD is spread of raw data (σ or s). SE is spread of an estimator. For means, SE(x̄) ≈ s/√n.

10) Why talk about repeated studies (M) if I only sample once?

Repeated‑sampling is a concept to define uncertainty and justify formulas for SE, confidence intervals, and tests. In this app, M is a teaching tool.