Session 4.5 Study Guide — Normal Distribution with Class Grades

1) Introduction: Why Normal Distribution?

This dataset represents 150 students’ Statics grades.
Many real‑world outcomes (exam scores, heights, measurement errors) cluster around an average and form a bell‑shaped curve.
Goal: summarize the distribution with mean $\mu$ and standard deviation $\sigma$, then use them to answer probability questions.

2) Step 1: Compute Descriptive Statistics in Excel

Assume grades are in B2:B151.

Measure	Excel Formula	Meaning
Mean ($\mu$)	=AVERAGE(B2:B151)	Center of the distribution
Standard Deviation ($\sigma$)	=STDEV.S(B2:B151)	Spread of the grades (use STDEV.S for a sample)
Min / Max	=MIN(B2:B151) / =MAX(B2:B151)	Range check
Count	=COUNT(B2:B151)	Sample size

For our dataset: $n=150$, $\mu\approx83.54$, $\sigma\approx7.44$, min = 69, max = 100.

3) Step 2: Create a Histogram

Select the grade column → Insert → Statistical → Histogram.
Edit binning: right‑click the horizontal axis → Format Axis → set Bin width = 5 (≈ 65–70, 70–75, …, 95–100).
Observe how the shape approaches a bell curve centered near $\mu$.
(Optional) Overlay a normal curve:
- Compute expected frequency per bin using NORM.DIST() at bin centers and multiply by bin width and $n$.
- Or add a smooth line series using those expected counts.

4) Step 3: z‑Scores and Probabilities

Standardize any grade $x$ to a z‑score:

\[ z = \frac{x - \mu}{\sigma} \]

Excel (if mean in D2 and stdev in D3): =(B2-$D$2)/$D$3

Find Probabilities

Task	Excel Formula	Math Equivalent
Left‑tail $P(X\le x)$	=NORM.DIST(x, μ, σ, TRUE)	$F(x)$
Right‑tail $P(X\ge x)$	=1-NORM.DIST(x, μ, σ, TRUE)	$1-F(x)$
Between $P(a\le X\le b)$	=NORM.DIST(b, μ, σ, TRUE)-NORM.DIST(a, μ, σ, TRUE)	$F(b)-F(a)$
Percentile → x	=NORM.INV(p, μ, σ)	solve $x=\mu+\sigma z_p$

What is a z‑Score? (And Why It’s Important)

The z‑score transforms any normal random variable $X\sim\mathcal{N}(μ,σ^2)$ into the standard normal $Z\sim\mathcal{N}(0,1)$:

\[ z = \frac{x - μ}{σ} \]

This rescaling centers the distribution at 0 and makes the spread 1. Historically, before computers, students used printed z‑tables to look up $Φ(z)=P(Z≤z)$ and find probabilities. Today, Excel and calculators compute $P(X≤x)$ directly, but z‑scores are still crucial for:

Universality: One set of tables (or one function) works for any normal distribution once you convert to z.
Interpretability: z tells you how many standard deviations above/below the mean you are — an intuitive measure of extremeness.
Mathematical proofs: In statistics, derivations (CLT, sampling distributions, hypothesis tests) always standardize to Z for simplicity.
Inference foundations: Confidence intervals and z‑tests rely on comparing a computed z to critical values (e.g. ±1.96 for 95%).

Bottom line: Excel doesn’t require you to compute z to get probabilities, but learning z builds conceptual understanding and prepares students for hypothesis testing and theoretical statistics.

5) Step 4: ICE Examples (using our dataset)

What % scored below 80?
Excel: =NORM.DIST(80, μ, σ, TRUE)
With $\mu\approx83.54$, $\sigma\approx7.44$: $\approx 0.317$ → 31.7%
What grade is the top 10% cutoff?
Excel: =NORM.INV(0.90, μ, σ)
Result with our data: $\approx 93.07$
What % scored between 70 and 90?
Excel: =NORM.DIST(90, μ, σ, TRUE)-NORM.DIST(70, μ, σ, TRUE)
Result with our data: $\approx 0.773$ → 77.3%

6) Step 5: Concept Reinforcement

68‑95‑99.7 Rule: $\mu\pm\sigma\approx[76.10, 90.98]$, $\mu\pm2\sigma\approx[68.67, 98.41]$, $\mu\pm3\sigma\approx[61.23, 105.85]$ (cap at 100 in practice).
Outliers: look for $|z|>2$ (unusual) or $|z|>3$ (rare).
Model check: compare the histogram bars to the smooth normal curve. If tails are heavier/lighter or skewed, normal may be an approximation.

Appendix — Copy‑Block for Excel

Mean: =AVERAGE(B2:B151)
Stdev (sample): =STDEV.S(B2:B151)
Left tail P(X≤x): =NORM.DIST(x, $D$2, $D$3, TRUE)
Right tail P(X≥x): =1-NORM.DIST(x, $D$2, $D$3, TRUE)
Between a..b: =NORM.DIST(b,$D$2,$D$3,TRUE)-NORM.DIST(a,$D$2,$D$3,TRUE)
Percentile → x: =NORM.INV(p, $D$2, $D$3)
z-score for cell B2: =(B2-$D$2)/$D$3

Replace x, a, b, and p with your chosen values. Cells D2 and D3 hold the computed mean and stdev.

Dataset — Statics Grades (n=150)

Get the full dataset and open it in Excel for practice.

student id	Statics_Grade
1	75
2	71
3	74
4	90
5	74
6	100
7	86
8	83
9	75
10	88
11	77
12	83
13	100
14	81
15	91
16	76
17	82
18	96
19	77
20	96
21	88
22	78
23	87
24	90
25	87
26	69
27	76
28	80
29	81
30	87
31	83
32	71
33	85
34	87
35	86
36	91
37	89
38	86
39	81
40	69
41	85
42	84
43	84
44	72
45	73
46	90
47	82
48	87
49	82
50	82
51	90
52	78
53	83
54	78
55	79
56	80
57	84
58	78
59	92
60	75
61	81
62	78
63	94
64	84
65	90
66	70
67	84
68	89
69	83
70	91
71	78
72	93
73	100
74	79
75	78
76	94
77	95
78	78
79	79
80	80
81	71
82	75
83	74
84	76
85	82
86	84
87	94
88	74
89	90
90	80
91	82
92	87
93	73
94	85
95	83
96	86
97	84
98	100
99	77
100	78
101	77
102	78
103	77
104	92
105	93
106	77
107	75
108	86
109	78
110	87
111	84
112	70
113	94
114	96
115	77
116	79
117	84
118	85
119	87
120	98
121	81
122	76
123	71
124	76
125	82
126	96
127	78
128	84
129	82
130	92
131	100
132	78
133	78
134	90
135	87
136	97
137	87
138	79
139	87
140	91
141	89
142	86
143	91
144	91
145	93
146	87
147	81
148	83
149	92
150	75

7) Mini Calculator — z, Probabilities, and Inverse (works with any dataset)

Enter your dataset’s mean (μ) and stdev (σ), or click “Use practice dataset”. Once μ and σ are known, the entire distribution is defined — no need to reference every data point!

Mean (μ)
Stdev (σ)

A) z & Left-Tail Probability

x
z = (x−μ)/σ
P(X ≤ x)

B) Inverse: Given p, find z and x

p (0–1)
z_p = Φ⁻¹(p)
x = μ + σ·z_p

C) Middle Probability

a
b
P(a ≤ X ≤ b)

Concept Note: Once we have μ and σ, we no longer need the original data to answer probability questions.

The normal curve is completely determined by μ and σ.
Any x can be turned into a z-score and looked up on the standard normal curve.
This is why Excel’s NORM.DIST() works directly with x, μ, and σ.

Excel equivalents: NORM.DIST(x, μ, σ, TRUE), 1−NORM.DIST(x, μ, σ, TRUE), NORM.DIST(b, μ, σ, TRUE)−NORM.DIST(a, μ, σ, TRUE), NORM.INV(p, μ, σ).

Measure	Excel Formula	Meaning
Mean (\(\mu\))	=AVERAGE(B2:B151)	Center of the distribution
Standard Deviation (\(\sigma\))	=STDEV.S(B2:B151)	Spread of the grades (use STDEV.S for a sample)
Min / Max	=MIN(B2:B151) / =MAX(B2:B151)	Range check
Count	=COUNT(B2:B151)	Sample size

Task	Excel Formula	Math Equivalent
Left‑tail \(P(X\le x)\)	=NORM.DIST(x, μ, σ, TRUE)	\(F(x)\)
Right‑tail \(P(X\ge x)\)	=1-NORM.DIST(x, μ, σ, TRUE)	\(1-F(x)\)
Between \(P(a\le X\le b)\)	=NORM.DIST(b, μ, σ, TRUE)-NORM.DIST(a, μ, σ, TRUE)	\(F(b)-F(a)\)
Percentile → x	=NORM.INV(p, μ, σ)	solve \(x=\mu+\sigma z_p\)

student id	Statics_Grade
1	75
2	71
3	74
4	90
5	74
6	100
7	86
8	83
9	75
10	88
11	77
12	83
13	100
14	81
15	91
16	76
17	82
18	96
19	77
20	96
21	88
22	78
23	87
24	90
25	87
26	69
27	76
28	80
29	81
30	87
31	83
32	71
33	85
34	87
35	86
36	91
37	89
38	86
39	81
40	69
41	85
42	84
43	84
44	72
45	73
46	90
47	82
48	87
49	82
50	82
51	90
52	78
53	83
54	78
55	79
56	80
57	84
58	78
59	92
60	75
61	81
62	78
63	94
64	84
65	90
66	70
67	84
68	89
69	83
70	91
71	78
72	93
73	100
74	79
75	78
76	94
77	95
78	78
79	79
80	80
81	71
82	75
83	74
84	76
85	82
86	84
87	94
88	74
89	90
90	80
91	82
92	87
93	73
94	85
95	83
96	86
97	84
98	100
99	77
100	78
101	77
102	78
103	77
104	92
105	93
106	77
107	75
108	86
109	78
110	87
111	84
112	70
113	94
114	96
115	77
116	79
117	84
118	85
119	87
120	98
121	81
122	76
123	71
124	76
125	82
126	96
127	78
128	84
129	82
130	92
131	100
132	78
133	78
134	90
135	87
136	97
137	87
138	79
139	87
140	91
141	89
142	86
143	91
144	91
145	93
146	87
147	81
148	83
149	92
150	75

student id	Statics_Grade
1	75
2	71
3	74
4	90
5	74
6	100
7	86
8	83
9	75
10	88
11	77
12	83
13	100
14	81
15	91
16	76
17	82
18	96
19	77
20	96
21	88
22	78
23	87
24	90
25	87
26	69
27	76
28	80
29	81
30	87
31	83
32	71
33	85
34	87
35	86
36	91
37	89
38	86
39	81
40	69
41	85
42	84
43	84
44	72
45	73
46	90
47	82
48	87
49	82
50	82
51	90
52	78
53	83
54	78
55	79
56	80
57	84
58	78
59	92
60	75
61	81
62	78
63	94
64	84
65	90
66	70
67	84
68	89
69	83
70	91
71	78
72	93
73	100
74	79
75	78
76	94
77	95
78	78
79	79
80	80
81	71
82	75
83	74
84	76
85	82
86	84
87	94
88	74
89	90
90	80
91	82
92	87
93	73
94	85
95	83
96	86
97	84
98	100
99	77
100	78
101	77
102	78
103	77
104	92
105	93
106	77
107	75
108	86
109	78
110	87
111	84
112	70
113	94
114	96
115	77
116	79
117	84
118	85
119	87
120	98
121	81
122	76
123	71
124	76
125	82
126	96
127	78
128	84
129	82
130	92
131	100
132	78
133	78
134	90
135	87
136	97
137	87
138	79
139	87
140	91
141	89
142	86
143	91
144	91
145	93
146	87
147	81
148	83
149	92
150	75

Session 4.5 — Normal Distribution - ICE