Key
The 3rd Hourly
Math 1107
Spring 2009
Protocol
You will use only the
following resources: Your individual calculator; individual tool-sheet (one (1)
8.5 by 11 inch sheet), writing utensils, blank paper
(provided by me) and this copy of the hourly. Do not share these
resources with anyone else. Show complete detail
and work for full credit. Follow case
study solutions and sample hourly keys in presenting your solutions. Work
all four cases. Using only one side of the blank sheets provided, present
your work. Do not write on both sides of the sheets provided, and present your
work only on these sheets. Do not share information with any other students
during this hourly.
When you
are finished: Prepare a Cover Sheet: Print your name on an otherwise blank
sheet of paper. Then stack your stuff as follows: Cover Sheet (Top),
Your Work Sheets, The Test Papers, Your Toolsheet.
Then hand all of this in to me.
Sign and
Acknowledge: I agree to follow
this protocol.
________________________________________________________________________
Name
(PRINTED)
Signature
Date
Case One | Confidence
Interval: Population Mean | The
The objective of the
Framingham Heart Study was to identify the common factors or characteristics
that contribute to Cardiovascular disease (CVD) by following its development
over a long period of time (since 1948)
in a large group of participants who had not yet developed overt
symptoms of CVD or suffered a heart attack or stroke. Blood pressure is a
measurement of the force applied to the walls of the arteries as the heart
pumps blood through the body. Blood pressure readings are measured in
millimeters of mercury (mm Hg) and usually given as 2 numbers. The top
number is the systolic blood pressure reading. It represents the maximum
pressure exerted when the heart contracts. The bottom
number is the diastolic blood pressure reading. It represents the pressure in
the arteries when the heart is at rest. A
sample of FHS adult subjects yields the following readings: (top/bottom)
130/79, 175/75, 136/84, 124/84, 144/88,
154/90, 164/97, 210/120, 110/75, 166/108, 100/79, 172/110, 160/90, 122/84,
162/90, 155/85, 120/65, 128/84, 130/90, 210/110, 110/68, 160/106, 132/72,
120/80, 200/114, 165/105, 138/92, 134/84, 152/74, 118/70, 122/80, 155/90,
166/108, 120/80, 210/130, 121/85, 160/100, 135/75, 140/78, 142/85, 146/94,
185/90, 166/78, 193/116, 160/85, 140/90, 150/110, 140/84, 130/82, 130/80,
238/122, 128/92, 220/118, 165/95, 208/120, 126/80, 140/90, 166/104, 130/70,
131/88
Estimate the population mean diastolic blood pressure with 95% confidence. That is,
compute and discuss a 95% confidence interval for this population mean. Show your work. Fully
discuss the results. This
discussion must include a clear discussion of the population and the population
mean, the family of samples, the family of intervals and the interpretation of
the interval.
Numbers
n m sd
se Z Lower95
Upper95
60 90.85
15.2503 1.96880 2
86.9124 94.7876
n = 60
m » 90.85
sd » 15.2503
se = sd/sqrt(n) = sd/sqrt(60) » 15.2503/sqrt(60) »
1.96880
Z = 2 from the mean/proportion
row: 2.00 0.022750 0.95450
lower95 = m –
Z*se » 90.85 – 2*(15.2503/sqrt(60)) »
90.85 – 2*1.96880 »
86.9124
upper95 = m + Z*se » 90.85 + 2*(15.2503/sqrt(60)) »
90.85 + 2*1.96880 »
94.7876
Report the interval as [86.9,94.7].
Interpretation
Our population is the population of Framingham
Heart Study subjects and our population mean is the mean diastolic blood pressure
(DBP, mm Hg).
Our Family of Samples (FoS) consists of
every possible random sample of 60 Framingham Heart Study subjects.
From each member
sample of the FoS, we compute the sample mean (m) and standard deviation (sd)
for FHS DBP, and then compute the interval
[m – 2.00*( sd/sqrt(n)), m + 2.00*(
sd/sqrt(n))].
Computing this
interval for each member sample of the FoS, we obtain a Family of Intervals (FoI),
approximately 95% of which cover the true population mean systolic blood
pressure for Framingham Heart Study subjects.
If our interval, [86.9, 94.7] is among the approximate 95%
super-majority of intervals that cover the population mean, then the true
population mean diastolic blood pressure for Framingham Heart Study subjects is between 86.9 And 94.7 mm Hg.
Case Two | Confidence
Interval: Population Proportion | Prenatal Care
A random
sample of Year 2000 Georgia resident live births are checked for prenatal care status,
in the following categories:
Prenatal
Care Status |
Number in
Sample |
Prenatal
Care Began 1st Trimester (Months 1-3 of Pregnancy) |
410 |
Prenatal
Care Began 2nd Trimester (Months 4-6 of Pregnancy) |
52 |
Prenatal
Care Began 3rd Trimester (Months 7-9 of Pregnancy) |
12 |
No Prenatal
Care |
6 |
Prenatal
Care Status Unknown |
20 |
Total |
500 |
Compute and
interpret a 97% confidence interval for the population proportion of year 2000
Numbers
n = 500
event = “Prenatal Care Begins in 3rd Trimester or No
Prenatal Care”
nevent = 12 + 6 = 18
pevent =nevent/n = 18/500 = .036
sdp = sqrt(pevent *(1 – pevent )/ n) »
sqrt(.036*(1 – .036 )/ 500) » .0083311464
Z = 2.20 from row: 2.20 0.013903 0.97219
lower99 = pevent – Z*sdp » .036 – 2.2*.0083311464
» .01767
upper99 = pevent + Z*sdp » .036 + 2.2*.0083311464
» .05432
Report the
interval as [.017, .054].
Interpretation
Our population is the population of
Our Family of Samples (FoS) consists of
every possible random sample of 500 year 2000
From each member
sample of the FoS, we compute the sample proportion p of live births where prenatal care is absent or began in the 3rd
trimester, sdp = sqrt(p*(1–p)/n) and then compute the interval
[p – (2.20*sdp), p + (2.20*sdp)].
Computing this
interval for each member sample of the FoS, we obtain a Family of Intervals (FoI),
approximately 97% of which cover the true population proportion of year 2000
If our interval, [.017, .054] is among the
approximate 97% super-majority of intervals that cover the population
proportion, then between 1.7% and 5.4% of year 2000 Georgia resident live
births either did not have prenatal care, or had prenatal care beginning in the
3rd trimester.
Case Three |
Hypothesis Test: Population Median | The
Using the
Numbers
Here is the
data from case one:
The top
number is the systolic blood pressure reading.
130/79, 175/75, 136/84,
124/84, 144/88, 154/90, 164/97, 210/120, 110/75, 166/108, 100/79, 172/110,
160/90, 122/84, 162/90, 155/85, 120/65, 128/84, 130/90, 210/110, 110/68,
160/106, 132/72, 120/80, 200/114, 165/105, 138/92, 134/84, 152/74, 118/70,
122/80, 155/90, 166/108, 120/80, 210/130, 121/85, 160/100, 135/75, 140/78,
142/85, 146/94, 185/90, 166/78, 193/116, 160/85, 140/90, 150/110, 140/84,
130/82, 130/80, 238/122, 128/92, 220/118, 165/95, 208/120, 126/80, 140/90,
166/104, 130/70, 131/88
n = 60
Error Form:
“Guess is too large” – Count strictly below the guess:
130/79, 136/84, 124/84,
144/88, 154/90, 164/97, 110/75, 166/108, 100/79,
172/110, 160/90, 122/84, 162/90, 155/85, 120/65,
128/84, 130/90, 110/68, 160/106, 132/72, 120/80,
165/105, 138/92, 134/84, 152/74, 118/70, 122/80,
155/90, 166/108, 120/80, 121/85, 160/100, 135/75, 140/78, 142/85, 146/94,
166/78, 160/85, 140/90, 150/110, 140/84, 130/82, 130/80, 128/92, 165/95, 126/80,
140/90, 166/104, 130/70, 131/88
error =
#FHS subjects in the sample whose systolic blood pressure is strictly below 175
= 50
From the
row: 60 50
<0.00001, p-value <0.001%
Interpretation
Our population
is the population of Framingham Heart Study subjects.
Our Family of
Samples (FoS) consists of every possible random sample of 60 Framingham
Heart Study subjects.
From each member
sample of the FoS, we compute Error = Number of sample subjects whose
systolic blood pressure is strictly below 175 mm Hg. Computing this error
for each member sample of the FoS, we obtain a Family of Errors (FoE).
If the true
population systolic blood pressure for Framingham Heart Study subjects is 175 mm
Hg, then less than .001% of the Family of Samples yield errors as bad as or
worse than our single error. The sample presents highly significant evidence
against the null hypothesis.
Case Four | Hypothesis
Test: Categorical Goodness of Fit | Prenatal Care
A random
sample of Year 2000 Georgia resident live births are checked for prenatal care
status, in the following categories – consider the portion of the sample
reporting prenatal care:
Prenatal
Care Status |
Number in
Sample |
Prenatal Care
Began 1st Trimester (Months 1-3 of Pregnancy) |
410 |
Prenatal
Care Began 2nd Trimester (Months 4-6 of Pregnancy) |
52 |
Prenatal
Care Began 3rd Trimester (Months 7-9 of Pregnancy) |
12 |
Test the hypothesis that:
Pr
Pr
Pr
Show your work. Completely discuss and interpret your test
results, as indicated in class and case study summaries. Fully discuss the
testing procedure and results. This discussion must include a clear
discussion of the population and the null hypothesis, the family of samples,
the family of errors and the interpretation of the p-value. Show all
work and detail for full credit.
Numbers
Prenatal Care Status |
n |
P |
E=nP |
Error |
PNC T1 |
410 |
0.75 |
355.5 |
8.355133615 |
PNC T2 |
52 |
0.15 |
71.1 |
5.130942335 |
PNC T3 |
12 |
0.1 |
47.4 |
26.43797468 |
Total Sample with PNC |
474 |
1 |
474 |
39.92405063 |
n=474
Prenatal Starts 1st Trimester
Observed=410
Expected = n*PT1= 474*.75 » 355.5
Error = (Observed – Expected)2/Expected » (410 –
355.5)2/355.5 » 8.355133615
Prenatal Starts 2nd Trimester
Observed=52
Expected = n*PT2= 474*.15 » 71.1
Error = (Observed – Expected)2/Expected » (52 –
71.1)2/71.1 » 5.130942335
Prenatal Starts 3rd Trimester
Observed=12
Expected = n*PT3= 474*.10 » 47.4
Error = (Observed – Expected)2/Expected » (12 – 47.4)2/47.4
» 26.43797468
Total Error » 8.355133615 +
5.130942335 + 26.43797468 » 39.92405063
over three categories.
From row: 3 9.2103 0.010, p-value < .01, since our error
exceeds 9.2103.
Interpretation
Our population is the population of Year 2000
Our Family of Samples (FoS) consists of
every possible random sample of 474 Year 2000
ET1 = N*PT1 = 474*.75 » 355.5
ET2 = N*PT2 = 474*.15 » 71.1
ET3 = N*PT3
= 474*.10 » 47.4
From each member sample of the FoS, we compute sample counts and
errors for each level of survival:
ET1 = N*PT1 = 474*.75 » 355.5
ErrorT1 = (OT1 ─ ET1)2/
ET1
ET2 = N*PT2 = 474*.15 » 71.1
ErrorT2 = (OT2
─ ET2 )2/
ET2
ET3 = N*PT3
= 474*.10 » 47.4
ErrorT3 = (OT3 ─ ET3)2/
ET3
Then add the individual errors for the total error as Total Error =
ErrorT1 + ErrorT2 + ErrorT3
Computing this error for each member sample of the FoS, we obtain a
Family of Errors (FoE).
If the prenatal
care categories are distributed as: 75%
T1, 15% T2 and 10% T3, then fewer than 1% of the member samples of the
Family of Samples yields errors as large as or larger than that of our single
sample. Our sample presents highly significant evidence against the null
hypothesis.
Work all four (4)
cases.
Table 1. Means and Proportions
Z(k)
PROBRT PROBCENT 0.05 0.48006
0.03988 0.10 0.46017
0.07966 0.15 0.44038
0.11924 0.20 0.42074
0.15852 0.25 0.40129
0.19741 0.30 0.38209
0.23582 0.35 0.36317
0.27366 0.40 0.34458
0.31084 0.45 0.32636
0.34729 0.50 0.30854 0.38292 0.55 0.29116
0.41768 0.60 0.27425
0.45149 0.65 0.25785
0.48431 0.70 0.24196
0.51607 0.75 0.22663
0.54675 0.80 0.21186
0.57629 0.85 0.19766
0.60467 0.90 0.18406
0.63188 0.95 0.17106
0.65789 1.00 0.15866
0.68269 |
Z(k)
PROBRT PROBCENT 1.05 0.14686 0.70628 1.10 0.13567
0.72867 1.15 0.12507
0.74986 1.20 0.11507
0.76986 1.25 0.10565
0.78870 1.30 0.09680
0.80640 1.35 0.088508
0.82298 1.40 0.080757
0.83849 1.45 0.073529
0.85294 1.50 0.066807
0.86639 1.55 0.060571
0.87886 1.60 0.054799
0.89040 1.65 0.049471
0.90106 1.70 0.044565
0.91087 1.75 0.040059
0.91988 1.80 0.035930
0.92814 1.85 0.032157
0.93569 1.90 0.028717
0.94257 1.95 0.025588
0.94882 2.00
0.022750 0.95450 |
Z(k)
PROBRT PROBCENT 2.05 0.020182
0.95964 2.10 0.017864
0.96427 2.15 0.015778
0.96844 2.20
0.013903 0.97219 2.25 0.012224
0.97555 2.30 0.010724
0.97855 2.35 0.009387
0.98123 2.40 0.008198
0.98360 2.45 0.007143
0.98571 2.50 0.006210
0.98758 2.55 0.005386
0.98923 2.60 0.004661
0.99068 2.65 0.004025
0.99195 2.70 .0034670
0.99307 2.75 .0029798 0.99404 2.80 .0025551
0.99489 2.85 .0021860
0.99563 2.90 .0018658
0.99627 2.95 .0015889
0.99682 3.00 .0013499
0.99730 |
Table 2. Medians
n error base p-value 60
0 1.00000 60 1
1.00000 60 2
1.00000 60 3
1.00000 60 4
1.00000 60 5
1.00000 60 6
1.00000 60 7
1.00000 60 8
1.00000 60 9
1.00000
60 10
1.00000 60 11
1.00000 60 12
1.00000 60 13
0.99999 60 14
0.99998 60 15
0.99993 60 16
0.99980 60 17
0.99947 60 18
0.99866 60 19
0.99689 60 20
0.99326 |
n error base p-value 60
21 0.98633 60 22
0.97405 60 23
0.95377 60 24
0.92250 60 25
0.87747 60 26
0.81685 60 27
0.74052 60 28
0.65056 60 29
0.55129 60 30
0.44871
60 31
0.34944 60 32
0.25948 60 33
0.18315 60 34
0.12253 60 35
0.07750 60 36
0.04623 60 37
0.02595 60 38
0.01367 60 39
0.00674 60 40
0.00311 |
n error base p-value 60
41 0.00134 60 42
0.00053 60 43
0.00020 60 44
0.00007
60 45 0.00002 60 46
0.00001 60 47
<0.00001 60 48
<0.00001 60 49
<0.00001 |
Table 3. Categories/Goodness of Fit
Categories
ERROR p-value 3 0.0000 1.000 3 0.2107
0.900
3 0.4463 0.800 3 0.7133
0.700 3 1.0217 0.600 3 1.3863
0.500
3 1.5970 0.450 3 1.8326
0.400
3 2.0996 0.350 3 2.4079
0.300
3 2.7726 0.250 3 3.2189
0.200
3 4.6052 0.100 3 4.8159
0.090
3 5.0515 0.080 3 5.3185
0.070
3 5.6268 0.060 3 5.9915
0.050
3 6.4378 0.040 3 7.0131
0.030 3 7.8240 0.020 3
9.2103 0.010 |
Categories
ERROR p-value 4 0.0000 1.000 4 0.5844 0.900 4 1.0052 0.800 4 1.4237 0.700 4 1.8692 0.600 4 2.3660 0.500 4 2.6430 0.450 4 2.9462 0.400 4 3.2831 0.350 4 3.6649 0.300 4 4.1083 0.250 4 4.6416 0.200 4 4.9566 0.175 4 5.3170 0.150 4 5.7394 0.125 4 6.2514 0.100 4 6.4915 0.090 4 6.7587 0.080 4 7.0603 0.070 4 7.4069 0.060 4 7.8147 0.050 4 8.3112 0.040 4 8.9473 0.030 4 9.8374 0.020 4 11.3449 0.010 |
Categories ERROR
p-value 5 0.0000 1.000 5 1.0636 0.900 5 1.6488 0.800 5 2.1947 0.700 5 2.7528 0.600 5 3.3567 0.500 5 3.6871 0.450 5 4.0446 0.400 5 4.4377 0.350 5 4.8784 0.300 5 5.3853 0.250 5 5.9886 0.200 5 6.3423 0.175 5 6.7449 0.150 5 7.2140 0.125 5 7.7794 0.100 5 8.0434 0.090 5 8.3365 0.080 5 8.6664 0.070 5 9.0444 0.060 5 9.4877 0.050 5 10.0255 0.040 5 10.7119 0.030 5 11.6678 0.020 5 13.2767 0.010 |