The 3rd Hourly

Key

The 3^rd Hourly

Math 1107

Spring 2009

Protocol

You will use only the following resources: Your individual calculator; individual tool-sheet (one (1) 8.5 by 11 inch sheet), writing utensils, blank paper (provided by me) and this copy of the hourly. Do not share these resources with anyone else. Show complete detail and work for full credit. Follow case study solutions and sample hourly keys in presenting your solutions. Work all four cases. Using only one side of the blank sheets provided, present your work. Do not write on both sides of the sheets provided, and present your work only on these sheets. Do not share information with any other students during this hourly.

When you are finished: Prepare a Cover Sheet: Print your name on an otherwise blank sheet of paper. Then stack your stuff as follows: Cover Sheet (Top), Your Work Sheets, The Test Papers, Your Toolsheet. Then hand all of this in to me.

Sign and Acknowledge: I agree to follow this protocol.

________________________________________________________________________

Name (PRINTED) Signature Date

Case One | Confidence Interval: Population Mean | The Framingham Heart Study

The objective of the Framingham Heart Study was to identify the common factors or characteristics that contribute to Cardiovascular disease (CVD) by following its development over a long period of time (since 1948) in a large group of participants who had not yet developed overt symptoms of CVD or suffered a heart attack or stroke. Blood pressure is a measurement of the force applied to the walls of the arteries as the heart pumps blood through the body. Blood pressure readings are measured in millimeters of mercury (mm Hg) and usually given as 2 numbers. The top number is the systolic blood pressure reading. It represents the maximum pressure exerted when the heart contracts. The bottom number is the diastolic blood pressure reading. It represents the pressure in the arteries when the heart is at rest. A sample of FHS adult subjects yields the following readings: (top/bottom)

130/79, 175/75, 136/84, 124/84, 144/88, 154/90, 164/97, 210/120, 110/75, 166/108, 100/79, 172/110, 160/90, 122/84, 162/90, 155/85, 120/65, 128/84, 130/90, 210/110, 110/68, 160/106, 132/72, 120/80, 200/114, 165/105, 138/92, 134/84, 152/74, 118/70, 122/80, 155/90, 166/108, 120/80, 210/130, 121/85, 160/100, 135/75, 140/78, 142/85, 146/94, 185/90, 166/78, 193/116, 160/85, 140/90, 150/110, 140/84, 130/82, 130/80, 238/122, 128/92, 220/118, 165/95, 208/120, 126/80, 140/90, 166/104, 130/70, 131/88

Estimate the population mean diastolic blood pressure with 95% confidence. That is, compute and discuss a 95% confidence interval for this population mean. Show your work. Fully discuss the results. This discussion must include a clear discussion of the population and the population mean, the family of samples, the family of intervals and the interpretation of the interval.

Numbers

n m sd se Z Lower95 Upper95

60 90.85 15.2503 1.96880 2 86.9124 94.7876

n = 60

m » 90.85

sd » 15.2503

se = sd/sqrt(n) = sd/sqrt(60) » 15.2503/sqrt(60) » 1.96880

Z = 2 from the mean/proportion row: 2.00 0.022750 0.95450

lower95 = m – Z*se » 90.85 – 2*(15.2503/sqrt(60)) » 90.85 – 2*1.96880 » 86.9124

upper95 = m + Z*se » 90.85 + 2*(15.2503/sqrt(60)) » 90.85 + 2*1.96880 » 94.7876

Report the interval as [86.9,94.7].

Interpretation

Our population is the population of Framingham Heart Study subjects and our population mean is the mean diastolic blood pressure (DBP, mm Hg).

Our Family of Samples (FoS) consists of every possible random sample of 60 Framingham Heart Study subjects.

From each member sample of the FoS, we compute the sample mean (m) and standard deviation (sd) for FHS DBP, and then compute the interval

[m – 2.00*( sd/sqrt(n)), m + 2.00*( sd/sqrt(n))].

Computing this interval for each member sample of the FoS, we obtain a Family of Intervals (FoI), approximately 95% of which cover the true population mean systolic blood pressure for Framingham Heart Study subjects.

If our interval, [86.9, 94.7] is among the approximate 95% super-majority of intervals that cover the population mean, then the true population mean diastolic blood pressure for Framingham Heart Study subjects is between 86.9 And 94.7 mm Hg.

Case Two | Confidence Interval: Population Proportion | Prenatal Care

A random sample of Year 2000 Georgia resident live births are checked for prenatal care status, in the following categories:

Prenatal Care Status	Number in Sample
Prenatal Care Began 1^st Trimester (Months 1-3 of Pregnancy)	410
Prenatal Care Began 2^nd Trimester (Months 4-6 of Pregnancy)	52
Prenatal Care Began 3^rd Trimester (Months 7-9 of Pregnancy)	12
No Prenatal Care	6
Prenatal Care Status Unknown	20
Total	500

Compute and interpret a 97% confidence interval for the population proportion of year 2000 Georgia resident live births reporting 3^rd trimester or no prenatal care. That is, compute and discuss a 97% confidence interval for this population proportion. Show your work. Fully discuss the results. This discussion must include a clear discussion of the population and the population proportion, the family of samples, the family of intervals and the interpretation of the interval.

Numbers

n = 500

event = “Prenatal Care Begins in 3^rd Trimester or No Prenatal Care”

n_event = 12 + 6 = 18

p_event =n_event/n = 18/500 = .036

sdp = sqrt(p_event *(1 – p_event )/ n) » sqrt(.036*(1 – .036 )/ 500) » .0083311464

Z = 2.20 from row: 2.20 0.013903 0.97219

lower99 = p_event – Z*sdp » .036 – 2.2*.0083311464 » .01767

upper99 = p_event + Z*sdp » .036 + 2.2*.0083311464 » .05432

Report the interval as [.017, .054].

Interpretation

Our population is the population of Georgia resident live births occurring in the year 2000. We seek to estimated the population proportion of year 2000 Georgia resident live births where prenatal is absent or began in the 3^rd trimester.

Our Family of Samples (FoS) consists of every possible random sample of 500 year 2000 Georgia resident live births.

From each member sample of the FoS, we compute the sample proportion p of live births where prenatal care is absent or began in the 3^rd trimester, sdp = sqrt(p*(1–p)/n) and then compute the interval

[p – (2.20*sdp), p + (2.20*sdp)].

Computing this interval for each member sample of the FoS, we obtain a Family of Intervals (FoI), approximately 97% of which cover the true population proportion of year 2000 Georgia resident live births where prenatal care is absent or began in the 3^rd trimester.

If our interval, [.017, .054] is among the approximate 97% super-majority of intervals that cover the population proportion, then between 1.7% and 5.4% of year 2000 Georgia resident live births either did not have prenatal care, or had prenatal care beginning in the 3^rd trimester.

Case Three | Hypothesis Test: Population Median | The Framingham Heart Study

Using the Framingham Heart Study sample from Case One, test the following: null (H₀): The median systolic blood pressure (SBP) is 175 (h = 175) against the alternative (H₁): h < 175. Show your work. Fully discuss the results. This discussion must include a clear discussion of the population and the population median, the family of samples, the family of errors and the interpretation of the p-value.

Numbers

Here is the data from case one:

The top number is the systolic blood pressure reading.

n = 60

Error Form: “Guess is too large” – Count strictly below the guess:

130/79, 136/84, 124/84, 144/88, 154/90, 164/97, 110/75, 166/108, 100/79, 172/110, 160/90, 122/84, 162/90, 155/85, 120/65, 128/84, 130/90, 110/68, 160/106, 132/72, 120/80, 165/105, 138/92, 134/84, 152/74, 118/70, 122/80, 155/90, 166/108, 120/80, 121/85, 160/100, 135/75, 140/78, 142/85, 146/94, 166/78, 160/85, 140/90, 150/110, 140/84, 130/82, 130/80, 128/92, 165/95, 126/80, 140/90, 166/104, 130/70, 131/88

error = #FHS subjects in the sample whose systolic blood pressure is strictly below 175 = 50

From the row: 60 50 <0.00001, p-value <0.001%

Interpretation

Our population is the population of Framingham Heart Study subjects.

Our Family of Samples (FoS) consists of every possible random sample of 60 Framingham Heart Study subjects.

From each member sample of the FoS, we compute Error = Number of sample subjects whose systolic blood pressure is strictly below 175 mm Hg. Computing this error for each member sample of the FoS, we obtain a Family of Errors (FoE).

If the true population systolic blood pressure for Framingham Heart Study subjects is 175 mm Hg, then less than .001% of the Family of Samples yield errors as bad as or worse than our single error. The sample presents highly significant evidence against the null hypothesis.

Case Four | Hypothesis Test: Categorical Goodness of Fit | Prenatal Care

A random sample of Year 2000 Georgia resident live births are checked for prenatal care status, in the following categories – consider the portion of the sample reporting prenatal care:

Prenatal Care Status	Number in Sample
Prenatal Care Began 1^st Trimester (Months 1-3 of Pregnancy)	410
Prenatal Care Began 2^nd Trimester (Months 4-6 of Pregnancy)	52
Prenatal Care Began 3^rd Trimester (Months 7-9 of Pregnancy)	12

Test the hypothesis that:

Pr{ Prenatal Care Began 1^st Trimester (Months 1-3 of Pregnancy)} = .75

Pr{ Prenatal Care Began 2^ndTrimester (Months 4-6 of Pregnancy)} = .15

Pr{ Prenatal Care Began 3^rd Trimester (Months 7-9 of Pregnancy)} = .10 .

Show your work. Completely discuss and interpret your test results, as indicated in class and case study summaries. Fully discuss the testing procedure and results. This discussion must include a clear discussion of the population and the null hypothesis, the family of samples, the family of errors and the interpretation of the p-value. Show all work and detail for full credit.

Numbers

Prenatal Care Status	n	P	E=nP	Error
PNC T1	410	0.75	355.5	8.355133615
PNC T2	52	0.15	71.1	5.130942335
PNC T3	12	0.1	47.4	26.43797468
Total Sample with PNC	474	1	474	39.92405063

n=474

Prenatal Starts 1^st Trimester

Observed=410

Expected = n*PT1= 474*.75 » 355.5

Error = (Observed – Expected)²/Expected » (410 – 355.5)²/355.5 » 8.355133615

Prenatal Starts 2^nd Trimester

Observed=52

Expected = n*PT2= 474*.15 » 71.1

Error = (Observed – Expected)²/Expected » (52 – 71.1)²/71.1 » 5.130942335

Prenatal Starts 3^rd Trimester

Observed=12

Expected = n*PT3= 474*.10 » 47.4

Error = (Observed – Expected)²/Expected » (12 – 47.4)²/47.4 » 26.43797468

Total Error » 8.355133615 + 5.130942335 + 26.43797468 » 39.92405063 over three categories.

From row: 3 9.2103 0.010, p-value < .01, since our error exceeds 9.2103.

Interpretation

Our population is the population of Year 2000 Georgia resident live births. Our categories are based on those who reported receiving Prenatal Care and include: (T1)Prenatal Care Began 1st Trimester (Months 1-3 of Pregnancy), (T2)Prenatal Care Began 2nd Trimester (Months 4-6 of Pregnancy) and (T3)Prenatal Care Began 3rd Trimester (Months 7-9 of Pregnancy). Our null hypothesis is that the categories are distributed as: 75% T1, 15% T2 and 10% T3.

Our Family of Samples (FoS) consists of every possible random sample of 474 Year 2000 Georgia resident live births. Under the null hypothesis, within each member of the FoS, we expect approximately:

E_T1 = N*P_T1= 474*.75 » 355.5

E_T2 = N*P_T2= 474*.15 » 71.1

E_T3 = N*P_T3 = 474*.10 » 47.4

From each member sample of the FoS, we compute sample counts and errors for each level of survival:

E_T1 = N*P_T1= 474*.75 » 355.5

Error_T1 = (O_T1 ─ E_T1)²/ E_T1

E_T2 = N*P_T2= 474*.15 » 71.1

Error_T2 = (O_T2 ─ E_T2)²/ E_T2

E_T3 = N*P_T3 = 474*.10 » 47.4

Error_T3 = (O_T3 ─ E_T3)²/ E_T3

Then add the individual errors for the total error as Total Error = Error_T1 + Error_T2 + Error_T3

Computing this error for each member sample of the FoS, we obtain a Family of Errors (FoE).

If the prenatal care categories are distributed as: 75% T1, 15% T2 and 10% T3, then fewer than 1% of the member samples of the Family of Samples yields errors as large as or larger than that of our single sample. Our sample presents highly significant evidence against the null hypothesis.

Work all four (4) cases.

Table 1. Means and Proportions

Z(k) PROBRT PROBCENT

0.05 0.48006 0.03988

0.10 0.46017 0.07966

0.15 0.44038 0.11924

0.20 0.42074 0.15852

0.25 0.40129 0.19741

0.30 0.38209 0.23582

0.35 0.36317 0.27366

0.40 0.34458 0.31084

0.45 0.32636 0.34729

0.50 0.30854 0.38292

0.55 0.29116 0.41768

0.60 0.27425 0.45149

0.65 0.25785 0.48431

0.70 0.24196 0.51607

0.75 0.22663 0.54675

0.80 0.21186 0.57629

0.85 0.19766 0.60467

0.90 0.18406 0.63188

0.95 0.17106 0.65789

1.00 0.15866 0.68269

Z(k) PROBRT PROBCENT

1.05 0.14686 0.70628

1.10 0.13567 0.72867

1.15 0.12507 0.74986

1.20 0.11507 0.76986

1.25 0.10565 0.78870

1.30 0.09680 0.80640

1.35 0.088508 0.82298

1.40 0.080757 0.83849

1.45 0.073529 0.85294

1.50 0.066807 0.86639

1.55 0.060571 0.87886

1.60 0.054799 0.89040

1.65 0.049471 0.90106

1.70 0.044565 0.91087

1.75 0.040059 0.91988

1.80 0.035930 0.92814

1.85 0.032157 0.93569

1.90 0.028717 0.94257

1.95 0.025588 0.94882

2.00 0.022750 0.95450

Z(k) PROBRT PROBCENT

2.05 0.020182 0.95964

2.10 0.017864 0.96427

2.15 0.015778 0.96844

2.20 0.013903 0.97219

2.25 0.012224 0.97555

2.30 0.010724 0.97855

2.35 0.009387 0.98123

2.40 0.008198 0.98360

2.45 0.007143 0.98571

2.50 0.006210 0.98758

2.55 0.005386 0.98923

2.60 0.004661 0.99068

2.65 0.004025 0.99195

2.70 .0034670 0.99307

2.75 .0029798 0.99404

2.80 .0025551 0.99489

2.85 .0021860 0.99563

2.90 .0018658 0.99627

2.95 .0015889 0.99682

3.00 .0013499 0.99730

Table 2. Medians

n error base p-value

60 0 1.00000 60 1 1.00000 60 2 1.00000 60 3 1.00000 60 4 1.00000 60 5 1.00000 60 6 1.00000 60 7 1.00000 60 8 1.00000 60 9 1.00000 60 10 1.00000 60 11 1.00000 60 12 1.00000 60 13 0.99999 60 14 0.99998 60 15 0.99993 60 16 0.99980 60 17 0.99947 60 18 0.99866 60 19 0.99689 60 20 0.99326

n error base p-value

60 21 0.98633 60 22 0.97405 60 23 0.95377 60 24 0.92250 60 25 0.87747 60 26 0.81685 60 27 0.74052 60 28 0.65056 60 29 0.55129 60 30 0.44871 60 31 0.34944 60 32 0.25948 60 33 0.18315 60 34 0.12253 60 35 0.07750 60 36 0.04623 60 37 0.02595 60 38 0.01367 60 39 0.00674 60 40 0.00311

n error base p-value

60 41 0.00134 60 42 0.00053 60 43 0.00020 60 44 0.00007 60 45 0.00002 60 46 0.00001 60 47 <0.00001 60 48 <0.00001 60 49 <0.00001
60 50 <0.00001 60 51 <0.00001 60 52 <0.00001 60 53 <0.00001 60 54 <0.00001 60 55 <0.00001 60 56 <0.00001 60 57 <0.00001 60 58 <0.00001 60 59 <0.00001 60 60 <0.00001

Table 3. Categories/Goodness of Fit

Categories ERROR p-value

3 0.0000 1.000 3 0.2107 0.900 3 0.4463 0.800 3 0.7133 0.700 3 1.0217 0.600 3 1.3863 0.500 3 1.5970 0.450 3 1.8326 0.400 3 2.0996 0.350 3 2.4079 0.300 3 2.7726 0.250 3 3.2189 0.200 3 4.6052 0.100 3 4.8159 0.090 3 5.0515 0.080 3 5.3185 0.070 3 5.6268 0.060 3 5.9915 0.050 3 6.4378 0.040 3 7.0131 0.030 3 7.8240 0.020 3 9.2103 0.010

Categories ERROR p-value

4 0.0000 1.000

4 0.5844 0.900

4 1.0052 0.800

4 1.4237 0.700

4 1.8692 0.600

4 2.3660 0.500

4 2.6430 0.450

4 2.9462 0.400

4 3.2831 0.350

4 3.6649 0.300

4 4.1083 0.250

4 4.6416 0.200

4 4.9566 0.175

4 5.3170 0.150

4 5.7394 0.125

4 6.2514 0.100

4 6.4915 0.090

4 6.7587 0.080

4 7.0603 0.070

4 7.4069 0.060

4 7.8147 0.050

4 8.3112 0.040

4 8.9473 0.030

4 9.8374 0.020

4 11.3449 0.010

Categories ERROR p-value

5 0.0000 1.000

5 1.0636 0.900

5 1.6488 0.800

5 2.1947 0.700

5 2.7528 0.600

5 3.3567 0.500

5 3.6871 0.450

5 4.0446 0.400

5 4.4377 0.350

5 4.8784 0.300

5 5.3853 0.250

5 5.9886 0.200

5 6.3423 0.175

5 6.7449 0.150

5 7.2140 0.125

5 7.7794 0.100

5 8.0434 0.090

5 8.3365 0.080

5 8.6664 0.070

5 9.0444 0.060

5 9.4877 0.050

5 10.0255 0.040

5 10.7119 0.030

5 11.6678 0.020

5 13.2767 0.010