Key

The 3rd Hourly

Math 1107

Spring Semester 2010

Protocol: You will use only the following resources: Your individual calculator; individual tool-sheet (single 8.5 by 11 inch sheet); your writing utensils; blank paper (provided by me) and this copy of the hourly. Do not share these resources with anyone else. In each case, show complete detail and work for full credit. Follow case study solutions and sample hourly keys in presenting your solutions. Work all four cases. Using only one side of the blank sheets provided, present your work. Do not write on both sides of the sheets provided, and present your work only on these sheets. All of your work goes on one side each of the blank sheets provided. Space out your work. Do not share information with any other students during this hourly. Do not use any external resources during this hourly.

Sign and Acknowledge: I agree to follow this protocol. 

 

Name (PRINTED)                                             Signature                                             Date

Case One | Hypothesis Test: Population Median | Fictitious Spotted Toad

The Fictitious Spotted Toad is a native species of Toad Island, and is noteworthy for the both the quantity and quality of its spots. Consider a random sample of toads, in which the number of spots per toad is noted:

12, 14, 15, 17, 18, 20, 20, 21, 22, 23, 24, 24, 25, 26, 26 , 31, 33, 33, 35, 42, 42, 45, 45, 47, 49, 49, 58, 58, 62, 65

Test the following: null (H0): The median spot count for Fictitious Spotted Toads is 25 (h = 25 ) against the alternative (H1): h > 25.

Show your work. Completely discuss and interpret your test results, as indicated in class and case study summaries. Fully discuss the testing procedure and results. This discussion must include a clear discussion of the population and the null hypothesis, the family of samples, the family of errors and the interpretation of the p-value.

Null Hypothesis: Median Spot Count = 25 Spots

Alternative  Hypothesis: Median Spot Count > 25 Spots (Guess is too Small)

Error Function: Number of Sample Toads with Strictly More Than 25 Spots

 

12, 14, 15, 17, 18 | 20, 20, 21, 22, 23 | 24, 24, 25, 26, 26 | 31, 33, 33, 35, 42 | 42, 45, 45, 47, 49 | 49, 58, 58, 62, 65

 

sample error = Number of Sample Toads with Strictly More Than 25 Spots = 17

n = sample size = 30

from 30   17   0.29233 , p ≈ .29233 (~29.2%)

 

Our population consists of Fictitious Spotted Toads. Our null hypothesis is that the population median spot count for Fictitious Spotted Toads is 25 spots per toad.

 

Each member of the family of samples (FoS) is a single random sample of 30 Fictitious Spotted Toads. The FoS consists of all possible samples of this type.

 

From each member of the (FoS), compute an error as the number of sample toads with strictly more than 25 spots. Computing this error for each member of the FoS forms a family of errors (FoE).

 

If the true population median spot count for Fictitious Spotted Toads is 35 spots per toad, then approximately 29.2% member samples from the FoS yield errors as bas as or worse than our error. The sample does not appear to present significant evidence against the null hypothesis.

 

Case Two | Hypothesis Test: Categorical Goodness-of-Fit | Fictitious Striped Lizard

The Fictitious Striped Lizard is a native species of Lizard Mountain, and is noteworthy for the both the quantity and quality of its stripes. Consider a random sample of stripes, in which the number of stripes per lizard is noted:

0, 0, 0, 0, 0, 5, 6, 7, 7, 7, 7, 8, 9, 11, 11, 12, 12, 12, 13, 13, 14, 14, 17, 17, 18, 18, 18, 18, 18, 20, 21, 22, 23, 23, 24, 27, 29, 29, 30, 30

Consider the following stripe count categories

Stripe Count Range

Category

0

Stripe-less

1-10

Sparsely Striped

11-25

Normally Striped

26-29

Densely Striped

30 or more

Very Densely Striped

 

Case Two | Hypothesis Test: Categorical Goodness-of-Fit | Fictitious Striped Lizard

Test the hypothesis that:

Pr{ Fictitious Striped Lizard is Stripe-less} = .05

Pr{ Fictitious Striped Lizard is Sparsely Striped} = 0.25

Pr{ Fictitious Striped Lizard is Normally Striped} = 0.40

Pr{ Fictitious Striped Lizard is Densely Striped} = 0.25

Pr{ Fictitious Striped Lizard is Very Densely Striped} = 0.05

Show your work. Completely discuss and interpret your test results, as indicated in class and case study summaries. Fully discuss the testing procedure and results. This discussion must include a clear discussion of the population and the null hypothesis, the family of samples, the family of errors and the interpretation of the p-value. Show all work and detail for full credit.

stripecat                o     e    error      sum

Densely Striped          3    10     4.90      4.90

Normally Striped        22    16     2.25      7.15

Sparsely Striped         8    10     0.40      7.55

Stripeless               5     2     4.50     12.05

Very Densely Striped     2     2     0.00     12.05

 

Stripeless{0 Stripes}: 0, 0, 0, 0, 0 (5 Observed)

Sparsely Striped{1-10 Stripes}: 5, 6, 7, 7, 7 | 7, 8, 9 (8 Observed)

Normally Striped{11-25 Stripes}: 11, 11, 12, 12, 12 | 13, 13, 14, 14, 17 | 17, 18, 18, 18, 18 | 18, 20, 21, 22, 23 | 23, 24 (22 Observed)

 

Densely Striped{26-29 stripes}: 27, 29, 29 (3 Observed)

Very Densely Striped{30 or more stripes}: 30, 30 (2 Observed)

 

Total = 5 + 8 + 22 + 5 = 13 + 27 = 40

 

Expected Counts from the Null Hypothesis for n=40

 

eStripeless = 40*0.05 = 2

eSparse = 40*0.25 = 10

eNormal = 40*.40 = 16

eDense = 40*.25 = 10

eVeryDense = 40*0.05 =2

 

Error Calculations

 

errorStripeless = (nStripelesseStripeless )2/ eStripeless = (5 – 2 )2/ 2 = 4.5

errorSparse = (nSparseeSparse )2/ eSparse = (8 – 10 )2/ 10 = 0.4

errorNormal = (nNormaleNormal )2/ eNormal = (22 – 16 )2/16 = 2.25

errorDense = (nDenseeDense )2/ eDense = (3 – 10 )2/ 10 = 4.90

errorVeryDense = (nVeryDenseeVeryDense )2/ eVeryDense = (2 –  2 )2/2 = 0

 

errorTotal =

 errorStripeless + errorSparse + errorNormal + errorDense + errorVeryDense = 4.5 + 0.4 + 2.25 + 4.90 + 0 = 12.05

 

From 5  11.6678  0.020 and 5  13.2767  0.010, .01 < p < 0.02 – the p-value is strictly between 1% and 2%.

 

Interpretation

 

Our population consists of Fictitious Striped Lizards. Our categories are based on stripe count: Stripeless{0 Stripes}, Sparsely Striped{1-10 Stripes}, Normally Striped{11-25 Stripes}, Densely Striped{26-29 Stripes} and Very Densely Striped{30 or more stripes}. Our null hypothesis is that the categories are distributed as: 5% Stripeless, 25% Sparsely Striped, 40% Normally Striped, 25% Densely Striped and 5% Very Densely Striped.

Our Family of Samples (FoS) consists of every possible random sample of 40 Fictitious Striped Lizards. Under the null hypothesis, within each member of the FoS, we expect approximately:

eStripeless = 40*0.05 = 2

eSparse = 40*0.25 = 10

eNormal = 40*.40 = 16

eDense = 40*.25 = 10

eVeryDense = 40*0.05 =2

 

From each member sample of the FoS, we compute sample counts and errors for each level of stripe count:

 

errorStripeless = (nStripelesseStripeless )2/ eStripeless

errorSparse = (nSparseeSparse )2/ eSparse

errorNormal = (nNormaleNormal )2/ eNormal

errorDense = (nDenseeDense )2/ eDense

errorVeryDense = (nVeryDenseeVeryDense )2/ eVeryDense

 

Then add the individual errors for the total error as

errorTotal = errorStripeless + errorSparse + errorNormal + errorDense + errorVeryDense

 

Computing this error for each member sample of the FoS, we obtain a Family of Errors (FoE).

 

If the stripe count categories for Fictitious Striped Lizard are distributed as 5% Stripeless, 25% Sparsely Striped, 40% Normally Striped, 25% Densely Striped and 5% Very Densely Striped then between 1% and 2% of the member samples of the Family of Samples yields errors as large as or larger than that of our single sample. Our sample presents significant evidence against the null hypothesis.

 

Case Three | Confidence Interval for the Population Proportion | Framingham Heart Study

The objective of the Framingham Heart Study (FHS) is to identify the common factors or characteristics that contribute to Cardiovascular disease (CVD) by following its development over a long period of time (since 1948)  in a large group of participants who had not yet developed overt symptoms of CVD or suffered a heart attack or stroke. Blood pressure is a measurement of the force applied to the walls of the arteries as the heart pumps blood through the body. Blood pressure readings are measured in millimeters of mercury (mm Hg) and usually given as two numbers: the systolic blood pressure (SBP) reading, representing the maximum pressure exerted when the heart contracts and the diastolic blood pressure (DBP) reading, representing the pressure in the arteries when the heart is at rest.

Consider the systolic to diastolic blood pressure ratio R = SBP/DBP. 

A sample of FHS adult subjects yields the following ratios:

1.86         1.71         1.43         1.82         1.62         1.57         1.53         1.42         1.55         1.75         1.95         1.58

2.33         1.69         1.56         1.85         1.51         1.50         1.72         1.50         2.06         1.36         1.78         1.56

1.62         1.75         1.78         1.52         1.83         1.60         1.54         1.80         2.13         1.67         1.86         1.60

1.48         1.47         1.45         1.44         1.50         2.05         1.50         1.79         1.82         1.59         1.74         1.86

1.64         1.54         2.03         1.91         1.92         1.97         1.54         1.67         2.00         1.63         1.82         1.49

 

Define the event “Framingham Heart Study Subject has Systolic to Diastolic Blood Pressure Ratio of 1.30 or greater.” Estimate the population proportion for this event with 98% confidence. That is, compute and discuss a 98% confidence interval for this population proportion. Provide concise and complete details and discussion as demonstrated in the case study summaries.

 

“Framingham Heart Study Subject has Systolic to Diastolic Blood Pressure Ratio of 1.30 or greater.”

 

1.86         1.71         1.43         1.82         1.62         1.57         1.53         1.42         1.55         1.75         1.95         1.58

2.33         1.69         1.56         1.85         1.51         1.50         1.72         1.50         2.06         1.36         1.78         1.56

1.62         1.75         1.78         1.52         1.83         1.60         1.54         1.80         2.13         1.67         1.86         1.60

1.48         1.47         1.45         1.44         1.50         2.05         1.50         1.79         1.82         1.59         1.74         1.86

1.64         1.54         2.03         1.91         1.92         1.97         1.54         1.67         2.00         1.63         1.82         1.49

 

 

sample size n = 60

 

event count = e = 60

p = e /n = 60/60 = 1

1 – p = 1 – 1 = 0

sdp = sqrt(p*(1 – p)/n) = sqrt(1*0/60)  = 0

 

from 2.35   0.009387   0.98123, Z 2.35

 

 lower98 = p – 2.35*sdp ≈ 1 – (2.35*0) = 1

upper98 = p + 2.35*sdp ≈ 1 + (2.35*0)  = 1

Our population consists of study subjects in the Framingham Heart Study (FHS). Our population proportion is the population proportion of Framingham Heart Study Subject having systolic to diastolic blood pressure ratios of 1.30 or greater.

 

Each member of the family of samples (FoS) is a single random sample of 60 FHS study subjects.The FoS consists of all possible samples of this type.

 

From each member of the (FoS), compute: 

 

e = sample count of FHS study subjects with systolic to diastolic blood pressure ratios of 1.30 or greater.

p = sample proportion of FHS study subjects with systolic to diastolic blood pressure ratios of 1.30 or greater. = e/60

sdp = square root of (p*(1 – p)/n )  

from 2.35   0.009387   0.98123, Z 2.35

 

and then compute the interval as: lower98  = p   (2.35*sdp), upper98 = p +  (2.35*sdp).

 

Computing this interval for each member of the FoS forms a family of intervals (FoI).

 

Approximately 98% of the FoI captures the true population proportion of Framingham Heart Study Subject having systolic to diastolic blood pressure ratios of 1.3 or greater. If our interval resides in this 98% supermajority, then 100% of Framingham Heart Study Subject have systolic to diastolic blood pressure ratios of 1.3 or greater.

 

Case Four | Confidence Interval for the Population Mean | Framingham Heart Study

 

Using the context and data from Case Three, estimate the population mean systolic to diastolic blood pressure ratio with 99% confidence. That is, compute and discuss a 99% confidence interval for this population mean. Show your work. Fully discuss the results. This discussion must include a clear discussion of the population and the population mean, the family of samples, the family of intervals and the interpretation of the interval.

 

n      m         sd         se        Z     lower99    upper99

60    1.696    0.20511    0.026479    2.6    1.62715    1.76485

from 2.60   0.004661   0.99068, Z ≈ 2.60

se = sd/sqrt(n) = 0.20511/sqrt(60) 0.026479;

 

lower93 = m - (z*se) = 1.696 - (2.60*0.026479) ≈ 1.62715

upper93 = m + (z*se) = 1.696 + (2.60*0.026479) ≈ 1.76485

 

Our population consists of study subjects in the Framingham Heart Study (FHS). Our population mean is the population mean systolic to diastolic blood pressure ratio.

 

Each member of the family of samples (FoS) is a single random sample of 60 FHS study subjects. The FoS consists of all possible samples of this type.

 

From each member of the (FoS), compute: 

 

m = sample mean systolic to diastolic blood pressure ratio

sd = sample standard deviation for the sample mean systolic to diastolic blood pressure ratio

se = sample standard error = sd/sqrt(60)

 

from 2.60   0.004661   0.99068, Z ≈ 2.60

 

and then compute the interval as:  lower99 = m - (2.60*se), upper99 = m + (2.60*se).

 

Computing this interval for each member of the FoS forms a family of intervals (FoI).

 

Approximately 99% of the FoI captures the true population mean systolic to diastolic blood pressure ratio for Framingham Heart Study subjects. If our interval resides in this 99% supermajority, then systolic to diastolic blood pressure ratio for Framingham Heart Study subjects.is between  1.627 and 1.765.

 

Work all four (4) cases.

Tble 1. Means and Proportions

 Z(k) PROBRT PROBCENT

0.05 0.48006 0.03988

0.10 0.46017 0.07966

0.15 0.44038 0.11924

0.20 0.42074 0.15852

0.25 0.40129 0.19741

0.30 0.38209 0.23582

0.35 0.36317 0.27366

0.40 0.34458 0.31084

0.45 0.32636 0.34729

0.50 0.30854 0.38292

0.55 0.29116 0.41768

0.60 0.27425 0.45149

0.65 0.25785 0.48431

0.70 0.24196 0.51607

0.75 0.22663 0.54675

0.80 0.21186 0.57629

0.85 0.19766 0.60467

0.90 0.18406 0.63188

0.95 0.17106 0.65789

1.00 0.15866 0.68269

Z(k) PROBRT PROBCENT

1.05 0.14686 0.70628

1.10 0.13567 0.72867

1.15 0.12507 0.74986

1.20 0.11507 0.76986

1.25 0.10565 0.78870

1.30 0.09680 0.80640

1.35 0.088508 0.82298

1.40 0.080757 0.83849

1.45 0.073529 0.85294

1.50 0.066807 0.86639

1.55 0.060571 0.87886

1.60 0.054799 0.89040

1.65 0.049471 0.90106

1.70 0.044565 0.91087

1.75 0.040059 0.91988

1.80 0.035930 0.92814

1.85 0.032157 0.93569

1.90 0.028717 0.94257

1.95 0.025588 0.94882

2.00 0.022750 0.95450

Z(k) PROBRT PROBCENT

2.05 0.020182 0.95964

2.10 0.017864 0.96427

2.15 0.015778 0.96844

2.20 0.013903 0.97219

2.25 0.012224 0.97555

2.30 0.010724 0.97855

2.35 0.009387 0.98123

2.40 0.008198 0.98360

2.45 0.007143 0.98571

2.50 0.006210 0.98758

2.55 0.005386 0.98923

2.60 0.004661 0.99068

2.65 0.004025 0.99195

2.70 .0034670 0.99307

2.75 .0029798 0.99404

2.80 .0025551 0.99489

2.85 .0021860 0.99563

2.90 .0018658 0.99627

2.95 .0015889 0.99682

3.00 .0013499 0.99730

 

Table 2. Medians

n error base p-value

25 1 1.00000

25 2 1.00000

25 3 0.99999

25 4 0.99992

25 5 0.99954

25 6 0.99796

25 7 0.99268

25 8 0.97836

25 9 0.94612

25 10 0.88524

25 11 0.78782

25 12 0.65498

25 13 0.50000

25 14 0.34502

25 15 0.21218

25 16 0.11476

25 17 0.05388

25 18 0.02164

25 19 0.00732

25 20 0.00204

25 21 0.00046 

 n error base p-value

25 22 0.00008

25 23 0.00001

25 23 0.00001

25 24 0.00000

25 25 0.00000

30 1 1.00000

30 2 1.00000

30 3 1.00000

30 4 1.00000

30 5 0.99997

30 6 0.99984

30 7 0.99928

30 8 0.99739

30 9 0.99194

30 10 0.97861

30 11 0.95063

30 12 0.89976

30 13 0.81920

30 14 0.70767

30 15 0.57223

 n error base p-value

30 16 0.42777

30 17 0.29233

30 18 0.18080

30 19 0.10024

30 20 0.04937

30 21 0.02139

30 22 0.00806

30 23 0.00261

30 24 0.00072

30 24 0.00072

30 25 0.00016

30 26 0.00003

30 27 “< 0.00001”

30 28 “< 0.00001”

30 29 “< 0.00001”

30 30 “< 0.00001”

   Table 3. Categories/Goodness of Fit 

 

Categories ERROR  p-value 3 0.0000 1.000                                  3 0.2107 0.900                                   3 0.4463 0.800                                   3 0.7133 0.700                                   3 1.0217 0.600                                   3 1.3863 0.500                                   3 1.5970 0.450                                   3 1.8326 0.400                                   3 2.0996 0.350                                   3 2.4079 0.300                                   3 2.7726 0.250                                   3 3.2189 0.200                                   3 4.6052 0.100                                   3 4.8159 0.090                                   3 5.0515 0.080                                   3 5.3185 0.070                                   3 5.6268 0.060                                   3 5.9915 0.050                                   3 6.4378 0.040                                   3 7.0131 0.030                                   3 7.8240 0.020                                   3 9.2103 0.010

Categories ERROR p-value

4 0.0000 1.000

4 0.5844 0.900

4 1.0052 0.800

4 1.4237 0.700

4 1.8692 0.600

4 2.3660 0.500

4 2.6430 0.450

4 2.9462 0.400

4 3.2831 0.350

4 3.6649 0.300

4 4.1083 0.250

4 4.6416 0.200

4 6.2514 0.100

4 6.4915 0.090

4 6.7587 0.080

4 7.0603 0.070

4 7.4069 0.060

4 7.8147 0.050

4 8.3112 0.040

4 8.9473 0.030

4 9.8374 0.020

4 11.3449 0.010

 

 Categories ERROR p-value

5 0.0000 1.000

5 1.0636 0.900

5 1.6488 0.800

5 2.1947 0.700

5 2.7528 0.600

5 3.3567 0.500

5 3.6871 0.450

5 4.0446 0.400

5 4.4377 0.350

5 4.8784 0.300

5 5.3853 0.250

5 5.9886 0.200

5 7.7794 0.100

5 8.0434 0.090

5 8.3365 0.080

5 8.6664 0.070

5 9.0444 0.060

5 9.4877 0.050

5 10.0255 0.040

5 10.7119 0.030

5 11.6678 0.020

5 13.2767 0.010