Key | Third Hourly | Math 1107| Spring Semester 2011

 

Protocol

 

You will use only the following resources: Your individual calculator; Your individual tool-sheet (one (1) 8.5 by 11 inch sheet); Your writing utensils; Blank Paper (provided by me); This copy of the hourly. Do not share these resources with anyone else. Show complete detail and work for full credit.  Follow case study solutions and sample hourly keys in presenting your solutions.

 

Work all four cases. Using only one side of the blank sheets provided, present your work. Do not write on both sides of the sheets provided, and present your work only on these sheets. Do not share information with any other students during this hourly. Neither give nor receive information with any other students during this hourly.

 

When you are finished: Prepare a Cover Sheet: Print your name on an otherwise blank sheet of paper. Then stack your stuff as follows: Cover Sheet (Top), Your Work Sheets, The Test Papers and Your Tool-sheet. Then hand all of this in to me.

 

Before you begin work, sign and acknowledge

 

I agree to follow this protocol. ______ (ÜInitial Here)

 

 

________________________________________________________________________

Name (Print Clearly)                                       Signature                                         Date

 

Case One | Confidence Interval, Mean | Duchenne Muscular Dystrophy

Duchenne muscular dystrophy (DMD) is an inherited disorder characterized by rapidly progressive muscle weakness which starts in the legs and pelvis and later affects the whole body. Suppose that we follow individuals diagnosed with DMD from diagnosis until death, noting age at death in months. Consider a random sample of individuals who were diagnosed with, and died with DMD. Age at death in months follows below:

17 22 37 45 50 65 73 87 93 102  112 123 127 130 137 138 143 150 156 161 169 173 177 179 180 181 181 182 184 185 186 186 188 189 190 190 190 192 193 193 194 194 195 196 196 197 197 199 199 200 200 201 203 204 207 210 213 215 217 219 220 223 227 228 230 231 233 234 235 237 239 240 240 241

Estimate the population mean age at death time for Duchenne muscular dystrophy patients with 97% confidence. That is, compute and discuss a 97% confidence interval for this population mean. Provide concise and complete details and discussion as demonstrated in the case study summaries.

 

Numbers

 

n       m          sd      Lower97    Upper97

74    175.270    55.4837    161.081    189.460

 

From the means/proportions table, row 2.20 0.013903 0.97219, Z ≈ 2.20.

Lower Bound = m – (Z*sd/sqrt(n))  ≈ 175.27 – (2.2*(55.4837/sqrt(74)) ≈ 161.081

Upper Bound = m + (Z*sd/sqrt(n))  ≈ 175.27 + (2.2*(55.4837/sqrt(74)) ≈ 189.460

 

Report the interval as [161.0, 189.4]

 

Interpretation

 

We estimate the population age at death for patients with Duchenne Muscular Dystrophy.

 

Each member of the family of samples is a single random sample of 74 deceased patients with DMD – the family consists of all possible samples of this type.

 

From each member sample, compute the sample mean m and sample standard deviation sd and then an interval as:

 

Lower Bound = m – (2.2*sd/sqrt(n))

Upper Bound = m + (2.2*sd/sqrt(n))

 

Doing this for each member of the family of samples yields a family of intervals, approximately 97% of which capture the true population mean age at death for patients with DMD. If our intervals in this supermajority, then the population mean age at death for patients with Duchenne Muuscular Dystrophy is between 161.1 and 189.4 months.

Case Two | Hypothesis Test, Median | Sea Weaselsä

Sea Weaselsä are instant pets, distributed as a kit - Sea Weasel Eggsä , Sea Weasel Water Conditionerä and Sea Weasel Foodä . A Sea Weaselä Kit is started by placing the Sea Weasel Eggsä , Sea Weasel Water Conditionerä and some water in a container. The eggs will then hatch, producing Sea Weaselsä , and some of them will survive. A random sample of Sea Weaselä Kits is selected, each selected kit is started, and the number of surviving Sea Weaselsä is counted one week after start. Here are the Sea Weaselä counts per kit (kit yields)

27 32 41 47 48 62 65 73 89 91 96 98 104 109 117 124 132 135 140 151 160 165 166 174 176 181 182 183 187 275

 

Test the following: null (H0): The median Sea Weaselä kit yield is 100 (h = 100) against the alternative (H1): h ≠ 100.  Show your work. Completely discuss and interpret your test results, as indicated in class and case study summaries.

 

Numbers

 

27 32 41 47 48 | 62 65 73 89 91 | 96 98 104 109 117 | 124 132 135 140 151 | 160 165 166 174 176 | 181 182 183 187 275

 

n = 30

Sample Count Below 100: 12

Sample Count Above 100: 18

Sample Error for Two-Sided Test = Max(12, 18) = 18

From the row 30 18 0.18080, base p-value = 0.18080

p-value for two-sided test = 2*0.18080 = 0.3616

 

Interpretation

 

Our test concerns the median Sea Weasel kit yield.

 

Each member of the family of samples is a single random sample of 30 kits. The family of samples consists of all possible samples of this type.

 

From each member of the family of samples, compute the sample error as the maximum of the number of kits in the sample whose yields are strictly less than 100 weasels, and the number of kits in the sample whose yields are strictly larger than 100. Computing this error for each member sample yields a family of errors.

 

If the true population median kit yield for the Sea Weasel kits is 100 weasels, then approximately 36.16% of the samples yield errors equal to or more extreme than our sample. The sample does not appear to present significant evidence against the null hypothesis.

 

Case Three | Confidence Interval, Proportion | Sea Weaselsä

 

Using the context and data from case two, consider the proportion of Sea Weaselä kits yielding 100 or more Sea Weaselsä. Compute and interpret a 95% confidence interval for this population proportion. Show your work. Completely discuss and interpret your test results, as indicated in class and case study summaries.

 

27 32 41 47 48 | 62 65 73 89 91 | 96 98 104 109 117 | 124 132 135 140 151 | 160 165 166 174 176 | 181 182 183 187 275

n = 30

Sample Count at or Above 100: 18

Sample Event Proportion: p = 18/30 » 0.60

Sample Standard Error for Proportion: sqrt(p*(1 – p)/n) = sqrt((18/30)*(12/30)/30) » 0.089443 

From the row 2.00 0.022750 0.95450 , Z = 2.00

Lower95 = p – (Z*sdp) = 0.60 – (2*0.089443) » 0.421115

Upper95 = p + (Z*sdp) = 0.60 + (2*0.089443) » 0.778885

 

Interpretation

 

We estimate the population proportion of Sea Weasel kits yielding 100 or more Sea Weasels.

 

Each member of the family of samples is a single random sample of 30 kits. The family of samples consists of all possible samples of this type.

 

From each member sample, compute the number e≥100 of kits in the sample whose yields equal or exceed 100 Sea Weasels, then compute an interval:

 

p≥100 = e≥100/30

sdp≥100 = sqrt(p≥100*(1 – p≥100)/30)

lower95 = p≥100 – (2*sdp≥100)

upper95 = p≥100 + (2*sdp≥100)

 

Computing these intervals for each member of the family of samples yields a family of intervals, approximately 95% of which capture the true population proportion of Sea Weasel kits yielding 100 or more Sea Weasels.

If our interval is in the supermajority, then between 42.1% and 77.8% of Sea Weasel kits yield 100 or more Sea Weasels. 

Case Four | Categorical Goodness of Fit | A Color Bowl

We have a color bowl, with black, blue, green, red and yellow marbles. Three hundred and fifty draws with replacement from the bowl yield the following sample counts:

50 black, 33 blue, 60 green, 74 red and 133 yellow.

Our null hypothesis is that the color bowl has the following color distribution: 25% Black 5% Blue, 10% Green, 20% Red and 40% Yellow. Test this Hypothesis. Show your work. Completely discuss and interpret your test results, as indicated in class and case study summaries. Fully discuss the testing procedure and results. This discussion must include a clear discussion of the population and the null hypothesis, the family of samples, the family of errors and the interpretation of the p-value. Show all work and detail for full credit.

 

Numbers

n = 50 + 33 + 60 + 74 + 133 = 350

EBlack = 350*PBlack = 350*0.25 = 87.5

ErrorBlack = (nBlackEBlack )2/ EBlack = (50 – 87.5 )2/ 87.5 » 16.07

 

EBlue = 350*PBlue = 350*0.05 = 17.5

ErrorBlue = (nBlueEBlue )2/ EBlue = (33 – 17.5 )2/ 17.5 » 13.72

 

EGreen = 350*PGreen = 350*0.10 = 35

ErrorGreen = (nGreenEGreen )2/ EGreen = (60 – 35 )2/ 35 » 17.86

 

ERed = 350*PRed = 350*0.20 = 70

ErrorRed = (nRedERed )2/ ERed = (74 – 70 )2/ 70  » 0.23

 

EYellow = 350*PYellow = 350*0.40 = 140

ErrorYellow = (nYellowEYellow )2/ EYellow = (133 – 140 )2/ 140  » 0.35

 

ErrorTotal = ErrorBlack + ErrorBlue + ErrorGreen + ErrorRed + ErrorYellow =

16.07 + 13.72 + 17.86 + 0.23 + 0.35 »  48.24 over 5 Categories

From the table row 5 13.2767 0.010, p < .01 

Interpretation

Each member of the family of samples is a single random sample of 350 draws. The family of samples consists of all possible samples of this type.

From each member sample, compute the color counts: nBlack , nBlue , nGreen , nRed and nYellow. Under the null hypothesis, we expect EBlack = 350*PBlack = 350*0.25 = 87.5 black, EBlue = 350*PBlue = 350*0.05 = 17.5 blue, EGreen = 350*PGreen = 350*0.10 = 35 green, ERed = 350*PRed = 350*0.20 = 70 red and EYellow = 350*PYellow = 350*0.40 = 140 yellow.

 

Compute a sample error as

 

ErrorTotal =

{(nBlackEBlack )2/ EBlack} + {(nBlueEBlue )2/ EBlue} + {(nGreenEGreen )2/ EGreen} + {(nRedERed )2/ ERed} + {(nYellowEYellow )2/ EYellow}.

 

Computing this error for each member sample yields a family of errors. If the true color distribution for the population is 25% Black 5% Blue, 10% Green, 20% Red and 40% Yellow, then fewer than 1% of the member samples yield errors equal to or larger than ours. The sample appears to present highly significant evidence against the null hypothesis.

 

Table: Means and Proportions

Z(k) PROBRT PROBCENT

0.05 0.48006 0.03988

0.10 0.46017 0.07966

0.15 0.44038 0.11924

0.20 0.42074 0.15852

0.25 0.40129 0.19741

0.30 0.38209 0.23582

0.35 0.36317 0.27366

0.40 0.34458 0.31084

0.45 0.32636 0.34729

0.50 0.30854 0.38292

0.55 0.29116 0.41768

0.60 0.27425 0.45149

0.65 0.25785 0.48431

0.70 0.24196 0.51607

0.75 0.22663 0.54675

0.80 0.21186 0.57629

0.85 0.19766 0.60467

0.90 0.18406 0.63188

0.95 0.17106 0.65789

1.00 0.15866 0.68269

Z(k) PROBRT PROBCENT

1.05 0.14686 0.70628

1.10 0.13567 0.72867

1.15 0.12507 0.74986

1.20 0.11507 0.76986

1.25 0.10565 0.78870

1.30 0.09680 0.80640

1.35 0.088508 0.82298

1.40 0.080757 0.83849

1.45 0.073529 0.85294

1.50 0.066807 0.86639

1.55 0.060571 0.87886

1.60 0.054799 0.89040

1.65 0.049471 0.90106

1.70 0.044565 0.91087

1.75 0.040059 0.91988

1.80 0.035930 0.92814

1.85 0.032157 0.93569

1.90 0.028717 0.94257

1.95 0.025588 0.94882

2.00 0.022750 0.95450

Z(k) PROBRT PROBCENT

2.05 0.020182 0.95964

2.10 0.017864 0.96427

2.15 0.015778 0.96844

2.20 0.013903 0.97219

2.25 0.012224 0.97555

2.30 0.010724 0.97855

2.35 0.009387 0.98123

2.40 0.008198 0.98360

2.45 0.007143 0.98571

2.50 0.006210 0.98758

2.55 0.005386 0.98923

2.60 0.004661 0.99068

2.65 0.004025 0.99195

2.70 .0034670 0.99307

2.75 .0029798 0.99404

2.80 .0025551 0.99489

2.85 .0021860 0.99563

2.90 .0018658 0.99627

2.95 .0015889 0.99682

3.00 .0013499 0.99730

 

Table: Medians

n error base p-value

25 1 1.00000

25 2 1.00000

25 3 0.99999

25 4 0.99992

25 5 0.99954

25 6 0.99796

25 7 0.99268

25 8 0.97836

25 9 0.94612

25 10 0.88524

25 11 0.78782

25 12 0.65498

25 13 0.50000

25 14 0.34502

25 15 0.21218

25 16 0.11476

25 17 0.05388

25 18 0.02164

n error base p-value

25 19 0.00732

25 20 0.00204

25 21 0.00046

25 22 0.00008

25 23 0.00001

25 24 0.00000

25 25 0.00000

30 1 1.00000

30 2 1.00000

30 3 1.00000

30 4 1.00000

30 5 0.99997

30 6 0.99984

30 7 0.99928

30 8 0.99739

30 9 0.99194

30 10 0.97861

30 11 0.95063

n error base p-value

30 12 0.89976

30 13 0.81920

30 14 0.70767

30 15 0.57223

30 16 0.42777

30 17 0.29233

30 18 0.18080

30 19 0.10024

30 20 0.04937

30 21 0.02139

30 22 0.00806

30 23 0.00261

30 24 0.00072

30 25 0.00016

30 26 0.00003

30 27 <0.00001

30 28 <0.00001

30 29 <0.00001

30 30 <0.00001

 

Table: Categories/Goodness of Fit

Categories ERROR p-value

4 0.0000 1.000

4 0.5844 0.900

4 1.0052 0.800

4 1.4237 0.700

4 1.8692 0.600

4 2.3660 0.500

4 2.6430 0.450

4 2.9462 0.400

4 3.2831 0.350

4 3.6649 0.300

4 4.1083 0.250

4 4.6416 0.200

4 4.9566 0.175

4 5.3170 0.150

4 5.7394 0.125

4 6.2514 0.100

4 6.4915 0.090

4 6.7587 0.080

4 7.0603 0.070

4 7.4069 0.060

4 7.8147 0.050

4 8.3112 0.040

4 8.9473 0.030

4 9.8374 0.020

4 11.3449 0.010

Categories ERROR p-value

5 0.0000 1.000

5 1.0636 0.900

5 1.6488 0.800

5 2.1947 0.700

5 2.7528 0.600

5 3.3567 0.500

5 3.6871 0.450

5 4.0446 0.400

5 4.4377 0.350

5 4.8784 0.300

5 5.3853 0.250

5 5.9886 0.200

5 6.3423 0.175

5 6.7449 0.150

5 7.2140 0.125

5 7.7794 0.100

5 8.0434 0.090

5 8.3365 0.080

5 8.6664 0.070

5 9.0444 0.060

5 9.4877 0.050

5 10.0255 0.040

5 10.7119 0.030

5 11.6678 0.020

5 13.2767 0.010

Categories ERROR p-value

6 0.0000 1.000

6 1.6103 0.900

6 2.3425 0.800

6 2.9999 0.700

6 3.6555 0.600

6 4.3515 0.500

6 4.7278 0.450

6 5.1319 0.400

6 5.5731 0.350

6 6.0644 0.300

6 6.6257 0.250

6 7.2893 0.200

6 7.6763 0.175

6 8.1152 0.150

6 8.6248 0.125

6 9.2364 0.100

6 9.5211 0.090

6 9.8366 0.080

6 10.1910 0.070

6 10.5962 0.060

6 11.0705 0.050

6 11.6443 0.040

6 12.3746 0.030

6 13.3882 0.020

6 15.0863 0.010