"

9.5 Review Questions

  1. Determine whether we should use a two-sample t test or a paired t test in the following applications.
    1. The data in the following table give the number of young per litter for 24 female cottonmouths in Florida and 44 female cottonmouths in Virginia.
      Florida Virginia
      8 6 7 5 12 7 7 6 8
      7 4 3 12 9 7 4 9 6
      5 5 4 5 4

      At the 1% significance level, do the data provide sufficient evidence to conclude that, on average, the number of young per litter of cottonmouths in Florida is less than that in Virginia?

    2. Independent random samples of 10 homes each in Atlantic City and Las Vegas yielded the following data on home prices in thousands of dollars. At the 5% significance level, can you conclude that the mean costs for existing single-family homes differ in Atlantic City and Las Vegas?
      Atlantic City Las Vegas
      234.0 192.8 226.4 231.5
      213.0 256.4 214.7 210.9
      236.1 301.9 349.4 178.5
    3. The following table gives data, for a sample of eight years, on the number of days that ice stayed on two lakes in Madison, Wisconsin-Lake Mendota and Lake Monona. At the 10% significance level, do the data provide sufficient evidence to conclude that a difference exists in the mean length of time that ice stays on these two lakes?
      Year Mendota Monona
      1 119 107
      2 115 108
      8 87 91
    4. The fiber density of 10 samples with varying fiber density was obtained using both an eye-piece method and a TV-screen method. The results, in fibers per square millimeter, are presented in the following table. Test at the 5% significance level whether, on average, the eyepiece method gives a greater fiber-density reading than the TV-screen method.
      Sample ID Eyepiece TV Screen
      1 182.2 177.8
      2 118.5 116.6
      10 85.4 86.6
    5. Compare the average IQ score of students from U of A and MacEwan. Randomly pick 30 students from each University and obtain their IQ scores.
    6. Compare weekly earnings of male and female workers. Randomly pick 40 male and 40 female workers and compare their average weekly earnings.
    7. Compare weekly earnings of male and female workers. Randomly pick 40 households and compare the husbands’ and wives’ average weekly earnings.
  2. The following table summarizes the operative times of neurosurgeries conducted by a dynamic system (Z-plate) and a static system (ALPS plate), respectively.
    Dynamic Static
    x¯1=400 x¯2=480
    s1=85 s2=40
    n1=60 n2=30
    1. Test at a 5% significance level whether the dynamic system (Z-plate) reduced the mean operative time relative to the static system (ALPS plate).
    2. Obtain a confidence interval for the difference in mean operative time between the dynamic and the static systems, μ1μ2, corresponding to the test in part (a).
  3. This table shows men and women’s winning times (in minutes) in the New York City Marathon between 1978 and 2006.
    Men(yi) Women(xi) Difference(di)
    132.2 152.5 20.3
    131.7 147.6 15.9
    129.7 145.7 16.0
    128.2 145.5 17.3
    129.5 147.2 17.7
    129.0 147.0 18.0
    134.9 149.5 14.6
    131.6 148.6 17.0
    131.1 148.1 17.0
    131.0 150.3 19.3
    128.3 148.1 19.8
    128.0 145.5 17.5
    132.7 150.8 18.1
    129.5 147.5 18.0
    129.5 144.7 15.2
    Men(yi) Women(xi) Difference(di)
    130.1 146.4 16.3
    131.4 147.6 16.2
    131.0 148.1 17.1
    129.9 148.3 18.4
    128.2 148.7 20.5
    128.8 145.3 16.5
    129.2 145.1 15.9
    130.2 145.8 15.6
    127.7 144.4 16.7
    128.1 145.9 17.8
    130.5 142.5 12.0
    129.5 143.2 13.7
    129.5 144.7 15.2
    130.0 145.1 15.1
    d¯=16.85,sd=1.98
    1. At the 1% significance level, do the data provide sufficient evidence that there is a difference in winning times between males and females?
    2. Obtain a confidence interval corresponding to the test in part (a).
    3. Does the interval in part (b) support the conclusion in part (a)?
    4. Based on the interval obtained in part (b), can we claim that the winning time of males is at least 15 minutes fast than that of females? How about 20 minutes fast?
Show/Hide Answer
    1. The data are two independent samples, use two-sample t test
    2. The data are two independent samples, use two-sample t test
    3. The data is a paired sample, use paired t test.
    4. The data is a paired sample, use paired t test.
    5. Two independent samples, use two-sample t test.
    6. Two independent samples, use two-sample t test.
    7. Paired sample, use paired t test.
    1. Check the assumptions:
      • We have simple random samples.
      • The two samples are independent.
      • We have large samples n1=60>30 and n2=30=30.

      Steps:

      1. Hypotheses. H0:μ1μ20 versus Ha:μ1μ2<0.
      2. Significance level α=0.05.
      3. Test statistic. to=(x¯1x¯2)Δ0s12n1+s22n2=(400480)085260+40230=6.069,df=min{n11,n21}=min{601,301}=29.
      4. P-value: for a left-tailed test, P-value = P(tto)=P(t6.069)=P(t6.069)<0.0005<0.05(α).
      5. Decision: Reject H0, since P-value<α.
      6. Conclusion: At the 5% significance level, we have sufficient evidence that the dynamic system (Z-plate) reduced the mean operative time relative to the static system (ALPS plate).
    2. We should construct a lower-tailed 95% confidence interval for a left-tailed test at the 5% significance level. The upper confidence bound for a 95% lower-tailed interval for μ1μ2 is
      (x¯1x¯2)+tα×s12n1+s22n2=(400480)+1.699×85260+40230=57.605.
      The 95% lower-tailed confidence interval is (,57.605).
      Interpretation: we can be 95% confident that the mean operative time of the dynamic system is at least 57.605 minutes shorter than the mean operative time of the static system.
    1. Check the assumptions:
      • We have simple random samples.
      • We have large number of pairs n=2930.

      Steps:

      1. Hypotheses. H0:μMμF=0 versus Ha:μMμF0.
      2. Significance level α=0.01.
      3. Test statistic. to=d¯δ0sdn=16.8501.9829=45.828,with df=n1=291=28.
      4. P-value: for a two-tailed test, P-value = 2P(t|to|)=2P(t45.828)<2×0.0005=0.001<0.01(α).
      5. Decision: Reject H0, since P-value<α.
      6. Conclusion: At the 1% significance level, we have sufficient evidence that there is a difference in winning times between males and females.
    2. A two-tailed test at the 1% significance level corresponds to a 99% two-tailed confidence interval. For df=28, 1α=0.99α=0.01α2=0.012=0.005tα/2=t0.005=2.763. A 99% two-tailed confidence interval for μd is d¯±tα/2×sdn=16.85±2.763×1.9829=(15.834,17.866).
      Interpretation: we can be 99% confident that the difference in the mean winning times between male and female is somewhere between 15.834 minutes and 17.866 minutes.
    3. Yes, because the hypothesized value δ0=0 is outside the entire interval; therefore, we can claim that μMμF0 which is the conclusion of the paired t test. Since the entire interval is above 0, we can further claim that μMμF>0.
    4. We can claim that the mean winning time of males is at least 15 minutes fast than that of females, since the entire interval is above 15, we can claim μMμF>15. However, we cannot claim that the mean winning time of males is at least 20 minutes faster than that of females since the 20 is inside the interval.
      Note: for a right-tailed test at the 1% significance level, it is more precise to construct a 99% upper-tailed interval with a lower bound: d¯±tα×sdn=16.852.467×1.9829=15.943. The 99% upper-tailed interval is (,15.943).
      Interpretation: we can be 99% confident that the winning time of males is at least 15.943 minutes faster than that of females. Since δ0=15 is outside the confidence interval, we can claim that μMμF>15; however, δ0=20 is inside the confidence interval, we cannot claim that μMμF>20. There is insufficient evidence that the mean winning time of males is at least 20 minutes faster than that of females.