11.7 Review Questions

  1. An American roulette wheel contains 18 red numbers, 18 black numbers, and 2 green numbers. The following table shows the frequency with which the ball landed on each color in 300 trials.
    Color Red Black Green
    Frequency 140 120 40

    At the 5% significance level, do the data suggest that the wheel is out of balance?

  2. A gambler thinks a die may be loaded and that the six numbers are not equally likely. To test his suspicion, he rolled the die 150 times and obtained the data in the following table.
    Number 1 2 3 4 5 6
    Frequency 23 26 23 21 31 26

    Do the data provide sufficient evidence to conclude that the die is loaded? Perform the hypothesis test at the 5% significance level.

  3. The following table reported the survey results on how members would prefer to receive ballots in annual elections. At the 5% significance level, do the data provide sufficient evidence to conclude that gender (column) and preference (row) are associated?
    Male Female Total
    Mail 60 30 90
    Email 150 90 240
    Both 70 40 110
    N/A 80 50 130
    Total 360 210 570
Show/Hide Answer
  1. If the wheel is balanced, the chance of landing on a red number is [latex]p_1=\frac{18}{18+18+2}[/latex], on a black number is [latex]p_2=\frac{18}{18+18+2}[/latex], and on a green number is [latex]p_3=\frac{2}{18+18+2}[/latex]. If the wheel is out of balance, at least one proportion differs from the specified value. We can use the chi-square goodness of fit test to test this. The assumptions of a goodness-of-fit test are:
    1. All expected frequencies are at least 1.
    2. At most 20% of the expected frequencies are less than 5.
    3. Simple random sample (if you need to generalize the conclusion to a larger population)

    The expected frequencies are [latex]E_1=np_1=300\times \frac{18}{38}=142.1053[/latex], [latex]E_2=np_2=300\times \frac{18}{38}=142.1053[/latex], [latex]E_3=np_3=300\times \frac{2}{38}=15.78947[/latex]. All the expected frequencies are greater than 5. We assume the trials were conducted randomly.

    Steps:

    1. Hypotheses. [latex]H_0: p_1=\frac{18}{38}, p_2=\frac{18}{38}, p_3=\frac{2}{38}[/latex] versus [latex]H_a:[/latex] at least one proportion is different from the specified value.
    2. Significance level [latex]\alpha=0.05[/latex]
    3. Test statistic.
      Color Observed Expected Contribution
      red 140 142.10526 0.0311891
      black 120 142.10526 3.4385965
      green 40 15.78947 37.1228070

      [latex]\begin{aligned} \chi_o^2 & =\sum_{i=1}^3 \frac{(O_i-E_i)^2}{E_i}=\frac{(140- 142.10526)^2}{142.10526}+\frac{(120-142.10526)^2}{142.10526}+\frac{(40- 15.78947)^2}{15.78947} \\ & =0.03118908+3.43859649+37.12280702 \\ & =40.59259. \end{aligned}[/latex]
      with [latex]df=k-1=3-1=2[/latex].

    4. P-value. The chi-square test is always right-tailed. P-value is the area to the right of the observed value [latex]\chi^2_o[/latex] under the chi-square density curve with degrees of freedom [latex]df=k-1=3-1=2[/latex]. [latex]\text{P-value}=P(\chi^2 \ge \chi_o^2)=P(\chi^2 \ge 40.593) \lt P(\chi^2 \ge 10.597)=0.005.[/latex]
      That is, P-value<0.005.
      If the critical value approach is used, we need to find the rejection region. The chi-square test is always right-tailed. Make sure that the area of the rejection region is [latex]\alpha[/latex], so the critical value is [latex]\chi^2_\alpha=\chi^2_{0.05}=5.991.[/latex]
    5. Decision. We reject [latex]H_0[/latex] since P-value<0.005<0.05 ([latex]\alpha[/latex]). If the critical approach is used, we reject [latex]H_0[/latex] since the observed test statistic [latex]\chi_o^2=40.593>5.991[/latex] falls in the rejection region.
    6. Conclusion. At the 5% significance level, we have sufficient evidence that the wheel is out of balance.
  2. If the die is balanced, the proportion of landing on each of 1, 2, [latex]\cdots[/latex], 6 should be the same, i.e., [latex]p_i=\frac{1}{6}, i=1, 2, \cdots, 6[/latex]. All expected frequencies are [latex]E_i=np_i=150\times \frac{1}{6}=25>5[/latex], the assumptions for a chi-square goodness-of-fit test are satisfied.
    Steps:

    1. Hypotheses. [latex]H_0: p_i=\frac{1}{6}, i=1, 2, \cdots 6[/latex] versus [latex]H_a[/latex]: at least one proportion is equal to [latex]\frac{1}{6}[/latex].
    2. Significance level [latex]\alpha=0.01[/latex]
    3. Test statistic. The following table is useful.
      [latex]\chi^2_o=\sum_{\mbox{all cells}} \frac{(O-E)^2}{E}=\frac{(23-25)^2}{25}+\frac{(26-25)^2}{25}+\frac{(23-25)^2}{25}+\frac{(21-25)^2}{25}+\frac{(31-25)^2}{25}+\frac{(26-25)^2}{25}=2.48[/latex]
      with the degrees of freedom of the test [latex]df=k-1=6-1=5.[/latex]
    4. P-value. The chi-square test is always right-tailed. P-value is the area to the right of the observed value [latex]\chi^2_o[/latex] under the chi-square density curve with degrees of freedom [latex]df=k-1=6-1=5[/latex]. P-value=[latex]P(\chi^2\ge \chi_o^2)=P(\chi^2\ge 2.48)>P(\chi^2\ge 9.236)=0.1[/latex], i.e., P-value>0.1.
      Or
      Rejection region. The chi-square test is always right-tailed. Make sure that the area of the rejection region is [latex]\alpha[/latex], so the critical value is [latex]\chi^2_{\alpha}=\chi^2_{0.01}=15.086[/latex].
    5. Decision: We cannot reject [latex]H_0[/latex] since P-value>0.1>0.01 ([latex]\alpha)[/latex].
      Or
      We cannot reject [latex]H_0[/latex] since [latex]\chi_o^2=2.48<15.086[/latex] falls in the non-rejection region.
    6. Conclusion: At the 5% significance level, we do not have sufficient evidence that the die is loaded.
  3. We should use the chi-square independence test. The expected frequencies are given in the following table. All the expected frequencies are greater than 5, the assumptions for a chi-square independence test are satisfied.
    Preference Male Female
    Mail 56.84211 33.15789
    Email 151.57895 88.42105
    Both 69.47368 40.52632
    N/A 82.10526 47.89474

    Steps:

    1. Hypotheses. [latex]H_0:[/latex] gender and preference are independent. versus [latex]H_a[/latex]: gender and preference are associated.
    2. Significance level [latex]\alpha=0.05[/latex].
    3. Test statistic.
      [latex]\begin{aligned} \chi^2_o &=\sum_\text{all cells} \frac{(O-E)^2}{E}\\&=\frac{(60-56.842)^2}{56.842}+\frac{(30-33.158)^2}{33.158}+\frac{(150-151.579)^2}{151.579} +\frac{(90-88.421)^2}{88.421}\\&+\frac{(70-69.474)^2}{69.474}+\frac{(40-40.526)^2}{40.526}+\frac{(80-82.105)^2}{82.105}+\frac{(50-47.895)^2}{47.895} \\ & =0.678 \end{aligned}[/latex]
      with the degrees of freedom of the test [latex]df=(r-1)\times (c-1)=(4-1)\times (2-1)=3.[/latex]
    4. P-value. The chi-square test is always right-tailed. P-value is the area to the right of the observed value [latex]\chi^2_o[/latex] under the chi-square density curve with degrees of freedom [latex]df=3[/latex]. P-value=[latex]P(\chi^2\ge \chi_o^2)=P(\chi^2\ge 0.678)>P(\chi^2\ge 6.251)=0.1[/latex], i.e., P-value >0.10.
      Or
      Rejection region. The chi-square test is always right-tailed. Make sure that the area of the rejection region is [latex]\alpha[/latex], so the critical value is [latex]\chi^2_{\alpha}=\chi^2_{0.05}=7.815[/latex].
    5. Decision: Don’t reject [latex]H_0[/latex], since [latex]P\text{-value} > 0.1 > 0.05(\alpha).[/latex]
      Or
      We cannot reject [latex]H_0[/latex] since [latex]\chi_o^2=0.678<7.815[/latex] falls in the non-rejection region.
    6. Conclusion: At the 5% significance level, we do not have sufficient evidence that gender and preference are associated.

 

License

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

Introduction to Applied Statistics Copyright © 2024 by Wanhua Su is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, except where otherwise noted.