{"id":1083,"date":"2021-06-11T20:30:02","date_gmt":"2021-06-12T00:30:02","guid":{"rendered":"https:\/\/openbooks.macewan.ca\/rcommander\/?post_type=chapter&#038;p=1083"},"modified":"2025-06-24T18:34:38","modified_gmt":"2025-06-24T22:34:38","slug":"10-2-distribution-of-the-sample-proportion","status":"publish","type":"chapter","link":"https:\/\/openbooks.macewan.ca\/introstats\/chapter\/10-2-distribution-of-the-sample-proportion\/","title":{"raw":"10.2 Distribution of the Sample Proportion","rendered":"10.2 Distribution of the Sample Proportion"},"content":{"raw":"Inferences about the population mean [latex]\\mu[\/latex] are based on the distribution of the sample mean [latex]\\bar{X}[\/latex]. Similarly, inferences about the population proportion [latex]p[\/latex] are based on the distribution of the sample proportion [latex]\\hat{p}[\/latex].\r\n\r\nThe<strong> population proportion<\/strong> is defined as\r\n\r\n[latex] p = \\frac{\\text{\\# of individuals having a certain attribute}}{\\text{\\# of individuals in the population}} = \\frac{\\text{\\# of successes}}{N}. [\/latex]\r\n\r\nThe population proportion can be regarded as a special type of population mean if we let the variable of interest be an indicator variable as follows:\r\n<p align=\"center\">[latex] x_i = \\begin{cases}\r\n1 &amp; \\text{if the ith individual has the attribute (a success)}, \\\\\r\n0 &amp; \\text{if the ith individual does not have the attribute (a failure)}.\r\n\\end{cases}\r\n[\/latex]<\/p>\r\nThen, the population proportion can be rewritten as\r\n\r\n[latex]p=\\frac{\\text{\\# of individuals having a certain attribute}}{\\text{\\# of individuals in the population}} = \\frac{\\text{\\# of successes}}{N} = \\frac{\\sum x_i}{N}.[\/latex]\r\n\r\nThe variable of interest [latex]X[\/latex] has only two possible values: 1 if the individual has the attribute and 0 if not. Randomly select one individual and define [latex]p[\/latex] as the probability that this individual has the attribute. As a result, the probability distribution of [latex]X[\/latex] is\r\n<p style=\"text-align: center;\"><strong>Table 10.1<\/strong>: Probability Distribution of an Indicator Variable<\/p>\r\n\r\n<div align=\"center\">\r\n<table class=\"aligncenter first-col-border\" style=\"height: 30px;\" border=\"1\" cellspacing=\"0\" cellpadding=\"0\">\r\n<thead>\r\n<tr class=\"shaded\" style=\"height: 15px;\">\r\n<th style=\"height: 15px; width: 214.014px;\" scope=\"row\" valign=\"top\">\r\n<div align=\"center\"><em>[latex]x[\/latex]<\/em><\/div><\/th>\r\n<td style=\"height: 15px; width: 135.181px;\" valign=\"top\">\r\n<div align=\"center\">1<\/div><\/td>\r\n<td style=\"height: 15px; width: 191.306px;\" valign=\"top\">\r\n<div align=\"center\">0<\/div><\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr style=\"height: 15px;\">\r\n<th style=\"height: 15px; width: 214.014px;\" scope=\"row\" valign=\"top\">\r\n<div align=\"center\"><em>[latex]P(X=x)[\/latex]<\/em><\/div><\/th>\r\n<td style=\"height: 15px; width: 135.181px;\" valign=\"top\">\r\n<div align=\"center\">[latex]p[\/latex]<\/div><\/td>\r\n<td style=\"height: 15px; width: 191.306px;\" valign=\"top\">\r\n<div align=\"center\">[latex]1 \u2013 p[\/latex]<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\nwith a population mean and population standard deviation:\r\n\r\n[latex]\\mu = \\sum x P(X=x) = 1 \\times p + 0 \\times (1-p) = p,[\/latex]\r\n\r\n[latex]\\sigma = \\sqrt{\\sum x^2 P(X=x) - \\mu^2} = \\sqrt{1^2 \\times p + 0^2 \\times (1-p) - \\mu^2} = \\sqrt{p - p^2} = \\sqrt{p(1-p)}. [\/latex]\r\n\r\nThe sample proportion can be viewed as a special type of sample mean (in the same way that the population proportion can be viewed as a special type of population mean). That is, in a simple random sample of size <em>n<\/em>, the proportion of individuals with the specific attribute is the sample proportion:\r\n<p align=\"center\">[latex]\\begin{align*} \\hat{p} &amp;= \\frac{\\text{\\# of individuals having a certain attribute in the sample}}{\\text{sample size}}\\\\&amp; = \\frac{\\text{\\# of successes in the sample}}{n} = \\frac{\\sum x_i}{n} = \\bar{x} \\end{align*}[\/latex]<\/p>\r\nwith [latex]x_i = 1[\/latex] if the individual has the attribute and [latex]x_i=0[\/latex] if not.\r\n\r\nRecall from Chapter 6, the sampling distribution of the sample mean [latex]\\bar{X}[\/latex]:\r\n<ul>\r\n \t<li>Centre: the mean of the sample mean [latex]\\bar{X}[\/latex] equals the population mean [latex]\\mu[\/latex]. That is,\r\n<p align=\"center\">[latex]\\mu_{\\scriptsize \\bar{X}} = \\mu.[\/latex]<\/p>\r\n<\/li>\r\n \t<li>Spread: the standard deviation of the sample mean equals the population standard deviation divided by the square root of the sample size. That is,\r\n<p align=\"center\">[latex]\\sigma_{\\scriptsize \\bar{X}} = \\frac{\\sigma}{\\sqrt{n}}.[\/latex]<\/p>\r\n<\/li>\r\n<\/ul>\r\nThese two arguments are true for any population distribution and sample size <em>n<\/em>.\r\n<ul>\r\n \t<li>Shape:\r\n<ul>\r\n \t<li>When the population distribution is normal, [latex]\\bar{X}[\/latex] is also normal regardless of <em>n<\/em>.<\/li>\r\n \t<li>When the population distribution is non-normal but the sample size <em>n<\/em> is large, [latex]\\bar{X}[\/latex] is approximately normally distributed. This is guaranteed by the central limit theorem (CLT).<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ul>\r\nThe same conclusions can be applied to the sampling distribution of the sample proportion [latex]\\hat{p}[\/latex], where the variable of interest is\r\n<p align=\"center\">[latex] X = \\begin{cases}\r\n1 &amp; \\text{with probability } p \\\\\r\n0 &amp; \\text{with probability } 1-p\r\n\\end{cases}\r\n[\/latex]<\/p>\r\nwith the population mean [latex]\\mu = p[\/latex] and standard deviation [latex]\\sigma = \\sqrt{p(1-p)}[\/latex]. Therefore, the sampling distribution of the sample proportion [latex]\\hat{p}[\/latex] is summarized as follows.\r\n<div class=\"textbox textbox--key-takeaways\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">Key Facts: Sampling Distribution of the Sample Proportion<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<ul>\r\n \t<li><strong>Centre<\/strong>: the mean of the sample proportion [latex]\\hat{p}[\/latex] equals the population mean [latex]\\mu[\/latex]. That is,\r\n<p align=\"center\">[latex]\\mu_{\\scriptsize \\hat{p}} = \\mu = p[\/latex].<\/p>\r\n<\/li>\r\n \t<li><strong>Spread<\/strong>: the standard deviation of the sample proportion [latex]\\hat{p}[\/latex] equals the population standard deviation [latex]\\sigma[\/latex] divided by the square root of the sample size. That is,\r\n<p align=\"center\">[latex]\\sigma_{\\scriptsize \\hat{p}} = \\frac{\\sigma}{\\sqrt{n}} = \\frac{\\sqrt{p(1-p)}}{\\sqrt{n}} = \\sqrt{ \\frac{p(1-p)}{n}}[\/latex].<\/p>\r\n<\/li>\r\n<\/ul>\r\nThese two arguments are true for any population proportion [latex]p[\/latex] and any sample size <em>n<\/em>.\r\n<ul>\r\n \t<li><strong>Shape<\/strong>: The population distribution is non-normal. By the central limit theorem (CLT), however, [latex]\\hat{p}[\/latex] is approximately normal if <em>n<\/em> is large enough. The rule of thumb is to guarantee both [latex]np \\geq 5[\/latex] and [latex]n(1-p) \\geq 5[\/latex], i.e., [latex] n \\geq \\max \\left\\{ \\frac{5}{p}, \\frac{5}{1-p} \\right\\}[\/latex]. Some textbooks require both [latex]np \\geq 10[\/latex] and [latex]n(1-p) \\geq 10[\/latex].<\/li>\r\n<\/ul>\r\n<\/div>\r\n<\/div>\r\n<strong>Central limit theorem for the sample proportion:<\/strong>\r\n\r\nIf the sample size <em>n<\/em> is large enough ([latex]np \\geq 5[\/latex] and [latex]n(1-p) \\geq 5[\/latex]), the sampling distribution of the sample proportion [latex]\\hat{p}[\/latex] is approximately normally distributed.\r\n\r\nFor example, suppose the population proportion is [latex]p=0.05[\/latex]. Then the sampling distribution of the sample proportion [latex]\\hat{p}[\/latex] is approximately normally distributed if the sample size is at least\r\n<p align=\"center\">[latex] n = \\max \\left\\{ \\frac{5}{p}, \\frac{5}{1-p} \\right\\} = \\max \\left\\{ \\frac{5}{0.05}, \\frac{5}{1-0.05} \\right\\} = \\max \\{ 100, 5.26 \\} = 100.[\/latex]<\/p>\r\nThe following figures show the sampling distribution of the sample proportion with [latex]p=0.05[\/latex] and sample sizes <em>n<\/em> = 50, 100, 200, and 1000.<a id=\"retfig10.1\"><\/a>\r\n<div align=\"center\">\r\n<table class=\"no-border\" style=\"width: 90%; height: 191px;\" border=\"1\" cellspacing=\"0\" cellpadding=\"0\">\r\n<tbody>\r\n<tr style=\"height: 176px;\">\r\n<td style=\"width: 25%; height: 176px;\" valign=\"top\"><a href=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n50.png\" rel=\"noopener\"><img class=\"alignnone wp-image-1088 size-medium\" src=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n50-292x300.png\" alt=\"A histogram of sample proportion for sample size n = 50. Image description available.\" width=\"292\" height=\"300\" \/><\/a><\/td>\r\n<td style=\"width: 25%; height: 176px;\" valign=\"top\"><a class=\"alignnone size-medium wp-image-1089\" href=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n100.png\" rel=\"noopener\"><img class=\"alignnone wp-image-1089 size-medium\" src=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n100-294x300.png\" alt=\"A histogram of sample proportion for sample size n = 100. Image description available.\" width=\"294\" height=\"300\" \/><\/a><\/td>\r\n<td style=\"width: 25%; height: 176px;\" valign=\"top\"><a class=\"alignnone size-medium wp-image-1090\" href=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n200.png\" rel=\"noopener\"><img class=\"alignnone wp-image-1090 size-medium\" src=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n200-290x300.png\" alt=\"A histogram for sample proportion for sample size n = 200. Image description available.\" width=\"290\" height=\"300\" \/><\/a><\/td>\r\n<td style=\"width: 25%; height: 176px;\" valign=\"top\" width=\"178\"><a class=\"alignnone size-medium wp-image-1091\" href=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n1000.png\" rel=\"noopener\"><img class=\"alignnone wp-image-1091 size-medium\" src=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n1000-289x300.png\" alt=\"A histogram of sample proportion for sample size n = 1000. Image description available.\" width=\"289\" height=\"300\" \/><\/a><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px;\">\r\n<td style=\"height: 15px;\" colspan=\"4\" valign=\"top\">\r\n<div style=\"text-align: center;\" align=\"center\"><strong>Figure 10.1<\/strong>: Histograms of Sample Proportions with Different Sample Size. [<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/back-matter\/image-description\/#fig10.1\">Image Description (See Appendix D Figure 10.1)<\/a>] Click on the image to enlarge it.<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nThere are several findings:\r\n\r\n<\/div>\r\n<ul>\r\n \t<li>The sampling distribution of the sample proportion becomes increasingly normal as the sample size <em>n<\/em> increases. When <em>n<\/em> = 50, the sampling distribution of sample proportion is skewed. When <em>n<\/em> = 100, the distribution is still slightly right skewed. For <em>n<\/em> = 200 and <em>n<\/em> = 1000, the sampling distribution appears bell-shaped and symmetric (indicative of a normal distribution).<\/li>\r\n \t<li>The mean of the sample proportion (blue dashed line) is always identical to the population proportion <em>p<\/em> = 0.05 (red solid line) regardless of the sample size <em>n<\/em>.<\/li>\r\n \t<li>The standard deviation of the sample proportion decreases as\u00a0<em>n<\/em> increases.<\/li>\r\n<\/ul>\r\nTo summarize, for [latex]np \\geq 5[\/latex] and [latex]n(1-p) \\geq 5[\/latex], [latex]\\hat{p} \\sim N \\left( p , \\sqrt{\\frac{p(1-p)}{n}} \\right)[\/latex]. The standardized version of [latex]\\hat{p}[\/latex] is [latex]Z = \\frac{\\hat{p} - p}{\\sqrt{\\frac{p(1-p)}{n}}} \\sim N(0,1)[\/latex]. As a result, inferences about the population proportions are based on the standard normal distribution.","rendered":"<p>Inferences about the population mean [latex]\\mu[\/latex] are based on the distribution of the sample mean [latex]\\bar{X}[\/latex]. Similarly, inferences about the population proportion [latex]p[\/latex] are based on the distribution of the sample proportion [latex]\\hat{p}[\/latex].<\/p>\n<p>The<strong> population proportion<\/strong> is defined as<\/p>\n<p>[latex]p = \\frac{\\text{\\# of individuals having a certain attribute}}{\\text{\\# of individuals in the population}} = \\frac{\\text{\\# of successes}}{N}.[\/latex]<\/p>\n<p>The population proportion can be regarded as a special type of population mean if we let the variable of interest be an indicator variable as follows:<\/p>\n<p style=\"text-align: center;\">[latex]x_i = \\begin{cases}  1 & \\text{if the ith individual has the attribute (a success)}, \\\\  0 & \\text{if the ith individual does not have the attribute (a failure)}.  \\end{cases}[\/latex]<\/p>\n<p>Then, the population proportion can be rewritten as<\/p>\n<p>[latex]p=\\frac{\\text{\\# of individuals having a certain attribute}}{\\text{\\# of individuals in the population}} = \\frac{\\text{\\# of successes}}{N} = \\frac{\\sum x_i}{N}.[\/latex]<\/p>\n<p>The variable of interest [latex]X[\/latex] has only two possible values: 1 if the individual has the attribute and 0 if not. Randomly select one individual and define [latex]p[\/latex] as the probability that this individual has the attribute. As a result, the probability distribution of [latex]X[\/latex] is<\/p>\n<p style=\"text-align: center;\"><strong>Table 10.1<\/strong>: Probability Distribution of an Indicator Variable<\/p>\n<div style=\"margin: auto;\">\n<table class=\"aligncenter first-col-border\" style=\"height: 30px; border-spacing: 0px;\" cellpadding=\"0\">\n<thead>\n<tr class=\"shaded\" style=\"height: 15px;\">\n<th style=\"height: 15px; width: 214.014px;\" scope=\"row\" valign=\"top\">\n<div style=\"margin: auto;\"><em>[latex]x[\/latex]<\/em><\/div>\n<\/th>\n<td style=\"height: 15px; width: 135.181px;\" valign=\"top\">\n<div style=\"margin: auto;\">1<\/div>\n<\/td>\n<td style=\"height: 15px; width: 191.306px;\" valign=\"top\">\n<div style=\"margin: auto;\">0<\/div>\n<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr style=\"height: 15px;\">\n<th style=\"height: 15px; width: 214.014px;\" scope=\"row\" valign=\"top\">\n<div style=\"margin: auto;\"><em>[latex]P(X=x)[\/latex]<\/em><\/div>\n<\/th>\n<td style=\"height: 15px; width: 135.181px;\" valign=\"top\">\n<div style=\"margin: auto;\">[latex]p[\/latex]<\/div>\n<\/td>\n<td style=\"height: 15px; width: 191.306px;\" valign=\"top\">\n<div style=\"margin: auto;\">[latex]1 \u2013 p[\/latex]<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>with a population mean and population standard deviation:<\/p>\n<p>[latex]\\mu = \\sum x P(X=x) = 1 \\times p + 0 \\times (1-p) = p,[\/latex]<\/p>\n<p>[latex]\\sigma = \\sqrt{\\sum x^2 P(X=x) - \\mu^2} = \\sqrt{1^2 \\times p + 0^2 \\times (1-p) - \\mu^2} = \\sqrt{p - p^2} = \\sqrt{p(1-p)}.[\/latex]<\/p>\n<p>The sample proportion can be viewed as a special type of sample mean (in the same way that the population proportion can be viewed as a special type of population mean). That is, in a simple random sample of size <em>n<\/em>, the proportion of individuals with the specific attribute is the sample proportion:<\/p>\n<p style=\"text-align: center;\">[latex]\\begin{align*} \\hat{p} &= \\frac{\\text{\\# of individuals having a certain attribute in the sample}}{\\text{sample size}}\\\\& = \\frac{\\text{\\# of successes in the sample}}{n} = \\frac{\\sum x_i}{n} = \\bar{x} \\end{align*}[\/latex]<\/p>\n<p>with [latex]x_i = 1[\/latex] if the individual has the attribute and [latex]x_i=0[\/latex] if not.<\/p>\n<p>Recall from Chapter 6, the sampling distribution of the sample mean [latex]\\bar{X}[\/latex]:<\/p>\n<ul>\n<li>Centre: the mean of the sample mean [latex]\\bar{X}[\/latex] equals the population mean [latex]\\mu[\/latex]. That is,\n<p style=\"text-align: center;\">[latex]\\mu_{\\scriptsize \\bar{X}} = \\mu.[\/latex]<\/p>\n<\/li>\n<li>Spread: the standard deviation of the sample mean equals the population standard deviation divided by the square root of the sample size. That is,\n<p style=\"text-align: center;\">[latex]\\sigma_{\\scriptsize \\bar{X}} = \\frac{\\sigma}{\\sqrt{n}}.[\/latex]<\/p>\n<\/li>\n<\/ul>\n<p>These two arguments are true for any population distribution and sample size <em>n<\/em>.<\/p>\n<ul>\n<li>Shape:\n<ul>\n<li>When the population distribution is normal, [latex]\\bar{X}[\/latex] is also normal regardless of <em>n<\/em>.<\/li>\n<li>When the population distribution is non-normal but the sample size <em>n<\/em> is large, [latex]\\bar{X}[\/latex] is approximately normally distributed. This is guaranteed by the central limit theorem (CLT).<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>The same conclusions can be applied to the sampling distribution of the sample proportion [latex]\\hat{p}[\/latex], where the variable of interest is<\/p>\n<p style=\"text-align: center;\">[latex]X = \\begin{cases}  1 & \\text{with probability } p \\\\  0 & \\text{with probability } 1-p  \\end{cases}[\/latex]<\/p>\n<p>with the population mean [latex]\\mu = p[\/latex] and standard deviation [latex]\\sigma = \\sqrt{p(1-p)}[\/latex]. Therefore, the sampling distribution of the sample proportion [latex]\\hat{p}[\/latex] is summarized as follows.<\/p>\n<div class=\"textbox textbox--key-takeaways\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Key Facts: Sampling Distribution of the Sample Proportion<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<ul>\n<li><strong>Centre<\/strong>: the mean of the sample proportion [latex]\\hat{p}[\/latex] equals the population mean [latex]\\mu[\/latex]. That is,\n<p style=\"text-align: center;\">[latex]\\mu_{\\scriptsize \\hat{p}} = \\mu = p[\/latex].<\/p>\n<\/li>\n<li><strong>Spread<\/strong>: the standard deviation of the sample proportion [latex]\\hat{p}[\/latex] equals the population standard deviation [latex]\\sigma[\/latex] divided by the square root of the sample size. That is,\n<p style=\"text-align: center;\">[latex]\\sigma_{\\scriptsize \\hat{p}} = \\frac{\\sigma}{\\sqrt{n}} = \\frac{\\sqrt{p(1-p)}}{\\sqrt{n}} = \\sqrt{ \\frac{p(1-p)}{n}}[\/latex].<\/p>\n<\/li>\n<\/ul>\n<p>These two arguments are true for any population proportion [latex]p[\/latex] and any sample size <em>n<\/em>.<\/p>\n<ul>\n<li><strong>Shape<\/strong>: The population distribution is non-normal. By the central limit theorem (CLT), however, [latex]\\hat{p}[\/latex] is approximately normal if <em>n<\/em> is large enough. The rule of thumb is to guarantee both [latex]np \\geq 5[\/latex] and [latex]n(1-p) \\geq 5[\/latex], i.e., [latex]n \\geq \\max \\left\\{ \\frac{5}{p}, \\frac{5}{1-p} \\right\\}[\/latex]. Some textbooks require both [latex]np \\geq 10[\/latex] and [latex]n(1-p) \\geq 10[\/latex].<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<p><strong>Central limit theorem for the sample proportion:<\/strong><\/p>\n<p>If the sample size <em>n<\/em> is large enough ([latex]np \\geq 5[\/latex] and [latex]n(1-p) \\geq 5[\/latex]), the sampling distribution of the sample proportion [latex]\\hat{p}[\/latex] is approximately normally distributed.<\/p>\n<p>For example, suppose the population proportion is [latex]p=0.05[\/latex]. Then the sampling distribution of the sample proportion [latex]\\hat{p}[\/latex] is approximately normally distributed if the sample size is at least<\/p>\n<p style=\"text-align: center;\">[latex]n = \\max \\left\\{ \\frac{5}{p}, \\frac{5}{1-p} \\right\\} = \\max \\left\\{ \\frac{5}{0.05}, \\frac{5}{1-0.05} \\right\\} = \\max \\{ 100, 5.26 \\} = 100.[\/latex]<\/p>\n<p>The following figures show the sampling distribution of the sample proportion with [latex]p=0.05[\/latex] and sample sizes <em>n<\/em> = 50, 100, 200, and 1000.<a id=\"retfig10.1\"><\/a><\/p>\n<div style=\"margin: auto;\">\n<table class=\"no-border\" style=\"width: 90%; height: 191px; border-spacing: 0px;\" cellpadding=\"0\">\n<tbody>\n<tr style=\"height: 176px;\">\n<td style=\"width: 25%; height: 176px;\" valign=\"top\"><a href=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n50.png\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1088 size-medium\" src=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n50-292x300.png\" alt=\"A histogram of sample proportion for sample size n = 50. Image description available.\" width=\"292\" height=\"300\" srcset=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n50-292x300.png 292w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n50-65x67.png 65w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n50-225x231.png 225w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n50-350x360.png 350w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n50.png 614w\" sizes=\"auto, (max-width: 292px) 100vw, 292px\" \/><\/a><\/td>\n<td style=\"width: 25%; height: 176px;\" valign=\"top\"><a class=\"alignnone size-medium wp-image-1089\" href=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n100.png\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1089 size-medium\" src=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n100-294x300.png\" alt=\"A histogram of sample proportion for sample size n = 100. Image description available.\" width=\"294\" height=\"300\" srcset=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n100-294x300.png 294w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n100-65x66.png 65w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n100-225x230.png 225w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n100-350x357.png 350w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n100.png 614w\" sizes=\"auto, (max-width: 294px) 100vw, 294px\" \/><\/a><\/td>\n<td style=\"width: 25%; height: 176px;\" valign=\"top\"><a class=\"alignnone size-medium wp-image-1090\" href=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n200.png\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1090 size-medium\" src=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n200-290x300.png\" alt=\"A histogram for sample proportion for sample size n = 200. Image description available.\" width=\"290\" height=\"300\" srcset=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n200-290x300.png 290w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n200-65x67.png 65w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n200-225x233.png 225w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n200-350x362.png 350w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n200.png 612w\" sizes=\"auto, (max-width: 290px) 100vw, 290px\" \/><\/a><\/td>\n<td style=\"width: 25%; height: 176px; width: 178px;\" valign=\"top\"><a class=\"alignnone size-medium wp-image-1091\" href=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n1000.png\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1091 size-medium\" src=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n1000-289x300.png\" alt=\"A histogram of sample proportion for sample size n = 1000. Image description available.\" width=\"289\" height=\"300\" srcset=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n1000-289x300.png 289w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n1000-65x67.png 65w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n1000-225x233.png 225w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n1000-350x363.png 350w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/06\/m10_SampleProportion_SampleDistribution_n1000.png 622w\" sizes=\"auto, (max-width: 289px) 100vw, 289px\" \/><\/a><\/td>\n<\/tr>\n<tr style=\"height: 15px;\">\n<td style=\"height: 15px;\" colspan=\"4\" valign=\"top\">\n<div style=\"text-align: center; margin: auto;\"><strong>Figure 10.1<\/strong>: Histograms of Sample Proportions with Different Sample Size. [<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/back-matter\/image-description\/#fig10.1\">Image Description (See Appendix D Figure 10.1)<\/a>] Click on the image to enlarge it.<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>There are several findings:<\/p>\n<\/div>\n<ul>\n<li>The sampling distribution of the sample proportion becomes increasingly normal as the sample size <em>n<\/em> increases. When <em>n<\/em> = 50, the sampling distribution of sample proportion is skewed. When <em>n<\/em> = 100, the distribution is still slightly right skewed. For <em>n<\/em> = 200 and <em>n<\/em> = 1000, the sampling distribution appears bell-shaped and symmetric (indicative of a normal distribution).<\/li>\n<li>The mean of the sample proportion (blue dashed line) is always identical to the population proportion <em>p<\/em> = 0.05 (red solid line) regardless of the sample size <em>n<\/em>.<\/li>\n<li>The standard deviation of the sample proportion decreases as\u00a0<em>n<\/em> increases.<\/li>\n<\/ul>\n<p>To summarize, for [latex]np \\geq 5[\/latex] and [latex]n(1-p) \\geq 5[\/latex], [latex]\\hat{p} \\sim N \\left( p , \\sqrt{\\frac{p(1-p)}{n}} \\right)[\/latex]. The standardized version of [latex]\\hat{p}[\/latex] is [latex]Z = \\frac{\\hat{p} - p}{\\sqrt{\\frac{p(1-p)}{n}}} \\sim N(0,1)[\/latex]. As a result, inferences about the population proportions are based on the standard normal distribution.<\/p>\n","protected":false},"author":19,"menu_order":2,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-1083","chapter","type-chapter","status-publish","hentry"],"part":1075,"_links":{"self":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters\/1083","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/users\/19"}],"version-history":[{"count":44,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters\/1083\/revisions"}],"predecessor-version":[{"id":5433,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters\/1083\/revisions\/5433"}],"part":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/parts\/1075"}],"metadata":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters\/1083\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/media?parent=1083"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapter-type?post=1083"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/contributor?post=1083"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/license?post=1083"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}