{"id":663,"date":"2020-07-22T15:10:58","date_gmt":"2020-07-22T19:10:58","guid":{"rendered":"https:\/\/openbooks.macewan.ca\/rcommander\/?post_type=chapter&#038;p=663"},"modified":"2024-02-08T13:40:45","modified_gmt":"2024-02-08T18:40:45","slug":"5-6-assessing-normality","status":"publish","type":"chapter","link":"https:\/\/openbooks.macewan.ca\/introstats\/chapter\/5-6-assessing-normality\/","title":{"raw":"5.6 Assessing Normality: Normal Probability Plot","rendered":"5.6 Assessing Normality: Normal Probability Plot"},"content":{"raw":"In later chapters, it will be necessary for us to assume our sample is selected from a normally distributed population; an easy way to check this assumption is to do so via graphical methods. For example, we can construct a histogram of our sampled data and if the histogram looks to be somewhat bell-shaped, then it is reasonable for us to assume the population is normally distributed (or at least approximately normally distributed). However, histograms only tend to inherit features of the population when the sample size is reasonably large.\r\n\r\nA more effective alternative to a histogram is a normal probability plot, which plots observed data points against normal quantiles (for this reason, normal probability plots are often referred to as normal Q-Q plots, where \u201cQ\u201d stands for \u201cQuantile.\u201d). If the distribution of the data is roughly normal, the points on a normal probability plot will roughly fall on a straight line. Deviations from a straight line indicate that the underlying distribution is not normal.\r\n\r\nTypically, software such as R commander is used to make a normal probability plot. Some software plots the observed quantiles in the y-axis by default (e.g., R), and some plots the normal quantiles in the y-axis (e.g., Minitab). The steps to draw a normal probability plot are illustrated in the following example.\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">Example: Accessing Normality Using Normal Probability Plot<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\nSuppose the data are: 75, 80, 90, 85, 75, and 40. Check whether the data are from a normal distribution by drawing a normal probability plot.\r\n\r\nSteps:\r\n<ol>\r\n \t<li>Sort the data from smallest to largest. We refer to the sorted data as the observed quantiles and put them in the first column of a table.<\/li>\r\n \t<li>Refer to a table of normal scores (such as Table III in the appendix of the course textbook) in order to find the normal quantiles (sometimes called theoretical quantiles). In this example, there are [latex]n=6[\/latex] data points and so we copy the column with [latex]n=6[\/latex] into the second column of our table.<\/li>\r\n \t<li>\u00a0Plot the observed quantiles (y-axis) versus the theoretical quantiles (x-axis) or the other way.<\/li>\r\n \t<li>If the data points roughly fall on a straight line, then we assume the data are from a normal distribution; otherwise, the data are not from a normal distribution.<a id=\"retfig5.10\"><\/a><\/li>\r\n<\/ol>\r\n<div align=\"center\">\r\n<table class=\"aligncenter no-border\" style=\"width: 100%; height: 396px;\" border=\"0\" cellspacing=\"0\" cellpadding=\"0\">\r\n<tbody>\r\n<tr style=\"height: 396px;\">\r\n<td style=\"width: 436.163px; height: 396px;\" valign=\"top\"><strong>Table 5.2<\/strong>: Observed and Theoretical Quantiles for a Normal Q-Q plot\r\n<table class=\"aligncenter\" style=\"width: 100%; height: 112px;\" border=\"1\" cellspacing=\"0\" cellpadding=\"0\" align=\"center\">\r\n<thead>\r\n<tr class=\"shaded\" style=\"height: 28px;\">\r\n<td style=\"height: 28px; width: 137.84375px;\" valign=\"top\">\r\n<div align=\"center\">Sorted value\r\n(observed quantile)<\/div><\/td>\r\n<td style=\"height: 28px; width: 147.859375px;\" valign=\"top\">\r\n<div align=\"center\">Normal score\r\n(theoretical quantile)<\/div><\/td>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr style=\"height: 14px;\">\r\n<td style=\"height: 14px; width: 137.84375px;\" valign=\"top\">\r\n<div align=\"center\">40<\/div><\/td>\r\n<td style=\"height: 14px; width: 147.859375px;\" valign=\"top\">\r\n<div align=\"center\">-1.28<\/div><\/td>\r\n<\/tr>\r\n<tr style=\"height: 14px;\">\r\n<td style=\"height: 14px; width: 137.84375px;\" valign=\"top\">\r\n<div align=\"center\">75<\/div><\/td>\r\n<td style=\"height: 14px; width: 147.859375px;\" valign=\"top\">\r\n<div align=\"center\">-0.64<\/div><\/td>\r\n<\/tr>\r\n<tr style=\"height: 14px;\">\r\n<td style=\"height: 14px; width: 137.84375px;\" valign=\"top\">\r\n<div align=\"center\">75<\/div><\/td>\r\n<td style=\"height: 14px; width: 147.859375px;\" valign=\"top\">\r\n<div align=\"center\">-0.20<\/div><\/td>\r\n<\/tr>\r\n<tr style=\"height: 14px;\">\r\n<td style=\"height: 14px; width: 137.84375px;\" valign=\"top\">\r\n<div align=\"center\">80<\/div><\/td>\r\n<td style=\"height: 14px; width: 147.859375px;\" valign=\"top\">\r\n<div align=\"center\">0.20<\/div><\/td>\r\n<\/tr>\r\n<tr style=\"height: 14px;\">\r\n<td style=\"height: 14px; width: 137.84375px;\" valign=\"top\">\r\n<div align=\"center\">85<\/div><\/td>\r\n<td style=\"height: 14px; width: 147.859375px;\" valign=\"top\">\r\n<div align=\"center\">0.64<\/div><\/td>\r\n<\/tr>\r\n<tr style=\"height: 14px;\">\r\n<td style=\"height: 14px; width: 137.84375px;\" valign=\"top\">\r\n<div align=\"center\">90<\/div><\/td>\r\n<td style=\"height: 14px; width: 147.859375px;\" valign=\"top\">\r\n<div align=\"center\">1.28<\/div><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/td>\r\n<td style=\"width: 474.263px; height: 396px;\" valign=\"top\">&nbsp;\r\n\r\n[caption id=\"attachment_617\" align=\"aligncenter\" width=\"331\"]<img class=\"wp-image-617\" src=\"https:\/\/openbooks.macewan.ca\/rcommander\/wp-content\/uploads\/sites\/8\/2020\/07\/M05-07-AssessingNormalityNormalProbabilityPlot_clip_image002.png\" alt=\"A scatter plot of the data given in Table 5.2. Image description available.\" width=\"331\" height=\"331\" \/> <strong>Figure 5.10<\/strong>: Normal Probability Plot for Six Grades. [<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/back-matter\/image-description\/#fig5.10\">Image Description (See Appendix D Figure 5.10)<\/a>][\/caption]<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\nThe six points do not fall in a straight line; the data do not seem to come from a normal distribution. However, the point on the lower left corner might be an outlier. If we remove this potential outlier, the other five points roughly fall on a straight line.\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">Exercise: Normal Probability Plot<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\nComment on the following normal probability plots and answer whether the data seem to come from a normal distribution.<a id=\"retfig5.11\"><\/a>\r\n\r\n[caption id=\"attachment_3066\" align=\"aligncenter\" width=\"900\"]<img class=\"wp-image-3066 size-full\" src=\"https:\/\/openbooks.macewan.ca\/rcommander\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1.png\" alt=\"Three normal probability plots are shown in a row. Image description available.\" width=\"900\" height=\"300\" \/> <strong>Figure 5.11<\/strong>: Example of Normal Probability Plots. [<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/back-matter\/image-description\/#fig5.11\">Image Description (See Appendix D Figure 5.11)<\/a>][\/caption]\u00a0<details><summary>Show\/Hide Answer<\/summary><img class=\"alignnone size-full wp-image-3066\" src=\"https:\/\/openbooks.macewan.ca\/rcommander\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1.png\" alt=\"\" width=\"900\" height=\"300\" \/>\r\n<table class=\"aligncenter no-border\" style=\"width: 96.5292%;\" border=\"1\" cellspacing=\"0\" cellpadding=\"3\">\r\n<tbody>\r\n<tr>\r\n<td style=\"width: 34.8663%;\" valign=\"top\"><\/td>\r\n<td style=\"width: 34.9878%;\" valign=\"top\"><\/td>\r\n<td style=\"width: 37.1741%;\" valign=\"top\"><\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 34.8663%; vertical-align: top;\" valign=\"top\">The points form an approximate straight line. Thus, it is reasonable to assume the data are from a normal distribution.<\/td>\r\n<td style=\"width: 34.9878%; vertical-align: top;\" valign=\"top\">The points do not form a straight line (there is obvious curvature). This suggests that the data are not from a normal distribution.<\/td>\r\n<td style=\"width: 37.1741%; vertical-align: top;\" valign=\"top\">Excluding the outliers, the points form an approximate straight line. Thus, it is reasonable to assume the data are from a normal distribution (if we disregard the outlier).<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/details><\/div>\r\n<\/div>\r\nHistogram, boxplot and normal probability (Q-Q) plot are popular graphs used to explore the distribution of data. If the data are taken from a normal population, the histogram should appear to be bell-shaped, the boxplot should be symmetric, the normal probability plot should show a linear pattern. When the number of observations is not large, however, the histogram might not show bell-shape with different bin widths. Note that a boxplot cannot be used to confirm that data follow a normal distribution since some distributions, such as uniform and multimodal, are also symmetric. Therefore, the normal probability plot is the best graphical method to assess normality. Figure 5.12 shows histograms, normal probability plots, and boxplots for six typical distributions: left skewed, normal, right skewed, multimodal and symmetric, normal with outliers, and uniform. Based on the graphs, we can see how histogram features correspond to boxplot and normal probability features.<a id=\"retfig5.12\"><\/a>\r\n\r\n&nbsp;\r\n<div><img class=\"aligncenter wp-image-2797\" src=\"https:\/\/openbooks.macewan.ca\/rcommander\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare1.png\" alt=\"Nine graphs showing three types of distributions. The first column shows a left skewed distribution, the middle a normal distribution, and the right a right skewed distribution. The first row is histograms, the second is probability plots and the third is box plots. Image description available.\" width=\"700\" height=\"700\" \/><\/div>\r\n<div>[caption id=\"attachment_2798\" align=\"aligncenter\" width=\"700\"]<img class=\"wp-image-2798\" src=\"https:\/\/openbooks.macewan.ca\/rcommander\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare2.png\" alt=\"Nine graphs in a 3 by 3 matrix. The left column is a multimodal distribution, the middle is normal with outliers, and the left is uniform. The first row shows histograms, the second shows probability plots, and the third shows box plots. Image description available.\" width=\"700\" height=\"700\" \/> <strong>Figure 5.12<\/strong>: Histograms, Normal Probability Plots and Boxplot for Six Typical Distributions. [<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/back-matter\/image-description\/#fig5.12\">Image Description (See Appendix D Figure 5.12)<\/a>][\/caption]<\/div>\r\n<div><\/div>","rendered":"<p>In later chapters, it will be necessary for us to assume our sample is selected from a normally distributed population; an easy way to check this assumption is to do so via graphical methods. For example, we can construct a histogram of our sampled data and if the histogram looks to be somewhat bell-shaped, then it is reasonable for us to assume the population is normally distributed (or at least approximately normally distributed). However, histograms only tend to inherit features of the population when the sample size is reasonably large.<\/p>\n<p>A more effective alternative to a histogram is a normal probability plot, which plots observed data points against normal quantiles (for this reason, normal probability plots are often referred to as normal Q-Q plots, where \u201cQ\u201d stands for \u201cQuantile.\u201d). If the distribution of the data is roughly normal, the points on a normal probability plot will roughly fall on a straight line. Deviations from a straight line indicate that the underlying distribution is not normal.<\/p>\n<p>Typically, software such as R commander is used to make a normal probability plot. Some software plots the observed quantiles in the y-axis by default (e.g., R), and some plots the normal quantiles in the y-axis (e.g., Minitab). The steps to draw a normal probability plot are illustrated in the following example.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Example: Accessing Normality Using Normal Probability Plot<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>Suppose the data are: 75, 80, 90, 85, 75, and 40. Check whether the data are from a normal distribution by drawing a normal probability plot.<\/p>\n<p>Steps:<\/p>\n<ol>\n<li>Sort the data from smallest to largest. We refer to the sorted data as the observed quantiles and put them in the first column of a table.<\/li>\n<li>Refer to a table of normal scores (such as Table III in the appendix of the course textbook) in order to find the normal quantiles (sometimes called theoretical quantiles). In this example, there are [latex]n=6[\/latex] data points and so we copy the column with [latex]n=6[\/latex] into the second column of our table.<\/li>\n<li>\u00a0Plot the observed quantiles (y-axis) versus the theoretical quantiles (x-axis) or the other way.<\/li>\n<li>If the data points roughly fall on a straight line, then we assume the data are from a normal distribution; otherwise, the data are not from a normal distribution.<a id=\"retfig5.10\"><\/a><\/li>\n<\/ol>\n<div style=\"margin: auto;\">\n<table class=\"aligncenter no-border\" style=\"width: 100%; height: 396px; border-spacing: 0px;\" cellpadding=\"0\">\n<tbody>\n<tr style=\"height: 396px;\">\n<td style=\"width: 436.163px; height: 396px;\" valign=\"top\"><strong>Table 5.2<\/strong>: Observed and Theoretical Quantiles for a Normal Q-Q plot<\/p>\n<table class=\"aligncenter\" style=\"width: 100%; height: 112px; border-spacing: 0px; margin: auto;\" cellpadding=\"0\">\n<thead>\n<tr class=\"shaded\" style=\"height: 28px;\">\n<td style=\"height: 28px; width: 137.84375px;\" valign=\"top\">\n<div style=\"margin: auto;\">Sorted value<br \/>\n(observed quantile)<\/div>\n<\/td>\n<td style=\"height: 28px; width: 147.859375px;\" valign=\"top\">\n<div style=\"margin: auto;\">Normal score<br \/>\n(theoretical quantile)<\/div>\n<\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr style=\"height: 14px;\">\n<td style=\"height: 14px; width: 137.84375px;\" valign=\"top\">\n<div style=\"margin: auto;\">40<\/div>\n<\/td>\n<td style=\"height: 14px; width: 147.859375px;\" valign=\"top\">\n<div style=\"margin: auto;\">-1.28<\/div>\n<\/td>\n<\/tr>\n<tr style=\"height: 14px;\">\n<td style=\"height: 14px; width: 137.84375px;\" valign=\"top\">\n<div style=\"margin: auto;\">75<\/div>\n<\/td>\n<td style=\"height: 14px; width: 147.859375px;\" valign=\"top\">\n<div style=\"margin: auto;\">-0.64<\/div>\n<\/td>\n<\/tr>\n<tr style=\"height: 14px;\">\n<td style=\"height: 14px; width: 137.84375px;\" valign=\"top\">\n<div style=\"margin: auto;\">75<\/div>\n<\/td>\n<td style=\"height: 14px; width: 147.859375px;\" valign=\"top\">\n<div style=\"margin: auto;\">-0.20<\/div>\n<\/td>\n<\/tr>\n<tr style=\"height: 14px;\">\n<td style=\"height: 14px; width: 137.84375px;\" valign=\"top\">\n<div style=\"margin: auto;\">80<\/div>\n<\/td>\n<td style=\"height: 14px; width: 147.859375px;\" valign=\"top\">\n<div style=\"margin: auto;\">0.20<\/div>\n<\/td>\n<\/tr>\n<tr style=\"height: 14px;\">\n<td style=\"height: 14px; width: 137.84375px;\" valign=\"top\">\n<div style=\"margin: auto;\">85<\/div>\n<\/td>\n<td style=\"height: 14px; width: 147.859375px;\" valign=\"top\">\n<div style=\"margin: auto;\">0.64<\/div>\n<\/td>\n<\/tr>\n<tr style=\"height: 14px;\">\n<td style=\"height: 14px; width: 137.84375px;\" valign=\"top\">\n<div style=\"margin: auto;\">90<\/div>\n<\/td>\n<td style=\"height: 14px; width: 147.859375px;\" valign=\"top\">\n<div style=\"margin: auto;\">1.28<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/td>\n<td style=\"width: 474.263px; height: 396px;\" valign=\"top\">&nbsp;<\/p>\n<figure id=\"attachment_617\" aria-describedby=\"caption-attachment-617\" style=\"width: 331px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-617\" src=\"https:\/\/openbooks.macewan.ca\/rcommander\/wp-content\/uploads\/sites\/8\/2020\/07\/M05-07-AssessingNormalityNormalProbabilityPlot_clip_image002.png\" alt=\"A scatter plot of the data given in Table 5.2. Image description available.\" width=\"331\" height=\"331\" srcset=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/M05-07-AssessingNormalityNormalProbabilityPlot_clip_image002.png 357w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/M05-07-AssessingNormalityNormalProbabilityPlot_clip_image002-300x300.png 300w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/M05-07-AssessingNormalityNormalProbabilityPlot_clip_image002-150x150.png 150w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/M05-07-AssessingNormalityNormalProbabilityPlot_clip_image002-65x65.png 65w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/M05-07-AssessingNormalityNormalProbabilityPlot_clip_image002-225x225.png 225w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/M05-07-AssessingNormalityNormalProbabilityPlot_clip_image002-350x350.png 350w\" sizes=\"auto, (max-width: 331px) 100vw, 331px\" \/><figcaption id=\"caption-attachment-617\" class=\"wp-caption-text\"><strong>Figure 5.10<\/strong>: Normal Probability Plot for Six Grades. [<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/back-matter\/image-description\/#fig5.10\">Image Description (See Appendix D Figure 5.10)<\/a>]<\/figcaption><\/figure>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>The six points do not fall in a straight line; the data do not seem to come from a normal distribution. However, the point on the lower left corner might be an outlier. If we remove this potential outlier, the other five points roughly fall on a straight line.<\/p>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Exercise: Normal Probability Plot<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>Comment on the following normal probability plots and answer whether the data seem to come from a normal distribution.<a id=\"retfig5.11\"><\/a><\/p>\n<figure id=\"attachment_3066\" aria-describedby=\"caption-attachment-3066\" style=\"width: 900px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-3066 size-full\" src=\"https:\/\/openbooks.macewan.ca\/rcommander\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1.png\" alt=\"Three normal probability plots are shown in a row. Image description available.\" width=\"900\" height=\"300\" srcset=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1.png 900w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1-300x100.png 300w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1-768x256.png 768w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1-65x22.png 65w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1-225x75.png 225w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1-350x117.png 350w\" sizes=\"auto, (max-width: 900px) 100vw, 900px\" \/><figcaption id=\"caption-attachment-3066\" class=\"wp-caption-text\"><strong>Figure 5.11<\/strong>: Example of Normal Probability Plots. [<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/back-matter\/image-description\/#fig5.11\">Image Description (See Appendix D Figure 5.11)<\/a>]<\/figcaption><\/figure>\n<p>\u00a0<\/p>\n<details>\n<summary>Show\/Hide Answer<\/summary>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-3066\" src=\"https:\/\/openbooks.macewan.ca\/rcommander\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1.png\" alt=\"\" width=\"900\" height=\"300\" srcset=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1.png 900w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1-300x100.png 300w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1-768x256.png 768w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1-65x22.png 65w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1-225x75.png 225w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2020\/07\/qqplot_exercise-1-350x117.png 350w\" sizes=\"auto, (max-width: 900px) 100vw, 900px\" \/><\/p>\n<table class=\"aligncenter no-border\" style=\"width: 96.5292%; border-spacing: 0px;\" cellpadding=\"3\">\n<tbody>\n<tr>\n<td style=\"width: 34.8663%;\" valign=\"top\"><\/td>\n<td style=\"width: 34.9878%;\" valign=\"top\"><\/td>\n<td style=\"width: 37.1741%;\" valign=\"top\"><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 34.8663%; vertical-align: top;\" valign=\"top\">The points form an approximate straight line. Thus, it is reasonable to assume the data are from a normal distribution.<\/td>\n<td style=\"width: 34.9878%; vertical-align: top;\" valign=\"top\">The points do not form a straight line (there is obvious curvature). This suggests that the data are not from a normal distribution.<\/td>\n<td style=\"width: 37.1741%; vertical-align: top;\" valign=\"top\">Excluding the outliers, the points form an approximate straight line. Thus, it is reasonable to assume the data are from a normal distribution (if we disregard the outlier).<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/details>\n<\/div>\n<\/div>\n<p>Histogram, boxplot and normal probability (Q-Q) plot are popular graphs used to explore the distribution of data. If the data are taken from a normal population, the histogram should appear to be bell-shaped, the boxplot should be symmetric, the normal probability plot should show a linear pattern. When the number of observations is not large, however, the histogram might not show bell-shape with different bin widths. Note that a boxplot cannot be used to confirm that data follow a normal distribution since some distributions, such as uniform and multimodal, are also symmetric. Therefore, the normal probability plot is the best graphical method to assess normality. Figure 5.12 shows histograms, normal probability plots, and boxplots for six typical distributions: left skewed, normal, right skewed, multimodal and symmetric, normal with outliers, and uniform. Based on the graphs, we can see how histogram features correspond to boxplot and normal probability features.<a id=\"retfig5.12\"><\/a><\/p>\n<p>&nbsp;<\/p>\n<div><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-2797\" src=\"https:\/\/openbooks.macewan.ca\/rcommander\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare1.png\" alt=\"Nine graphs showing three types of distributions. The first column shows a left skewed distribution, the middle a normal distribution, and the right a right skewed distribution. The first row is histograms, the second is probability plots and the third is box plots. Image description available.\" width=\"700\" height=\"700\" srcset=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare1.png 480w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare1-300x300.png 300w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare1-150x150.png 150w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare1-65x65.png 65w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare1-225x225.png 225w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare1-350x350.png 350w\" sizes=\"auto, (max-width: 700px) 100vw, 700px\" \/><\/div>\n<div>\n<figure id=\"attachment_2798\" aria-describedby=\"caption-attachment-2798\" style=\"width: 700px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-2798\" src=\"https:\/\/openbooks.macewan.ca\/rcommander\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare2.png\" alt=\"Nine graphs in a 3 by 3 matrix. The left column is a multimodal distribution, the middle is normal with outliers, and the left is uniform. The first row shows histograms, the second shows probability plots, and the third shows box plots. Image description available.\" width=\"700\" height=\"700\" srcset=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare2.png 480w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare2-300x300.png 300w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare2-150x150.png 150w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare2-65x65.png 65w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare2-225x225.png 225w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2022\/05\/qqplot_compare2-350x350.png 350w\" sizes=\"auto, (max-width: 700px) 100vw, 700px\" \/><figcaption id=\"caption-attachment-2798\" class=\"wp-caption-text\"><strong>Figure 5.12<\/strong>: Histograms, Normal Probability Plots and Boxplot for Six Typical Distributions. [<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/back-matter\/image-description\/#fig5.12\">Image Description (See Appendix D Figure 5.12)<\/a>]<\/figcaption><\/figure>\n<\/div>\n<div><\/div>\n","protected":false},"author":19,"menu_order":6,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-663","chapter","type-chapter","status-publish","hentry"],"part":588,"_links":{"self":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters\/663","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/users\/19"}],"version-history":[{"count":35,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters\/663\/revisions"}],"predecessor-version":[{"id":5240,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters\/663\/revisions\/5240"}],"part":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/parts\/588"}],"metadata":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters\/663\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/media?parent=663"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapter-type?post=663"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/contributor?post=663"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/license?post=663"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}