{"id":1271,"date":"2021-07-03T11:46:22","date_gmt":"2021-07-03T15:46:22","guid":{"rendered":"https:\/\/openbooks.macewan.ca\/rcommander\/?post_type=chapter&#038;p=1271"},"modified":"2024-02-08T14:49:10","modified_gmt":"2024-02-08T19:49:10","slug":"13-4-outliers-and-influential-observations","status":"publish","type":"chapter","link":"https:\/\/openbooks.macewan.ca\/introstats\/chapter\/13-4-outliers-and-influential-observations\/","title":{"raw":"13.4 Outliers and Influential Observations","rendered":"13.4 Outliers and Influential Observations"},"content":{"raw":"In simple linear regression, we must also watch out for outliers and influential observations. <strong>Outliers<\/strong> are observations that are far away from the majority of the data. An <strong>influential observation<\/strong> is a data point that changes the regression equation dramatically if included. Note that an outlier might or might not be an influential observation.\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">Example: Outlier and Influential Observations<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\nIn the following figures, identify whether the red point is an outlier or an influential observation.<a id=\"retfig13.5\"><\/a><a id=\"retfig13.6\"><\/a><span style=\"text-align: initial; font-size: 1em;\">\u200b<\/span>\r\n<div align=\"center\">\r\n<table class=\"aligncenter no-border\" style=\"height: 351px; width: 85%;\" border=\"0\" width=\"85%\" cellspacing=\"0\" cellpadding=\"0\">\r\n<tbody>\r\n<tr style=\"height: 351px;\">\r\n<td style=\"width: 50%; height: 351px;\" valign=\"top\">[caption id=\"attachment_2928\" align=\"aligncenter\" width=\"300\"]<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_outlier.png\"><img class=\"wp-image-2928 size-medium\" src=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_outlier-300x300.png\" alt=\"A scatter plot with an outlier. The outlier does not significantly change the regression line. Image description available.\" width=\"300\" height=\"300\" \/><\/a> <strong>Figure 13.5<\/strong>: An Outlier But Not Influential. [<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/back-matter\/image-description\/#fig13.5\">Image Description (See Appendix D Figure 13.5)<\/a>] Click on the image to enlarge it.[\/caption]<\/td>\r\n<td style=\"width: 50%; height: 351px;\" valign=\"top\">[caption id=\"attachment_2927\" align=\"aligncenter\" width=\"300\"]<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_influential.png\"><img class=\"wp-image-2927 size-medium\" src=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_influential-300x300.png\" alt=\"A scatter plot with an outlier. The outlier does significantly change the regression line. Image description available.\" width=\"300\" height=\"300\" \/><\/a> <strong>Figure 13.6<\/strong>: An Outlier and Influential [<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/back-matter\/image-description\/#fig13.6\">Image Description (See Appendix D Figure 13.6)<\/a>] Click on the image to enlarge.[\/caption]<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\nThe red point on the left panel is an outlier since it is far away from the majority of the data; however, it is not an influential observation since the regression lines are almost identical with and without the red point.\r\n\r\nThe red point on the right panel is an outlier and an influential observation since including the red point dramatically changes the regression line. Without the red point, the slope of the regression line is positive; the slope becomes negative when the red observation is included. The red observation is also far away from the majority of the data and hence is an outlier.\r\n\r\n<\/div>\r\n<\/div>","rendered":"<p>In simple linear regression, we must also watch out for outliers and influential observations. <strong>Outliers<\/strong> are observations that are far away from the majority of the data. An <strong>influential observation<\/strong> is a data point that changes the regression equation dramatically if included. Note that an outlier might or might not be an influential observation.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Example: Outlier and Influential Observations<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>In the following figures, identify whether the red point is an outlier or an influential observation.<a id=\"retfig13.5\"><\/a><a id=\"retfig13.6\"><\/a><span style=\"text-align: initial; font-size: 1em;\">\u200b<\/span><\/p>\n<div style=\"margin: auto;\">\n<table class=\"aligncenter no-border\" style=\"height: 351px; width: 85%; width: 85%; border-spacing: 0px;\" cellpadding=\"0\">\n<tbody>\n<tr style=\"height: 351px;\">\n<td style=\"width: 50%; height: 351px;\" valign=\"top\">\n<figure id=\"attachment_2928\" aria-describedby=\"caption-attachment-2928\" style=\"width: 300px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_outlier.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-2928 size-medium\" src=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_outlier-300x300.png\" alt=\"A scatter plot with an outlier. The outlier does not significantly change the regression line. Image description available.\" width=\"300\" height=\"300\" srcset=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_outlier-300x300.png 300w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_outlier-150x150.png 150w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_outlier-65x65.png 65w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_outlier-225x225.png 225w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_outlier-350x350.png 350w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_outlier.png 480w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><figcaption id=\"caption-attachment-2928\" class=\"wp-caption-text\"><strong>Figure 13.5<\/strong>: An Outlier But Not Influential. [<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/back-matter\/image-description\/#fig13.5\">Image Description (See Appendix D Figure 13.5)<\/a>] Click on the image to enlarge it.<\/figcaption><\/figure>\n<\/td>\n<td style=\"width: 50%; height: 351px;\" valign=\"top\">\n<figure id=\"attachment_2927\" aria-describedby=\"caption-attachment-2927\" style=\"width: 300px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_influential.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-2927 size-medium\" src=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_influential-300x300.png\" alt=\"A scatter plot with an outlier. The outlier does significantly change the regression line. Image description available.\" width=\"300\" height=\"300\" srcset=\"https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_influential-300x300.png 300w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_influential-150x150.png 150w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_influential-65x65.png 65w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_influential-225x225.png 225w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_influential-350x350.png 350w, https:\/\/openbooks.macewan.ca\/introstats\/wp-content\/uploads\/sites\/8\/2021\/07\/regression_influential.png 480w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><figcaption id=\"caption-attachment-2927\" class=\"wp-caption-text\"><strong>Figure 13.6<\/strong>: An Outlier and Influential [<a href=\"https:\/\/openbooks.macewan.ca\/introstats\/back-matter\/image-description\/#fig13.6\">Image Description (See Appendix D Figure 13.6)<\/a>] Click on the image to enlarge.<\/figcaption><\/figure>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p>The red point on the left panel is an outlier since it is far away from the majority of the data; however, it is not an influential observation since the regression lines are almost identical with and without the red point.<\/p>\n<p>The red point on the right panel is an outlier and an influential observation since including the red point dramatically changes the regression line. Without the red point, the slope of the regression line is positive; the slope becomes negative when the red observation is included. The red observation is also far away from the majority of the data and hence is an outlier.<\/p>\n<\/div>\n<\/div>\n","protected":false},"author":19,"menu_order":4,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-1271","chapter","type-chapter","status-publish","hentry"],"part":1246,"_links":{"self":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters\/1271","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/users\/19"}],"version-history":[{"count":24,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters\/1271\/revisions"}],"predecessor-version":[{"id":5320,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters\/1271\/revisions\/5320"}],"part":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/parts\/1246"}],"metadata":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapters\/1271\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/media?parent=1271"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/pressbooks\/v2\/chapter-type?post=1271"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/contributor?post=1271"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/openbooks.macewan.ca\/introstats\/wp-json\/wp\/v2\/license?post=1271"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}