5.1 Density Curve

We use a density curve to describe the distribution of a continuous variable. A density curve is the continuous analogue of a relative-frequency histogram with the density (scaled relative frequency) as the y-axis. For example, 1,000 grades are shown in the following grouping table and the relative-frequency histogram. The total area of the bins in the relative-frequency histogram is

[latex]\text{area} = 10 \times 0.003 + 10 \times 0.019 + 10 \times 0.150 + \cdots+ 10 \times 0.144 + 10 \times 0.022 = 10.[/latex]

The total area of the three leftmost bins is

[latex]\text{area}= 10 \times 0.003  + 10 \times 0.019 + 10 \times 0.150 = 1.72[/latex],

which accounts for [latex]1.72/10=0.172[/latex] or [latex]1.72\%[/latex] of the data. If we re-scale the y-axis of the relative-frequency histogram by dividing relative frequency (probability) by 10 (the bin width) and draw a smooth curve on the top of the histogram, we obtain the density curve of the grades with the y-axis density = relative frequency/width of the bins = relative frequency/10.

Sum=1.000
Interval Relative
Frequency
[30, 40) 0.003
[40, 50) 0.019
[50, 60) 0.150
[60, 70) 0.330
[70, 80) 0.332
[80, 90) 0.144
[90, 100] 0.022
1) A histogram of grade. The leftmost three bins are coloured black. Image description available. 2) A histogram of grade. The leftmost three bins are coloured black and there is a red bell curve over the bars. Image description available.
Table 5.1: Grouping Table of Grade Figure 5.1: Probability Histogram (Left) and Density Curve (Right) of Grade. [Image Description (See Appendix D Figure 5.1)] Click on image to enlarge.

Let [latex]X=[/latex] grade, the proportion of grades below 60 is given by

[latex]\begin{align*} P(X < 60) &= 0.003 + 0.019 + 0.150 = 0.172\\ &= \text{total area of the leftmost three bins}\\ &= \text{area to the left of 60 under the red curve}. \end{align*}[/latex]

Similarly, the percentage of grades above 80, [latex]P(X \: > \: 80) = 0.144 + 0.022 -= 0.166[/latex], which is the total area of the rightmost two bins or the area to the right of 80 under the red curve. The proportion (percentage) of grades between 60 and 80 is given by

[latex]\begin{align*} P(60 < X < 80) &= 0.330 + 0.332 = 0.662\\ &= \text{total are of the two bins in the middle}\\ &= \text{the area between 60 and 80 under the red curve}. \end{align*}[/latex]

A density curve has the following properties:

Key Facts: Properties of a density curve

  • Total area under the curve is one.
  • Area of a region under the curve gives the probability of an event.
A density curve. The area to the left of a vertical line x=a is shaded in grey. Image description available. A density curve. The area to the right of a vertical line x=b is shaded in grey. Image description available. A density curve. The area between vertical lines x=a and x=b (a<b) is shaded in grey. Image description available.

[latex]P(X \leq a)[/latex]

probability below a = area to the left of a

[latex]P(X \geq b)[/latex]

probability above b = area to the right of b

[latex]P(a \leq X \leq b)[/latex]

[latex]= P(X \leq b) - P(X \leq a)[/latex]

probability between a and b = area between a and b = area left of b minus area left of a.

[Image Description (See Appendix D Figure 5.1.1)]

 

License

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

Introduction to Applied Statistics Copyright © 2024 by Wanhua Su is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, except where otherwise noted.