| 
Web Lesson: Statistics
 
#11 › Cumulative Frequency Curves
 The Interactive Web-Lesson below has questions embedded | 
| x | 5-15 | 15-20 | 20-23 | 23-26 | 26-29 | 29-33 | 33-38 | |
| f | 13 | 18 | 35 | 42 | 34 | 13 | 5 | 
| U.C.B. | up to 5 | up to 15 | up to 20 | up to 23 | up to 26 | up to 29 | up to 33 | up to 38 | 
|  |  |  |  |  |  |  | 
| U.C.B. | up to 5 | up to 15 | up to 20 | up to 10 | up to 15 | up to 20 | up to 25 | up to 30 | 
| F | 0 | 13 | 31 | 66 | 108 | 142 | 155 | 16 | 
               ▲          ▲          ▲          ▲          ▲         ▲          ▲
              ┐└─┐        │          │          │          ╚════════etc═════════╝
Use L.C.B. of │  │        │          │          │
the 1st class ├──┘        │          │          │
and F = 0     │      ┌────┴───┐      │          │
              ┘       1st freq       │          │
                                     │          │
                             ┌───────┴───────┐  │
                              1st + 2nd freqs   │
                               13 + 18  = 31    │
                                                │
                                     ┌──────────┴──────────┐
                                      1st + 2nd + 3rd freqs
                                       13 + 18  + 35  = 66
 	
 
Alternatively, click HERE to download all of the links in 1 go!
| Amount (£) | 5-20 | 20-30 | 30-40 | 40-50 | 50-70 | 70-120 | |
| Number of pupils | 8 | 15 | 18 | 11 | 6 | 2 | 
| U.C.B. | up to 5 | up to 20 | up to ⋯ | up to ⋯ | up to ⋯ | up to ⋯ | up to 120 | 
| Cumulative Frequency | 0 | 8 | ⋯ | ⋯ | ⋯ | ⋯ | 60 | 
 
Hints: 
 
Step 1: Always CHECK the data is written using CLASS BOUNDARIES (not class-limits) ✔  IT IS
        (i.e. there no GAP between the end of any class and the start of the next class)
 	
	
Step 2: To populate the TOP-ROW of the Cumulative Frequency Table, look at the LIMITS of each class:
 
                 The first class is SPECIAL:  ‘5 - 20’
                                               ▲    ▲
                                               │    │
  This is the only class where its L.C.B. ─────┘    │
  ISN'T deleted: its the 1st № in our CF            │
  table (the F-value below it is just ‘0’)          └────────── This is the 2nd № in the
                                                                top-row of our CF-table
 
        After that, fill in the U.C.B.s of the rest of classes to complete the top-row:
 		
 	
Step 3: To populate the BOTTOM-ROW, you need to work out the ‘running-totals’ of the frequency
                                                                ┌───┐
        We saw already that the first number in the F-row is:   |‘0’|
        After that - the values are: 8                        = | 8 |
                                     8 + 15                   = | ⋯	|
                                     8 + 15 + 18              = | ⋯	|
                                     8 + 15 + 18 + ……         = | ⋯	|
                                     8 + …… + …… + …… + …     = | ⋯	|
                                     8 + …… + …… + …… + … + … = | ⋯	|
                                                                └─┬─┘
                      these are our CUMULATIVE FREQUENCIES ───────┘
	
	 
Step 4: PRINT off the grid provided (see the link in the question), plot the points
        and connect them with a smooth curve (smooth out the curve, even if it means you slightly 
        miss one of your points
 
| Time (mins) | 10-15 | 15-30 | 30-40 | 40-50 | 50-65 | 65-90 | |
| Frequency | 4 | 25 | 36 | 27 | 7 | 1 | 
	
Hints:
	
Step 1: Always CHECK the data is written using CLASS BOUNDARIES (not class-limits)  ✔  IT IS
        (i.e. there no GAP between the end of any class and the start of the next class) 
 
	
Step 2: To populate the TOP-ROW of the Cumulative Frequency Table, look at the LIMITS of each class:
 
                 The first class is SPECIAL:  ‘10 - 20’
                                               ▲     ▲
                                               │     │
  This is the only class where its L.C.B. ─────┘     │
  ISN'T deleted: its the 1st № in our CF             │
  table (the F-value below it is just ‘0’)           └────────── This is the 2nd № in the
                                                                 top-row of our CF-table
 
        After that, fill in the U.C.B.s of the rest of classes to complete the top-row
 
        
        TIME (UCB) │   10     	15     	30    	40     	50     	65     	90
        ───────────┼───────────────────────────────────────────────────────
           F       │ 	0     	⋯     	⋯     	⋯     	⋯     	⋯     	⋯
	
	
Step 3: To populate the BOTTOM-ROW, you need to work out the ‘running-totals’ of the frequency
 	
	
Step 4: PRINT off the grid provided (see the link in the question), plot the points
        and connect them with a smooth curve (smooth out the curve, even if it means you slightly 
        miss one of your points
 	
| Mass (kg) | 10-16 | 16-19 | 19-22 | 22-26 | 26-40 | |
| frequency | 4 | 14 | 18 | 9 | 3 | 
Hints: Don't start your cumulative frequency curve from the point (0, 0) As we have done in the questions so far, this time, you have to start from the point (10, 0)
| x | 5-15 | 15-20 | 20-23 | 23-26 | 26-29 | 29-33 | 33-38 | |
| f | 13 | 18 | 35 | 42 | 34 | 13 | 5 | 
 
Note: The number on the y-axis is always the answer to the question, "how many got LESS THAN x=..."
 
 


Hints: Part (a): Firstly - PRINT OFF THE FULL PAGE CUMULATIVE FREQUENCY CURVE for this question… We want to know how many people waited LESS THAN 30 minutes So, we go to "30" on the x-axis Add in a vertical line up to meet the curve and then a horizontal line across to the y-axis Reading the number from the y-axis tell us how many people waited less than 30 minutes Part (b): When we read a y-value from the GRID, it always tells us the NUMBER that were LESS THAN the corresponding x-value… If we wanna know how many were MORE than that x-value, we need to subtract the y-value from ‘n’ (remember, ‘n’ is the total number in the sample, which is the LAST number in the cumulative frequencies and also the y-value of the TOP of the ogive…)

Hints: Part (a): If we draw a vertical line at x = 30 minutes and read across to the y-value - that will tell us how many passengers waited LESS THAN 30 minutes If we draw a vertical line at x = 15 minutes and read across to the y-value - that will tell us how many passengers waited LESS THAN 15 minutes SUBTRACT these two values, and we'll get the number that waited BETWEEN 15-30 mins How do we then turn that into a percentage? Part (b): Since the SAMPLE-SIZES are different, it's wrong to just compare the NUMBER that waited 30+ mins It makes much more sense to compare the PERCENTAGE that waited 30+ minutes Part (c): Again, we just need to compare the PERCENTAGES!

	
Hints:
	
Part (a):
 
Start by finding the NUMBER of students that were 1.4 - 1.7 m tall: By drawing vertical lines
at x=1.7 and x=1.4, reading across to the y-values and subtracting…
 
Then turn it into a percentage (by dividing by 4000)
	
	
	
Part (b): 
 
This is kinda a ‘trick’ question - but understanding the answer is gonna be crucial to your
understanding of continuous data…
	
It's not impossible to deduce: If you follow the logic of how you answered Part (a),
then it's kind-o-bvious  - a ‘prize’ to anyone that gets it right!
 
 
 
Part (c): The TALLEST 10% refers to the 10% at the TOP of the curve…
 
           …10% of 4000 is 400
          └─────────┬─────────┘
                    │
                    └────────────────────  The 400 at the ‘TOP’ of the curve are
                                           the 400 between y = 3600 and y = 4000
	
| Amount (£) | 5-20 | 20-30 | 30-40 | 40-50 | 50-70 | 70-120 | |
| Number of pupils | 8 | 15 | 18 | 11 | 6 | 2 | 
Hints: We already know that n = 60 (i.e. the total number in the sample was 60) The MEDIAN is the 30th value:

WARNING: ››› Students sometimes write: MEDIAN = 30 = 34 ‹‹‹ which you should obviously recognise gobbledegook (how and 30=34 ???) - it is BETTER to write: MEDIAN (30th value) = 34
Similarly, the 3rd QUARTILE is the 45th value, and the 1st QUARTILE is the 15th value
| Time (mins) | 10-15 | 15-30 | 30-40 | 40-50 | 50-65 | 65-90 | |
| Frequency | 4 | 25 | 36 | 27 | 7 | 1 | 
Hints: Sorry - no help for you here!
| Mass (kg) | 10-16 | 16-19 | 19-22 | 22-26 | 26-40 | |
| frequency | 4 | 14 | 18 | 9 | 3 | 
Hints: Firstly, before we can find median/quartiles etc we need a CUMULATIVE FREQUENCY TABLE and a CUMULATIVE FREQUENCY CURVE (remembering to show n = 48 on our curve) Americans call the MEDIAN the 50th percentile (we use: ½n, they use: ⁵⁰⁄₁₀₀n - same difference!) To find the 90th Percentile, start by working out ⁹⁰⁄₁₀₀n Then look up this value on the y-axis and read across to the x-axis
| Weight (kg) | 10-19 | 20-29 | 30-34 | 35-39 | 40-49 | 50-59 | |
| frequency | 7 | 21 | 28 | 32 | 23 | 9 | 
| U.C.B. | up to … | up to … | up to … | up to … | up to … | up to … | up to … | 
| F | 0 | 7 | 28 | ... | ... | ... | 120 | 
	
Hints:
 
This is NOT WRITTEN using CLASS BOUNDARIES:
 
 
                    these should
                    be the same!
                    ┌┬────────┬┐
                    ▼▼        ▼▼
 WEIGHT (kg) │   10-19        20-29        30-34        35-39        40-49        50-59
─────────────┼─────────────────────────────────────────────────────────────────────────
      f      │     7           21           28           32           23            9
 
The WEB LESSON showed you that, when that happens - you need to consider some values
between x = 19 and x = 20 and see which CLASS each value should be placed
                  ┌─────────────┐                          ┌─────────────┐
  WHICH CLASS:    │ 10 < x ≤ 19 │           OR             │ 20 < x ≤ 29 │
                 ┌┴────┬─┬─┬─┬──┘                        ┌─┴───┬─┬─┬──┬──┴─┐
            ┌────┘ ┌───┘ │ │ └───┐                  ┌────┘  ┌──┘ │ │  └──┐ └────┐
       ┌────┘ ┌────┘  ┌──┘ └──┐  └────┐       ┌─────┘ ┌─────┘ ┌──┘ └──┐  └────┐ └────┐ 
      19    19.1    19.2    19.3    19.4    19.5    19.6    19.7    19.8    19.9    20
                                           └─┬──┘
                                          ┌──┘
                   ┌──────────────────────┴────┐
                     THIS IS THE CUT-OFF VALUE
                   └────┬──┬───────────────────┘
So we change:           │  │
                     ┌──┘  └──┐
 WEIGHT (kg) │   ⋯-19.5     19.5-⋯         ⋯-⋯         ⋯-⋯         ⋯-⋯         ⋯-⋯
─────────────┼─────────────────────────────────────────────────────────────────────────
      f      │     7           21           28           32           23            9
 
 
Next, do the same to ever UPPER-LIMIT (i.e. add ½)
 
Then, do the same to all the LOWER-LIMITS (i.e. subtract ½)
 
There - now we've got it written using CLASS-BOUNDARIES, we can carry on as usual!