Use linear interpolation for median or quartiles

A question is this type if and only if it asks the student to estimate the median, quartiles, or percentiles from grouped data using interpolation within a class interval.

9 questions

OCR MEI S1 Q1
1 The heights \(x \mathrm {~cm}\) of 100 boys in Year 7 at a school are summarised in the table below.
Height\(125 \leqslant x \leqslant 140\)\(140 < x \leqslant 145\)\(145 < x \leqslant 150\)\(150 < x \leqslant 160\)\(160 < x \leqslant 170\)
Frequency252924184
  1. Estimate the number of boys who have heights of at least 155 cm .
  2. Calculate an estimate of the median height of the 100 boys.
  3. Draw a histogram to illustrate the data. The histogram below shows the heights of 100 girls in Year 7 at the same school.
    \includegraphics[max width=\textwidth, alt={}, center]{ab4d5ab1-e3b7-495f-9142-d37df7e712de-1_868_1361_1015_381}
  4. How many more girls than boys had heights exceeding 160 cm ?
  5. Calculate an estimate of the mean height of the 100 girls.
OCR MEI S1 2013 January Q6
6 The heights \(x \mathrm {~cm}\) of 100 boys in Year 7 at a school are summarised in the table below.
Height\(125 \leqslant x \leqslant 140\)\(140 < x \leqslant 145\)\(145 < x \leqslant 150\)\(150 < x \leqslant 160\)\(160 < x \leqslant 170\)
Frequency252924184
  1. Estimate the number of boys who have heights of at least 155 cm .
  2. Calculate an estimate of the median height of the 100 boys.
  3. Draw a histogram to illustrate the data. The histogram below shows the heights of 100 girls in Year 7 at the same school.
    \includegraphics[max width=\textwidth, alt={}, center]{76283206-687f-45d6-9204-952d60843cf1-3_865_1349_1297_349}
  4. How many more girls than boys had heights exceeding 160 cm ?
  5. Calculate an estimate of the mean height of the 100 girls.
OCR S1 2013 January Q6
6 The masses, \(x\) grams, of 800 apples are summarised in the histogram.
\includegraphics[max width=\textwidth, alt={}, center]{13d8d940-fd63-4b62-bd7a-aa7174f6af4b-4_592_1363_957_351}
  1. On the frequency density axis, 1 cm represents \(a\) units. Find the value of \(a\).
  2. Find an estimate of the median mass of the apples.
  3. Two judges rank \(n\) competitors, where \(n\) is an even number. Judge 2 reverses each consecutive pair of ranks given by Judge 1, as shown.
    Competitor\(C _ { 1 }\)\(C _ { 2 }\)\(C _ { 3 }\)\(C _ { 4 }\)\(C _ { 5 }\)\(C _ { 6 }\)\(\ldots \ldots\)\(C _ { n - 1 }\)\(C _ { n }\)
    Judge 1 rank123456\(\ldots \ldots\)\(n - 1\)\(n\)
    Judge 2 rank214365\(\ldots \ldots\)\(n\)\(n - 1\)
    Given that the value of Spearman's coefficient of rank correlation is \(\frac { 63 } { 65 }\), find \(n\).
  4. An experiment produced some data from a bivariate distribution. The product moment correlation coefficient is denoted by \(r\), and Spearman's rank correlation coefficient is denoted by \(r _ { s }\).
    (a) Explain whether the statement $$r = 1 \Rightarrow r _ { s } = 1$$ is true or false.
    (b) Use a diagram to explain whether the statement $$r \neq 1 \Rightarrow r _ { s } \neq 1$$ is true or false. 8 Sandra makes repeated, independent attempts to hit a target. On each attempt, the probability that she succeeds is 0.1 .
  5. Find the probability that
    (a) the first time she succeeds is on her 5th attempt,
    (b) the first time she succeeds is after her 5th attempt,
    (c) the second time she succeeds is before her 4th attempt. Jill also makes repeated attempts to hit the target. Each attempt of either Jill or Sandra is independent. Each time that Jill attempts to hit the target, the probability that she succeeds is 0.2 . Sandra and Jill take turns attempting to hit the target, with Sandra going first.
  6. Find the probability that the first person to hit the target is Sandra, on her
    (a) 2nd attempt,
    (b) 10th attempt. \href{http://physicsandmathstutor.com}{physicsandmathstutor.com}
Edexcel AS Paper 2 2022 June Q3
  1. The histogram summarises the heights of 256 seedlings two weeks after they were planted.
    \includegraphics[max width=\textwidth, alt={}, center]{08e3b0b0-2155-4b37-83e3-343c317ca10c-06_1242_1810_287_132}
    1. Use linear interpolation to estimate the median height of the seedlings.
      (4)
    Chris decides to model the frequency density for these 256 seedlings by a curve with equation $$y = k x ( 8 - x ) \quad 0 \leqslant x \leqslant 8$$ where \(k\) is a constant.
  2. Find the value of \(k\) Using this model,
  3. write down the median height of the seedlings.
Edexcel S1 2009 June Q4
4. A researcher measured the foot lengths of a random sample of 120 ten-year-old children. The lengths are summarised in the table below.
Foot length, \(l\), (cm)Number of children
\(10 \leqslant l < 12\)5
\(12 \leqslant l < 17\)53
\(17 \leqslant l < 19\)29
\(19 \leqslant l < 21\)15
\(21 \leqslant l < 23\)11
\(23 \leqslant l < 25\)7
  1. Use interpolation to estimate the median of this distribution.
  2. Calculate estimates for the mean and the standard deviation of these data. One measure of skewness is given by $$\text { Coefficient of skewness } = \frac { 3 ( \text { mean } - \text { median } ) } { \text { standard deviation } }$$
  3. Evaluate this coefficient and comment on the skewness of these data. Greg suggests that a normal distribution is a suitable model for the foot lengths of ten-year-old children.
  4. Using the value found in part (c), comment on Greg's suggestion, giving a reason for your answer.
Edexcel S1 Q3
3. The frequency distribution for the lengths of 108 fish in an aquarium is given by the following table. The lengths of the fish ranged from 5 cm to 90 cm .
Length \(( \mathrm { cm } )\)\(5 - 10\)\(10 - 20\)\(20 - 25\)\(25 - 30\)\(30 - 40\)\(40 - 60\)\(60 - 90\)
Frequency8162018201412
  1. Calculate estimates of the three quartiles of the distribution.
  2. On graph paper, draw a box and whisker plot of the data.
  3. Hence describe the skewness of the distribution.
  4. If the data were represented by a histogram, what would be the ratio of the heights of the shortest and highest bars?
Edexcel S1 Q7
7. The following table gives the weights, in grams, of 60 items delivered to a company in a day.
Weight (g)\(0 - 10\)\(10 - 20\)\(20 - 30\)\(30 - 40\)\(40 - 50\)\(50 - 60\)\(60 - 80\)
No. of items2111812962
  1. Use interpolation to calculate estimated values of (i) the median weight,
    (ii) the interquartile range,
    (iii) the thirty-third percentile. Outliers are defined to be outside the range from \(2 \cdot 5 Q _ { 1 } - 1 \cdot 5 Q _ { 3 }\) to \(2 \cdot 5 Q _ { 3 } - 1 \cdot 5 Q _ { 1 }\).
  2. Given that the lightest item weighed 3 g and the two heaviest weighed 65 g and 79 g , draw on graph paper an accurate box-and-whisker plot of the data. Indicate any outliers clearly.
  3. Describe the skewness of the distribution. The mean weight was 32.0 g and the standard deviation of the weights was 14.9 g .
  4. State, with a reason, whether you would choose to summarise the data by using the mean and standard deviation or the median and interquartile range. On another day, items were delivered whose weights ranged from 14 g to 58 g ; the median was 32 g , the lower quartile was 24 g and the interquartile range was 26 g .
  5. Draw a further box plot for these data on the same diagram. Briefly compare the two sets of data using your plots.
    ( 6 marks)
Edexcel S1 Q4
4. The ages of 300 houses in a village are recorded giving the following table of results.
Age (years)Number of houses
0 -36
20 -92
40 -74
60 -39
100 -14
200 -27
300-50018
Use linear interpolation to estimate for these data
  1. the median,
  2. the limits between which the middle \(80 \%\) of the ages lie. An estimate of the mean of these data is calculated to be 86.6 years.
  3. Explain why the mean and median are so different and hence say which you consider best represents the data.
Edexcel S1 Q7
7. A cyber-cafe recorded how long each user stayed during one day giving the following results.
Length of stay
(minutes)
\(0 -\)\(30 -\)\(60 -\)\(90 -\)\(120 -\)\(240 -\)\(360 -\)
Number of users153132231720
  1. Use linear interpolation to estimate the median and quartiles of these data. The results of a previous study had led to the suggestion that the length of time each user stays can be modelled by a normal distribution with a mean of 72 minutes and a standard deviation of 48 minutes.
  2. Find the median and quartiles that this model would predict.
  3. Comment on the suitability of the suggested model in the light of the new results.