Edexcel S1 Specimen — Question 5 16 marks

Exam BoardEdexcel
ModuleS1 (Statistics 1)
SessionSpecimen
Marks16
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicData representation
TypeDraw box plot from summary statistics
DifficultyEasy -1.2 This is a straightforward S1 statistics question testing basic understanding of quartiles, box plots, and outliers. Parts (a) and (b) require recall of standard definitions. Part (c) involves routine calculation of outlier boundaries and drawing a box plot with given summary statistics. Part (d) requires standard comparative statements about center and spread. No problem-solving or novel insight needed—purely procedural application of learned concepts.
Spec2.02f Measures of average and spread2.02i Select/critique data presentation

  1. Explain briefly the advantages and disadvantages of using the quartiles to summarise a set of data. [4]
  2. Describe the main features and uses of a box plot. [3]
The distances, in kilometres, travelled to school by the teachers in two schools, \(A\) and \(B\), in the same town were recorded. The data for School \(A\) are summarised in Diagram 1. \includegraphics{figure_1} For School \(B\), the least distance travelled was 3 km and the longest distance travelled was 55 km. The three quartiles were 17, 24 and 31 respectively. An outlier is an observation that falls either \(1.5 \times\) (interquartile range) above the upper quartile or \(1.5 \times\) (interquartile range) below the lower quartile.
  1. Draw a box plot for School \(B\). [5]
  2. Compare and contrast the two box plots. [4]

(a)
Advantages:
- Uses central 50% of the data
- Not affected by extreme values (outliers)
- Provide an alternative measure of spread to the variance/standard deviation, i.e. IQR/STQR
Disadvantages:
- Not always a simple calculation, e.g. interpolation for a grouped frequency distribution
- Different measures of calculation – no single argued method
- Does not use all the data directly
AnswerMarks Guidance
Any 4 sensible comments – at least one advantage and one disadvantageB1 B1 B1 B1 (4 marks)
(b)
- Indicates maximum/minimum observations and possible outliers
- Indicates relative positions of the quartiles
- Indicates skewness
- When plotted on the same scale enables comparisons of distributions
AnswerMarks Guidance
Any 4 sensible commentsB1 B1 B1 (3 marks)
(c)
AnswerMarks Guidance
\(Q_1 - 1.5(Q_3 - Q_1) = -4 \Rightarrow\) no outlier below lower quartileB1
\(Q_2 + 1.5(Q_3 - Q_1) = 52 \Rightarrow\) an outlier (55) above upper quartileB1
[Boxplot for School B shown with appropriate scale and label]B1
\(Q_1, Q_2, Q_3, 3, 52\)B1
\(55\)B1 (3 marks)
(d)
AnswerMarks Guidance
\(A: Q_3 - Q_2 = 10; Q_2 - Q_1 = 10 \Rightarrow\) symmetrical\(\begin{rcases} \text{both distributions} \\ \text{are symmetrical} \end{rcases}\)
\(B: Q_3 - Q_2 = 7; Q_2 - Q_1 = 7 \Rightarrow\) symmetrical
Median B (24) > Median A (22) \(\Rightarrow\) on average teachers in B travel slightly further to school than those in AB1 B1 B1 B1 (4 marks)
Range of B is greater than that of A
25% of teachers in A travel 12 km or less compared with 25% of teachers in B who travel 17 km or less
50% of teachers in A travel between 12 km and 32 km as compared with 17 km and 31 km for B
AnswerMarks Guidance
Any 4 sensible commentsB1 B1 B1 B1 (4 marks)
(16 marks)
## (a)
**Advantages:**
- Uses central 50% of the data
- Not affected by extreme values (outliers)
- Provide an alternative measure of spread to the variance/standard deviation, i.e. IQR/STQR

**Disadvantages:**
- Not always a simple calculation, e.g. interpolation for a grouped frequency distribution
- Different measures of calculation – no single argued method
- Does not use all the data directly

Any 4 sensible comments – at least one advantage and one disadvantage | B1 B1 B1 B1 | (4 marks)

## (b)
- Indicates maximum/minimum observations and possible outliers
- Indicates relative positions of the quartiles
- Indicates skewness
- When plotted on the same scale enables comparisons of distributions

Any 4 sensible comments | B1 B1 B1 | (3 marks)

## (c)
$Q_1 - 1.5(Q_3 - Q_1) = -4 \Rightarrow$ no outlier below lower quartile | B1 |

$Q_2 + 1.5(Q_3 - Q_1) = 52 \Rightarrow$ an outlier (55) above upper quartile | B1 |

[Boxplot for School B shown with appropriate scale and label] | B1 |

$Q_1, Q_2, Q_3, 3, 52$ | B1 |

$55$ | B1 | (3 marks)

## (d)
$A: Q_3 - Q_2 = 10; Q_2 - Q_1 = 10 \Rightarrow$ symmetrical | $\begin{rcases} \text{both distributions} \\ \text{are symmetrical} \end{rcases}$ |

$B: Q_3 - Q_2 = 7; Q_2 - Q_1 = 7 \Rightarrow$ symmetrical |

Median B (24) > Median A (22) $\Rightarrow$ on average teachers in B travel slightly further to school than those in A | B1 B1 B1 B1 | (4 marks)

Range of B is greater than that of A

25% of teachers in A travel 12 km or less compared with 25% of teachers in B who travel 17 km or less

50% of teachers in A travel between 12 km and 32 km as compared with 17 km and 31 km for B

Any 4 sensible comments | B1 B1 B1 B1 | (4 marks)

**(16 marks)**

---
\begin{enumerate}[label=(\alph*)]
\item Explain briefly the advantages and disadvantages of using the quartiles to summarise a set of data. [4]
\item Describe the main features and uses of a box plot. [3]
\end{enumerate}

The distances, in kilometres, travelled to school by the teachers in two schools, $A$ and $B$, in the same town were recorded. The data for School $A$ are summarised in Diagram 1.

\includegraphics{figure_1}

For School $B$, the least distance travelled was 3 km and the longest distance travelled was 55 km. The three quartiles were 17, 24 and 31 respectively.

An outlier is an observation that falls either $1.5 \times$ (interquartile range) above the upper quartile or $1.5 \times$ (interquartile range) below the lower quartile.

\begin{enumerate}[label=(\alph*)]
\setcounter{enumi}{2}
\item Draw a box plot for School $B$. [5]
\item Compare and contrast the two box plots. [4]
\end{enumerate}

\hfill \mbox{\textit{Edexcel S1  Q5 [16]}}