Edexcel S1 2001 January — Question 5 17 marks

Exam BoardEdexcel
ModuleS1 (Statistics 1)
Year2001
SessionJanuary
Marks17
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicData representation
TypeDraw histogram then find median/quartiles from cumulative frequency
DifficultyModerate -0.3 This is a standard S1 statistics question covering routine histogram construction, mean/standard deviation calculation, and basic interpretation. While it has multiple parts, each part follows textbook procedures with no novel problem-solving required. The interpolation for median and skewness coefficient are standard A-level techniques, making this slightly easier than average overall.
Spec2.02b Histogram: area represents frequency2.02f Measures of average and spread2.02g Calculate mean and standard deviation2.02h Recognize outliers

5. The following grouped frequency distribution summarises the number of minutes, to the nearest minute, that a random sample of 200 motorists were delayed by roadworks on a stretch of motorway.
Delay (mins)Number of motorists
\(4 - 6\)15
\(7 - 8\)28
949
1053
\(11 - 12\)30
\(13 - 15\)15
\(16 - 20\)10
  1. Using graph paper represent these data by a histogram.
  2. Give a reason to justify the use of a histogram to represent these data.
  3. Use interpolation to estimate the median of this distribution.
  4. Calculate an estimate of the mean and an estimate of the standard deviation of these data. One coefficient of skewness is given by $$\frac { 3 ( \text { mean - median } ) } { \text { standard deviation } } .$$
  5. Evaluate this coefficient for the above data.
  6. Explain why the normal distribution may not be suitable to model the number of minutes that motorists are delayed by these roadworks.

Question 5:
Part (a)
AnswerMarks Guidance
AnswerMarks Guidance
Histogram with frequency densities: \(5, 14, 49, 53, 15, 5, 2\)B4 (4) B1 scales & labels; M1 histogram shape; A1 shape; A1 accuracy
Part (b)
AnswerMarks Guidance
AnswerMarks Guidance
The variable (minutes delayed) is continuousB1 (1)
Part (c)
AnswerMarks Guidance
AnswerMarks Guidance
Median \(= 9.5 + \dfrac{(100-92)}{53}\times1 = 9.65\) (or \(9.66\))M1 A1 (2) M1 must use \(100.5\); accept \(9.65\) or \(9.66\)
Part (d)
AnswerMarks Guidance
AnswerMarks Guidance
\(\sum fx = 1991\), \(\sum fx^2 = 21366.5\) (using midpoints)M1 M1 M1 for \(\sum fx\); M1 for \(\sum fx^2\) using midpoints
Mean \(= \dfrac{1991}{200} = 9.955\) or \(9.96\)M1 A1
\(s = \sqrt{\dfrac{21366.5}{200} - \left(\dfrac{1991}{200}\right)^2} = 2.78\) or \(2\text{ min }67\text{ sec}\)M1 A1 (6) Accept \(2.78\); (\(S_{n-1} = 2.79\))
Part (e)
AnswerMarks Guidance
AnswerMarks Guidance
\(\dfrac{3(9.955 - 9.65)}{2.78} = 0.329\)M1 A1 (2) Answer approximately \(0.3\)
Part (f)
AnswerMarks Guidance
AnswerMarks Guidance
For normal distribution skewness is zero. In this case skewness is \(0.329\), so normal may not be suitableB1, B1 (2)
## Question 5:

**Part (a)**
| Answer | Marks | Guidance |
|--------|-------|----------|
| Histogram with frequency densities: $5, 14, 49, 53, 15, 5, 2$ | B4 (4) | B1 scales & labels; M1 histogram shape; A1 shape; A1 accuracy |

**Part (b)**
| Answer | Marks | Guidance |
|--------|-------|----------|
| The variable (minutes delayed) is continuous | B1 (1) | |

**Part (c)**
| Answer | Marks | Guidance |
|--------|-------|----------|
| Median $= 9.5 + \dfrac{(100-92)}{53}\times1 = 9.65$ (or $9.66$) | M1 A1 (2) | M1 must use $100.5$; accept $9.65$ or $9.66$ |

**Part (d)**
| Answer | Marks | Guidance |
|--------|-------|----------|
| $\sum fx = 1991$, $\sum fx^2 = 21366.5$ (using midpoints) | M1 M1 | M1 for $\sum fx$; M1 for $\sum fx^2$ using midpoints |
| Mean $= \dfrac{1991}{200} = 9.955$ or $9.96$ | M1 A1 | |
| $s = \sqrt{\dfrac{21366.5}{200} - \left(\dfrac{1991}{200}\right)^2} = 2.78$ or $2\text{ min }67\text{ sec}$ | M1 A1 (6) | Accept $2.78$; ($S_{n-1} = 2.79$) |

**Part (e)**
| Answer | Marks | Guidance |
|--------|-------|----------|
| $\dfrac{3(9.955 - 9.65)}{2.78} = 0.329$ | M1 A1 (2) | Answer approximately $0.3$ |

**Part (f)**
| Answer | Marks | Guidance |
|--------|-------|----------|
| For normal distribution skewness is zero. In this case skewness is $0.329$, so normal may not be suitable | B1, B1 (2) | |

---
5. The following grouped frequency distribution summarises the number of minutes, to the nearest minute, that a random sample of 200 motorists were delayed by roadworks on a stretch of motorway.

\begin{center}
\begin{tabular}{ | c | c | }
\hline
Delay (mins) & Number of motorists \\
\hline
$4 - 6$ & 15 \\
\hline
$7 - 8$ & 28 \\
\hline
9 & 49 \\
\hline
10 & 53 \\
\hline
$11 - 12$ & 30 \\
\hline
$13 - 15$ & 15 \\
\hline
$16 - 20$ & 10 \\
\hline
\end{tabular}
\end{center}
\begin{enumerate}[label=(\alph*)]
\item Using graph paper represent these data by a histogram.
\item Give a reason to justify the use of a histogram to represent these data.
\item Use interpolation to estimate the median of this distribution.
\item Calculate an estimate of the mean and an estimate of the standard deviation of these data.

One coefficient of skewness is given by

$$\frac { 3 ( \text { mean - median } ) } { \text { standard deviation } } .$$
\item Evaluate this coefficient for the above data.
\item Explain why the normal distribution may not be suitable to model the number of minutes that motorists are delayed by these roadworks.
\end{enumerate}

\hfill \mbox{\textit{Edexcel S1 2001 Q5 [17]}}