| Exam Board | CAIE |
|---|---|
| Module | S1 (Statistics 1) |
| Year | 2021 |
| Session | March |
| Marks | 9 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Data representation |
| Type | Draw cumulative frequency graph from frequency table (unequal class widths) |
| Difficulty | Easy -1.3 This is a routine S1 statistics question requiring standard procedures: calculating cumulative frequencies, plotting points, and reading from a graph. Part (a) is mechanical data processing, part (b) is straightforward graph reading (finding the 70th percentile), and part (c) uses the standard formula for mean from grouped data with midpoints. No problem-solving insight or conceptual challenge is required—purely procedural application of well-practiced techniques. |
| Spec | 2.02a Interpret single variable data: tables and diagrams2.02f Measures of average and spread2.02g Calculate mean and standard deviation |
| Distance \(( \mathrm { km } )\) | \(0 - 4\) | \(5 - 10\) | \(11 - 20\) | \(21 - 30\) | \(31 - 40\) | \(41 - 60\) |
| Frequency | 12 | 16 | 32 | 66 | 20 | 4 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| Cumulative frequencies: 12, 28, 60, 126, 146, 150 | B1 | Correct cumulative frequencies seen (may be by table or plotted accurately on graph), condone 12 not stated |
| Axes labelled 'distance (or d) [in] km' from 0 to 60 and 'cumulative frequency' (or cf) from 0 to 150 | B1 | |
| At least 5 points plotted at upper end points for \(d\) | M1 | Allow upper boundary \(\pm0.5\), condone \(0-4\) interval inaccurate, no scale break on axis. Not bar graph/histogram unless clear indication of upper end point only of each bar |
| All plotted correctly at correct upper end points (4.5 etc.) | A1 | Both scales linear (\(0\leqslant d\leqslant60\), \(0\leqslant\text{cf}\leqslant150\)), curve drawn accurately joined to (0,0), cf line>150, no daylight if >150 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| \(70\%\) of \(150 = 105\) | M1 | 105 seen or implied by indication on grid |
| Approx. 27 | A1 FT | Strict FT *their* increasing cumulative frequency graph, use of graph must be seen. If no clear evidence of use of graph: SC B1 FT correct value from *their* increasing cumulative frequency graph |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| Midpoints: 2.25, 7.5, 15.5, 25.5, 35.5, 50.5 | B1 | At least 5 correct midpoints seen |
| \(\text{Mean} = \frac{2.25\times12+7.5\times16+15.5\times32+25.5\times66+35.5\times20+50.5\times4}{150} = \frac{27+120+496+1683+710+202}{150}\) | M1 | Using 6 midpoint attempts (e.g. \(2.25\pm0.5\)), condone one error not omission, multiplied by frequency, accept unevaluated, denominator either correct or *their* \(\Sigma\) frequencies |
| \(\left[=\frac{3238}{150}\right] = 21.6,\ 21\frac{44}{75}\) | A1 | Evaluated, WWW, accept \(21.5[866\ldots]\) |
## Question 5(a):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Cumulative frequencies: 12, 28, 60, 126, 146, 150 | B1 | Correct cumulative frequencies seen (may be by table or plotted accurately on graph), condone 12 not stated |
| Axes labelled 'distance (or d) [in] km' from 0 to 60 and 'cumulative frequency' (or cf) from 0 to 150 | B1 | |
| At least 5 points plotted at upper end points for $d$ | M1 | Allow upper boundary $\pm0.5$, condone $0-4$ interval inaccurate, no scale break on axis. Not bar graph/histogram unless clear indication of upper end point only of each bar |
| All plotted correctly at correct upper end points (4.5 etc.) | A1 | Both scales linear ($0\leqslant d\leqslant60$, $0\leqslant\text{cf}\leqslant150$), curve drawn accurately joined to (0,0), cf line>150, no daylight if >150 |
## Question 5(b):
| Answer | Marks | Guidance |
|--------|-------|----------|
| $70\%$ of $150 = 105$ | M1 | 105 seen or implied by indication on grid |
| Approx. 27 | A1 FT | Strict FT *their* increasing cumulative frequency graph, use of graph must be seen. If no clear evidence of use of graph: SC B1 FT correct value from *their* increasing cumulative frequency graph |
## Question 5(c):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Midpoints: 2.25, 7.5, 15.5, 25.5, 35.5, 50.5 | B1 | At least 5 correct midpoints seen |
| $\text{Mean} = \frac{2.25\times12+7.5\times16+15.5\times32+25.5\times66+35.5\times20+50.5\times4}{150} = \frac{27+120+496+1683+710+202}{150}$ | M1 | Using 6 midpoint attempts (e.g. $2.25\pm0.5$), condone one error not omission, multiplied by frequency, accept unevaluated, denominator either correct or *their* $\Sigma$ frequencies |
| $\left[=\frac{3238}{150}\right] = 21.6,\ 21\frac{44}{75}$ | A1 | Evaluated, WWW, accept $21.5[866\ldots]$ |
5 A driver records the distance travelled in each of 150 journeys. These distances, correct to the nearest km , are summarised in the following table.
\begin{center}
\begin{tabular}{ | l | c | c | c | c | c | c | }
\hline
Distance $( \mathrm { km } )$ & $0 - 4$ & $5 - 10$ & $11 - 20$ & $21 - 30$ & $31 - 40$ & $41 - 60$ \\
\hline
Frequency & 12 & 16 & 32 & 66 & 20 & 4 \\
\hline
\end{tabular}
\end{center}
\begin{enumerate}[label=(\alph*)]
\item Draw a cumulative frequency graph to illustrate the data.\\
\includegraphics[max width=\textwidth, alt={}, center]{3f05dc2a-b466-40bc-9f5f-0fd2bff120c8-06_1593_1397_852_415}
\item For 30\% of these journeys the distance travelled is $d \mathrm {~km}$ or more.
Use your graph to estimate the value of $d$.
\item Calculate an estimate of the mean distance travelled for the 150 journeys.
\end{enumerate}
\hfill \mbox{\textit{CAIE S1 2021 Q5 [9]}}