| Exam Board | OCR MEI |
|---|---|
| Module | S1 (Statistics 1) |
| Marks | 17 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Measures of Location and Spread |
| Type | Direct cumulative frequency graph reading |
| Difficulty | Easy -1.2 This is a straightforward cumulative frequency question requiring only direct reading from a graph (median at 300th value, quartiles at 150th and 450th values), basic outlier checking using 1.5×IQR rule, completing a frequency table by subtraction, and applying standard formulas for mean and linear transformations of standard deviation. All techniques are routine S1 procedures with no problem-solving or novel insight required. |
| Spec | 2.02g Calculate mean and standard deviation2.02j Clean data: missing data, errors |
| \(3.0 \leqslant t \leqslant 3.4\) | \(3.4 < t \leqslant 3.8\) | \(3.8 < t \leqslant 4.2\) | \(4.2 < t \leqslant 4.6\) | \(4.6 < t \leqslant 5.0\) | ||
| Frequency | 243 | 157 |
| Answer | Marks |
|---|---|
| Median = 4.06 – 4.075 (inclusive) | B1cao |
| \(Q_1\) = 3.8 | B1 for \(Q_1\) (cao) |
| \(Q_3\) = 4.3 | B1 for \(Q_3\) (cao) |
| Inter-quartile range = 4.3 – 3.8 = 0.5 | B1 ft for IQR must be using t-values not locations to earn this mark |
| Answer | Marks |
|---|---|
| Lower limit 'their 3.8' \(-\) 1.5 \(\times\) 'their 0.5' = (3.05) | B1ft: must have \(-\)1.5 |
| Upper limit 'their 4.3' \(+\) 1.5 \(\times\) 'their 0.5' = (5.05) | B1ft: must have \(+\)1.5 |
| Very few if any temperatures below 3.05 (but not zero) | E1ft dep on \(-\)1.5 and \(Q_1\) |
| None above 5.05 | E1ft dep on \(+\)1.5 and \(Q_3\) |
| 'So few, if any outliers' scores SC1 | Must be using t-values NOT locations to earn these 4 marks |
| Answer | Marks |
|---|---|
| Valid argument such as 'Probably not, because there is nothing to suggest that they are not genuine data items; (they do not appear to form a separate pool of data.)' | E1 |
| Answer | Marks |
|---|---|
| Missing frequencies 25, 125, 50 | B1, B1, B1 (all cao) |
| Answer | Marks |
|---|---|
| Mean = \(\frac{3.2 \times 25 + 3.6 \times 125 + 4.0 \times 243 + 4.4 \times 157 + 4.8 \times 50}{600}\) = \(\frac{2432.8}{600}\) = 4.05(47) | M1 for at least 4 midpoints correct and being used in attempt to find \(\sum\) ft |
| A1cao: awfw (4.05 – 4.055) ISW or rounding |
| Answer | Marks |
|---|---|
| New mean = 1.8 \(\times\) 'their 4.05(47)' \(+\) 32 = 39.29(84) to 39.3 | B1 FT |
| New \(s\) = 1.8 \(\times\) 0.379 = 0.682 | M1 for 1.8 \(\times\) 0.379 |
| A1 CAO awfw (0.68 – 0.6822) |
# Question 1
## (i)
Median = 4.06 – 4.075 (inclusive) | B1cao
$Q_1$ = 3.8 | B1 for $Q_1$ (cao)
$Q_3$ = 4.3 | B1 for $Q_3$ (cao)
Inter-quartile range = 4.3 – 3.8 = 0.5 | B1 ft for IQR must be using t-values not locations to earn this mark
## (ii)
Lower limit 'their 3.8' $-$ 1.5 $\times$ 'their 0.5' = (3.05) | B1ft: must have $-$1.5
Upper limit 'their 4.3' $+$ 1.5 $\times$ 'their 0.5' = (5.05) | B1ft: must have $+$1.5
Very few if any temperatures below 3.05 (but not zero) | E1ft dep on $-$1.5 and $Q_1$
None above 5.05 | E1ft dep on $+$1.5 and $Q_3$
'So few, if any outliers' scores SC1 | Must be using t-values NOT locations to earn these 4 marks
## (iii)
Valid argument such as 'Probably not, because there is nothing to suggest that they are not genuine data items; (they do not appear to form a separate pool of data.)' | E1
Accept: exclude outlier – 'measuring equipment was wrong' or 'there was a power cut' or ref to hot / cold day
[Allow suitable valid alternative arguments]
## (iv)
Missing frequencies 25, 125, 50 | B1, B1, B1 (all cao)
## (v)
Mean = $\frac{3.2 \times 25 + 3.6 \times 125 + 4.0 \times 243 + 4.4 \times 157 + 4.8 \times 50}{600}$ = $\frac{2432.8}{600}$ = 4.05(47) | M1 for at least 4 midpoints correct and being used in attempt to find $\sum$ ft
| A1cao: awfw (4.05 – 4.055) ISW or rounding
## (vi)
New mean = 1.8 $\times$ 'their 4.05(47)' $+$ 32 = 39.29(84) to 39.3 | B1 FT
New $s$ = 1.8 $\times$ 0.379 = 0.682 | M1 for 1.8 $\times$ 0.379
| A1 CAO awfw (0.68 – 0.6822)
1 The temperature of a supermarket fridge is regularly checked to ensure that it is working correctly. Over a period of three months the temperature (measured in degrees Celsius) is checked 600 times. These temperatures are displayed in the cumulative frequency diagram below.\\
\includegraphics[max width=\textwidth, alt={}, center]{c7cb0f6b-7b6b-4c52-8287-7efc6bd70247-1_1052_1647_549_289}\\
(i) Use the diagram to estimate the median and interquartile range of the data.\\
(ii) Use your answers to part (i) to show that there are very few, if any, outliers in the sample.\\
(iii) Suppose that an outlier is identified in these data. Discuss whether it should be excluded from any further analysis.\\
(iv) Copy and complete the frequency table below for these data.
\begin{center}
\begin{tabular}{ | l | l | l | c | c | c | }
\hline
\begin{tabular}{ l }
Temperature \\
$( t$ degrees Celsius $)$ \\
\end{tabular} & $3.0 \leqslant t \leqslant 3.4$ & $3.4 < t \leqslant 3.8$ & $3.8 < t \leqslant 4.2$ & $4.2 < t \leqslant 4.6$ & $4.6 < t \leqslant 5.0$ \\
\hline
Frequency & & & 243 & 157 & \\
\hline
\end{tabular}
\end{center}
(v) Use your table to calculate an estimate of the mean.\\
(vi) The standard deviation of the temperatures in degrees Celsius is 0.379 . The temperatures are converted from degrees Celsius into degrees Fahrenheit using the formula $F = 1.8 C + 32$. Hence estimate the mean and find the standard deviation of the temperatures in degrees Fahrenheit.
\hfill \mbox{\textit{OCR MEI S1 Q1 [17]}}