| Exam Board | Edexcel |
|---|---|
| Module | S1 (Statistics 1) |
| Year | 2013 |
| Session | June |
| Marks | 14 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Data representation |
| Type | Estimate mean and standard deviation from frequency table |
| Difficulty | Moderate -0.8 This is a routine S1 statistics question testing standard procedures: calculating mean/SD from grouped data (with Σft² provided), linear interpolation for median/quartiles, and understanding the effect of linear transformations. All techniques are straightforward textbook exercises requiring no problem-solving insight, though the multi-part nature and coding of data adds minor complexity. |
| Spec | 2.02a Interpret single variable data: tables and diagrams2.02f Measures of average and spread2.02g Calculate mean and standard deviation2.02h Recognize outliers2.02i Select/critique data presentation2.02j Clean data: missing data, errors |
| Time (minutes) \(t\) | \(11 - 20\) | \(21 - 25\) | \(26 - 30\) | \(31 - 35\) | \(36 - 45\) | \(46 - 60\) |
| Number of students f | 62 | 88 | 16 | 13 | 11 | 10 |
| Time (minutes) \(t\) | \(6 - 15\) | \(16 - 20\) | \(21 - 25\) | \(26 - 30\) | \(31 - 40\) | \(41 - 55\) |
| Number of students f | 62 | 88 | 16 | 13 | 11 | 10 |
| Answer | Marks | Guidance |
|---|---|---|
| \(\sum ft = 4837.5\) (allow 4838 or 4840) | B1 | For 4837.5 or 4838 or 4840 seen |
| \(\text{Mean} = \frac{4837.5}{200} = 24.1875\), awrt \(\mathbf{24.2}\) or \(\frac{387}{16}\) | M1 A1 | For attempt at \(\frac{\sum ft}{\sum f}\); allow 1sf so \(\sum f =\) awrt 200 and \(\sum ft =\) awrt 5000 |
| \(\sigma = \sqrt{\frac{134281.25}{200} - \left(\frac{4837.5}{200}\right)^2} = 9.293\ldots\), accept \(s = 9.32\), awrt \(\mathbf{9.29}\) | M1 A1 | For correct expression including square root; allow transcription error in 134281.25 but not incorrect re-calculation |
| Answer | Marks | Guidance |
|---|---|---|
| \(Q_2 = [20.5] + \frac{(100/100.5 - 62)}{88} \times 5 = 22.659\ldots\), awrt \(\mathbf{22.7}\) | M1 A1 | For a correct fraction \(\times 5\); ignore end point but must be \(+\); allow use of \((n+1)\) giving \(100.5\ldots\) |
| Answer | Marks | Guidance |
|---|---|---|
| \(Q_1 = 10.5 + \frac{(50/50.25)}{62} \times 10 [= 18.56]\), \((n+1)\) gives \(18.604\ldots\) | B1 cso | For fully correct expression including end point; allow use of \((n+1)\) giving \(50.25\ldots\) but use of 50.5 scores B0 |
| Answer | Marks | Guidance |
|---|---|---|
| \(Q_3 = 25.5\) (use of \(n+1\) gives \(25.734\ldots\)) | B1 | For 25.5 (or awrt 25.7 using \(n+1\)) |
| \(IQR = 6.9\) (use of \(n+1\) gives 7.1) | B1 ft | For their \(Q_3\) - their \(Q_1\) (or 18.6) provided \(> 0\); accept awrt 2sf |
| Answer | Marks | Guidance |
|---|---|---|
| The data is skewed (condone "negative skew") | B1 | Must mention data is skewed or not symmetrical; do not award for "outliers" |
| Answer | Marks | Guidance |
|---|---|---|
| Mean decreases and st. dev. remains the same [Must mention mean and st. dev.] | B1 | For one correct comment |
| The median and quartiles would decrease [Must refer to median and at least \(Q_1\)] | B1 | For two correct comments |
| The IQR would remain unchanged | B1 | For all 3 correct comments |
# Question 4:
## Part (a)
| $\sum ft = 4837.5$ (allow 4838 or 4840) | B1 | For 4837.5 or 4838 or 4840 seen |
| $\text{Mean} = \frac{4837.5}{200} = 24.1875$, awrt $\mathbf{24.2}$ or $\frac{387}{16}$ | M1 A1 | For attempt at $\frac{\sum ft}{\sum f}$; allow 1sf so $\sum f =$ awrt 200 and $\sum ft =$ awrt 5000 |
| $\sigma = \sqrt{\frac{134281.25}{200} - \left(\frac{4837.5}{200}\right)^2} = 9.293\ldots$, accept $s = 9.32$, awrt $\mathbf{9.29}$ | M1 A1 | For correct expression including square root; allow transcription error in 134281.25 but not incorrect re-calculation |
## Part (b)
| $Q_2 = [20.5] + \frac{(100/100.5 - 62)}{88} \times 5 = 22.659\ldots$, awrt $\mathbf{22.7}$ | M1 A1 | For a correct fraction $\times 5$; ignore end point but must be $+$; allow use of $(n+1)$ giving $100.5\ldots$ |
## Part (c)
| $Q_1 = 10.5 + \frac{(50/50.25)}{62} \times 10 [= 18.56]$, $(n+1)$ gives $18.604\ldots$ | B1 cso | For fully correct expression including end point; allow use of $(n+1)$ giving $50.25\ldots$ but use of 50.5 scores B0 |
## Part (d)
| $Q_3 = 25.5$ (use of $n+1$ gives $25.734\ldots$) | B1 | For 25.5 (or awrt 25.7 using $n+1$) |
| $IQR = 6.9$ (use of $n+1$ gives 7.1) | B1 ft | For their $Q_3$ - their $Q_1$ (or 18.6) provided $> 0$; accept awrt 2sf |
## Part (e)
| The data is skewed (condone "negative skew") | B1 | Must mention data is skewed or not symmetrical; do not award for "outliers" |
## Part (f)
| Mean decreases and st. dev. remains the same [Must mention mean and st. dev.] | B1 | For one correct comment |
| The median and quartiles would decrease [Must refer to median and at least $Q_1$] | B1 | For two correct comments |
| The IQR would remain unchanged | B1 | For all 3 correct comments |
---
4. The following table summarises the times, $t$ minutes to the nearest minute, recorded for a group of students to complete an exam.
\begin{center}
\begin{tabular}{ | l | c | c | c | c | c | c | }
\hline
Time (minutes) $t$ & $11 - 20$ & $21 - 25$ & $26 - 30$ & $31 - 35$ & $36 - 45$ & $46 - 60$ \\
\hline
Number of students f & 62 & 88 & 16 & 13 & 11 & 10 \\
\hline
\end{tabular}
\end{center}
$$\text { [You may use } \sum \mathrm { f } t ^ { 2 } = 134281.25 \text { ] }$$
\begin{enumerate}[label=(\alph*)]
\item Estimate the mean and standard deviation of these data.
\item Use linear interpolation to estimate the value of the median.
\item Show that the estimated value of the lower quartile is 18.6 to 3 significant figures.
\item Estimate the interquartile range of this distribution.
\item Give a reason why the mean and standard deviation are not the most appropriate summary statistics to use with these data.
The person timing the exam made an error and each student actually took 5 minutes less than the times recorded above. The table below summarises the actual times.
\begin{center}
\begin{tabular}{ | l | c | c | c | c | c | c | }
\hline
Time (minutes) $t$ & $6 - 15$ & $16 - 20$ & $21 - 25$ & $26 - 30$ & $31 - 40$ & $41 - 55$ \\
\hline
Number of students f & 62 & 88 & 16 & 13 & 11 & 10 \\
\hline
\end{tabular}
\end{center}
\item Without further calculations, explain the effect this would have on each of the estimates found in parts (a), (b), (c) and (d).
\end{enumerate}
\hfill \mbox{\textit{Edexcel S1 2013 Q4 [14]}}