| Exam Board | Edexcel |
|---|---|
| Module | S1 (Statistics 1) |
| Year | 2019 |
| Session | January |
| Marks | 13 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Data representation |
| Type | Estimate mean and standard deviation from frequency table |
| Difficulty | Moderate -0.8 This is a routine S1 statistics question testing standard procedures: histogram calculations using frequency density, linear interpolation for median, and mean/standard deviation from grouped data using given formulas. All parts follow textbook methods with no problem-solving or novel insight required, making it easier than average but not trivial due to the multiple parts and some computational work. |
| Spec | 2.02a Interpret single variable data: tables and diagrams2.02b Histogram: area represents frequency2.02f Measures of average and spread2.02g Calculate mean and standard deviation |
| Time (t minutes) | Frequency (f) | Time midpoint (x) |
| \(0 \leqslant t < 15\) | 25 | 7.5 |
| \(15 \leqslant t < 30\) | 17 | 22.5 |
| \(30 \leqslant t < 60\) | 28 | 45 |
| \(60 \leqslant t < 120\) | 24 | 90 |
| \(120 \leqslant t \leqslant 240\) | 6 | 180 |
| Answer | Marks | Guidance |
|---|---|---|
| Width: 15 minutes is 0.5 cm so 60 mins will be \(4 \times 0.5 = \mathbf{2}\) (cm) | B1 | For a width of 2 cm |
| Height: freq of 25 represented by \(6 \times 0.5 = 3\ (\text{cm}^2)\) so freq of 24 is \(\frac{24}{25} \times 3\) | M1 | For some calculation linking area and frequency |
| So height \(= \frac{1}{2} \times \frac{24}{25} \times 3 = \mathbf{1.44}\) (cm) | A1 | For 1.44 or exact equivalent e.g. \(\frac{36}{25}\) |
| (3) |
| Answer | Marks | Guidance |
|---|---|---|
| \([Q_2 =]\ \{30\} + \dfrac{(50 - [25+17])}{28} \times 30\) or e.g. \(\dfrac{60-30}{70-42} = \dfrac{m - \{30\}}{50-42}\) | M1 | For \(+\frac{8}{28} \times 30\) |
| \(= 38.571...\ \ \ \text{awrt}\ \mathbf{38.6}\) | A1 | For awrt 38.6 |
| (2) |
| Answer | Marks | Guidance |
|---|---|---|
| Use of midpoints to get \(\sum fx = 5070\) (allow 5000 to 1 sf) | M1 | |
| \([\bar{t}\ \text{or}\ \bar{x}] = \mathbf{50.7}\) | A1 | For 50.7 or exact equivalent e.g. \(\frac{5070}{100}\) |
| (2) |
| Answer | Marks | Guidance |
|---|---|---|
| \([\sigma] = \sqrt{\dfrac{455512.5}{100} - \text{"50.7"}^2}\) or \(\sqrt{1984.635}\) | M1 | For correct expression including square root (ft their mean) |
| \(= 44.5492...\ \ \ \text{awrt}\ \mathbf{44.5}\) | A1 | For awrt 44.5 |
| (2) |
| Answer | Marks | Guidance |
|---|---|---|
| \(\bar{t}\) or \(\bar{x} > Q_2\) [allow "50.7" > "38.6" or formula] so positive (skew) | B1 | For positive skew plus a suitable correct reason |
| (1) |
| Answer | Marks | Guidance |
|---|---|---|
| Median: no change — since e.g. all 18 values are still below the median | B2/1/0 | All 3 correct: B2; only 2 correct: B1 |
| Answer | Marks |
|---|---|
| Mean: will be smaller — since e.g. changes reduce total of \(x\) (\(7 \times 7.5\) not \(25 \times 7.5\) in 1st class) | B1 |
| Answer | Marks | Guidance |
|---|---|---|
| Standard deviation: will be greater — since e.g. 18 zeros means data more spread out | (3) [13 marks] |
# Question 4:
## Part (a):
Width: 15 minutes is 0.5 cm so 60 mins will be $4 \times 0.5 = \mathbf{2}$ (cm) | B1 | For a width of 2 cm
Height: freq of 25 represented by $6 \times 0.5 = 3\ (\text{cm}^2)$ so freq of 24 is $\frac{24}{25} \times 3$ | M1 | For some calculation linking area and frequency
So height $= \frac{1}{2} \times \frac{24}{25} \times 3 = \mathbf{1.44}$ (cm) | A1 | For 1.44 or exact equivalent e.g. $\frac{36}{25}$
| | (3)
## Part (b):
$[Q_2 =]\ \{30\} + \dfrac{(50 - [25+17])}{28} \times 30$ or e.g. $\dfrac{60-30}{70-42} = \dfrac{m - \{30\}}{50-42}$ | M1 | For $+\frac{8}{28} \times 30$
$= 38.571...\ \ \ \text{awrt}\ \mathbf{38.6}$ | A1 | For awrt 38.6
| | (2)
## Part (c):
Use of midpoints to get $\sum fx = 5070$ (allow 5000 to 1 sf) | M1 |
$[\bar{t}\ \text{or}\ \bar{x}] = \mathbf{50.7}$ | A1 | For 50.7 or exact equivalent e.g. $\frac{5070}{100}$
| | (2)
## Part (d):
$[\sigma] = \sqrt{\dfrac{455512.5}{100} - \text{"50.7"}^2}$ or $\sqrt{1984.635}$ | M1 | For correct expression including square root (ft their mean)
$= 44.5492...\ \ \ \text{awrt}\ \mathbf{44.5}$ | A1 | For awrt 44.5
| | (2)
## Part (e):
$\bar{t}$ or $\bar{x} > Q_2$ [allow "50.7" > "38.6" or formula] so **positive** (skew) | B1 | For positive skew plus a suitable correct reason
| | (1)
## Part (f)(i):
Median: **no change** — since e.g. all 18 values are still below the median | B2/1/0 | All 3 correct: B2; only 2 correct: B1
## Part (f)(ii):
Mean: will be **smaller** — since e.g. changes reduce total of $x$ ($7 \times 7.5$ not $25 \times 7.5$ in 1st class) | B1 |
## Part (f)(iii):
Standard deviation: will be **greater** — since e.g. 18 zeros means data more spread out | | (3) **[13 marks]**
---
4. A group of 100 adults recorded the amount of time, $t$ minutes, they spent exercising each day. Their results are summarised in the table below.
\begin{center}
\begin{tabular}{|l|l|l|}
\hline
Time (t minutes) & Frequency (f) & Time midpoint (x) \\
\hline
$0 \leqslant t < 15$ & 25 & 7.5 \\
\hline
$15 \leqslant t < 30$ & 17 & 22.5 \\
\hline
$30 \leqslant t < 60$ & 28 & 45 \\
\hline
$60 \leqslant t < 120$ & 24 & 90 \\
\hline
$120 \leqslant t \leqslant 240$ & 6 & 180 \\
\hline
\end{tabular}
\end{center}
[You may use $\sum \mathrm { f } x ^ { 2 } = 455$ 512.5]\\
A histogram is drawn to represent these data.\\
The bar representing the time $0 \leqslant t < 15$ has width 0.5 cm and height 6 cm .
\begin{enumerate}[label=(\alph*)]
\item Calculate the width and height of the bar representing a time of $60 \leqslant t < 120$
\item Use linear interpolation to estimate the median time spent exercising by these adults each day.
\item Find an estimate of the mean time spent exercising by these adults each day.
\item Calculate an estimate for the standard deviation of these times.
\item Describe, giving a reason, the skewness of these data.
Further analysis of the above data revealed that 18 of the 25 adults in the $0 \leqslant t < 15$ group took no exercise each day.
\item State, giving a reason, what effect, if any, this new information would have on your answers to
\begin{enumerate}[label=(\roman*)]
\item the estimate of the median in part (b),
\item the estimate of the mean in part (c),
\item the estimate of the standard deviation in part (d).
\end{enumerate}\end{enumerate}
\hfill \mbox{\textit{Edexcel S1 2019 Q4 [13]}}