Edexcel S1 2019 January — Question 4 13 marks

Exam BoardEdexcel
ModuleS1 (Statistics 1)
Year2019
SessionJanuary
Marks13
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicData representation
TypeEstimate mean and standard deviation from frequency table
DifficultyModerate -0.8 This is a routine S1 statistics question testing standard procedures: histogram calculations using frequency density, linear interpolation for median, and mean/standard deviation from grouped data using given formulas. All parts follow textbook methods with no problem-solving or novel insight required, making it easier than average but not trivial due to the multiple parts and some computational work.
Spec2.02a Interpret single variable data: tables and diagrams2.02b Histogram: area represents frequency2.02f Measures of average and spread2.02g Calculate mean and standard deviation

4. A group of 100 adults recorded the amount of time, \(t\) minutes, they spent exercising each day. Their results are summarised in the table below.
Time (t minutes)Frequency (f)Time midpoint (x)
\(0 \leqslant t < 15\)257.5
\(15 \leqslant t < 30\)1722.5
\(30 \leqslant t < 60\)2845
\(60 \leqslant t < 120\)2490
\(120 \leqslant t \leqslant 240\)6180
[You may use \(\sum \mathrm { f } x ^ { 2 } = 455\) 512.5]
A histogram is drawn to represent these data.
The bar representing the time \(0 \leqslant t < 15\) has width 0.5 cm and height 6 cm .
  1. Calculate the width and height of the bar representing a time of \(60 \leqslant t < 120\)
  2. Use linear interpolation to estimate the median time spent exercising by these adults each day.
  3. Find an estimate of the mean time spent exercising by these adults each day.
  4. Calculate an estimate for the standard deviation of these times.
  5. Describe, giving a reason, the skewness of these data. Further analysis of the above data revealed that 18 of the 25 adults in the \(0 \leqslant t < 15\) group took no exercise each day.
  6. State, giving a reason, what effect, if any, this new information would have on your answers to
    1. the estimate of the median in part (b),
    2. the estimate of the mean in part (c),
    3. the estimate of the standard deviation in part (d).

Question 4:
Part (a):
AnswerMarks Guidance
Width: 15 minutes is 0.5 cm so 60 mins will be \(4 \times 0.5 = \mathbf{2}\) (cm)B1 For a width of 2 cm
Height: freq of 25 represented by \(6 \times 0.5 = 3\ (\text{cm}^2)\) so freq of 24 is \(\frac{24}{25} \times 3\)M1 For some calculation linking area and frequency
So height \(= \frac{1}{2} \times \frac{24}{25} \times 3 = \mathbf{1.44}\) (cm)A1 For 1.44 or exact equivalent e.g. \(\frac{36}{25}\)
(3)
Part (b):
AnswerMarks Guidance
\([Q_2 =]\ \{30\} + \dfrac{(50 - [25+17])}{28} \times 30\) or e.g. \(\dfrac{60-30}{70-42} = \dfrac{m - \{30\}}{50-42}\)M1 For \(+\frac{8}{28} \times 30\)
\(= 38.571...\ \ \ \text{awrt}\ \mathbf{38.6}\)A1 For awrt 38.6
(2)
Part (c):
AnswerMarks Guidance
Use of midpoints to get \(\sum fx = 5070\) (allow 5000 to 1 sf)M1
\([\bar{t}\ \text{or}\ \bar{x}] = \mathbf{50.7}\)A1 For 50.7 or exact equivalent e.g. \(\frac{5070}{100}\)
(2)
Part (d):
AnswerMarks Guidance
\([\sigma] = \sqrt{\dfrac{455512.5}{100} - \text{"50.7"}^2}\) or \(\sqrt{1984.635}\)M1 For correct expression including square root (ft their mean)
\(= 44.5492...\ \ \ \text{awrt}\ \mathbf{44.5}\)A1 For awrt 44.5
(2)
Part (e):
AnswerMarks Guidance
\(\bar{t}\) or \(\bar{x} > Q_2\) [allow "50.7" > "38.6" or formula] so positive (skew)B1 For positive skew plus a suitable correct reason
(1)
Part (f)(i):
AnswerMarks Guidance
Median: no change — since e.g. all 18 values are still below the medianB2/1/0 All 3 correct: B2; only 2 correct: B1
Part (f)(ii):
AnswerMarks
Mean: will be smaller — since e.g. changes reduce total of \(x\) (\(7 \times 7.5\) not \(25 \times 7.5\) in 1st class)B1
Part (f)(iii):
AnswerMarks Guidance
Standard deviation: will be greater — since e.g. 18 zeros means data more spread out (3) [13 marks]
# Question 4:

## Part (a):
Width: 15 minutes is 0.5 cm so 60 mins will be $4 \times 0.5 = \mathbf{2}$ (cm) | B1 | For a width of 2 cm
Height: freq of 25 represented by $6 \times 0.5 = 3\ (\text{cm}^2)$ so freq of 24 is $\frac{24}{25} \times 3$ | M1 | For some calculation linking area and frequency
So height $= \frac{1}{2} \times \frac{24}{25} \times 3 = \mathbf{1.44}$ (cm) | A1 | For 1.44 or exact equivalent e.g. $\frac{36}{25}$
| | (3)

## Part (b):
$[Q_2 =]\ \{30\} + \dfrac{(50 - [25+17])}{28} \times 30$ or e.g. $\dfrac{60-30}{70-42} = \dfrac{m - \{30\}}{50-42}$ | M1 | For $+\frac{8}{28} \times 30$
$= 38.571...\ \ \ \text{awrt}\ \mathbf{38.6}$ | A1 | For awrt 38.6
| | (2)

## Part (c):
Use of midpoints to get $\sum fx = 5070$ (allow 5000 to 1 sf) | M1 |
$[\bar{t}\ \text{or}\ \bar{x}] = \mathbf{50.7}$ | A1 | For 50.7 or exact equivalent e.g. $\frac{5070}{100}$
| | (2)

## Part (d):
$[\sigma] = \sqrt{\dfrac{455512.5}{100} - \text{"50.7"}^2}$ or $\sqrt{1984.635}$ | M1 | For correct expression including square root (ft their mean)
$= 44.5492...\ \ \ \text{awrt}\ \mathbf{44.5}$ | A1 | For awrt 44.5
| | (2)

## Part (e):
$\bar{t}$ or $\bar{x} > Q_2$ [allow "50.7" > "38.6" or formula] so **positive** (skew) | B1 | For positive skew plus a suitable correct reason
| | (1)

## Part (f)(i):
Median: **no change** — since e.g. all 18 values are still below the median | B2/1/0 | All 3 correct: B2; only 2 correct: B1

## Part (f)(ii):
Mean: will be **smaller** — since e.g. changes reduce total of $x$ ($7 \times 7.5$ not $25 \times 7.5$ in 1st class) | B1 |

## Part (f)(iii):
Standard deviation: will be **greater** — since e.g. 18 zeros means data more spread out | | (3) **[13 marks]**

---
4. A group of 100 adults recorded the amount of time, $t$ minutes, they spent exercising each day. Their results are summarised in the table below.

\begin{center}
\begin{tabular}{|l|l|l|}
\hline
Time (t minutes) & Frequency (f) & Time midpoint (x) \\
\hline
$0 \leqslant t < 15$ & 25 & 7.5 \\
\hline
$15 \leqslant t < 30$ & 17 & 22.5 \\
\hline
$30 \leqslant t < 60$ & 28 & 45 \\
\hline
$60 \leqslant t < 120$ & 24 & 90 \\
\hline
$120 \leqslant t \leqslant 240$ & 6 & 180 \\
\hline
\end{tabular}
\end{center}

[You may use $\sum \mathrm { f } x ^ { 2 } = 455$ 512.5]\\
A histogram is drawn to represent these data.\\
The bar representing the time $0 \leqslant t < 15$ has width 0.5 cm and height 6 cm .
\begin{enumerate}[label=(\alph*)]
\item Calculate the width and height of the bar representing a time of $60 \leqslant t < 120$
\item Use linear interpolation to estimate the median time spent exercising by these adults each day.
\item Find an estimate of the mean time spent exercising by these adults each day.
\item Calculate an estimate for the standard deviation of these times.
\item Describe, giving a reason, the skewness of these data.

Further analysis of the above data revealed that 18 of the 25 adults in the $0 \leqslant t < 15$ group took no exercise each day.
\item State, giving a reason, what effect, if any, this new information would have on your answers to
\begin{enumerate}[label=(\roman*)]
\item the estimate of the median in part (b),
\item the estimate of the mean in part (c),
\item the estimate of the standard deviation in part (d).
\end{enumerate}\end{enumerate}

\hfill \mbox{\textit{Edexcel S1 2019 Q4 [13]}}