Edexcel Paper 3 2018 June — Question 4 13 marks

Exam BoardEdexcel
ModulePaper 3 (Paper 3)
Year2018
SessionJune
Marks13
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicData representation
TypeIdentify and compare sampling techniques
DifficultyEasy -1.3 This is a straightforward statistics question testing basic terminology (sampling methods, census) and routine calculations (IQR, mean, standard deviation). Parts (a)-(c) require simple recall of definitions, (d)-(e) are standard formula applications, (f) is a textbook comparison of measures, and (g) requires basic understanding of box plots. No problem-solving or novel insight needed—purely procedural with multiple easy marks available.
Spec2.01a Population and sample: terminology2.01c Sampling techniques: simple random, opportunity, etc2.02f Measures of average and spread2.02g Calculate mean and standard deviation2.02h Recognize outliers

  1. Charlie is studying the time it takes members of his company to travel to the office. He stands by the door to the office from 0840 to 0850 one morning and asks workers, as they arrive, how long their journey was.
    1. State the sampling method Charlie used.
    2. State and briefly describe an alternative method of non-random sampling Charlie could have used to obtain a sample of 40 workers.
    Taruni decided to ask every member of the company the time, \(x\) minutes, it takes them to travel to the office.
  2. State the data selection process Taruni used. Taruni's results are summarised by the box plot and summary statistics below. \includegraphics[max width=\textwidth, alt={}, center]{65e4b254-fb7b-45c2-9702-32f034018193-10_378_1349_1050_367} $$n = 95 \quad \sum x = 4133 \quad \sum x ^ { 2 } = 202294$$
  3. Write down the interquartile range for these data.
  4. Calculate the mean and the standard deviation for these data.
  5. State, giving a reason, whether you would recommend using the mean and standard deviation or the median and interquartile range to describe these data. Rana and David both work for the company and have both moved house since Taruni collected her data. Rana's journey to work has changed from 75 minutes to 35 minutes and David's journey to work has changed from 60 minutes to 33 minutes. Taruni drew her box plot again and only had to change two values.
  6. Explain which two values Taruni must have changed and whether each of these values has increased or decreased.

Question 4:
Part (a):
AnswerMarks Guidance
AnswerMarks Guidance
Convenience or opportunity [sampling]B1
Part (b):
AnswerMarks Guidance
AnswerMarks Guidance
Quota [sampling]B1 For quota sampling mentioned. "Stratified", "systematic" or "random" are B0B0
e.g. Take 4 people every 10 minutesB1 For description of how such a system might work; requires suitable strata or categories e.g. time slots, departments, gender, age groups. Suggestion of randomness is B0
Part (c):
AnswerMarks Guidance
AnswerMarks Guidance
CensusB1
Part (d):
AnswerMarks Guidance
AnswerMarks Guidance
\([58 - 26 =]\ \mathbf{32}\) (min)B1
Part (e):
AnswerMarks Guidance
AnswerMarks Guidance
\(\mu = \frac{4133}{95} = 43.505\ldots\) awrt \(\mathbf{43.5}\) (min)B1 For a correct mean (awrt 43.5)
\(\sigma_x = \sqrt{\frac{202294}{95} - \mu^2} = \sqrt{236.7026\ldots}\)M1 For correct expression for sd (including \(\sqrt{\ }\)) ft their mean
\(= 15.385\ldots\) awrt \(\mathbf{15.4}\) (min)A1 For awrt 15.4. Allow \(s = 15.4667\ldots\) awrt 15.5
Part (f):
AnswerMarks Guidance
AnswerMarks Guidance
There are outliers in the data (or data is skew) which will affect mean and sdB1 For acknowledging outliers or skewness are a problem for mean and sd. "extreme values"/"anomalies" OK
Therefore use median and IQRdB1 Dependent on 1st B1, for therefore choosing median and IQR
Part (g):
AnswerMarks Guidance
AnswerMarks Guidance
Value of 20, LQ at 26 and outliers will not change; or state that median and upper quartile are the values that do changeB1 For identifying 2 of these 3 groups of unchanged values or stating only \(Q_2\) and \(Q_3\) change
More values now below 40 than above so \(Q_2\) or \(Q_3\) will change and be lowerM1 For explaining that median or UQ should be lower. E.g. the 2 values have moved to below 40 (or 58) and therefore more than 50% below 40
Both \(Q_2\) and \(Q_3\) will be lowerA1 For stating median and UQ are both lower with clear evidence of M1 scored
# Question 4:

## Part (a):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Convenience or opportunity [sampling] | B1 | |

## Part (b):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Quota [sampling] | B1 | For quota sampling mentioned. "Stratified", "systematic" or "random" are B0B0 |
| e.g. Take 4 people every 10 minutes | B1 | For description of how such a system might work; requires suitable strata or categories e.g. time slots, departments, gender, age groups. Suggestion of randomness is B0 |

## Part (c):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Census | B1 | |

## Part (d):
| Answer | Marks | Guidance |
|--------|-------|----------|
| $[58 - 26 =]\ \mathbf{32}$ (min) | B1 | |

## Part (e):
| Answer | Marks | Guidance |
|--------|-------|----------|
| $\mu = \frac{4133}{95} = 43.505\ldots$ awrt $\mathbf{43.5}$ (min) | B1 | For a correct mean (awrt 43.5) |
| $\sigma_x = \sqrt{\frac{202294}{95} - \mu^2} = \sqrt{236.7026\ldots}$ | M1 | For correct expression for sd (including $\sqrt{\ }$) ft their mean |
| $= 15.385\ldots$ awrt $\mathbf{15.4}$ (min) | A1 | For awrt 15.4. Allow $s = 15.4667\ldots$ awrt 15.5 |

## Part (f):
| Answer | Marks | Guidance |
|--------|-------|----------|
| There are outliers in the data (or data is skew) which will affect mean and sd | B1 | For acknowledging outliers or skewness are a problem for mean and sd. "extreme values"/"anomalies" OK |
| Therefore use median and IQR | dB1 | Dependent on 1st B1, for therefore choosing median and IQR |

## Part (g):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Value of 20, LQ at 26 and outliers will not change; or state that median and upper quartile are the values that do change | B1 | For identifying 2 of these 3 groups of unchanged values or stating only $Q_2$ and $Q_3$ change |
| More values now below 40 than above so $Q_2$ or $Q_3$ will change and be lower | M1 | For explaining that median or UQ should be lower. E.g. the 2 values have moved to below 40 (or 58) and therefore more than 50% below 40 |
| Both $Q_2$ and $Q_3$ will be lower | A1 | For stating median and UQ are both lower with clear evidence of M1 scored |

---
\begin{enumerate}
  \item Charlie is studying the time it takes members of his company to travel to the office. He stands by the door to the office from 0840 to 0850 one morning and asks workers, as they arrive, how long their journey was.\\
(a) State the sampling method Charlie used.\\
(b) State and briefly describe an alternative method of non-random sampling Charlie could have used to obtain a sample of 40 workers.
\end{enumerate}

Taruni decided to ask every member of the company the time, $x$ minutes, it takes them to travel to the office.\\
(c) State the data selection process Taruni used.

Taruni's results are summarised by the box plot and summary statistics below.\\
\includegraphics[max width=\textwidth, alt={}, center]{65e4b254-fb7b-45c2-9702-32f034018193-10_378_1349_1050_367}

$$n = 95 \quad \sum x = 4133 \quad \sum x ^ { 2 } = 202294$$

(d) Write down the interquartile range for these data.\\
(e) Calculate the mean and the standard deviation for these data.\\
(f) State, giving a reason, whether you would recommend using the mean and standard deviation or the median and interquartile range to describe these data.

Rana and David both work for the company and have both moved house since Taruni collected her data.

Rana's journey to work has changed from 75 minutes to 35 minutes and David's journey to work has changed from 60 minutes to 33 minutes.

Taruni drew her box plot again and only had to change two values.\\
(g) Explain which two values Taruni must have changed and whether each of these values has increased or decreased.

\hfill \mbox{\textit{Edexcel Paper 3 2018 Q4 [13]}}