Edexcel S1 — Question 3 9 marks

Exam BoardEdexcel
ModuleS1 (Statistics 1)
Marks9
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicMeasures of Location and Spread
TypeCoding to simplify calculation
DifficultyModerate -0.8 This is a straightforward application of coding formulas for mean and standard deviation with grouped data. Part (a) requires direct substitution into standard formulas (reverse the coding transformation), part (b) tests understanding of why grouped data gives estimates (a bookwork explanation), and part (c) requires comparing mean to median for skewness. All steps are routine S1 techniques with no problem-solving or novel insight required.
Spec2.02f Measures of average and spread2.02g Calculate mean and standard deviation5.02c Linear coding: effects on mean and variance

3. A magazine collected data on the total cost of the reception at each of a random sample of 80 weddings. The data is grouped and coded using \(y = \frac { C - 3250 } { 250 }\), where \(C\) is the mid-point in pounds of each class, giving \(\sum f y = 37\) and \(\sum f y ^ { 2 } = 2317\).
  1. Using these values, calculate estimates of the mean and standard deviation of the cost of the receptions in the sample.
  2. Explain why your answers to part (a) are only estimates. The median of the data was \(\pounds 3050\).
  3. Comment on the skewness of the data and suggest a reason for it.

AnswerMarks Guidance
\(\bar{y} = \frac{37}{80} = 0.4625\)M1
\(\bar{C} = (250 \times 0.4625) + 3250 = £3366\) (nearest £)M1 A1
std. dev. of \(y = \sqrt{\frac{2317}{80} - 0.4625^2} = 5.3618\)M1
std. dev. of \(C = 250 \times 5.3618 = £1340\) (nearest £)M1 A1
used midpoints to represent data in each groupB1
median < mean \(\therefore\) +vely skewedB1
e.g. most cost a similar amount but some people spend a lot moreB1 (9)
| $\bar{y} = \frac{37}{80} = 0.4625$ | M1 | |
| $\bar{C} = (250 \times 0.4625) + 3250 = £3366$ (nearest £) | M1 A1 | |
| std. dev. of $y = \sqrt{\frac{2317}{80} - 0.4625^2} = 5.3618$ | M1 | |
| std. dev. of $C = 250 \times 5.3618 = £1340$ (nearest £) | M1 A1 | |
| used midpoints to represent data in each group | B1 | |
| median < mean $\therefore$ +vely skewed | B1 | |
| e.g. most cost a similar amount but some people spend a lot more | B1 | (9) |
3. A magazine collected data on the total cost of the reception at each of a random sample of 80 weddings.

The data is grouped and coded using $y = \frac { C - 3250 } { 250 }$, where $C$ is the mid-point in pounds of each class, giving $\sum f y = 37$ and $\sum f y ^ { 2 } = 2317$.
\begin{enumerate}[label=(\alph*)]
\item Using these values, calculate estimates of the mean and standard deviation of the cost of the receptions in the sample.
\item Explain why your answers to part (a) are only estimates.

The median of the data was $\pounds 3050$.
\item Comment on the skewness of the data and suggest a reason for it.
\end{enumerate}

\hfill \mbox{\textit{Edexcel S1  Q3 [9]}}