| Exam Board | Edexcel |
|---|---|
| Module | S1 (Statistics 1) |
| Year | 2018 |
| Session | June |
| Marks | 13 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Data representation |
| Type | Estimate mean and standard deviation from frequency table |
| Difficulty | Moderate -0.8 This is a standard S1 statistics question testing routine procedures: histogram drawing with unequal class widths, mean/SD from grouped data (with Σfy² given), linear interpolation for median, and basic probability. All techniques are textbook exercises requiring only formula application with no problem-solving or novel insight required. |
| Spec | 2.02b Histogram: area represents frequency2.02f Measures of average and spread2.02g Calculate mean and standard deviation |
| Weight in grams ( \(\boldsymbol { x }\) ) | Frequency (f) | Class midpoint (y) |
| \(0.9 < x \leqslant 1.1\) | 9 | 1.0 |
| \(1.1 < x \leqslant 1.3\) | 12 | 1.2 |
| \(1.3 < x \leqslant 1.5\) | 11 | 1.4 |
| \(1.5 < x \leqslant 1.7\) | 8 | 1.6 |
| \(1.7 < x \leqslant 1.9\) | 3 | 1.8 |
| \(1.9 < x \leqslant 2.1\) | 3 | 2.0 |
| \(2.1 < x \leqslant 2.7\) | 2 | 2.4 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| \(0.9 < x \leqslant 1.1\) group - width \(\mathbf{0.5}\) (cm) | B1 | |
| \(1.5\text{ cm}^2\) is 2 seeds or \(6.75\text{ cm}^2\) is 9 seeds or \(0.5c=6.75\) or \(\frac{9}{0.2}\times0.3\) | M1 | A correct statement comparing area and number of seeds; allow height \(\times\) width \(= 6.75\) for M1 |
| \(0.9 < x \leqslant 1.1\) group - height \(\mathbf{13.5}\) (cm) | A1 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| The data/weights are continuous | B1 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| Mean \(= 1.4125\) awrt \(\mathbf{1.41}\) | B1 | Allow \(\frac{113}{80}\) but not \(\frac{67.8}{48}\) |
| \(\sigma=\sqrt{\frac{101.56}{48}-\left(\frac{113}{80}\right)^2}=0.347\ldots\) awrt \(\mathbf{0.347}\) (\(s=\) awrt 0.351) | M1M1A1 | M1 attempt at \(\frac{101.56}{48}-'\mu'^2\); M1 using square root |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| Median \(= [1.3]+\dfrac{3}{11}\times0.2\), allow \((n+1)=[1.3]+\dfrac{3.5}{11}\times0.2\) | M1 | M1 for correct fraction: \([1.3]+\frac{3}{11}\times0.2\) or \(m=1.5-\frac{8}{11}\times0.2\); allow \(\frac{[m-1.3]}{1.5-1.3}=\frac{24-21}{32-21}\) |
| \(=\) awrt \(\mathbf{1.35}\) or \(\mathbf{1.355}\) (or if using \((n+1)\) allow awrt \(\mathbf{1.36}\)) | A1 | A1 from correct working |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| \(\dfrac{27}{48}\) oe 0.5625 (allow 0.563) | B1 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| Mean increases and standard deviation decreases | B1 | B1 for both correct comments |
| e.g. '\(\sum fy\) increases (so the mean increases) and an extreme value has been replaced/data is more concentrated around the mean (so the standard deviation decreases)' | depB1 | dep on previous B1 for complete reasoning for both cases |
# Question 5:
## Part (a):
| Answer | Marks | Guidance |
|--------|-------|----------|
| $0.9 < x \leqslant 1.1$ group - width $\mathbf{0.5}$ (cm) | B1 | |
| $1.5\text{ cm}^2$ is 2 seeds or $6.75\text{ cm}^2$ is 9 seeds or $0.5c=6.75$ or $\frac{9}{0.2}\times0.3$ | M1 | A correct statement comparing area and number of seeds; allow height $\times$ width $= 6.75$ for M1 |
| $0.9 < x \leqslant 1.1$ group - height $\mathbf{13.5}$ (cm) | A1 | |
## Part (b):
| Answer | Marks | Guidance |
|--------|-------|----------|
| The data/weights are continuous | B1 | |
## Part (c):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Mean $= 1.4125$ awrt $\mathbf{1.41}$ | B1 | Allow $\frac{113}{80}$ but not $\frac{67.8}{48}$ |
| $\sigma=\sqrt{\frac{101.56}{48}-\left(\frac{113}{80}\right)^2}=0.347\ldots$ awrt $\mathbf{0.347}$ ($s=$ awrt 0.351) | M1M1A1 | M1 attempt at $\frac{101.56}{48}-'\mu'^2$; M1 using square root |
## Part (d):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Median $= [1.3]+\dfrac{3}{11}\times0.2$, allow $(n+1)=[1.3]+\dfrac{3.5}{11}\times0.2$ | M1 | M1 for correct fraction: $[1.3]+\frac{3}{11}\times0.2$ or $m=1.5-\frac{8}{11}\times0.2$; allow $\frac{[m-1.3]}{1.5-1.3}=\frac{24-21}{32-21}$ |
| $=$ awrt $\mathbf{1.35}$ or $\mathbf{1.355}$ (or if using $(n+1)$ allow awrt $\mathbf{1.36}$) | A1 | A1 from correct working |
## Part (e):
| Answer | Marks | Guidance |
|--------|-------|----------|
| $\dfrac{27}{48}$ oe 0.5625 (allow 0.563) | B1 | |
## Part (f):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Mean increases **and** standard deviation decreases | B1 | B1 for both correct comments |
| e.g. '$\sum fy$ increases (so the mean increases) **and** an extreme value has been replaced/data is more concentrated around the mean (so the standard deviation decreases)' | depB1 | dep on previous B1 for **complete** reasoning for both cases |
---
5. The weights, in grams, of a random sample of 48 broad beans are summarised in the table.
\begin{center}
\begin{tabular}{|l|l|l|}
\hline
Weight in grams ( $\boldsymbol { x }$ ) & Frequency (f) & Class midpoint (y) \\
\hline
$0.9 < x \leqslant 1.1$ & 9 & 1.0 \\
\hline
$1.1 < x \leqslant 1.3$ & 12 & 1.2 \\
\hline
$1.3 < x \leqslant 1.5$ & 11 & 1.4 \\
\hline
$1.5 < x \leqslant 1.7$ & 8 & 1.6 \\
\hline
$1.7 < x \leqslant 1.9$ & 3 & 1.8 \\
\hline
$1.9 < x \leqslant 2.1$ & 3 & 2.0 \\
\hline
$2.1 < x \leqslant 2.7$ & 2 & 2.4 \\
\hline
\end{tabular}
\end{center}
(You may assume $\sum \mathrm { fy } { } ^ { 2 } = 101.56$ )
A histogram was drawn to represent these data. The $2.1 < x \leqslant 2.7$ class was represented by a bar of width 1.5 cm and height 1 cm .
\begin{enumerate}[label=(\alph*)]
\item Find the width and height of the $0.9 < x \leqslant 1.1$ class.
\item Give a reason to justify the use of a histogram to represent these data.
\item Estimate the mean and the standard deviation of the weights of these broad beans.
\item Use linear interpolation to estimate the median of the weights of these broad beans.
One of these broad beans is selected at random.
\item Estimate the probability that its weight lies between 1.1 grams and 1.6 grams.
One of these broad beans having a recorded weight of 0.95 grams was incorrectly weighed. The correct weight is 1.4 grams.
\item State, giving a reason, the effect this would have on your answers to part (c). Do not carry out any further calculations.
\end{enumerate}
\hfill \mbox{\textit{Edexcel S1 2018 Q5 [13]}}