| Exam Board | Edexcel |
|---|---|
| Module | S1 (Statistics 1) |
| Year | 2023 |
| Session | June |
| Marks | 13 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Bivariate data |
| Type | Effect of data transformation on correlation |
| Difficulty | Moderate -0.3 This is a standard S1 bivariate data question testing routine calculations (Sxx, Sxy, correlation coefficient, regression line) and understanding of coding transformations. All parts follow textbook procedures with no novel problem-solving required, though the multi-part structure and coding section elevate it slightly above the most basic exercises. |
| Spec | 5.08a Pearson correlation: calculate pmcc5.08b Linear coding: effect on pmcc5.09a Dependent/independent variables5.09b Least squares regression: concepts5.09c Calculate regression line5.09d Linear coding: effect on regression5.09e Use regression: for estimation in context |
| Answer | Marks |
|---|---|
| 2(a) | 297.8114.8 297.82 |
| Answer | Marks |
|---|---|
| 15 15 | M1 |
| Answer | Marks |
|---|---|
| tw | A1 |
| Answer | Marks |
|---|---|
| ww | A1 |
| Answer | Marks |
|---|---|
| (b) | "25.367" |
| Answer | Marks |
|---|---|
| 5.3173"176.797..." | M1 |
| = 0.82735…. awrt 0.827 or 0.828 | A1 |
| Answer | Marks |
|---|---|
| (c) | "25.367..." |
| Answer | Marks |
|---|---|
| 5.3173 | M1 |
| Answer | Marks |
|---|---|
| 15 5.3173 15 | M1 |
| b = 4.771 or better or a = –16.66 or better seen andw=−16.7+ 4.77t* | A1*cso |
| Answer | Marks | Guidance |
|---|---|---|
| (d) | [On average,] for each cm/1 cm of tail length/t the weight/w increases by 4.77 g/grams | B1 |
| Answer | Marks |
|---|---|
| (e) | 16.7 |
| Answer | Marks |
|---|---|
| 4.77 | M1 |
| Answer | Marks |
|---|---|
| or for sd extrapolation since a 2 cm tail is (approx 9 sd)/(more than 3 sd) from the mean | A1 |
| Answer | Marks | Guidance |
|---|---|---|
| (f) | 0.827 | B1ft |
| Answer | Marks |
|---|---|
| (g) | 2y+10=−16.7+4.77(x+6) |
| oe | B1ft |
| Answer | Marks |
|---|---|
| Notes | Total 13 |
| (a) | M1 for a correct expression for S or S |
| Answer | Marks |
|---|---|
| (b) | M1 for a valid attempt at r with their S not equal to 2304.53 and S not equal to 6089.12 |
| Answer | Marks |
|---|---|
| (c) | 1st M1 for a correct method to find the value of b |
| Answer | Marks |
|---|---|
| (d) | B1 For a suitable contextual comment that implies that as length increases by 1 cm weight increases by |
| Answer | Marks |
|---|---|
| (e) | M1 for a correct method to calculate the value of w (condone if written as a fraction) or |
| Answer | Marks |
|---|---|
| (f) | B1ft follow through their answer to (b) |
| (g) | −16.7+4.77(x+6) |
| Answer | Marks | Guidance |
|---|---|---|
| Qu | Scheme | Marks |
Question 2:
--- 2(a) ---
2(a) | 297.8114.8 297.82
S =2304.53− or S =6089.12−
tw ww
15 15 | M1
S =25.367... awrt 25.4
tw | A1
S =176.797 awrt 177
ww | A1
(3)
(b) | "25.367"
r=
5.3173"176.797..." | M1
= 0.82735…. awrt 0.827 or 0.828 | A1
(2)
(c) | "25.367..."
b= =4.77065...
5.3173 | M1
297.8 "25.367" 114.8
a= − =−16.658....
15 5.3173 15 | M1
b = 4.771 or better or a = –16.66 or better seen andw=−16.7+ 4.77t* | A1*cso
(3)
(d) | [On average,] for each cm/1 cm of tail length/t the weight/w increases by 4.77 g/grams | B1
(1)
(e) | 16.7
w=−16.7+4.772 =−7.16 or 4.772 =9.54 or t= =3.5 or sd = awrt 0.6
4.77 | M1
w=−7.16
or 9.54 < 16.7 or 2 < 3.5 which is negative/weight cannot be negative
or for sd extrapolation since a 2 cm tail is (approx 9 sd)/(more than 3 sd) from the mean | A1
(2)
(f) | 0.827 | B1ft
(1)
(g) | 2y+10=−16.7+4.77(x+6)
oe | B1ft
(1)
Notes | Total 13
(a) | M1 for a correct expression for S or S
tw ww
A1 awrt 25.4
A1 awrt 177
(b) | M1 for a valid attempt at r with their S not equal to 2304.53 and S not equal to 6089.12
tw ww
A1 (M2 on epen) awrt 0.827 or awrt 0.828
(c) | 1st M1 for a correct method to find the value of b
2nd M1 ft their b. For a correct method to find a. Minimum shown
a=awrt 19.9−"theirb"awrt 7.65 =−16.658
A1* Both method marks must be awarded, equation stated (no fractions) and sight of (4.771 or better)
or (−16.66 or better)
(d) | B1 For a suitable contextual comment that implies that as length increases by 1 cm weight increases by
4.77g. Allow multiples eg each 10 cm increase in tail length weight increases by 47.7g Allow in terms
of t and w
(e) | M1 for a correct method to calculate the value of w (condone if written as a fraction) or
4.772 =9.54 or correct method to find tail length when w = 0 or sd = awrt 0.6
A1 Method mark must be awarded. For –7.16 or 9.54 < 16.7 or 2 < 3.5 with a relevant explanation
stating that weight is negative. If sd = awrt 0.6 is given allow extrapolation since a 2 cm tail is (approx
9 sd)/(more than 3 sd) from the mean.
(f) | B1ft follow through their answer to (b)
(g) | −16.7+4.77(x+6)
B1 ISW no need to be simplified. Allow equivalent eg y= −5The correct
2
simplified equation is y=2.385x+0.96 allow awrt 2.39 and 0.96 – 0.98
Qu | Scheme | Marks
Two students, Olive and Shan, collect data on the weight, $w$ grams, and the tail length, $t$ cm, of 15 mice.
Olive summarised the data as follows
$S_tt = 5.3173$ \quad $\sum w^2 = 6089.12$ \quad $\sum tw = 2304.53$ \quad $\sum w = 297.8$ \quad $\sum t = 114.8$
\begin{enumerate}[label=(\alph*)]
\item Calculate the value of $S_{ww}$ and the value of $S_{tw}$ [3]
\item Calculate the value of the product moment correlation coefficient between $w$ and $t$ [2]
\item Show that the equation of the regression line of $w$ on $t$ can be written as
$$w = -16.7 + 4.77t$$ [3]
\item Give an interpretation of the gradient of the regression line. [1]
\item Explain why it would not be appropriate to use the regression line in part (c) to estimate the weight of a mouse with a tail length of 2cm. [2]
\end{enumerate}
Shan decided to code the data using $x = t - 6$ and $y = \frac{w}{2} - 5$
\begin{enumerate}[label=(\alph*)]
\setcounter{enumi}{5}
\item Write down the value of the product moment correlation coefficient between $x$ and $y$ [1]
\item Write down an equation of the regression line of $y$ on $x$
You do not need to simplify your equation. [1]
\end{enumerate}
\hfill \mbox{\textit{Edexcel S1 2023 Q2 [13]}}