Edexcel FS2 AS 2022 June — Question 3 10 marks

Exam BoardEdexcel
ModuleFS2 AS (Further Statistics 2 AS)
Year2022
SessionJune
Marks10
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicLinear regression
TypeCalculate regression line then predict
DifficultyStandard +0.3 This is a straightforward application of standard regression formulas from the formula booklet. Parts (a)-(c) involve direct substitution into given formulas for correlation coefficient and regression line. Parts (d)-(e) require reading a residual plot and making basic interpretations. While it's a multi-part question worth several marks, each step is routine for Further Statistics students with no novel problem-solving required. Slightly easier than average A-level due to the mechanical nature of the calculations.
Spec5.08a Pearson correlation: calculate pmcc5.09a Dependent/independent variables5.09c Calculate regression line5.09e Use regression: for estimation in context

  1. Gabriela is investigating a particular type of fish, called bream. She wants to create a model to predict the weight, \(w\) grams, of bream based on their length, \(x \mathrm {~cm}\).
For a sample of 27 bream, some summary statistics are given below. $$\begin{gathered} \bar { x } = 31.07 \quad \bar { w } = 628.59 \quad \sum w ^ { 2 } = 11386134 \\ \mathrm {~S} _ { x w } = 13082.3 \quad \mathrm {~S} _ { x x } = 260.8 \end{gathered}$$
  1. Find the value of the product moment correlation coefficient between \(x\) and \(w\)
  2. Explain whether the answer to part (a) is consistent with a linear model for these data.
  3. Find the equation of the regression line of \(w\) on \(x\) in the form \(w = a + b x\) A residual plot for these data is shown below. \includegraphics[max width=\textwidth, alt={}, center]{128c408d-3e08-4f74-8f19-d33ecd5c882f-06_931_1790_1107_139} One of the bream in the sample has a length of 32 cm .
  4. Find its weight.
  5. With reference to the residual plot, comment on the model for bream with lengths above 33 cm .

Question 3:
Part (a):
AnswerMarks Guidance
WorkingMark Guidance
\(S_{ww}=11\,386\,134 - 27(628.59)^2 [=717\,748.5213]\)B1 Correct expression for \(S_{ww}\) (implied by correct answer)
\(r=\frac{13082.3}{\sqrt{260.8\times 717\,748.5213}}\)M1 Complete method to find \(r\); use of \(S_{ww}=11\,386\,134\) is M0
\(r=0.95618\ldots\) awrt 0.956A1
(3 marks)
Part (b):
AnswerMarks Guidance
WorkingMark Guidance
Since \(r\) is close to 1, data is consistent with a linear model.B1 Correct explanation and conclusion
(1 mark)
Part (c):
AnswerMarks Guidance
WorkingMark Guidance
\(b=\frac{13082.3}{260.8}[=50.162\ldots]\)M1 Setting up linear model by finding gradient
\(a=628.59 - {'}b{'} (31.07)\)M1 Attempting \(y\)-intercept of linear model
\(w=-930+50.2x\)A1 Correct model with \(b=\) awrt 50.2 and \(a=\) awrt \(-930\) (must use \(w\) and \(x\))
(3 marks)
Part (d):
AnswerMarks Guidance
WorkingMark Guidance
\(w=-930+50.2(32)+80\)M1 Using the model with the residual; allow \(\pm80\)
\(w=756.4\)A1 awrt 756 (allow awrt 755 from use of exact values)
(2 marks)
Part (e):
AnswerMarks Guidance
WorkingMark Guidance
Negative residuals for all 5 observations with \(x>33\) suggests the model systematically overestimates weights for the longest bream.B1 Must reference both the residuals and the model; negative correlation between residuals and length is B0
(1 mark)
## Question 3:

### Part (a):
| Working | Mark | Guidance |
|---------|------|----------|
| $S_{ww}=11\,386\,134 - 27(628.59)^2 [=717\,748.5213]$ | B1 | Correct expression for $S_{ww}$ (implied by correct answer) |
| $r=\frac{13082.3}{\sqrt{260.8\times 717\,748.5213}}$ | M1 | Complete method to find $r$; use of $S_{ww}=11\,386\,134$ is M0 |
| $r=0.95618\ldots$ awrt **0.956** | A1 | |

**(3 marks)**

### Part (b):
| Working | Mark | Guidance |
|---------|------|----------|
| Since $r$ is close to 1, data is consistent with a linear model. | B1 | Correct explanation and conclusion |

**(1 mark)**

### Part (c):
| Working | Mark | Guidance |
|---------|------|----------|
| $b=\frac{13082.3}{260.8}[=50.162\ldots]$ | M1 | Setting up linear model by finding gradient |
| $a=628.59 - {'}b{'} (31.07)$ | M1 | Attempting $y$-intercept of linear model |
| $w=-930+50.2x$ | A1 | Correct model with $b=$ awrt 50.2 and $a=$ awrt $-930$ (must use $w$ and $x$) |

**(3 marks)**

### Part (d):
| Working | Mark | Guidance |
|---------|------|----------|
| $w=-930+50.2(32)+80$ | M1 | Using the model with the residual; allow $\pm80$ |
| $w=756.4$ | A1 | awrt 756 (allow awrt 755 from use of exact values) |

**(2 marks)**

### Part (e):
| Working | Mark | Guidance |
|---------|------|----------|
| Negative residuals for all 5 observations with $x>33$ suggests the model systematically overestimates weights for the longest bream. | B1 | Must reference both the residuals and the model; negative correlation between residuals and length is B0 |

**(1 mark)**

---
\begin{enumerate}
  \item Gabriela is investigating a particular type of fish, called bream. She wants to create a model to predict the weight, $w$ grams, of bream based on their length, $x \mathrm {~cm}$.
\end{enumerate}

For a sample of 27 bream, some summary statistics are given below.

$$\begin{gathered}
\bar { x } = 31.07 \quad \bar { w } = 628.59 \quad \sum w ^ { 2 } = 11386134 \\
\mathrm {~S} _ { x w } = 13082.3 \quad \mathrm {~S} _ { x x } = 260.8
\end{gathered}$$

(a) Find the value of the product moment correlation coefficient between $x$ and $w$\\
(b) Explain whether the answer to part (a) is consistent with a linear model for these data.\\
(c) Find the equation of the regression line of $w$ on $x$ in the form $w = a + b x$

A residual plot for these data is shown below.\\
\includegraphics[max width=\textwidth, alt={}, center]{128c408d-3e08-4f74-8f19-d33ecd5c882f-06_931_1790_1107_139}

One of the bream in the sample has a length of 32 cm .\\
(d) Find its weight.\\
(e) With reference to the residual plot, comment on the model for bream with lengths above 33 cm .

\hfill \mbox{\textit{Edexcel FS2 AS 2022 Q3 [10]}}