Edexcel FS2 Specimen — Question 7 8 marks

Exam BoardEdexcel
ModuleFS2 (Further Statistics 2)
SessionSpecimen
Marks8
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicBivariate data
TypeFind missing data values
DifficultyStandard +0.8 This Further Statistics 2 question requires understanding of coded data transformations, applying variance formulas through coding relationships (S_xx = 100·S_ss), and interpreting correlation. While methodical, it demands careful algebraic manipulation of summary statistics and understanding that correlation is invariant under linear coding—more sophisticated than standard A-level but accessible with FS2 knowledge.
Spec5.08a Pearson correlation: calculate pmcc5.08c Pearson: measure of straight-line fit

  1. Over a period of time, researchers took 10 blood samples from one patient with a blood disease. For each sample, they measured the levels of serum magnesium, \(s \mathrm { mg } / \mathrm { dl }\), in the blood and the corresponding level of the disease protein, \(d \mathrm { mg } / \mathrm { dl }\). One of the researchers coded the data for each sample using \(x = 10 s\) and \(y = 10 ( d - 9 )\) but spilt ink over his work.
The following summary statistics and unfinished scatter diagram are the only remaining information. $$\sum d ^ { 2 } = 1081.74 \quad \mathrm {~S} _ { d s } = 59.524$$ and $$\sum y = 64 \quad \mathrm {~S} _ { x x } = 2658.9$$ \(d \mathrm { mg } / \mathrm { dl }\) \includegraphics[max width=\textwidth, alt={}, center]{e777c787-0d39-4d84-a0f9-fc4a6712184f-22_983_1534_840_303}
  1. Use the formula for \(\mathrm { S } _ { x x }\) to show that \(\mathrm { S } _ { s s } = 26.589\)
  2. Find the value of the product moment correlation coefficient between \(s\) and \(d\).
  3. With reference to the unfinished scatter diagram, comment on your result in part (b).

(a)
AnswerMarks Guidance
\(S_{xx} = \sum(10s)^2 - \frac{(\sum 10s)^2}{10}\)M1 Attempting to use \(S_{xx} = \sum x^2 - \frac{(\sum x)^2}{10}\) with \(x = 10s\)
\(2658.9 = 100\sum(s)^2 - \frac{100(\sum s)^2}{10}\)M1 Substituting in 2658.9 and dealing with the 10 correctly
\(2658.9 = 100 S_{ss}\)
\(S_{ss} = 26.589 *\)A1*cso A complete solution with no errors leading to 26.589 only
(b)
AnswerMarks Guidance
\(64 = \sum_{1}^{10}(d_i - 9)\)M1 Realising that either \(64 = \sum_{1}^{10}(d_i - 9)\) or \(64 = 10\sum_{1}^{10}d_i - 900\) o.e. must be used. May be implied by seeing 96.4
\(64 = 10\sum_{1}^{10} d_i - 900\)
\(\sum_{1}^{10} d_i = 96.4\)A1 96.4 only
\(S_{dd} = 1081.74 - \frac{(\text{"96.4"})^2}{10}\)M1
\(= 152.444\)
\(r = 0.935\)A1ft awrt 0.935 ft "their 96.4"
(c)
AnswerMarks Guidance
Linear correlation is significant but scatter diagram suggests a non-linear relationship between the level of serum magnesium, and the level of the disease proteinB1 A correct comment comparing their value of \(r\) and the scatter diagram in context
**(a)**

| $S_{xx} = \sum(10s)^2 - \frac{(\sum 10s)^2}{10}$ | M1 | Attempting to use $S_{xx} = \sum x^2 - \frac{(\sum x)^2}{10}$ with $x = 10s$ |
| $2658.9 = 100\sum(s)^2 - \frac{100(\sum s)^2}{10}$ | M1 | Substituting in 2658.9 and dealing with the 10 correctly |
| $2658.9 = 100 S_{ss}$ | | |
| $S_{ss} = 26.589 *$ | A1*cso | A complete solution with no errors leading to 26.589 only |

**(b)**

| $64 = \sum_{1}^{10}(d_i - 9)$ | M1 | Realising that either $64 = \sum_{1}^{10}(d_i - 9)$ or $64 = 10\sum_{1}^{10}d_i - 900$ o.e. must be used. May be implied by seeing 96.4 |
| $64 = 10\sum_{1}^{10} d_i - 900$ | | |
| $\sum_{1}^{10} d_i = 96.4$ | A1 | 96.4 only |
| $S_{dd} = 1081.74 - \frac{(\text{"96.4"})^2}{10}$ | M1 | |
| $= 152.444$ | | |
| $r = 0.935$ | A1ft | awrt 0.935 ft "their 96.4" |

**(c)**

| Linear correlation is significant but scatter diagram suggests a non-linear relationship between the level of serum magnesium, and the level of the disease protein | B1 | A correct comment comparing their value of $r$ and the scatter diagram in context |
\begin{enumerate}
  \item Over a period of time, researchers took 10 blood samples from one patient with a blood disease. For each sample, they measured the levels of serum magnesium, $s \mathrm { mg } / \mathrm { dl }$, in the blood and the corresponding level of the disease protein, $d \mathrm { mg } / \mathrm { dl }$. One of the researchers coded the data for each sample using $x = 10 s$ and $y = 10 ( d - 9 )$ but spilt ink over his work.
\end{enumerate}

The following summary statistics and unfinished scatter diagram are the only remaining information.

$$\sum d ^ { 2 } = 1081.74 \quad \mathrm {~S} _ { d s } = 59.524$$

and

$$\sum y = 64 \quad \mathrm {~S} _ { x x } = 2658.9$$

$d \mathrm { mg } / \mathrm { dl }$\\
\includegraphics[max width=\textwidth, alt={}, center]{e777c787-0d39-4d84-a0f9-fc4a6712184f-22_983_1534_840_303}\\
(a) Use the formula for $\mathrm { S } _ { x x }$ to show that $\mathrm { S } _ { s s } = 26.589$\\
(b) Find the value of the product moment correlation coefficient between $s$ and $d$.\\
(c) With reference to the unfinished scatter diagram, comment on your result in part (b).\\

\hfill \mbox{\textit{Edexcel FS2  Q7 [8]}}