| Exam Board | Edexcel |
|---|---|
| Module | S1 (Statistics 1) |
| Year | 2017 |
| Session | January |
| Marks | 17 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Linear regression |
| Type | Calculate y on x from summary statistics |
| Difficulty | Moderate -0.3 This is a comprehensive but routine S1 linear regression question covering standard bookwork: calculating means, variance, PMCC, regression line, predictions, and transformations. All parts follow direct formula application with no conceptual challenges or novel problem-solving required. The multi-part structure and coding transformation (parts e-f) add length but not difficulty, as these are standard textbook exercises. Slightly easier than average due to being purely procedural. |
| Spec | 5.02c Linear coding: effects on mean and variance5.08a Pearson correlation: calculate pmcc5.08b Linear coding: effect on pmcc5.08c Pearson: measure of straight-line fit5.08d Hypothesis test: Pearson correlation5.09a Dependent/independent variables5.09b Least squares regression: concepts5.09c Calculate regression line5.09d Linear coding: effect on regression |
| Answer | Marks | Guidance |
|---|---|---|
| Part | Answer/Working | Marks |
| (a) | \([\bar{y} =] \frac{-27}{12} = -2.25\), \(\text{Var}(Y) = \frac{62.98}{12} - (-2.25)^2 = 0.1858333\ldots\) (allow \(\frac{223}{1200}\)) \(\underline{\text{awrt } 0.186}\) | B1, M1, A1 |
| (3) | ||
| (b)(i) | \(S_{xy} = -1190.7 - \frac{(504)(-27)}{12}\) or \(-56.7\) | B1 |
| \(r = \frac{\text{'}-56.7'}{\sqrt{(1674)(2.23)}} = \text{, } -0.9280105\ldots\) | M1, A1 | M1 for correct express' for \(r\) with 1674, 2.23 and their \(S_y\). [Correct ans. only 3/3, \(r = -0.93\) is 2/3] |
| \(\underline{\text{awrt } -0.928}\) | ||
| (ii) | Negative correlation, so Priya's belief is incorrect. | B1 |
| (4) | ||
| (c) | \(b = \frac{\text{'}-56.7'}{1674}\) [\(= -0.033870\ldots\)] | M1 |
| \(\frac{-27}{12} = a + b' \times \frac{504}{12}\) or \(a = -2.25 - \text{'}-0.03387...' \times 42\), \(a = \text{awrt } -0.827\) | M1, A1 | 1st M1 for correct expression for \(b\) f.t. their \(S_{yy}\) (May be implied by correct answer). 2nd M1 for correct use of \(a = \bar{y} - b\bar{x}\) to find \(a\) (f.t. their value of \(b\))(Implied by \(-0.827\)). 1st A1 for \(a = \text{awrt } -0.827\) (no fraction). 2nd A1 for an equ'n in the form \(y = a + bx\) with their \(a\) and \(b = \text{awrt } -0.0339\) (no fraction) |
| \(y = -0.827 - 0.0339x\) | A1 | |
| (4) | ||
| (d) | \([y = -0.827 - 0.0339(32) =] -1.9°C\) | B1 |
| (1) | ||
| (e) | \(\frac{(w-32)}{1.8} = -0.827 - 0.339x\) (o.e.) | M1 |
| \(w = 30.5 - 0.061x\) | A1 | A1 for a correct equation for \(w\) in terms of \(x\) with \(c = \text{awrt } 31\) and \(d = \text{awrt } -0.061\) |
| (2) | ||
| (f)(i) | \(\text{Var}(W) = 1.8^2 \text{ Var}(Y)\), \(= 0.602\ldots\) | M1, A1 |
| (ii) | \(r_{yx} = r_{wx} = -0.928\) | B1ft |
| [17 marks] |
| Part | Answer/Working | Marks | Guidance |
|------|---|---|---|
| (a) | $[\bar{y} =] \frac{-27}{12} = -2.25$, $\text{Var}(Y) = \frac{62.98}{12} - (-2.25)^2 = 0.1858333\ldots$ (allow $\frac{223}{1200}$) $\underline{\text{awrt } 0.186}$ | B1, M1, A1 | B1 either fraction or exact decimal equivalent [must see mean separately to earn this mark]. M1 for expr' for variance $\frac{62.98}{12} - \bar{y}^2$ [ft $\bar{y}$] or $\frac{S_{yy}}{11} = \text{awrt } 0.203$) [No $\sqrt{}$]. For M1 in (b)(i) and 1st M1 in (c) do not allow ft for $S_{yy} = -1190.7$ |
| | | (3) | |
| (b)(i) | $S_{xy} = -1190.7 - \frac{(504)(-27)}{12}$ or $-56.7$ | B1 | Correct expression for $S_{xy}$ or $-56.7$ (May be implied by a correct value for $r$) |
| | $r = \frac{\text{'}-56.7'}{\sqrt{(1674)(2.23)}} = \text{, } -0.9280105\ldots$ | M1, A1 | M1 for correct express' for $r$ with 1674, 2.23 and their $S_y$. [Correct ans. only 3/3, $r = -0.93$ is 2/3] |
| | | $\underline{\text{awrt } -0.928}$ | |
| (ii) | Negative correlation, so Priya's belief is incorrect. | B1 | B1 for Priya's belief not supported and reason e.g. negative correlation or $r$ is negative or $r$ is close to $-1$ or as salinity (or $x$) increases, temperature (or $y$) decreases |
| | | (4) | |
| (c) | $b = \frac{\text{'}-56.7'}{1674}$ [$= -0.033870\ldots$] | M1 | |
| | $\frac{-27}{12} = a + b' \times \frac{504}{12}$ or $a = -2.25 - \text{'}-0.03387...' \times 42$, $a = \text{awrt } -0.827$ | M1, A1 | 1st M1 for correct expression for $b$ f.t. their $S_{yy}$ (May be implied by correct answer). 2nd M1 for correct use of $a = \bar{y} - b\bar{x}$ to find $a$ (f.t. their value of $b$)(Implied by $-0.827$). 1st A1 for $a = \text{awrt } -0.827$ (no fraction). 2nd A1 for an equ'n in the form $y = a + bx$ with their $a$ and $b = \text{awrt } -0.0339$ (no fraction) |
| | $y = -0.827 - 0.0339x$ | A1 | |
| | | (4) | |
| (d) | $[y = -0.827 - 0.0339(32) =] -1.9°C$ | B1 | awrt $-1.9$ (no fractions) |
| | | (1) | |
| (e) | $\frac{(w-32)}{1.8} = -0.827 - 0.339x$ (o.e.) | M1 | M1 for substituting $\frac{(w-32)}{1.8}$ for $y$ (o.e.) in their regression equation |
| | | | |
| | $w = 30.5 - 0.061x$ | A1 | A1 for a correct equation for $w$ in terms of $x$ with $c = \text{awrt } 31$ and $d = \text{awrt } -0.061$ |
| | | (2) | |
| (f)(i) | $\text{Var}(W) = 1.8^2 \text{ Var}(Y)$, $= 0.602\ldots$ | M1, A1 | M1 for $1.8^2 \times \text{Var}(Y)$ f.t. their "(a)" (if $> 0$) [[Allow use of $s^2 = \text{awrt } 0.66$ to score M1A1] |
| (ii) | $r_{yx} = r_{wx} = -0.928$ | B1ft | (3) |
| | | [17 marks] | |
\begin{enumerate}
\item A scientist measured the salinity of water, $x \mathrm {~g} / \mathrm { kg }$, and recorded the temperature at which the water froze, $y ^ { \circ } \mathrm { C }$, for 12 different water samples. The summary statistics are listed below.
\end{enumerate}
$$\begin{gathered}
\sum x = 504 \quad \sum y = - 27 \quad \sum x ^ { 2 } = 22842 \quad \sum y ^ { 2 } = 62.98 \\
\sum x y = - 1190.7 \quad \mathrm {~S} _ { x x } = 1674 \quad \mathrm {~S} _ { y y } = 2.23
\end{gathered}$$
(a) Find the mean and variance of the recorded temperatures.\\
(3)
Priya believes that the higher the salinity of water, the higher the temperature at which the water freezes.\\
(b) (i) Calculate the product moment correlation coefficient between $x$ and $y$\\
(ii) State, with a reason, whether or not this value supports Priya's belief.\\
(c) Find the least squares regression line of $y$ on $x$ in the form $y = a + b x$ Give the value of $a$ and the value of $b$ to 3 significant figures.\\
(d) Estimate the temperature at which water freezes when the salinity is $32 \mathrm {~g} / \mathrm { kg }$
The coding $w = 1.8 y + 32$ is used to convert the recorded temperatures from ${ } ^ { \circ } \mathrm { C }$ to ${ } ^ { \circ } \mathrm { F }$\\
(e) Find an equation of the least squares regression line of $w$ on $x$ in the form $w = c + d x$\\
(f) Find\\
(i) the variance of the recorded temperatures when converted to ${ } ^ { \circ } \mathrm { F }$\\
(ii) the product moment correlation coefficient between $w$ and $x$\\
\href{http://PhysicsAndMathsTutor.com}{PhysicsAndMathsTutor.com}
\hfill \mbox{\textit{Edexcel S1 2017 Q3 [17]}}