OCR S1 2014 June — Question 5 9 marks

Exam BoardOCR
ModuleS1 (Statistics 1)
Year2014
SessionJune
Marks9
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicBivariate data
TypeCalculate regression line equation
DifficultyModerate -0.8 This is a standard S1 regression question requiring routine application of given formulas with all summary statistics provided. Parts (i) and (iv) involve direct substitution into standard formulas, while parts (ii), (iii), and (v) test basic conceptual understanding of correlation and regression. No problem-solving insight or novel reasoning is required—this is textbook exercise material, making it easier than average A-level questions.
Spec5.08a Pearson correlation: calculate pmcc5.08d Hypothesis test: Pearson correlation5.09a Dependent/independent variables5.09c Calculate regression line

5 Tariq collected information about typical prices, \(\pounds y\) million, of four-bedroomed houses at varying distances, \(x\) miles, from a large city. He chose houses at 10 -mile intervals from the city. His results are shown below.
\(x\)1020304050607080
\(y\)1.21.41.20.90.80.50.50.3
$$n = 8 \quad \Sigma x = 360 \quad \Sigma x ^ { 2 } = 20400 \quad \Sigma y = 6.8 \quad \Sigma y ^ { 2 } = 6.88 \quad \Sigma x y = 241$$
  1. Use an appropriate formula to calculate the product moment correlation coefficient, \(r\), showing that \(- 1.0 < r < - 0.9\).
  2. State what this value of \(r\) shows in this context.
  3. Tariq decides to recalculate the value of \(r\) with the house prices measured in hundreds of thousands of pounds, instead of millions of pounds. State what effect, if any, this will have on the value of \(r\).
  4. Calculate the equation of the regression line of \(y\) on \(x\).
  5. Explain why the regression line of \(y\) on \(x\), rather than \(x\) on \(y\), should be used for estimating a value of \(x\) from a given value of \(y\).

Question 5:
Part 5(i):
AnswerMarks Guidance
AnswerMarks Guidance
\(S_{xx} = 20400 - \frac{360^2}{8} \quad (=4200)\)
\(S_{yy} = 6.88 - \frac{6.8^2}{8} \quad (=1.1)\)
\(S_{xy} = 241 - \frac{360\times6.8}{8} \quad (=-65)\)M1 Correct sub in a correct \(S\) formula
\(r = \frac{"-65"}{\sqrt{"4200"\times"1.1"}}\)M1 Correct sub in 3 correct \(S\) formulae and a correct \(r\) formula
\(= -0.956\) (3 sf)A1 Correct ans with no working M2A1 — Ignore comment about \(-1 < r < -0.9\)
[3]
Part 5(ii):
AnswerMarks Guidance
AnswerMarks Guidance
eg As you move further away, prices dropB1 High prices go with short distances oe. Allow "Strong (or high or good or equiv) neg corr'n between price and distance" — Both variables must be in context; miles & £ enough. Ignore all else, even if incorrect. NOT just neg corr'n between price & dist
[1]
Part 5(iii):
AnswerMarks Guidance
AnswerMarks Guidance
NoneB1 Ignore all else, even if incorrect
[1]
Part 5(iv):
AnswerMarks Guidance
AnswerMarks Guidance
\(b = \frac{"-65"}{" 4200"} \quad (=-0.0154762)\)M1 ft their \(S_{xy}\) & \(S_{xx}\) from (i) for M-marks only — or fresh start correct method
\(Y - \frac{6.8}{8} = "-0.0154762"(x - \frac{360}{8})\) oeM1 or \(a = \frac{6.8}{8} + "0.0154762"\times\frac{360}{8}\) oe
\(y = -0.0155x + 1.55\) (3 sf) oeA1 allow \(y = -0.015x + 1.5\) (or figs which round to these). (NOT \(y=-0.016x+1.6\), NOT \(y=-0.02x+1.5\)). Must have "\(y=\)" — Allow figures in eqn which round to the correct figures to either 3 sf or 2 sf, even if they result from arith errors.
or \(y = \frac{433}{280} - \frac{13}{840}x\) oe
[3] Correct ans with no working M2A1
Part 5(v):
AnswerMarks Guidance
AnswerMarks Guidance
Values of \(x\) are chosen beforehand or \(x\) is independent or controlledB1 \(x\) is fixed or given or set or predetermined oe — Not "\(x\) is constant." Not just "\(y\) depends on \(x\)". Ignore all other, even if incorrect
[1]
# Question 5:

## Part 5(i):
| Answer | Marks | Guidance |
|--------|-------|----------|
| $S_{xx} = 20400 - \frac{360^2}{8} \quad (=4200)$ | | |
| $S_{yy} = 6.88 - \frac{6.8^2}{8} \quad (=1.1)$ | | |
| $S_{xy} = 241 - \frac{360\times6.8}{8} \quad (=-65)$ | M1 | Correct sub in a correct $S$ formula |
| $r = \frac{"-65"}{\sqrt{"4200"\times"1.1"}}$ | M1 | Correct sub in 3 correct $S$ formulae and a correct $r$ formula |
| $= -0.956$ (3 sf) | A1 | Correct ans with no working M2A1 — Ignore comment about $-1 < r < -0.9$ |
| **[3]** | | |

## Part 5(ii):
| Answer | Marks | Guidance |
|--------|-------|----------|
| eg As you move further away, prices drop | B1 | High prices go with short distances oe. Allow "Strong (or high or good or equiv) neg corr'n between price and distance" — Both variables must be in context; miles & £ enough. Ignore all else, even if incorrect. NOT just neg corr'n between price & dist |
| **[1]** | | |

## Part 5(iii):
| Answer | Marks | Guidance |
|--------|-------|----------|
| None | B1 | Ignore all else, even if incorrect |
| **[1]** | | |

## Part 5(iv):
| Answer | Marks | Guidance |
|--------|-------|----------|
| $b = \frac{"-65"}{" 4200"} \quad (=-0.0154762)$ | M1 | ft their $S_{xy}$ & $S_{xx}$ from (i) for M-marks only — or fresh start correct method |
| $Y - \frac{6.8}{8} = "-0.0154762"(x - \frac{360}{8})$ oe | M1 | or $a = \frac{6.8}{8} + "0.0154762"\times\frac{360}{8}$ oe |
| $y = -0.0155x + 1.55$ (3 sf) oe | A1 | allow $y = -0.015x + 1.5$ (or figs which round to these). (NOT $y=-0.016x+1.6$, NOT $y=-0.02x+1.5$). Must have "$y=$" — Allow figures in eqn which round to the correct figures to either 3 sf or 2 sf, even if they result from arith errors. |
| or $y = \frac{433}{280} - \frac{13}{840}x$ oe | | |
| **[3]** | | Correct ans with no working M2A1 |

## Part 5(v):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Values of $x$ are chosen beforehand or $x$ is independent or controlled | B1 | $x$ is fixed or given or set or predetermined oe — Not "$x$ is constant." Not just "$y$ depends on $x$". Ignore all other, even if incorrect |
| **[1]** | | |

---
5 Tariq collected information about typical prices, $\pounds y$ million, of four-bedroomed houses at varying distances, $x$ miles, from a large city. He chose houses at 10 -mile intervals from the city. His results are shown below.

\begin{center}
\begin{tabular}{ | c | c | c | c | c | c | c | c | c | }
\hline
$x$ & 10 & 20 & 30 & 40 & 50 & 60 & 70 & 80 \\
\hline
$y$ & 1.2 & 1.4 & 1.2 & 0.9 & 0.8 & 0.5 & 0.5 & 0.3 \\
\hline
\end{tabular}
\end{center}

$$n = 8 \quad \Sigma x = 360 \quad \Sigma x ^ { 2 } = 20400 \quad \Sigma y = 6.8 \quad \Sigma y ^ { 2 } = 6.88 \quad \Sigma x y = 241$$

(i) Use an appropriate formula to calculate the product moment correlation coefficient, $r$, showing that $- 1.0 < r < - 0.9$.\\
(ii) State what this value of $r$ shows in this context.\\
(iii) Tariq decides to recalculate the value of $r$ with the house prices measured in hundreds of thousands of pounds, instead of millions of pounds. State what effect, if any, this will have on the value of $r$.\\
(iv) Calculate the equation of the regression line of $y$ on $x$.\\
(v) Explain why the regression line of $y$ on $x$, rather than $x$ on $y$, should be used for estimating a value of $x$ from a given value of $y$.

\hfill \mbox{\textit{OCR S1 2014 Q5 [9]}}