| Exam Board | OCR MEI |
|---|---|
| Module | Further Statistics A AS (Further Statistics A AS) |
| Year | 2019 |
| Session | June |
| Marks | 13 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Linear regression |
| Type | Interpret regression line parameters |
| Difficulty | Standard +0.3 This is a straightforward linear regression question covering standard AS-level techniques: calculating regression line equation from summary statistics, making predictions, and computing residuals. Part (a) requires recognizing non-linearity from a graph, and part (e) involves commenting on extrapolation—both routine interpretations. The calculations are mechanical with no conceptual challenges beyond basic formula application, making this slightly easier than average. |
| Spec | 5.09a Dependent/independent variables5.09b Least squares regression: concepts5.09c Calculate regression line5.09e Use regression: for estimation in context |
| Answer | Marks | Guidance |
|---|---|---|
| 6 | (a) | Because the test is only valid for random on |
| Answer | Marks |
|---|---|
| controlled | B1 |
| Answer | Marks |
|---|---|
| [2] | 3.5a |
| Answer | Marks | Guidance |
|---|---|---|
| (b) | Because altitude is the independent variable | B1 |
| [1] | 2.4 | Allow ‘controlled’, ‘selected’ for ‘independent’ |
| (c) | S 176090(21200105.4/12) |
| Answer | Marks |
|---|---|
| y = 0.00614x + 19.637 | M1 |
| Answer | Marks |
|---|---|
| [4] | 1.1a |
| Answer | Marks |
|---|---|
| 1.1 | For correct structure to find gradient (b) as written or |
| Answer | Marks |
|---|---|
| (d) | Prediction for 2000m is 7.35 C |
| Prediction for 3000m is 1.21 C | B1 |
| Answer | Marks |
|---|---|
| [2] | 3.4 |
| 1.1 | Allow 7.36 |
| Answer | Marks |
|---|---|
| (e) | Prediction for 2000m is more likely to be reliable |
| Answer | Marks |
|---|---|
| extrapolation | B1 |
| Answer | Marks |
|---|---|
| [2] | 3.5a |
| 3.5b | If ‘interpolation’ not stated then alternative wording must |
| Answer | Marks |
|---|---|
| (f) | Residual = 8.1 (0.00614×1600 + 19.637) |
| = 1.71 3 s.f. | M1 |
| Answer | Marks |
|---|---|
| [2] | 1.1 |
| 1.1 | For difference between 8.1 and their prediction for 1600 |
Question 6:
6 | (a) | Because the test is only valid for random on
random
and the altitudes are not random variables but
controlled | B1
B1
[2] | 3.5a
3.5b
(b) | Because altitude is the independent variable | B1
[1] | 2.4 | Allow ‘controlled’, ‘selected’ for ‘independent’
(c) | S 176090(21200105.4/12)
b xy
S 39100000212002 /12
xx
= 0.00614 3 s.f.
x = 1766.666…, y = 8.783333…
hence least squares regression line is:
y – 8.7833… = 0.00614(x – 1766.666..)
y = 0.00614x + 19.637 | M1
A1
M1
A1
[4] | 1.1a
1.1
1.1
1.1 | For correct structure to find gradient (b) as written or
176090/12(1766.78.783)
equivalent e.g.
39100000/121766.72
For b correct to 3 s.f. or better
For equation of line with their b < 0
Values correct to 3 s.f. or better
(d) | Prediction for 2000m is 7.35 C
Prediction for 3000m is 1.21 C | B1
B1
[2] | 3.4
1.1 | Allow 7.36
Allow 1.22
FT their equation with b < 0
(e) | Prediction for 2000m is more likely to be reliable
as interpolation
Prediction for 3000m is less likely to be reliable as
extrapolation | B1
B1
[2] | 3.5a
3.5b | If ‘interpolation’ not stated then alternative wording must
be convincing - e.g. ‘within domain of validity’
If ‘extrapolation’ not stated then alternative wording must
be convincing. e.g. allow ‘outside the range of the data
(values)’
(f) | Residual = 8.1 (0.00614×1600 + 19.637)
= 1.71 3 s.f. | M1
A1
[2] | 1.1
1.1 | For difference between 8.1 and their prediction for 1600
For – 1.71 FT their b < 0
PPMMTT
Y412/01 Mark Scheme June 2019
Additional notes RE:Non-assertive conclusions relating to H in hypothesis tests
1
There is insufficient evidence to believe/suggest/indicate that there is an association between…
There is insufficient evidence to believe/suggest/indicate that… …are not independent.
There is sufficient evidence to believe/suggest/indicate that there is an association between…
There is sufficient evidence to believe/suggest/indicate that… …are not independent.
9
PPMMTT
OCR (Oxford Cambridge and RSA Examinations)
The Triangle Building
Shaftesbury Road
Cambridge
CB2 8EA
OCR Customer Contact Centre
Education and Learning
Telephone: 01223 553998
Facsimile: 01223 552627
Email: general.qualifications@ocr.org.uk
www.ocr.org.uk
For staff training purposes and as part of our quality assurance
programme your call may be recorded or monitored
Oxford Cambridge and RSA Examinations
is a Company Limited by Guarantee
Registered in England
Registered Office; The Triangle Building, Shaftesbury Road, Cambridge, CB2 8EA
Registered Company Number: 3484466
OCR is an exempt Charity
OCR (Oxford Cambridge and RSA Examinations)
Head office
Telephone: 01223 552552
Facsimile: 01223 552553
© OCR 2019
6 A meteorologist is investigating the relationship between altitude $x$ metres and mean annual temperature $y ^ { \circ } \mathrm { C }$ in an American state.\\
She selects 12 locations at various altitudes and then stations a remote monitoring device at each of them to measure the temperature over the course of a year. Fig. 6 illustrates the data which she obtains.
\begin{figure}[h]
\begin{center}
\includegraphics[alt={},max width=\textwidth]{fd496303-10f1-450e-bbeb-421ab6f4de21-6_686_1477_486_292}
\captionsetup{labelformat=empty}
\caption{Fig. 6}
\end{center}
\end{figure}
\begin{enumerate}[label=(\alph*)]
\item Explain why it would not be appropriate to carry out a hypothesis test for correlation based on the product moment correlation coefficient.
\item Explain why altitude has been plotted on the horizontal axis in Fig. 6.
Summary statistics for $x$ and $y$ are as follows.
$$\sum x = 21200 \quad \sum y = 105.4 \quad \sum x ^ { 2 } = 39100000 \quad \sum y ^ { 2 } = 1004 \quad \sum x y = 176090$$
\item Calculate the equation of the regression line of $y$ on $x$.
\item Use the equation of the regression line to predict the values of the mean annual temperature at each of the following altitudes.
\begin{itemize}
\item 2000 metres
\item 3000 metres
\item Comment on the reliability of your predictions in part (d).
\item Calculate the value of the residual for the data point ( $1600,8.1$ ).
\end{itemize}
\end{enumerate}
\hfill \mbox{\textit{OCR MEI Further Statistics A AS 2019 Q6 [13]}}