OCR Further Statistics AS 2020 November — Question 3 9 marks

Exam BoardOCR
ModuleFurther Statistics AS (Further Statistics AS)
Year2020
SessionNovember
Marks9
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicLinear regression
TypeCalculate PMCC from raw data
DifficultyModerate -0.3 This is a straightforward application of standard PMCC and regression formulas with all summary statistics provided. Students need to recall formulas and perform arithmetic calculations, plus make basic comments about interpolation vs extrapolation. While it's a Further Maths topic, the computational and conceptual demands are routine, making it slightly easier than an average A-level question.
Spec5.09a Dependent/independent variables5.09b Least squares regression: concepts5.09c Calculate regression line

3 An investor obtains data about the profits of 8 randomly chosen investment accounts over two one-year periods. The profit in the first year for each account is \(p \%\) and the profit in the second year for each account is \(q \%\). The results are shown in the table and in the scatter diagram.
AccountABCDEFGH
\(p\)1.62.12.42.72.83.35.28.4
\(q\)1.62.32.22.23.12.97.64.8
\(n = 8 \quad \sum \mathrm { p } = 28.5 \quad \sum \mathrm { q } = 26.7 \quad \sum \mathrm { p } ^ { 2 } = 136.35 \quad \sum \mathrm { q } ^ { 2 } = 116.35 \quad \sum \mathrm { pq } = 116.70\) \includegraphics[max width=\textwidth, alt={}, center]{bf1468d1-e02e-47d2-bf41-5bc8f5b4d7c4-3_782_1280_998_242}
  1. State which, if either, of the variables \(p\) and \(q\) is independent.
  2. Calculate the equation of the regression line of \(q\) on \(p\).
    1. Use the regression line to estimate the value of \(q\) for an investment account for which \(p = 2.5\).
    2. Give two reasons why this estimate could be considered reliable.
  3. Comment on the reliability of using the regression line to predict the value of \(q\) when \(p = 7.0\).

Question 3:
Part (a)
AnswerMarks
NeitherB1 [1]
Part (b)
AnswerMarks Guidance
\(q = 1.13 + 0.620p\)B1B1, B1 [3] \(0.62(0)\) correct; both numbers correct; Fully correct answer including letters
Part (c)(i)
AnswerMarks Guidance
\(2.68\)B1ft [1] awrt \(2.68\), ft on their (b) if letters correct
Part (c)(ii)
AnswerMarks Guidance
\(2.5\) is within data range, and points (here) are close to line/well correlatedB1, B1 [2] At least one reason, allow "no because points not close to line"; Full argument, two reasons needed
Part (d)
AnswerMarks Guidance
Not much data here/points scattered/possible outliers; So not very reliableM1, A1 [2] Reason for not very reliable (*not* "extrapolation"); Full argument and conclusion, not too assertive (not wholly unreliable!)
# Question 3:

## Part (a)
Neither | **B1** [1] |

## Part (b)
$q = 1.13 + 0.620p$ | **B1B1, B1** [3] | $0.62(0)$ correct; both numbers correct; Fully correct answer including letters

## Part (c)(i)
$2.68$ | **B1ft** [1] | awrt $2.68$, ft on their **(b)** if letters correct

## Part (c)(ii)
$2.5$ is within data range, and points (here) are close to line/well correlated | **B1, B1** [2] | At least one reason, allow "no because points not close to line"; Full argument, two reasons needed

## Part (d)
Not much data here/points scattered/possible outliers; So not very reliable | **M1, A1** [2] | Reason for not very reliable (*not* "extrapolation"); Full argument and conclusion, not too assertive (not wholly unreliable!)

---
3 An investor obtains data about the profits of 8 randomly chosen investment accounts over two one-year periods.

The profit in the first year for each account is $p \%$ and the profit in the second year for each account is $q \%$.

The results are shown in the table and in the scatter diagram.

\begin{center}
\begin{tabular}{ | c | c | c | c | c | c | c | c | c | }
\hline
Account & A & B & C & D & E & F & G & H \\
\hline
$p$ & 1.6 & 2.1 & 2.4 & 2.7 & 2.8 & 3.3 & 5.2 & 8.4 \\
\hline
$q$ & 1.6 & 2.3 & 2.2 & 2.2 & 3.1 & 2.9 & 7.6 & 4.8 \\
\hline
\end{tabular}
\end{center}

$n = 8 \quad \sum \mathrm { p } = 28.5 \quad \sum \mathrm { q } = 26.7 \quad \sum \mathrm { p } ^ { 2 } = 136.35 \quad \sum \mathrm { q } ^ { 2 } = 116.35 \quad \sum \mathrm { pq } = 116.70$\\
\includegraphics[max width=\textwidth, alt={}, center]{bf1468d1-e02e-47d2-bf41-5bc8f5b4d7c4-3_782_1280_998_242}
\begin{enumerate}[label=(\alph*)]
\item State which, if either, of the variables $p$ and $q$ is independent.
\item Calculate the equation of the regression line of $q$ on $p$.
\item \begin{enumerate}[label=(\roman*)]
\item Use the regression line to estimate the value of $q$ for an investment account for which $p = 2.5$.
\item Give two reasons why this estimate could be considered reliable.
\end{enumerate}\item Comment on the reliability of using the regression line to predict the value of $q$ when $p = 7.0$.
\end{enumerate}

\hfill \mbox{\textit{OCR Further Statistics AS 2020 Q3 [9]}}