AQA S1 2005 June — Question 4 12 marks

Exam BoardAQA
ModuleS1 (Statistics 1)
Year2005
SessionJune
Marks12
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicLinear regression
TypeCalculate y on x from raw data table
DifficultyModerate -0.8 This is a standard S1 regression question requiring routine calculations: finding regression line from summary statistics, computing residuals, and making predictions. All techniques are textbook procedures with no novel problem-solving required. The multi-part structure and context add length but not conceptual difficulty—this is easier than an average A-level question across all modules.
Spec5.09a Dependent/independent variables5.09c Calculate regression line5.09e Use regression: for estimation in context

4 The time taken for a fax machine to scan an A4 sheet of paper is dependent, in part, on the number of lines of print on the sheet. The table below shows, for each of a random sample of 8 sheets of A4 paper, the number, \(x\), of lines of print and the scanning time, \(y\) seconds, taken by the fax machine.
Sheet\(\mathbf { 1 }\)\(\mathbf { 2 }\)\(\mathbf { 3 }\)\(\mathbf { 4 }\)\(\mathbf { 5 }\)\(\mathbf { 6 }\)\(\mathbf { 7 }\)\(\mathbf { 8 }\)
\(\boldsymbol { x }\)1016232731353844
\(\boldsymbol { y }\)2.43.53.24.14.15.64.65.3
  1. Calculate the equation of the least squares regression line of \(y\) on \(x\).
  2. The following table lists some of the residuals for the regression line.
    Sheet\(\mathbf { 1 }\)\(\mathbf { 2 }\)\(\mathbf { 3 }\)\(\mathbf { 4 }\)\(\mathbf { 5 }\)\(\mathbf { 6 }\)\(\mathbf { 7 }\)\(\mathbf { 8 }\)
    Residual- 0.1740.4180.085- 0.2540.906- 0.157
    1. Calculate the values of the residuals for sheets 3 and 7 .
    2. Hence explain what can be deduced about the regression line.
  3. The time, \(z\) seconds, to transmit an A4 page after scanning is given by: $$z = 0.80 + 0.05 x$$ Estimate the total time to scan and transmit an A4 page containing:
    1. 15 lines of print;
    2. 75 lines of print. In each case comment on the likely reliability of your estimate.

Question 4:
Part (a)
AnswerMarks Guidance
Answer/WorkingMarks Guidance
Gradient \(b = 0.0848\) OR \(b = 0.084\) to \(0.085\)B2 (B1) AWRT; AWFW
Intercept \(a = 1.72\) to \(1.73\) OR \(a = 1.7\)B2 (B1) AWFW; CAO
Attempt at \(\Sigma x\), \(\Sigma x^2\), \(\Sigma y\), \(\Sigma xy\) OR \(S_{xx}\), \(S_{xy}\)(M1) 224, 7180; 32.8; 995.4 OR 908; 77
Attempt at correct formula for \(b\)(m1)
\(b = 0.0848\)(A1) AWRT
\(a = 1.72\) to \(1.73\)(A1) AWFW
Part (b)(i)
AnswerMarks Guidance
Answer/WorkingMarks Guidance
Residual \(= y - a - bx\)M1 \(\text{Res} =
\((\text{Residual})_3 = -0.465\) to \(-0.485\)A1 (A1) AWFW; both correct magnitude
\((\text{Residual})_7 = -0.335\) to \(-0.365\)A1 AWFW
Part (b)(ii)
AnswerMarks Guidance
Answer/WorkingMarks Guidance
Residuals are small (relative to \(y\)-values); no pattern to residualsB1 Except for \((\text{Residual})_6\); any sensible comment; residuals random
Fitted equation is appropriate/suitableB1 Or equivalent; do not allow "equation is good", "equation is accurate"; allow "equation is suitable"
Part (c)(i)
AnswerMarks Guidance
Answer/WorkingMarks Guidance
Total \(=\) Scan \(+\) Transmit \(= y + z\)M1 Use of; or equivalent
\(T_{15} = 4.45\) to \(4.6\)A1 AWFW
Reliable as interpolation or small residualsB1 Or equivalent
Part (c)(ii)
AnswerMarks Guidance
Answer/WorkingMarks Guidance
\(T_{75} = 12.5\) to \(12.7\)A1 AWFW
Unreliable as extrapolation; cannot get 75 lines of print on A4 pageB1 Or equivalent
# Question 4:

## Part (a)

| Answer/Working | Marks | Guidance |
|---|---|---|
| Gradient $b = 0.0848$ OR $b = 0.084$ to $0.085$ | B2 (B1) | AWRT; AWFW |
| Intercept $a = 1.72$ to $1.73$ OR $a = 1.7$ | B2 (B1) | AWFW; CAO |
| Attempt at $\Sigma x$, $\Sigma x^2$, $\Sigma y$, $\Sigma xy$ OR $S_{xx}$, $S_{xy}$ | (M1) | 224, 7180; 32.8; 995.4 OR 908; 77 |
| Attempt at correct formula for $b$ | (m1) | |
| $b = 0.0848$ | (A1) | AWRT |
| $a = 1.72$ to $1.73$ | (A1) | AWFW |

## Part (b)(i)

| Answer/Working | Marks | Guidance |
|---|---|---|
| Residual $= y - a - bx$ | M1 | $\text{Res} = |(\text{Obs } y) - (\text{Pred } y)|$ used; allow use of $x=3$ and/or $x=7$ |
| $(\text{Residual})_3 = -0.465$ to $-0.485$ | A1 (A1) | AWFW; both correct magnitude |
| $(\text{Residual})_7 = -0.335$ to $-0.365$ | A1 | AWFW |

## Part (b)(ii)

| Answer/Working | Marks | Guidance |
|---|---|---|
| Residuals are small (relative to $y$-values); no pattern to residuals | B1 | Except for $(\text{Residual})_6$; any sensible comment; residuals random |
| Fitted equation is **appropriate/suitable** | B1 | Or equivalent; do **not** allow "equation is good", "equation is accurate"; allow "equation is suitable" |

## Part (c)(i)

| Answer/Working | Marks | Guidance |
|---|---|---|
| Total $=$ Scan $+$ Transmit $= y + z$ | M1 | Use of; or equivalent |
| $T_{15} = 4.45$ to $4.6$ | A1 | AWFW |
| Reliable as interpolation or small residuals | B1 | Or equivalent |

## Part (c)(ii)

| Answer/Working | Marks | Guidance |
|---|---|---|
| $T_{75} = 12.5$ to $12.7$ | A1 | AWFW |
| Unreliable as extrapolation; cannot get 75 lines of print on A4 page | B1 | Or equivalent |
4 The time taken for a fax machine to scan an A4 sheet of paper is dependent, in part, on the number of lines of print on the sheet. The table below shows, for each of a random sample of 8 sheets of A4 paper, the number, $x$, of lines of print and the scanning time, $y$ seconds, taken by the fax machine.

\begin{center}
\begin{tabular}{ | c | c | c | c | c | c | c | c | c | }
\hline
Sheet & $\mathbf { 1 }$ & $\mathbf { 2 }$ & $\mathbf { 3 }$ & $\mathbf { 4 }$ & $\mathbf { 5 }$ & $\mathbf { 6 }$ & $\mathbf { 7 }$ & $\mathbf { 8 }$ \\
\hline
$\boldsymbol { x }$ & 10 & 16 & 23 & 27 & 31 & 35 & 38 & 44 \\
\hline
$\boldsymbol { y }$ & 2.4 & 3.5 & 3.2 & 4.1 & 4.1 & 5.6 & 4.6 & 5.3 \\
\hline
\end{tabular}
\end{center}
\begin{enumerate}[label=(\alph*)]
\item Calculate the equation of the least squares regression line of $y$ on $x$.
\item The following table lists some of the residuals for the regression line.

\begin{center}
\begin{tabular}{ | c | c | c | c | c | c | c | c | c | }
\hline
Sheet & $\mathbf { 1 }$ & $\mathbf { 2 }$ & $\mathbf { 3 }$ & $\mathbf { 4 }$ & $\mathbf { 5 }$ & $\mathbf { 6 }$ & $\mathbf { 7 }$ & $\mathbf { 8 }$ \\
\hline
Residual & - 0.174 & 0.418 &  & 0.085 & - 0.254 & 0.906 &  & - 0.157 \\
\hline
\end{tabular}
\end{center}
\begin{enumerate}[label=(\roman*)]
\item Calculate the values of the residuals for sheets 3 and 7 .
\item Hence explain what can be deduced about the regression line.
\end{enumerate}\item The time, $z$ seconds, to transmit an A4 page after scanning is given by:

$$z = 0.80 + 0.05 x$$

Estimate the total time to scan and transmit an A4 page containing:
\begin{enumerate}[label=(\roman*)]
\item 15 lines of print;
\item 75 lines of print.

In each case comment on the likely reliability of your estimate.
\end{enumerate}\end{enumerate}

\hfill \mbox{\textit{AQA S1 2005 Q4 [12]}}