| Exam Board | Edexcel |
|---|---|
| Module | S1 (Statistics 1) |
| Session | Specimen |
| Marks | 14 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Linear regression |
| Type | Calculate y on x from raw data table |
| Difficulty | Moderate -0.8 This is a standard S1 linear regression question with all summary statistics provided. Students follow a routine algorithm: calculate Sdd and Sfd using given sums, find b and a using formulas, then interpret. The final part requires simple algebraic manipulation. No conceptual difficulty or novel insight required—purely procedural application of memorized formulas. |
| Spec | 5.09a Dependent/independent variables5.09b Least squares regression: concepts5.09c Calculate regression line5.09e Use regression: for estimation in context |
| Destination | \(A\) | \(B\) | \(C\) | \(D\) | \(E\) | \(F\) |
| \(d\) | 2.2 | 4.0 | 6.0 | 2.5 | 8.0 | 5.0 |
| \(f\) | 18 | 20 | 25 | 23 | 32 | 28 |
| Answer | Marks |
|---|---|
| Points plotted correctly | B1 B1 |
| Answer | Marks |
|---|---|
| The points lie reasonably close to a straight line | B1 |
| Answer | Marks |
|---|---|
| \(\bar{d} = 27.7\), \(\bar{f} = 146\) (both, may be implied) | B1 |
| \(S_{dd} = 152.09 - \frac{(27.7)^2}{6} = 24.208\ldots\) awrt 24.2 | M1 A1 |
| \(S_{fd} = 723.1 - \frac{27.7 \times 146}{6} = 49.06\ldots\) awrt 49.1 | A1 |
| Answer | Marks |
|---|---|
| \(b = \frac{S_{fd}}{S_{dd}} = 2.026\ldots\) awrt 2.03 | M1 A1 |
| \(a = \frac{146}{6} - b \times \frac{27.7}{6} = 14.97\ldots\) so \(f = \mathbf{15.0 + 2.03d}\) | M1 A1 |
| Answer | Marks |
|---|---|
| A flight costs £2.03 (or about £2) for every extra 100km or about 2p per km | B1ft |
| Answer | Marks |
|---|---|
| \(15.0 + 2.03d < 5d\) so \(d > \frac{15.0}{(5-2.03)} = 5.00 \sim 5.05\) | M1 |
| So \(t > 500\sim505\) | A1 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Mark | Guidance |
| At least 4 points correct | B1 | Allow \(\pm\) one 2mm square |
| All points correct | B1 | Allow \(\pm\) one 2mm square |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Mark | Guidance |
| Line drawn through correct points | B1 | Ignore extra points and lines; require reference to points and line |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Mark | Guidance |
| Correct method for either \(S_{dd}\) or \(S_{fd}\) | M1 | Correct expression seen |
| \(S_{dd}\) awrt 24.2 | A1 | |
| \(S_{fd}\) awrt 49.1 | A1 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Mark | Guidance |
| Correct expression for \(b\) | M1 | Can follow through from (c) |
| Correct method to find \(a\) using \(b\) and means | M1 | Follow through |
| \(f = \ldots\) in terms of \(d\), all values awrt | A1 | Accept 15 as rounding from correct answer only |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Mark | Guidance |
| Contextual interpretation of cost and distance | — | Follow through their value of \(b\) |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Mark | Guidance |
| Attempt intersection of 2 lines | M1 | \(t\) in range 500 to 505 seen |
| \(d\) in range 5 to 5.05 | A1 | Accept \(t\) greater than 500 to 505 inclusive for M1A1 |
# Question 6:
## Part (a)
| Points plotted correctly | B1 B1 | |
## Part (b)
| The points lie reasonably close to a straight **line** | B1 | |
## Part (c)
| $\bar{d} = 27.7$, $\bar{f} = 146$ (both, may be implied) | B1 | |
|---|---|---|
| $S_{dd} = 152.09 - \frac{(27.7)^2}{6} = 24.208\ldots$ awrt **24.2** | M1 A1 | |
| $S_{fd} = 723.1 - \frac{27.7 \times 146}{6} = 49.06\ldots$ awrt **49.1** | A1 | |
## Part (d)
| $b = \frac{S_{fd}}{S_{dd}} = 2.026\ldots$ awrt **2.03** | M1 A1 | |
|---|---|---|
| $a = \frac{146}{6} - b \times \frac{27.7}{6} = 14.97\ldots$ so $f = \mathbf{15.0 + 2.03d}$ | M1 A1 | |
## Part (e)
| A flight costs **£2.03 (or about £2)** for every extra **100km** or about 2p per km | B1ft | |
## Part (f)
| $15.0 + 2.03d < 5d$ so $d > \frac{15.0}{(5-2.03)} = 5.00 \sim 5.05$ | M1 | |
|---|---|---|
| So $t > 500\sim505$ | A1 | |
# WST01/01: Statistics S1 Mark Scheme
---
## Question 6 (Regression/Correlation):
**Part (a)**
| Answer | Mark | Guidance |
|--------|------|----------|
| At least 4 points correct | B1 | Allow $\pm$ one 2mm square |
| All points correct | B1 | Allow $\pm$ one 2mm square |
**Part (b)**
| Answer | Mark | Guidance |
|--------|------|----------|
| Line drawn through correct points | B1 | Ignore extra points and lines; require reference to points and line |
**Part (c)**
| Answer | Mark | Guidance |
|--------|------|----------|
| Correct method for either $S_{dd}$ or $S_{fd}$ | M1 | Correct expression seen |
| $S_{dd}$ awrt 24.2 | A1 | |
| $S_{fd}$ awrt 49.1 | A1 | |
**Part (d)**
| Answer | Mark | Guidance |
|--------|------|----------|
| Correct expression for $b$ | M1 | Can follow through from (c) |
| Correct method to find $a$ using $b$ and means | M1 | Follow through |
| $f = \ldots$ in terms of $d$, all values awrt | A1 | Accept 15 as rounding from correct answer only |
**Part (e)**
| Answer | Mark | Guidance |
|--------|------|----------|
| Contextual interpretation of cost and distance | — | Follow through their value of $b$ |
**Part (f)**
| Answer | Mark | Guidance |
|--------|------|----------|
| Attempt intersection of 2 lines | M1 | $t$ in range 500 to 505 seen |
| $d$ in range 5 to 5.05 | A1 | Accept $t$ greater than 500 to 505 inclusive for M1A1 |
---
\begin{enumerate}
\item A travel agent sells flights to different destinations from Beerow airport. The distance $d$, measured in 100 km , of the destination from the airport and the fare $\pounds f$ are recorded for a random sample of 6 destinations.
\end{enumerate}
\begin{center}
\begin{tabular}{ | c | c | c | c | c | c | c | }
\hline
Destination & $A$ & $B$ & $C$ & $D$ & $E$ & $F$ \\
\hline
$d$ & 2.2 & 4.0 & 6.0 & 2.5 & 8.0 & 5.0 \\
\hline
$f$ & 18 & 20 & 25 & 23 & 32 & 28 \\
\hline
\end{tabular}
\end{center}
$$\text { [You may use } \sum d ^ { 2 } = 152.09 \quad \sum f ^ { 2 } = 3686 \quad \sum f d = 723.1 \text { ] }$$
(a) Using the axes below, complete a scatter diagram to illustrate this information.\\
(b) Explain why a linear regression model may be appropriate to describe the relationship between $f$ and $d$.\\
(c) Calculate $S _ { d d }$ and $S _ { f d }$\\
(d) Calculate the equation of the regression line of $f$ on $d$ giving your answer in the form $f = a + b d$.\\
(e) Give an interpretation of the value of $b$.
Jane is planning her holiday and wishes to fly from Beerow airport to a destination $t \mathrm {~km}$ away. A rival travel agent charges 5 p per km.\\
(f) Find the range of values of $t$ for which the first travel agent is cheaper than the rival.\\
\includegraphics[max width=\textwidth, alt={}, center]{61983561-79f7-4883-8ae7-ab1f4955d444-20_967_1630_1722_164}
\hfill \mbox{\textit{Edexcel S1 Q6 [14]}}