Edexcel S1 2003 January — Question 6 19 marks

Exam BoardEdexcel
ModuleS1 (Statistics 1)
Year2003
SessionJanuary
Marks19
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicLinear regression
TypeCalculate y on x from raw data table
DifficultyModerate -0.3 This is a standard S1 linear regression question with coding that follows a routine template. All summary statistics are provided, requiring only substitution into formulas for the regression line, then reverse-coding. The interpretation and prediction parts are straightforward applications. While it has multiple parts (5 marks total based on typical S1 marking), each step is mechanical with no novel problem-solving required, making it slightly easier than average.
Spec5.08a Pearson correlation: calculate pmcc5.09a Dependent/independent variables5.09b Least squares regression: concepts5.09c Calculate regression line5.09d Linear coding: effect on regression5.09e Use regression: for estimation in context

6. The chief executive of Rex cars wants to investigate the relationship between the number of new car sales and the amount of money spent on advertising. She collects data from company records on the number of new car sales, \(c\), and the cost of advertising each year, \(p\) (£000). The data are shown in the table below.
YearNumber of new car sale, \(c\)Cost of advertising (£000), \(p\)
19904240120
19914380126
19924420132
19934440134
19944430137
19954520144
19964590148
19974660150
19984700153
19994790158
  1. Using the coding \(x = ( p - 100 )\) and \(y = \frac { 1 } { 10 } ( c - 4000 )\), draw a scatter diagram to represent these data. Explain why \(x\) is the explanatory variable.
  2. Find the equation of the least squares regression line of \(y\) on \(x\). $$\text { [Use } \left. \Sigma x = 402 , \Sigma y = 517 , \Sigma x ^ { 2 } = 17538 \text { and } \Sigma x y = 22611 . \right]$$
  3. Deduce the equation of the least squares regression line of \(c\) on \(p\) in the form \(c = a + b p\).
  4. Interpret the value of \(a\).
  5. Predict the number of extra new cars sales for an increase of \(\pounds 2000\) in advertising budget. Comment on the validity of your answer.
    (2)

Question 6:
Part (a)
AnswerMarks Guidance
AnswerMarks Guidance
\(x\): 20 26 32 34 37 44 48 50 53 58B1
\(y\): 24 38 42 44 43 52 59 66 70 79B1
Change in cost of advertising influences number of new car salesB1
Graph: scale and labelsB1
Points all correctB2 (5 marks)
Part (b)
AnswerMarks Guidance
AnswerMarks Guidance
\(S_{xy} = 22611 - \frac{402 \times 517}{10} = 1827.6\)M1A1
\(S_{xx} = 17538 - \frac{402^2}{10} = 1377.6\)A1
\(b = \frac{S_{xy}}{S_{xx}} = \frac{1827.6}{1377.6} = 1.3267...\)M1A1
\(a = \frac{517}{10} - (1.326655...)\times\frac{402}{10} = -1.63153...\)B1
\(\therefore y = -1.63 + 1.33x\)B1ft (7 marks)
Part (c)
AnswerMarks Guidance
AnswerMarks Guidance
\(\frac{c-4000}{10} = -1.63 + 1.33(p-100)\)M1A1ft
\(c = 2653.7 + 13.3p\)A1 (3 marks)
Part (d)
AnswerMarks Guidance
AnswerMarks Guidance
No. sold if no money spent on advertisingB1
\(p=0\) is well outside valid range — meaninglessB1 (2 marks)
Part (e)
AnswerMarks Guidance
AnswerMarks Guidance
\(2 \times 13.3 = 27\) extra cars soldB1
Only valid in range of data for 1990sB1 (2 marks)
# Question 6:

## Part (a)
| Answer | Marks | Guidance |
|--------|-------|----------|
| $x$: 20 26 32 34 37 44 48 50 53 58 | B1 | |
| $y$: 24 38 42 44 43 52 59 66 70 79 | B1 | |
| Change in cost of advertising influences number of new car sales | B1 | |
| Graph: scale and labels | B1 | |
| Points all correct | B2 | **(5 marks)** |

## Part (b)
| Answer | Marks | Guidance |
|--------|-------|----------|
| $S_{xy} = 22611 - \frac{402 \times 517}{10} = 1827.6$ | M1A1 | |
| $S_{xx} = 17538 - \frac{402^2}{10} = 1377.6$ | A1 | |
| $b = \frac{S_{xy}}{S_{xx}} = \frac{1827.6}{1377.6} = 1.3267...$ | M1A1 | |
| $a = \frac{517}{10} - (1.326655...)\times\frac{402}{10} = -1.63153...$ | B1 | |
| $\therefore y = -1.63 + 1.33x$ | B1ft | **(7 marks)** |

## Part (c)
| Answer | Marks | Guidance |
|--------|-------|----------|
| $\frac{c-4000}{10} = -1.63 + 1.33(p-100)$ | M1A1ft | |
| $c = 2653.7 + 13.3p$ | A1 | **(3 marks)** |

## Part (d)
| Answer | Marks | Guidance |
|--------|-------|----------|
| No. sold if no money spent on advertising | B1 | |
| $p=0$ is well outside valid range — meaningless | B1 | **(2 marks)** |

## Part (e)
| Answer | Marks | Guidance |
|--------|-------|----------|
| $2 \times 13.3 = 27$ extra cars sold | B1 | |
| Only valid in range of data for 1990s | B1 | **(2 marks)** |
6. The chief executive of Rex cars wants to investigate the relationship between the number of new car sales and the amount of money spent on advertising. She collects data from company records on the number of new car sales, $c$, and the cost of advertising each year, $p$ (£000). The data are shown in the table below.

\begin{center}
\begin{tabular}{|l|l|l|}
\hline
Year & Number of new car sale, $c$ & Cost of advertising (£000), $p$ \\
\hline
1990 & 4240 & 120 \\
\hline
1991 & 4380 & 126 \\
\hline
1992 & 4420 & 132 \\
\hline
1993 & 4440 & 134 \\
\hline
1994 & 4430 & 137 \\
\hline
1995 & 4520 & 144 \\
\hline
1996 & 4590 & 148 \\
\hline
1997 & 4660 & 150 \\
\hline
1998 & 4700 & 153 \\
\hline
1999 & 4790 & 158 \\
\hline
\end{tabular}
\end{center}
\begin{enumerate}[label=(\alph*)]
\item Using the coding $x = ( p - 100 )$ and $y = \frac { 1 } { 10 } ( c - 4000 )$, draw a scatter diagram to represent these data. Explain why $x$ is the explanatory variable.
\item Find the equation of the least squares regression line of $y$ on $x$.

$$\text { [Use } \left. \Sigma x = 402 , \Sigma y = 517 , \Sigma x ^ { 2 } = 17538 \text { and } \Sigma x y = 22611 . \right]$$
\item Deduce the equation of the least squares regression line of $c$ on $p$ in the form $c = a + b p$.
\item Interpret the value of $a$.
\item Predict the number of extra new cars sales for an increase of $\pounds 2000$ in advertising budget. Comment on the validity of your answer.\\
(2)
\end{enumerate}

\hfill \mbox{\textit{Edexcel S1 2003 Q6 [19]}}