Edexcel S1 2002 June — Question 7 16 marks

Exam BoardEdexcel
ModuleS1 (Statistics 1)
Year2002
SessionJune
Marks16
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicLinear regression
TypeCalculate y on x from raw data table
DifficultyModerate -0.8 This is a standard S1 linear regression question requiring routine application of formulae for correlation coefficient and regression line from given summary statistics. All calculations follow textbook procedures with no problem-solving insight needed, making it easier than average but not trivial due to the multi-step arithmetic involved.
Spec2.02c Scatter diagrams and regression lines2.02d Informal interpretation of correlation5.08a Pearson correlation: calculate pmcc5.08d Hypothesis test: Pearson correlation5.09b Least squares regression: concepts5.09c Calculate regression line5.09e Use regression: for estimation in context

7. An ice cream seller believes that there is a relationship between the temperature on a summer day and the number of ice creams sold. Over a period of 10 days he records the temperature at 1 p.m., \(t ^ { \circ } \mathrm { C }\), and the number of ice creams sold, \(c\), in the next hour. The data he collects is summarised in the table below.
\(t\)\(c\)
1324
2255
1735
2045
1020
1530
1939
1219
1836
2354
[Use \(\left. \Sigma t ^ { 2 } = 3025 , \Sigma c ^ { 2 } = 14245 , \Sigma c t = 6526 .\right]\)
  1. Calculate the value of the product moment correlation coefficient between \(t\) and \(c\).
  2. State whether or not your value supports the use of a regression equation to predict the number of ice creams sold. Give a reason for your answer.
  3. Find the equation of the least squares regression line of \(c\) on \(t\) in the form \(c = a + b t\).
  4. Interpret the value of \(b\).
  5. Estimate the number of ice creams sold between 1 p.m. and 2 p.m. when the temperature at 1 p.m. is \(16 ^ { \circ } \mathrm { C }\).
    (3)
  6. At 1 p.m. on a particular day, the highest temperature for 50 years was recorded. Give a reason why you should not use the regression equation to predict ice cream sales on that day.
    (1)

Question 7:
Part (a)
AnswerMarks Guidance
AnswerMarks Guidance
\(\Sigma t = 169;\ \Sigma c = 357\)
\(S_{cc} = 14245 - \dfrac{357^2}{10} = 1500.1\)M1, A1
\(S_{tt} = 168.9,\ S_{ct} = 492.7\)A1, A1
\(r = \dfrac{492.7}{\sqrt{1500.1 \times 168.9}} = 0.97883\ldots\) accept \(0.979\)M1, A1
Part (b)
AnswerMarks Guidance
AnswerMarks Guidance
Since \(r\) close to 1, value supports use of regression lineB1, B1
Part (c)
AnswerMarks Guidance
AnswerMarks Guidance
\(b = \dfrac{S_{ct}}{S_{tt}} = \dfrac{492.7}{168.9} = 2.91711\ldots\)B1
\(a = \bar{c} - b\bar{t} = \dfrac{357}{10} - \dfrac{492.7}{168.9} \times \dfrac{169}{10} = -13.59917\ldots\)B1
\(c = -13.6 + 2.92t\)B1
Part (d)
AnswerMarks Guidance
AnswerMarks Guidance
3 extra ice-creams are sold for every \(1\ ^\circ\text{C}\) increase in temperatureB1
Part (e)
AnswerMarks Guidance
AnswerMarks Guidance
\(c = -13.6 + 2.92 \times 16 = 33.12\), i.e. 33 ice-creamsM1, A1, A1
Part (f)
AnswerMarks Guidance
AnswerMarks Guidance
Temperature likely to be outside range of validityB1
## Question 7:

### Part (a)
| Answer | Marks | Guidance |
|--------|-------|----------|
| $\Sigma t = 169;\ \Sigma c = 357$ | | |
| $S_{cc} = 14245 - \dfrac{357^2}{10} = 1500.1$ | M1, A1 | |
| $S_{tt} = 168.9,\ S_{ct} = 492.7$ | A1, A1 | |
| $r = \dfrac{492.7}{\sqrt{1500.1 \times 168.9}} = 0.97883\ldots$ accept $0.979$ | M1, A1 | |

### Part (b)
| Answer | Marks | Guidance |
|--------|-------|----------|
| Since $r$ close to 1, value supports use of regression line | B1, B1 | |

### Part (c)
| Answer | Marks | Guidance |
|--------|-------|----------|
| $b = \dfrac{S_{ct}}{S_{tt}} = \dfrac{492.7}{168.9} = 2.91711\ldots$ | B1 | |
| $a = \bar{c} - b\bar{t} = \dfrac{357}{10} - \dfrac{492.7}{168.9} \times \dfrac{169}{10} = -13.59917\ldots$ | B1 | |
| $c = -13.6 + 2.92t$ | B1 | |

### Part (d)
| Answer | Marks | Guidance |
|--------|-------|----------|
| 3 extra ice-creams are sold for every $1\ ^\circ\text{C}$ increase in temperature | B1 | |

### Part (e)
| Answer | Marks | Guidance |
|--------|-------|----------|
| $c = -13.6 + 2.92 \times 16 = 33.12$, i.e. 33 ice-creams | M1, A1, A1 | |

### Part (f)
| Answer | Marks | Guidance |
|--------|-------|----------|
| Temperature likely to be outside range of validity | B1 | |
7. An ice cream seller believes that there is a relationship between the temperature on a summer day and the number of ice creams sold. Over a period of 10 days he records the temperature at 1 p.m., $t ^ { \circ } \mathrm { C }$, and the number of ice creams sold, $c$, in the next hour. The data he collects is summarised in the table below.

\begin{center}
\begin{tabular}{|l|l|}
\hline
$t$ & $c$ \\
\hline
13 & 24 \\
\hline
22 & 55 \\
\hline
17 & 35 \\
\hline
20 & 45 \\
\hline
10 & 20 \\
\hline
15 & 30 \\
\hline
19 & 39 \\
\hline
12 & 19 \\
\hline
18 & 36 \\
\hline
23 & 54 \\
\hline
\end{tabular}
\end{center}

[Use $\left. \Sigma t ^ { 2 } = 3025 , \Sigma c ^ { 2 } = 14245 , \Sigma c t = 6526 .\right]$
\begin{enumerate}[label=(\alph*)]
\item Calculate the value of the product moment correlation coefficient between $t$ and $c$.
\item State whether or not your value supports the use of a regression equation to predict the number of ice creams sold. Give a reason for your answer.
\item Find the equation of the least squares regression line of $c$ on $t$ in the form $c = a + b t$.
\item Interpret the value of $b$.
\item Estimate the number of ice creams sold between 1 p.m. and 2 p.m. when the temperature at 1 p.m. is $16 ^ { \circ } \mathrm { C }$.\\
(3)
\item At 1 p.m. on a particular day, the highest temperature for 50 years was recorded. Give a reason why you should not use the regression equation to predict ice cream sales on that day.\\
(1)
\end{enumerate}

\hfill \mbox{\textit{Edexcel S1 2002 Q7 [16]}}