Edexcel S1 2017 June — Question 5 15 marks

Exam BoardEdexcel
ModuleS1 (Statistics 1)
Year2017
SessionJune
Marks15
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicLinear regression
TypeCalculate from summary statistics
DifficultyModerate -0.3 This is a standard S1 linear regression question requiring routine application of formulas (S_ss, S_st, regression equation) with given summary statistics. All parts follow textbook procedures with no novel problem-solving required, though the multi-part structure and need to recall several formulas makes it slightly more demanding than pure recall questions.
Spec2.02g Calculate mean and standard deviation5.08a Pearson correlation: calculate pmcc5.09b Least squares regression: concepts5.09c Calculate regression line5.09e Use regression: for estimation in context

  1. Tomas is studying the relationship between temperature and hours of sunshine in Seapron. He records the midday temperature, \(t ^ { \circ } \mathrm { C }\), and the hours of sunshine, \(s\) hours, for a random sample of 9 days in October. He calculated the following statistics
$$\sum s = 15 \quad \sum s ^ { 2 } = 44.22 \quad \sum t = 127 \quad \mathrm {~S} _ { t t } = 10.89$$
  1. Calculate \(\mathrm { S } _ { s s }\) Tomas calculated the product moment correlation coefficient between \(s\) and \(t\) to be 0.832 correct to 3 decimal places.
  2. State, giving a reason, whether or not this correlation coefficient supports the use of a linear regression model to describe the relationship between midday temperature and hours of sunshine.
  3. State, giving a reason, why the hours of sunshine would be the explanatory variable in a linear regression model between midday temperature and hours of sunshine.
  4. Find \(\mathrm { S } _ { s t }\)
  5. Calculate a suitable linear regression equation to model the relationship between midday temperature and hours of sunshine.
  6. Calculate the standard deviation of \(s\) Tomas uses this model to estimate the midday temperature in Seapron for a day in October with 5 hours of sunshine.
  7. State the value of Tomas' estimate. Given that the values of \(s\) are all within 2 standard deviations of the mean,
  8. comment, giving your reason, on the reliability of this estimate.

\begin{enumerate}
  \item Tomas is studying the relationship between temperature and hours of sunshine in Seapron. He records the midday temperature, $t ^ { \circ } \mathrm { C }$, and the hours of sunshine, $s$ hours, for a random sample of 9 days in October. He calculated the following statistics
\end{enumerate}

$$\sum s = 15 \quad \sum s ^ { 2 } = 44.22 \quad \sum t = 127 \quad \mathrm {~S} _ { t t } = 10.89$$

(a) Calculate $\mathrm { S } _ { s s }$

Tomas calculated the product moment correlation coefficient between $s$ and $t$ to be 0.832 correct to 3 decimal places.\\
(b) State, giving a reason, whether or not this correlation coefficient supports the use of a linear regression model to describe the relationship between midday temperature and hours of sunshine.\\
(c) State, giving a reason, why the hours of sunshine would be the explanatory variable in a linear regression model between midday temperature and hours of sunshine.\\
(d) Find $\mathrm { S } _ { s t }$\\
(e) Calculate a suitable linear regression equation to model the relationship between midday temperature and hours of sunshine.\\
(f) Calculate the standard deviation of $s$

Tomas uses this model to estimate the midday temperature in Seapron for a day in October with 5 hours of sunshine.\\
(g) State the value of Tomas' estimate.

Given that the values of $s$ are all within 2 standard deviations of the mean,\\
(h) comment, giving your reason, on the reliability of this estimate.

\hfill \mbox{\textit{Edexcel S1 2017 Q5 [15]}}