Edexcel S1 — Question 6 17 marks

Exam BoardEdexcel
ModuleS1 (Statistics 1)
Marks17
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicLinear regression
TypeCalculate y on x from raw data table
DifficultyModerate -0.8 This is a straightforward application of standard S1 regression formulas with all summations provided. Students simply substitute into memorized formulas for regression line and correlation coefficient, then draw a line and make a basic comment. No problem-solving or interpretation challenges beyond routine textbook exercises.
Spec2.02c Scatter diagrams and regression lines2.02d Informal interpretation of correlation5.08a Pearson correlation: calculate pmcc5.08d Hypothesis test: Pearson correlation5.09b Least squares regression: concepts5.09c Calculate regression line

6. A school introduced a new programme of support lessons in 1994 with a view to improving grades in GCSE English. The table below shows the number of years since 1994, n, and the corresponding percentage of students achieving A to C grades in GCSE English, \(p\), for each year.
\(n\)123456
\(p ( \% )\)35.237.140.639.043.444.8
  1. Represent these data on a scatter diagram. You may use the following values. $$\Sigma n = 21 , \quad \Sigma p = 240.1 , \quad \Sigma n ^ { 2 } = 91 , \quad \Sigma p ^ { 2 } = 9675.41 , \quad \Sigma n p = 873 .$$
  2. Find an equation of the regression line of \(p\) on \(n\) and draw it on your graph.
  3. Calculate the product moment correlation coefficient for these data and comment on the suitability of a linear model for the relationship between \(n\) and \(p\) during this period.

Part (a)
AnswerMarks
Scatter diagram with line \(p = 33.5 + 1.87n\)B4
Part (b)
AnswerMarks
\(S_{yy} = 873 - \frac{21 \times 240.1}{6} = 32.65\)M1
\(S_{nn} = 91 - \frac{21^2}{6} = 17.5\)M1
\(b = \frac{32.65}{17.5} = 1.8657\)M1 A1
\(a = \frac{240.1}{6} - 1.8657 \times \frac{21}{6} = 33.4867\)M1 A1
\(p = 33.5 + 1.87n\)A1
line on graph aboveB2
Part (c)
AnswerMarks
\(S_{pp} = 9675.41 - \frac{240.1^2}{6} = 67.4083\)M1
\(r = \frac{\sqrt{17.5 \times 67.4083}}{32.65} = 0.9506\)M1 A1
\(r\) strongly +ve supporting linear modelB1
(17)
Total(75)
**Part (a)**

**Scatter diagram with line** $p = 33.5 + 1.87n$ | B4 |

**Part (b)**

$S_{yy} = 873 - \frac{21 \times 240.1}{6} = 32.65$ | M1 |

$S_{nn} = 91 - \frac{21^2}{6} = 17.5$ | M1 |

$b = \frac{32.65}{17.5} = 1.8657$ | M1 A1 |

$a = \frac{240.1}{6} - 1.8657 \times \frac{21}{6} = 33.4867$ | M1 A1 |

$p = 33.5 + 1.87n$ | A1 |

line on graph above | B2 |

**Part (c)**

$S_{pp} = 9675.41 - \frac{240.1^2}{6} = 67.4083$ | M1 |

$r = \frac{\sqrt{17.5 \times 67.4083}}{32.65} = 0.9506$ | M1 A1 |

$r$ strongly +ve supporting linear model | B1 |

| (17) |

**Total** | (75) |
6. A school introduced a new programme of support lessons in 1994 with a view to improving grades in GCSE English. The table below shows the number of years since 1994, n, and the corresponding percentage of students achieving A to C grades in GCSE English, $p$, for each year.

\begin{center}
\begin{tabular}{ | c | c | c | c | c | c | c | }
\hline
$n$ & 1 & 2 & 3 & 4 & 5 & 6 \\
\hline
$p ( \% )$ & 35.2 & 37.1 & 40.6 & 39.0 & 43.4 & 44.8 \\
\hline
\end{tabular}
\end{center}
\begin{enumerate}[label=(\alph*)]
\item Represent these data on a scatter diagram.

You may use the following values.

$$\Sigma n = 21 , \quad \Sigma p = 240.1 , \quad \Sigma n ^ { 2 } = 91 , \quad \Sigma p ^ { 2 } = 9675.41 , \quad \Sigma n p = 873 .$$
\item Find an equation of the regression line of $p$ on $n$ and draw it on your graph.
\item Calculate the product moment correlation coefficient for these data and comment on the suitability of a linear model for the relationship between $n$ and $p$ during this period.
\end{enumerate}

\hfill \mbox{\textit{Edexcel S1  Q6 [17]}}