Edexcel S1 2004 June — Question 2 18 marks

Exam BoardEdexcel
ModuleS1 (Statistics 1)
Year2004
SessionJune
Marks18
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicLinear regression
TypeCalculate y on x from raw data table
DifficultyModerate -0.8 This is a standard S1 linear regression question requiring routine application of formulae for Sxy, Sxx, correlation coefficient, and regression line. All necessary summations are provided, making it purely computational with no problem-solving insight required. The multi-part structure is typical but each step follows directly from textbook procedures.
Spec2.02c Scatter diagrams and regression lines5.08a Pearson correlation: calculate pmcc5.09b Least squares regression: concepts5.09c Calculate regression line5.09e Use regression: for estimation in context

2. A researcher thinks there is a link between a person's height and level of confidence. She measured the height \(h\), to the nearest cm , of a random sample of 9 people. She also devised a test to measure the level of confidence \(c\) of each person. The data are shown in the table below.
\(h\)179169187166162193161177168
\(c\)569561579561540598542565573
[You may use \(\Sigma h ^ { 2 } = 272094 , \Sigma c ^ { 2 } = 2878966 , \Sigma h c = 884484\) ]
  1. Draw a scatter diagram to illustrate these data.
  2. Find exact values of \(S _ { h c } S _ { h h }\) and \(S _ { c c }\).
  3. Calculate the value of the product moment correlation coefficient for these data.
  4. Give an interpretation of your correlation coefficient.
  5. Calculate the equation of the regression line of \(c\) on \(h\) in the form \(c = a + b h\).
  6. Estimate the level of confidence of a person of height 180 cm .
  7. State the range of values of \(h\) for which estimates of \(c\) are reliable.

Question 2:
Part (a)
AnswerMarks Guidance
Answer/WorkingMarks Guidance
Scatter diagramB1 Labels (not \(x,y\))
B1Sensible scales; allow axis interchange
B2 \((-1\) ee\()\)Points (4 marks)
Part (b)
AnswerMarks Guidance
Answer/WorkingMarks Guidance
\(S_{hc}=884484-\frac{1562\times 5088}{9}=1433\frac{1}{3}\)M1 Correct use of \(S\)
A1\(1433\frac{1}{3}\); \(1433.\dot{3}\)
\(S_{hh}=1000\frac{2}{9}\); \(S_{cc}=2550\)A1; A1 \(1000\frac{2}{9}\), \(1000.\dot{2}\); \(2550\) (4 marks)
(NB: accept \(-9\); i.e.: \(-159\frac{7}{27}\); \(111\frac{1}{81}\); \(283\frac{1}{3}\))
Part (c)
AnswerMarks Guidance
Answer/WorkingMarks Guidance
\(r=\frac{1433\frac{1}{3}}{\sqrt{1000\frac{2}{9}\times 2550}}\)M1 Substitution in correct formula
A1 ft
\(=0.897488\ldots\)A1 AWRT \(0.897\) (accept \(0.8975\)) (3 marks)
Part (d)
AnswerMarks Guidance
Answer/WorkingMarks Guidance
Taller people tend to be more confidentB1 Context (1 mark)
Part (e)
AnswerMarks Guidance
Answer/WorkingMarks Guidance
\(b=\frac{1433.\dot{3}}{1000.\dot{2}}=1.433014\ldots\)M1
\(a=\frac{5088}{9}-\frac{1433.\dot{3}}{1000.\dot{2}}\times\frac{1562}{9}=316.6256\ldots\)M1 Allow use of their \(b\)
\(\therefore c=317+1.43h\)A1 3sf (3 marks)
Part (f)
AnswerMarks Guidance
Answer/WorkingMarks Guidance
\(h=180\Rightarrow c=574.4\) or \(574.5683\ldots\)M1 Substitution of \(180\)
A1\(574\)–\(575\) (2 marks)
Part (g)
AnswerMarks Guidance
Answer/WorkingMarks Guidance
\(161\leq h\leq 193\)B1 (1 mark)
NB (a) No graph paper \(\Rightarrow\) 0/4
(18 marks total)
## Question 2:

### Part (a)
| Answer/Working | Marks | Guidance |
|---|---|---|
| Scatter diagram | B1 | Labels (not $x,y$) |
| | B1 | Sensible scales; allow axis interchange |
| | B2 $(-1$ ee$)$ | Points **(4 marks)** |

### Part (b)
| Answer/Working | Marks | Guidance |
|---|---|---|
| $S_{hc}=884484-\frac{1562\times 5088}{9}=1433\frac{1}{3}$ | M1 | Correct use of $S$ |
| | A1 | $1433\frac{1}{3}$; $1433.\dot{3}$ |
| $S_{hh}=1000\frac{2}{9}$; $S_{cc}=2550$ | A1; A1 | $1000\frac{2}{9}$, $1000.\dot{2}$; $2550$ **(4 marks)** |
| (NB: accept $-9$; i.e.: $-159\frac{7}{27}$; $111\frac{1}{81}$; $283\frac{1}{3}$) | | |

### Part (c)
| Answer/Working | Marks | Guidance |
|---|---|---|
| $r=\frac{1433\frac{1}{3}}{\sqrt{1000\frac{2}{9}\times 2550}}$ | M1 | Substitution in correct formula |
| | A1 ft | |
| $=0.897488\ldots$ | A1 | AWRT $0.897$ (accept $0.8975$) **(3 marks)** |

### Part (d)
| Answer/Working | Marks | Guidance |
|---|---|---|
| Taller people tend to be more confident | B1 | Context **(1 mark)** |

### Part (e)
| Answer/Working | Marks | Guidance |
|---|---|---|
| $b=\frac{1433.\dot{3}}{1000.\dot{2}}=1.433014\ldots$ | M1 | |
| $a=\frac{5088}{9}-\frac{1433.\dot{3}}{1000.\dot{2}}\times\frac{1562}{9}=316.6256\ldots$ | M1 | Allow use of their $b$ |
| $\therefore c=317+1.43h$ | A1 | 3sf **(3 marks)** |

### Part (f)
| Answer/Working | Marks | Guidance |
|---|---|---|
| $h=180\Rightarrow c=574.4$ or $574.5683\ldots$ | M1 | Substitution of $180$ |
| | A1 | $574$–$575$ **(2 marks)** |

### Part (g)
| Answer/Working | Marks | Guidance |
|---|---|---|
| $161\leq h\leq 193$ | B1 | **(1 mark)** |

**NB (a) No graph paper $\Rightarrow$ 0/4**

**(18 marks total)**

---
2. A researcher thinks there is a link between a person's height and level of confidence. She measured the height $h$, to the nearest cm , of a random sample of 9 people. She also devised a test to measure the level of confidence $c$ of each person. The data are shown in the table below.

\begin{center}
\begin{tabular}{ | c | c | c | c | c | c | c | c | c | c | }
\hline
$h$ & 179 & 169 & 187 & 166 & 162 & 193 & 161 & 177 & 168 \\
\hline
$c$ & 569 & 561 & 579 & 561 & 540 & 598 & 542 & 565 & 573 \\
\hline
\end{tabular}
\end{center}

[You may use $\Sigma h ^ { 2 } = 272094 , \Sigma c ^ { 2 } = 2878966 , \Sigma h c = 884484$ ]
\begin{enumerate}[label=(\alph*)]
\item Draw a scatter diagram to illustrate these data.
\item Find exact values of $S _ { h c } S _ { h h }$ and $S _ { c c }$.
\item Calculate the value of the product moment correlation coefficient for these data.
\item Give an interpretation of your correlation coefficient.
\item Calculate the equation of the regression line of $c$ on $h$ in the form $c = a + b h$.
\item Estimate the level of confidence of a person of height 180 cm .
\item State the range of values of $h$ for which estimates of $c$ are reliable.
\end{enumerate}

\hfill \mbox{\textit{Edexcel S1 2004 Q2 [18]}}