Edexcel S1 2010 January — Question 6 18 marks

Exam BoardEdexcel
ModuleS1 (Statistics 1)
Year2010
SessionJanuary
Marks18
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicBivariate data
TypeCalculate summary statistics (Sxx, Syy, Sxy)
DifficultyModerate -0.8 This is a standard S1 bivariate data question requiring routine application of formulas for Stt, Spp, Stp, correlation coefficient, and regression line. All summations are provided, making it purely computational with no problem-solving or interpretation challenges beyond basic recall of standard procedures.
Spec2.02c Scatter diagrams and regression lines5.08a Pearson correlation: calculate pmcc5.09a Dependent/independent variables5.09b Least squares regression: concepts

The blood pressures, \(p\) mmHg, and the ages, \(t\) years, of 7 hospital patients are shown in the table below.
PatientABCDEFG
\(t\)42744835562660
\(p\)981301208818280135
[\(\sum t = 341\), \(\sum p = 833\), \(\sum t^2 = 18181\), \(\sum p^2 = 106397\), \(\sum tp = 42948\)]
  1. Find \(S_{tt}\), \(S_{pp}\) and \(S_t\) for these data. [4]
  2. Calculate the product moment correlation coefficient for these data. [3]
  3. Interpret the correlation coefficient. [1]
  4. On the graph paper on page 17, draw the scatter diagram of blood pressure against age for these 7 patients. [2]
  5. Find the equation of the regression line of \(p\) on \(t\). [4]
  6. Plot your regression line on your scatter diagram. [2]
  7. Use your regression line to estimate the blood pressure of a 40 year old patient. [2]

Part (a)
Answer: \(S_{pp} = 106397 - \frac{833^2}{7} = 7270\)
\(S_{tp} = 42948 - \frac{341 \times 833}{7} = 2369\), \(S_{tt} = 18181 - \frac{341^2}{7} = 1569.42857...\) or \(\frac{10986}{7}\)
AnswerMarks
Marks: M1 A1 A1 A1(4)
Guidance:
- M1 for at least one correct expression
- 1st A1 for \(S_{pp} = 7270\)
- 2nd A1 for \(S_{tp} = 2369\) or 2370
- 3rd A1 for \(S_{tt} = \) awrt 1570
Part (b)
Answer: \(r = \frac{2369}{\sqrt{7270 \times 1569.42857...}} = 0.7013375\) awrt (0.701)
AnswerMarks
Marks: M1 A1ft A1(3)
Guidance:
- M1 for attempt at correct formula and at least one correct value (or correct ft) M0 for \(\frac{42948}{\sqrt{106397 \times 18181}}\)
- A1ft All values correct or correct ft. Allow for an answer of 0.7 or 0.70
- Answer only: awrt 0.701 is 3/3, answer of 0.7 or 0.70 is 2/3
Part (c)
Answer: (Pmcc shows positive correlation.) Older patients have higher blood pressure
AnswerMarks
Marks: B1(1)
Guidance:
- B1 for comment in context that interprets the fact that correlation is positive, as in scheme. Must mention age and blood pressure in words, not just "r" and "p"
Part (d) + (f)
Answer: (d) Points plotted correctly on graph: -1 each error or omission (within one square of correct position). * see diagram below for correct points
(f) Line drawn with correct intercept, and gradient
AnswerMarks
Marks: B2 B1ft B1 B1ft B1(3) for (d); (2+2) for (f)
Guidance:
- Record 1 point incorrect as B1B0 on open. [NB overlay for (60, 135) is slightly wrong]
Part (e)
Answer: \(b = \frac{2369}{1569.42857...} = 1.509466...\)
\(a = \frac{833}{7} - b \times \frac{341}{7} = 45.467413...\)
\(p = 45.5 + 1.5l r\)
AnswerMarks
Marks: M1 A1 M1 M1 A1(4)
Guidance:
- 1st M1 for use of the correct formula for \(b\), fit their values from (a)
- 1st A1 allow 1.5 or better
- 2nd M1 for use of \(\bar{y} - b\bar{x}\) with their values
- 2nd A1 for full equation with \(a = \) awrt 45.5 and \(b = \) awrt 1.51. Must be \(p\) in terms of \(t\), not \(x\) and \(y\)
Part (g)
Answer: \(t = 40\), \(p = 105.84...\) from equation or graph.
AnswerMarks
Marks: M1 A1(2)
Guidance:
- M1 for clear use of their equation with \(t = 40\) or correct value from their graph
- A1 for awrt 106. Correct answer only (2/2) otherwise look for evidence on graph to award M1
## Part (a)
**Answer:** $S_{pp} = 106397 - \frac{833^2}{7} = 7270$

$S_{tp} = 42948 - \frac{341 \times 833}{7} = 2369$, $S_{tt} = 18181 - \frac{341^2}{7} = 1569.42857...$ or $\frac{10986}{7}$

**Marks:** M1 A1 A1 A1 | (4)

**Guidance:**
- M1 for at least one correct expression
- 1st A1 for $S_{pp} = 7270$
- 2nd A1 for $S_{tp} = 2369$ or 2370
- 3rd A1 for $S_{tt} = $ awrt 1570

## Part (b)
**Answer:** $r = \frac{2369}{\sqrt{7270 \times 1569.42857...}} = 0.7013375$ awrt (0.701)

**Marks:** M1 A1ft A1 | (3)

**Guidance:**
- M1 for attempt at correct formula and at least one correct value (or correct ft) M0 for $\frac{42948}{\sqrt{106397 \times 18181}}$
- A1ft All values correct or correct ft. Allow for an answer of 0.7 or 0.70
- Answer only: awrt 0.701 is 3/3, answer of 0.7 or 0.70 is 2/3

## Part (c)
**Answer:** (Pmcc shows positive correlation.) Older patients have higher blood pressure

**Marks:** B1 | (1)

**Guidance:**
- B1 for comment in context that interprets the fact that correlation is positive, as in scheme. Must mention age and blood pressure in words, not just "r" and "p"

## Part (d) + (f)
**Answer:** (d) Points plotted correctly on graph: -1 each error or omission (within one square of correct position). * see diagram below for correct points

(f) Line drawn with correct intercept, and gradient

**Marks:** B2 B1ft B1 B1ft B1 | (3) for (d); (2+2) for (f)

**Guidance:**
- Record 1 point incorrect as B1B0 on open. [NB overlay for (60, 135) is slightly wrong]

## Part (e)
**Answer:** $b = \frac{2369}{1569.42857...} = 1.509466...$

$a = \frac{833}{7} - b \times \frac{341}{7} = 45.467413...$

$p = 45.5 + 1.5l r$

**Marks:** M1 A1 M1 M1 A1 | (4)

**Guidance:**
- 1st M1 for use of the correct formula for $b$, fit their values from (a)
- 1st A1 allow 1.5 or better
- 2nd M1 for use of $\bar{y} - b\bar{x}$ with their values
- 2nd A1 for full equation with $a = $ awrt 45.5 and $b = $ awrt 1.51. Must be $p$ in terms of $t$, not $x$ and $y$

## Part (g)
**Answer:** $t = 40$, $p = 105.84...$ from equation or graph.

**Marks:** M1 A1 | (2)

**Guidance:**
- M1 for clear use of their equation with $t = 40$ or correct value from their graph
- A1 for awrt 106. Correct answer only (2/2) otherwise look for evidence on graph to award M1

---
The blood pressures, $p$ mmHg, and the ages, $t$ years, of 7 hospital patients are shown in the table below.

\begin{tabular}{|c|c|c|c|c|c|c|c|}
\hline
Patient & A & B & C & D & E & F & G \\
\hline
$t$ & 42 & 74 & 48 & 35 & 56 & 26 & 60 \\
\hline
$p$ & 98 & 130 & 120 & 88 & 182 & 80 & 135 \\
\hline
\end{tabular}

[$\sum t = 341$, $\sum p = 833$, $\sum t^2 = 18181$, $\sum p^2 = 106397$, $\sum tp = 42948$]

\begin{enumerate}[label=(\alph*)]
\item Find $S_{tt}$, $S_{pp}$ and $S_t$ for these data. [4]

\item Calculate the product moment correlation coefficient for these data. [3]

\item Interpret the correlation coefficient. [1]

\item On the graph paper on page 17, draw the scatter diagram of blood pressure against age for these 7 patients. [2]

\item Find the equation of the regression line of $p$ on $t$. [4]

\item Plot your regression line on your scatter diagram. [2]

\item Use your regression line to estimate the blood pressure of a 40 year old patient. [2]
\end{enumerate}

\hfill \mbox{\textit{Edexcel S1 2010 Q6 [18]}}