AQA S1 2006 January — Question 5 11 marks

Exam BoardAQA
ModuleS1 (Statistics 1)
Year2006
SessionJanuary
Marks11
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicBivariate data
TypeDraw scatter diagram from data
DifficultyEasy -1.2 This is a routine S1 statistics question requiring basic scatter diagram plotting and interpretation. Part (a) is pure data plotting, (b) requires simple visual interpretation, (c) identifies outliers by inspection, and only (d)(i) involves calculation (PMCC formula with 8 data points). All techniques are standard textbook exercises with no problem-solving insight required, making it easier than average A-level maths.
Spec2.02c Scatter diagrams and regression lines2.02d Informal interpretation of correlation5.08a Pearson correlation: calculate pmcc5.08d Hypothesis test: Pearson correlation

5 [Figure 1, printed on the insert, is provided for use in this question.]
The table shows the times, in seconds, taken by a random sample of 10 boys from a junior swimming club to swim 50 metres freestyle and 50 metres backstroke.
BoyABCDEFGHIJ
Freestyle ( \(\boldsymbol { x }\) seconds)30.232.825.131.831.235.632.438.036.134.1
Backstroke ( \(y\) seconds)33.535.437.427.234.738.237.741.442.338.4
  1. On Figure 1, complete the scatter diagram for these data.
  2. Hence:
    1. give two distinct comments on what your scatter diagram reveals;
    2. state, without calculation, which of the following 3 values is most likely to be the value of the product moment correlation coefficient for the data in your scatter diagram. $$0.912 \quad 0.088 \quad 0.462$$
  3. In the sample of 10 boys, one boy is a junior-champion freestyle swimmer and one boy is a junior-champion backstroke swimmer. Identify the two most likely boys.
  4. Removing the data for the two boys whom you identified in part (c):
    1. calculate the value of the product moment correlation coefficient for the remaining 8 pairs of values of \(x\) and \(y\);
    2. comment, in context, on the value that you obtain.

AnswerMarks Guidance
5(a)
Scatter DiagramB2, B1, B1 4 labelled points plotted, 3 labelled points plotted, 4 unlabelled points plotted
2
5(b)(i)
Positive/linear correlation/relationship except for two unusual values/resultsB1, B1 OE, OE
2
5(b)(ii)
0.462B1 CAO; accept 3rd/final/last value
1
5(c)
C and D: C is likely freestyle champion, D is likely backstroke championB1, B1 CAO, Style identified
or C is likely freestyle champion, D is likely backstroke championB1, B1
2
5(d)(i)
\(r = 0.912\) to \(0.913\)B3 AWFW
or \(r = 0.91\) to \(0.92\) or \(0.46\) to \(0.47\)B2 AWFW
or \(r = 0.9\)B1 AWRT
Attempt at \(\Sigma x\), \(\Sigma x^2\), \(\Sigma y\), \(\Sigma y^2\), \(\Sigma xy\)M1 270.4, 9188.46, 301.6, 11437.84, 10246.53
or Attempt at \(S_{xx}\), \(S_{yy}\), \(S_{xy}\)M1 48.94, 67.52, 52.45
Attempt at a correct formula for \(r\)m1
\(r = 0.912\) to \(0.913\)A1 AWFW
3
5(d)(ii)
Boys are faster/slower at both strokes or Boys are equally good at both strokesB1 OE; do not accept freestyle times are proportional to backstroke times
1
Total for Q5: 11
| **5(a)** |
|----------|
| Scatter Diagram | B2, B1, B1 | 4 labelled points plotted, 3 labelled points plotted, 4 unlabelled points plotted |
| | | 2 |

| **5(b)(i)** |
|----------|
| Positive/linear correlation/relationship except for two unusual values/results | B1, B1 | OE, OE |
| | | 2 |

| **5(b)(ii)** |
|----------|
| 0.462 | B1 | CAO; accept 3rd/final/last value |
| | | 1 |

| **5(c)** |
|----------|
| C and D: C is likely freestyle champion, D is likely backstroke champion | B1, B1 | CAO, Style identified |
| or C is likely freestyle champion, D is likely backstroke champion | B1, B1 | |
| | | 2 |

| **5(d)(i)** |
|----------|
| $r = 0.912$ to $0.913$ | B3 | AWFW |
| or $r = 0.91$ to $0.92$ or $0.46$ to $0.47$ | B2 | AWFW |
| or $r = 0.9$ | B1 | AWRT |
| Attempt at $\Sigma x$, $\Sigma x^2$, $\Sigma y$, $\Sigma y^2$, $\Sigma xy$ | M1 | 270.4, 9188.46, 301.6, 11437.84, 10246.53 |
| or Attempt at $S_{xx}$, $S_{yy}$, $S_{xy}$ | M1 | 48.94, 67.52, 52.45 |
| Attempt at a correct formula for $r$ | m1 | |
| $r = 0.912$ to $0.913$ | A1 | AWFW |
| | | 3 |

| **5(d)(ii)** |
|----------|
| Boys are faster/slower at both strokes or Boys are equally good at both strokes | B1 | OE; do not accept freestyle times are proportional to backstroke times |
| | | 1 |

**Total for Q5: 11**

---
5 [Figure 1, printed on the insert, is provided for use in this question.]\\
The table shows the times, in seconds, taken by a random sample of 10 boys from a junior swimming club to swim 50 metres freestyle and 50 metres backstroke.

\begin{center}
\begin{tabular}{|l|l|l|l|l|l|l|l|l|l|l|}
\hline
Boy & A & B & C & D & E & F & G & H & I & J \\
\hline
Freestyle ( $\boldsymbol { x }$ seconds) & 30.2 & 32.8 & 25.1 & 31.8 & 31.2 & 35.6 & 32.4 & 38.0 & 36.1 & 34.1 \\
\hline
Backstroke ( $y$ seconds) & 33.5 & 35.4 & 37.4 & 27.2 & 34.7 & 38.2 & 37.7 & 41.4 & 42.3 & 38.4 \\
\hline
\end{tabular}
\end{center}
\begin{enumerate}[label=(\alph*)]
\item On Figure 1, complete the scatter diagram for these data.
\item Hence:
\begin{enumerate}[label=(\roman*)]
\item give two distinct comments on what your scatter diagram reveals;
\item state, without calculation, which of the following 3 values is most likely to be the value of the product moment correlation coefficient for the data in your scatter diagram.

$$0.912 \quad 0.088 \quad 0.462$$
\end{enumerate}\item In the sample of 10 boys, one boy is a junior-champion freestyle swimmer and one boy is a junior-champion backstroke swimmer.

Identify the two most likely boys.
\item Removing the data for the two boys whom you identified in part (c):
\begin{enumerate}[label=(\roman*)]
\item calculate the value of the product moment correlation coefficient for the remaining 8 pairs of values of $x$ and $y$;
\item comment, in context, on the value that you obtain.
\end{enumerate}\end{enumerate}

\hfill \mbox{\textit{AQA S1 2006 Q5 [11]}}