AQA S1 2013 January — Question 4 12 marks

Exam BoardAQA
ModuleS1 (Statistics 1)
Year2013
SessionJanuary
Marks12
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicBivariate data
TypeCalculate regression line equation
DifficultyModerate -0.3 This is a standard S1 correlation question requiring calculation of PMCC using given formulas, interpretation, and visual analysis of a scatter diagram. While it involves multiple parts and Simpson's Paradox (two groups showing different correlations), the calculations are routine and the conceptual demands are typical for AS-level statistics. Slightly easier than average due to straightforward computational steps.
Spec2.05f Pearson correlation coefficient5.08a Pearson correlation: calculate pmcc5.08c Pearson: measure of straight-line fit

4 Ashok is a work-experience student with an organisation that offers two separate professional examination papers, I and II. For each of a random sample of 12 students, A to L , he records the mark, \(x\) per cent, achieved on Paper I, and the mark, \(y\) per cent, achieved on Paper II.
\cline { 2 - 13 } \multicolumn{1}{c|}{}\(\mathbf { A }\)\(\mathbf { B }\)\(\mathbf { C }\)\(\mathbf { D }\)\(\mathbf { E }\)\(\mathbf { F }\)\(\mathbf { G }\)\(\mathbf { H }\)\(\mathbf { I }\)\(\mathbf { J }\)\(\mathbf { K }\)\(\mathbf { L }\)
\(\boldsymbol { x }\)344653626772605470718285
\(\boldsymbol { y }\)616672788881496054444936
    1. Calculate the value of the product moment correlation coefficient, \(r\), between \(x\) and \(y\).
    2. Interpret your value of \(r\) in the context of this question.
    1. Give two possible advantages of plotting data on a graph before calculating the value of a product moment correlation coefficient.
    2. Complete the plotting of Ashok's data on the scatter diagram on page 5.
    3. State what is now revealed by the scatter diagram.
  1. Ashok subsequently discovers that students A to F have a more scientific background than students G to L. With reference to your scatter diagram, estimate the value of the product moment correlation coefficient for each of the two groups of students. You are not expected to calculate the two values.
    \cline { 2 - 7 } \multicolumn{1}{c|}{}\(\mathbf { G }\)\(\mathbf { H }\)\(\mathbf { I }\)\(\mathbf { J }\)\(\mathbf { K }\)\(\mathbf { L }\)
    \(\boldsymbol { x }\)605470718285
    \(\boldsymbol { y }\)496054444936
    \section*{Examination Marks}
    \includegraphics[max width=\textwidth, alt={}]{68830a6a-5479-4e5c-a845-a6536ab51cee-5_1616_1634_836_189}

Question 4:
Part (a)(i):
AnswerMarks Guidance
AnswerMarks Guidance
\(r = -0.326\) to \(-0.325\)B3 AWFW \((-0.32569)\)
\(r = -0.33\) to \(-0.32\)(B2) AWFW
\(r = -0.4\) to \(-0.2\)(B1) AWFW
\(r = 0.2\) to \(0.4\)(B1) AWFW
Attempt at \(\sum x\), \(\sum x^2\), \(\sum y\), \(\sum y^2\), \(\sum xy\) or attempt at \(S_{xx}\), \(S_{yy}\), \(S_{xy}\)M1 Values: 756, 50004, 738, 48200, 45652 (all 5 attempted); or 2376, 2813, \(-842\) (all 3 attempted)
Attempt substitution into correct formula for \(r\)m1
\(r = -0.326\) to \(-0.325\)A1 AWFW
Total3
Part (a)(ii):
AnswerMarks Guidance
AnswerMarks Guidance
Some/little/slight/(fairly/quite) weak / (fairly/quite) moderate negative (linear) correlation/relationship/association/link *(but not 'trend')* between marks/percentages in the two examination papersBdep1 Dependent on \(-0.4 \leq r \leq -0.2\); OE; must qualify strength and state negative; Bdep0 for 'low', 'small', 'poor', 'unlikely', 'medium', 'average', or adjective 'very'
Context markB1 Context; providing \(-1 < r < 1\)
Total2
Part (b)(i):
AnswerMarks Guidance
AnswerMarks Guidance
Identifying linear/non-linear/multiple/no patterns *(allow 'trend')* or Identifying outliers/anomalies or Estimating/gives idea of value or sign of \(r\)B2,1 OE; only one mark from each set; B0 for reference to checking calculated value
Total2
Part (b)(ii):
AnswerMarks Guidance
AnswerMarks Guidance
Graph with 6 labelled points correctB2 Correct \(\Rightarrow\) within circle of radius equal to distance between 2 grid lines
5 or 4 labelled points correct(B1) Deduct 1 mark for any unlabelled or incorrectly labelled point
Total2
Part (b)(iii):
AnswerMarks Guidance
AnswerMarks Guidance
Two separate correlations/relationships/lines/associations/links/sets of data *(but not 'trends')*B1 OE; eg A to F and G to L
Total1
Part (c):
AnswerMarks Guidance
AnswerMarks Guidance
A to F: \((+)0.7\) to \((+)0.99\)B1 AWFW; allow calculation \((0.937)\); if not labelled, assume order A to F then G to L
G to L: \(-0.9\) to \(-0.5\)B1 AWFW; allow calculation \((-0.757)\)
Total2
# Question 4:

## Part (a)(i):
| Answer | Marks | Guidance |
|--------|-------|----------|
| $r = -0.326$ to $-0.325$ | B3 | AWFW $(-0.32569)$ |
| $r = -0.33$ to $-0.32$ | (B2) | AWFW |
| $r = -0.4$ to $-0.2$ | (B1) | AWFW |
| $r = 0.2$ to $0.4$ | (B1) | AWFW |
| Attempt at $\sum x$, $\sum x^2$, $\sum y$, $\sum y^2$, $\sum xy$ **or** attempt at $S_{xx}$, $S_{yy}$, $S_{xy}$ | M1 | Values: 756, 50004, 738, 48200, 45652 (all 5 attempted); or 2376, 2813, $-842$ (all 3 attempted) |
| Attempt substitution into correct formula for $r$ | m1 | |
| $r = -0.326$ to $-0.325$ | A1 | AWFW |
| **Total** | **3** | |

## Part (a)(ii):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Some/little/slight/(fairly/quite) weak / (fairly/quite) moderate **negative** (linear) correlation/relationship/association/link *(but not 'trend')* between **marks/percentages** in the two examination papers | Bdep1 | Dependent on $-0.4 \leq r \leq -0.2$; OE; must qualify strength and state negative; Bdep0 for 'low', 'small', 'poor', 'unlikely', 'medium', 'average', or adjective 'very' |
| Context mark | B1 | Context; providing $-1 < r < 1$ |
| **Total** | **2** | |

## Part (b)(i):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Identifying linear/non-linear/multiple/no patterns *(allow 'trend')* **or** Identifying outliers/anomalies **or** Estimating/gives idea of value or sign of $r$ | B2,1 | OE; only one mark from each set; B0 for reference to **checking** calculated value |
| **Total** | **2** | |

## Part (b)(ii):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Graph with 6 labelled points correct | B2 | Correct $\Rightarrow$ within circle of radius equal to distance between 2 grid lines |
| 5 or 4 labelled points correct | (B1) | Deduct 1 mark for any unlabelled or incorrectly labelled point |
| **Total** | **2** | |

## Part (b)(iii):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Two separate correlations/relationships/lines/associations/links/sets of data *(but not 'trends')* | B1 | OE; eg A to F and G to L |
| **Total** | **1** | |

## Part (c):
| Answer | Marks | Guidance |
|--------|-------|----------|
| A to F: $(+)0.7$ to $(+)0.99$ | B1 | AWFW; allow calculation $(0.937)$; if not labelled, assume order A to F then G to L |
| G to L: $-0.9$ to $-0.5$ | B1 | AWFW; allow calculation $(-0.757)$ |
| **Total** | **2** | |

---
4 Ashok is a work-experience student with an organisation that offers two separate professional examination papers, I and II.

For each of a random sample of 12 students, A to L , he records the mark, $x$ per cent, achieved on Paper I, and the mark, $y$ per cent, achieved on Paper II.

\begin{center}
\begin{tabular}{ | c | c | c | c | c | c | c | c | c | c | c | c | c | }
\cline { 2 - 13 }
\multicolumn{1}{c|}{} & $\mathbf { A }$ & $\mathbf { B }$ & $\mathbf { C }$ & $\mathbf { D }$ & $\mathbf { E }$ & $\mathbf { F }$ & $\mathbf { G }$ & $\mathbf { H }$ & $\mathbf { I }$ & $\mathbf { J }$ & $\mathbf { K }$ & $\mathbf { L }$ \\
\hline
$\boldsymbol { x }$ & 34 & 46 & 53 & 62 & 67 & 72 & 60 & 54 & 70 & 71 & 82 & 85 \\
\hline
$\boldsymbol { y }$ & 61 & 66 & 72 & 78 & 88 & 81 & 49 & 60 & 54 & 44 & 49 & 36 \\
\hline
\end{tabular}
\end{center}
\begin{enumerate}[label=(\alph*)]
\item \begin{enumerate}[label=(\roman*)]
\item Calculate the value of the product moment correlation coefficient, $r$, between $x$ and $y$.
\item Interpret your value of $r$ in the context of this question.
\end{enumerate}\item \begin{enumerate}[label=(\roman*)]
\item Give two possible advantages of plotting data on a graph before calculating the value of a product moment correlation coefficient.
\item Complete the plotting of Ashok's data on the scatter diagram on page 5.
\item State what is now revealed by the scatter diagram.
\end{enumerate}\item Ashok subsequently discovers that students A to F have a more scientific background than students G to L.

With reference to your scatter diagram, estimate the value of the product moment correlation coefficient for each of the two groups of students. You are not expected to calculate the two values.

\begin{center}
\begin{tabular}{ | c | c | c | c | c | c | c | }
\cline { 2 - 7 }
\multicolumn{1}{c|}{} & $\mathbf { G }$ & $\mathbf { H }$ & $\mathbf { I }$ & $\mathbf { J }$ & $\mathbf { K }$ & $\mathbf { L }$ \\
\hline
$\boldsymbol { x }$ & 60 & 54 & 70 & 71 & 82 & 85 \\
\hline
$\boldsymbol { y }$ & 49 & 60 & 54 & 44 & 49 & 36 \\
\hline
\end{tabular}
\end{center}

\section*{Examination Marks}
\begin{center}
\includegraphics[max width=\textwidth, alt={}]{68830a6a-5479-4e5c-a845-a6536ab51cee-5_1616_1634_836_189}
\end{center}
\end{enumerate}

\hfill \mbox{\textit{AQA S1 2013 Q4 [12]}}