| Exam Board | AQA |
|---|---|
| Module | S1 (Statistics 1) |
| Year | 2013 |
| Session | January |
| Marks | 12 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Bivariate data |
| Type | Calculate regression line equation |
| Difficulty | Moderate -0.3 This is a standard S1 correlation question requiring calculation of PMCC using given formulas, interpretation, and visual analysis of a scatter diagram. While it involves multiple parts and Simpson's Paradox (two groups showing different correlations), the calculations are routine and the conceptual demands are typical for AS-level statistics. Slightly easier than average due to straightforward computational steps. |
| Spec | 2.05f Pearson correlation coefficient5.08a Pearson correlation: calculate pmcc5.08c Pearson: measure of straight-line fit |
| \cline { 2 - 13 } \multicolumn{1}{c|}{} | \(\mathbf { A }\) | \(\mathbf { B }\) | \(\mathbf { C }\) | \(\mathbf { D }\) | \(\mathbf { E }\) | \(\mathbf { F }\) | \(\mathbf { G }\) | \(\mathbf { H }\) | \(\mathbf { I }\) | \(\mathbf { J }\) | \(\mathbf { K }\) | \(\mathbf { L }\) |
| \(\boldsymbol { x }\) | 34 | 46 | 53 | 62 | 67 | 72 | 60 | 54 | 70 | 71 | 82 | 85 |
| \(\boldsymbol { y }\) | 61 | 66 | 72 | 78 | 88 | 81 | 49 | 60 | 54 | 44 | 49 | 36 |
| \cline { 2 - 7 } \multicolumn{1}{c|}{} | \(\mathbf { G }\) | \(\mathbf { H }\) | \(\mathbf { I }\) | \(\mathbf { J }\) | \(\mathbf { K }\) | \(\mathbf { L }\) |
| \(\boldsymbol { x }\) | 60 | 54 | 70 | 71 | 82 | 85 |
| \(\boldsymbol { y }\) | 49 | 60 | 54 | 44 | 49 | 36 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| \(r = -0.326\) to \(-0.325\) | B3 | AWFW \((-0.32569)\) |
| \(r = -0.33\) to \(-0.32\) | (B2) | AWFW |
| \(r = -0.4\) to \(-0.2\) | (B1) | AWFW |
| \(r = 0.2\) to \(0.4\) | (B1) | AWFW |
| Attempt at \(\sum x\), \(\sum x^2\), \(\sum y\), \(\sum y^2\), \(\sum xy\) or attempt at \(S_{xx}\), \(S_{yy}\), \(S_{xy}\) | M1 | Values: 756, 50004, 738, 48200, 45652 (all 5 attempted); or 2376, 2813, \(-842\) (all 3 attempted) |
| Attempt substitution into correct formula for \(r\) | m1 | |
| \(r = -0.326\) to \(-0.325\) | A1 | AWFW |
| Total | 3 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| Some/little/slight/(fairly/quite) weak / (fairly/quite) moderate negative (linear) correlation/relationship/association/link *(but not 'trend')* between marks/percentages in the two examination papers | Bdep1 | Dependent on \(-0.4 \leq r \leq -0.2\); OE; must qualify strength and state negative; Bdep0 for 'low', 'small', 'poor', 'unlikely', 'medium', 'average', or adjective 'very' |
| Context mark | B1 | Context; providing \(-1 < r < 1\) |
| Total | 2 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| Identifying linear/non-linear/multiple/no patterns *(allow 'trend')* or Identifying outliers/anomalies or Estimating/gives idea of value or sign of \(r\) | B2,1 | OE; only one mark from each set; B0 for reference to checking calculated value |
| Total | 2 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| Graph with 6 labelled points correct | B2 | Correct \(\Rightarrow\) within circle of radius equal to distance between 2 grid lines |
| 5 or 4 labelled points correct | (B1) | Deduct 1 mark for any unlabelled or incorrectly labelled point |
| Total | 2 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| Two separate correlations/relationships/lines/associations/links/sets of data *(but not 'trends')* | B1 | OE; eg A to F and G to L |
| Total | 1 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer | Marks | Guidance |
| A to F: \((+)0.7\) to \((+)0.99\) | B1 | AWFW; allow calculation \((0.937)\); if not labelled, assume order A to F then G to L |
| G to L: \(-0.9\) to \(-0.5\) | B1 | AWFW; allow calculation \((-0.757)\) |
| Total | 2 |
# Question 4:
## Part (a)(i):
| Answer | Marks | Guidance |
|--------|-------|----------|
| $r = -0.326$ to $-0.325$ | B3 | AWFW $(-0.32569)$ |
| $r = -0.33$ to $-0.32$ | (B2) | AWFW |
| $r = -0.4$ to $-0.2$ | (B1) | AWFW |
| $r = 0.2$ to $0.4$ | (B1) | AWFW |
| Attempt at $\sum x$, $\sum x^2$, $\sum y$, $\sum y^2$, $\sum xy$ **or** attempt at $S_{xx}$, $S_{yy}$, $S_{xy}$ | M1 | Values: 756, 50004, 738, 48200, 45652 (all 5 attempted); or 2376, 2813, $-842$ (all 3 attempted) |
| Attempt substitution into correct formula for $r$ | m1 | |
| $r = -0.326$ to $-0.325$ | A1 | AWFW |
| **Total** | **3** | |
## Part (a)(ii):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Some/little/slight/(fairly/quite) weak / (fairly/quite) moderate **negative** (linear) correlation/relationship/association/link *(but not 'trend')* between **marks/percentages** in the two examination papers | Bdep1 | Dependent on $-0.4 \leq r \leq -0.2$; OE; must qualify strength and state negative; Bdep0 for 'low', 'small', 'poor', 'unlikely', 'medium', 'average', or adjective 'very' |
| Context mark | B1 | Context; providing $-1 < r < 1$ |
| **Total** | **2** | |
## Part (b)(i):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Identifying linear/non-linear/multiple/no patterns *(allow 'trend')* **or** Identifying outliers/anomalies **or** Estimating/gives idea of value or sign of $r$ | B2,1 | OE; only one mark from each set; B0 for reference to **checking** calculated value |
| **Total** | **2** | |
## Part (b)(ii):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Graph with 6 labelled points correct | B2 | Correct $\Rightarrow$ within circle of radius equal to distance between 2 grid lines |
| 5 or 4 labelled points correct | (B1) | Deduct 1 mark for any unlabelled or incorrectly labelled point |
| **Total** | **2** | |
## Part (b)(iii):
| Answer | Marks | Guidance |
|--------|-------|----------|
| Two separate correlations/relationships/lines/associations/links/sets of data *(but not 'trends')* | B1 | OE; eg A to F and G to L |
| **Total** | **1** | |
## Part (c):
| Answer | Marks | Guidance |
|--------|-------|----------|
| A to F: $(+)0.7$ to $(+)0.99$ | B1 | AWFW; allow calculation $(0.937)$; if not labelled, assume order A to F then G to L |
| G to L: $-0.9$ to $-0.5$ | B1 | AWFW; allow calculation $(-0.757)$ |
| **Total** | **2** | |
---
4 Ashok is a work-experience student with an organisation that offers two separate professional examination papers, I and II.
For each of a random sample of 12 students, A to L , he records the mark, $x$ per cent, achieved on Paper I, and the mark, $y$ per cent, achieved on Paper II.
\begin{center}
\begin{tabular}{ | c | c | c | c | c | c | c | c | c | c | c | c | c | }
\cline { 2 - 13 }
\multicolumn{1}{c|}{} & $\mathbf { A }$ & $\mathbf { B }$ & $\mathbf { C }$ & $\mathbf { D }$ & $\mathbf { E }$ & $\mathbf { F }$ & $\mathbf { G }$ & $\mathbf { H }$ & $\mathbf { I }$ & $\mathbf { J }$ & $\mathbf { K }$ & $\mathbf { L }$ \\
\hline
$\boldsymbol { x }$ & 34 & 46 & 53 & 62 & 67 & 72 & 60 & 54 & 70 & 71 & 82 & 85 \\
\hline
$\boldsymbol { y }$ & 61 & 66 & 72 & 78 & 88 & 81 & 49 & 60 & 54 & 44 & 49 & 36 \\
\hline
\end{tabular}
\end{center}
\begin{enumerate}[label=(\alph*)]
\item \begin{enumerate}[label=(\roman*)]
\item Calculate the value of the product moment correlation coefficient, $r$, between $x$ and $y$.
\item Interpret your value of $r$ in the context of this question.
\end{enumerate}\item \begin{enumerate}[label=(\roman*)]
\item Give two possible advantages of plotting data on a graph before calculating the value of a product moment correlation coefficient.
\item Complete the plotting of Ashok's data on the scatter diagram on page 5.
\item State what is now revealed by the scatter diagram.
\end{enumerate}\item Ashok subsequently discovers that students A to F have a more scientific background than students G to L.
With reference to your scatter diagram, estimate the value of the product moment correlation coefficient for each of the two groups of students. You are not expected to calculate the two values.
\begin{center}
\begin{tabular}{ | c | c | c | c | c | c | c | }
\cline { 2 - 7 }
\multicolumn{1}{c|}{} & $\mathbf { G }$ & $\mathbf { H }$ & $\mathbf { I }$ & $\mathbf { J }$ & $\mathbf { K }$ & $\mathbf { L }$ \\
\hline
$\boldsymbol { x }$ & 60 & 54 & 70 & 71 & 82 & 85 \\
\hline
$\boldsymbol { y }$ & 49 & 60 & 54 & 44 & 49 & 36 \\
\hline
\end{tabular}
\end{center}
\section*{Examination Marks}
\begin{center}
\includegraphics[max width=\textwidth, alt={}]{68830a6a-5479-4e5c-a845-a6536ab51cee-5_1616_1634_836_189}
\end{center}
\end{enumerate}
\hfill \mbox{\textit{AQA S1 2013 Q4 [12]}}