Edexcel S3 2013 June — Question 3 13 marks

Exam BoardEdexcel
ModuleS3 (Statistics 3)
Year2013
SessionJune
Marks13
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicHypothesis test of Spearman’s rank correlation coefficien
TypeHypothesis test for positive correlation
DifficultyStandard +0.3 This is a standard S3 hypothesis testing question requiring calculation of Spearman's rank correlation coefficient (routine ranking and formula application), followed by straightforward hypothesis tests using critical value tables. The multi-part structure and comparison of two correlation methods adds length but not conceptual difficulty—all steps follow textbook procedures with no novel problem-solving required.
Spec5.08a Pearson correlation: calculate pmcc5.08d Hypothesis test: Pearson correlation5.08e Spearman rank correlation5.08f Hypothesis test: Spearman rank

3. The table below shows the population and the number of council employees for different towns and villages.
Town or villagePopulationNumber of council employees
A21110
B3562
C104712
D246321
E489216
F647925
G657167
H657345
I984548
\(J\)1478434
  1. Find, to 3 decimal places, Spearman's rank correlation coefficient between the population and the number of council employees.
  2. Use your value of Spearman's rank correlation coefficient to test for evidence of a positive correlation between the population and the number of council employees. Use a \(2.5 \%\) significance level. State your hypotheses clearly. It is suggested that a product moment correlation coefficient would be a more suitable calculation in this case. The product moment correlation coefficient for these data is 0.627 to 3 decimal places.
  3. Use the value of the product moment correlation coefficient to test for evidence of a positive correlation between the population and the number of council employees. Use a \(2.5 \%\) significance level.
  4. Interpret and comment on your results from part(b) and part(c).

AnswerMarks Guidance
TownA B
Pop1 2
Empl2 1
\(\d\ \)
\(d^2\)1 1
M1
\(\sum d^2 = 22\)M1A1
\(r_s = 1 - \frac{6 \times 22}{10 \times 99} = \frac{143}{165} = 0.866\)dM1
awrt 0.867A1 (5)
(b) \(H_0: \rho = 0\); \(H_1: \rho > 0\)B1 B1
CV = 0.6485B1
in critical region / significant/ reject \(H_0\)M1
evidence of positive correlation between population and no. of employeesA1 (4)
(c) CV = 0.6319B1
[not in critical region / not significant/ do not reject \(H_0\)]B1
No evidence of positive correlation (2)
(d) No evidence to suggest that as pop' increased the no. of employees increased linearly. Villages ranked highly for pop' were also ranked highly for the no. of employees.B1 B1 (2)
ALT Alternate for part (d) if different conclusions in part (b) and part (c): Data probably not (bivariate) normal therefore Spearman's coefficient is more suitable than the product moment correlation coefficient. [Total 13]
Notes:
(a) 1st M1 for an attempt to rank no of employees against the populations
2nd M1 for attempting \(\sum d^2\) (must be using ranks) ft their ranks
1st A1 for 22
3rd dM1 dep on 1st M1 for use of the correct formula with their \(\sum d^2\). If ans. is not correct an expr' is required.
(b) 1st B1 for both hypotheses in terms of \(\rho\), \(H_1\) must be one tail and compatible with their ranking
AnswerMarks Guidance
M1 for a correct statement relating their \(r_s\) (\(r_s < 1\)) with their cv but cv must be such that \(
A1 for a correct contextualised comment that is rejecting \(H_0\). Must mention "population" and "no. of employees" and "positive correlation". Follow through their \(r_s\) and their cv (provided it is \(cv < 1\)). Use of "association" is A0
(c) 1st B1 for 0.6319 2nd B1 does not require context just no positive correlation mentioned
(d) 1st B1 for a comment relating to pmcc
(i) no linear relationship or (ii) pmcc requires (joint) normal distribution
2nd B1 for a second comment relating to Spearman's
(i) there is a (non-linear) relationship between ranks.. or (ii) data not (joint) normal so Spearman's is better
| Town | A | B | C | D | E | F | G | H | I | J |
|------|---|---|---|---|---|---|---|---|---|---|
| Pop | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
| Empl | 2 | 1 | 3 | 5 | 4 | 6 | 10 | 8 | 9 | 7 |
| $\|d\|$ | 1 | 1 | 0 | 1 | 1 | 0 | 3 | 0 | 0 | 3 |
| $d^2$ | 1 | 1 | 0 | 1 | 1 | 0 | 9 | 0 | 0 | 9 |

| M1 |

$\sum d^2 = 22$ | M1A1 |

$r_s = 1 - \frac{6 \times 22}{10 \times 99} = \frac{143}{165} = 0.866$ | dM1 |

awrt **0.867** | A1 | (5)

**(b)** $H_0: \rho = 0$; $H_1: \rho > 0$ | B1 B1 |

CV = 0.6485 | B1 |

in critical region / significant/ reject $H_0$ | M1 |

evidence of positive correlation between population and no. of employees | A1 | (4)

**(c)** CV = 0.6319 | B1 |

[not in critical region / not significant/ do not reject $H_0$] | B1 |
No evidence of positive correlation | | (2)

**(d)** No evidence to suggest that as pop' increased the no. of employees increased linearly. Villages ranked highly for pop' were also ranked highly for the no. of employees. | B1 B1 | (2)

**ALT** Alternate for part (d) if different conclusions in part (b) and part (c): Data probably not (bivariate) normal therefore Spearman's coefficient is more suitable than the product moment correlation coefficient. | | **[Total 13]**

**Notes:**

**(a)** 1st M1 for an attempt to rank no of employees against the populations
2nd M1 for attempting $\sum d^2$ (must be using ranks) ft their ranks
1st A1 for 22
3rd dM1 dep on 1st M1 for use of the correct formula with their $\sum d^2$. If ans. is not correct an expr' is required.

**(b)** 1st B1 for both hypotheses in terms of $\rho$, $H_1$ must be one tail and compatible with their ranking
M1 for a correct statement relating their $r_s$ ($|r_s| < 1$) with their cv but cv must be such that $|cv| < 1$. Use of "association" is A0
A1 for a correct contextualised comment that is rejecting $H_0$. Must mention "population" and "no. of employees" and "positive correlation". Follow through their $r_s$ and their cv (provided it is $|cv| < 1$). Use of "association" is A0

**(c)** 1st B1 for 0.6319 2nd B1 does not require context just no positive correlation mentioned

**(d)** 1st B1 for a comment relating to pmcc
(i) no linear relationship or (ii) pmcc requires (joint) normal distribution
2nd B1 for a second comment relating to Spearman's
(i) there is a (non-linear) relationship between ranks.. or (ii) data not (joint) normal so Spearman's is better

---
3. The table below shows the population and the number of council employees for different towns and villages.

\begin{center}
\begin{tabular}{|l|l|l|}
\hline
Town or village & Population & Number of council employees \\
\hline
A & 211 & 10 \\
\hline
B & 356 & 2 \\
\hline
C & 1047 & 12 \\
\hline
D & 2463 & 21 \\
\hline
E & 4892 & 16 \\
\hline
F & 6479 & 25 \\
\hline
G & 6571 & 67 \\
\hline
H & 6573 & 45 \\
\hline
I & 9845 & 48 \\
\hline
$J$ & 14784 & 34 \\
\hline
\end{tabular}
\end{center}
\begin{enumerate}[label=(\alph*)]
\item Find, to 3 decimal places, Spearman's rank correlation coefficient between the population and the number of council employees.
\item Use your value of Spearman's rank correlation coefficient to test for evidence of a positive correlation between the population and the number of council employees. Use a $2.5 \%$ significance level. State your hypotheses clearly.

It is suggested that a product moment correlation coefficient would be a more suitable calculation in this case. The product moment correlation coefficient for these data is 0.627 to 3 decimal places.
\item Use the value of the product moment correlation coefficient to test for evidence of a positive correlation between the population and the number of council employees. Use a $2.5 \%$ significance level.
\item Interpret and comment on your results from part(b) and part(c).
\end{enumerate}

\hfill \mbox{\textit{Edexcel S3 2013 Q3 [13]}}