| Exam Board | AQA |
|---|---|
| Module | Further AS Paper 2 Statistics (Further AS Paper 2 Statistics) |
| Session | Specimen |
| Marks | 9 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Chi-squared test of independence |
| Type | Expected frequencies partially provided |
| Difficulty | Standard +0.3 This is a standard chi-squared test of independence with straightforward calculations. Part (a) requires simple expected frequency calculations using row×column totals divided by grand total. Part (b)(i) involves comparing a given test statistic to critical values from tables. Part (b)(ii) asks for contextual interpretation by comparing observed vs expected values. All steps are routine applications of the chi-squared test procedure taught in Further Statistics, with no novel problem-solving required. |
| Spec | 5.06a Chi-squared: contingency tables |
| \multirow{2}{*}{Table 1} | Yield | ||||
| Low | Medium | High | Total | ||
| \multirow{4}{*}{Breed} | A | 4 | 5 | 12 | 21 |
| B | 10 | 6 | 4 | 20 | |
| C | 8 | 17 | 7 | 32 | |
| D | 5 | 20 | 7 | 32 | |
| Total | 27 | 48 | 30 | 105 | |
| \multirow[t]{2}{*}{Table 2} | Yield | |||
| Low | Medium | High | ||
| \multirow{4}{*}{Breed} | A | 6 | ||
| B | 5.14 | 9.14 | 5.71 | |
| C | ||||
| D | 8.23 | 14.63 | 9.14 | |
| Answer | Marks | Guidance |
|---|---|---|
| Part 1: Expected values e.g. Low, Breed A \(= \frac{21\times27}{105} = 5.4\); one correct value | M1 | |
| Part 2: Full table: A: 5.4, 9.6, 6; B: 5.14, 9.14, 5.71; C: 8.23, 14.63, 9.14; D: 8.23, 14.63, 9.14 | A1 | A0 if integers given |
| Answer | Marks | Guidance |
|---|---|---|
| Part 1: \(H_0\): Milk yield is independent of breed; \(H_1\): Milk yield is not independent of breed; 1 tail 1% | B1 | |
| Part 2: \(\chi^2\) critical value for 6 df \(= 16.81\) (or \(p = 0.00275\)) | B1 | |
| Part 3: \(ts = \sum\frac{(O-E)^2}{E} = 19.4\); compare with cv | R1 | |
| Part 4: \(19.4 > 16.81\) so reject \(H_0\) | E1 | |
| Part 5: Evidence to suggest milk yield is not independent of breed | E1 | Conclusion should not be definite |
| Answer | Marks | Guidance |
|---|---|---|
| Part 1: Consider \(\frac{(O-E)^2}{E}\) or \((O-E)\) to identify largest sources; largest are Breed A/High and Breed B/Low | E1 | Do not allow if no reference to why source selected |
| Part 2: Far more than expected Breed A cows observed to have high milk yield, or far more than expected Breed B cows observed to have low milk yield | E1dep |
## Question 7(a):
**Part 1:** Expected values e.g. Low, Breed A $= \frac{21\times27}{105} = 5.4$; one correct value | M1 |
**Part 2:** Full table: A: 5.4, 9.6, 6; B: 5.14, 9.14, 5.71; C: 8.23, 14.63, 9.14; D: 8.23, 14.63, 9.14 | A1 | A0 if integers given
---
## Question 7(b)(i):
**Part 1:** $H_0$: Milk yield is independent of breed; $H_1$: Milk yield is not independent of breed; 1 tail 1% | B1 |
**Part 2:** $\chi^2$ critical value for 6 df $= 16.81$ (or $p = 0.00275$) | B1 |
**Part 3:** $ts = \sum\frac{(O-E)^2}{E} = 19.4$; compare with cv | R1 |
**Part 4:** $19.4 > 16.81$ so reject $H_0$ | E1 |
**Part 5:** Evidence to suggest milk yield is not independent of breed | E1 | Conclusion should not be definite
---
## Question 7(b)(ii):
**Part 1:** Consider $\frac{(O-E)^2}{E}$ or $(O-E)$ to identify largest sources; largest are Breed A/High and Breed B/Low | E1 | Do not allow if no reference to why source selected
**Part 2:** Far more than expected Breed A cows observed to have high milk yield, or far more than expected Breed B cows observed to have low milk yield | E1dep |
---
7 A dairy industry researcher, Robyn, decided to investigate the milk yield, classified as low, medium or high, obtained from four different breeds of cow, $\mathrm { A } , \mathrm { B } , \mathrm { C }$ and D .
The milk yield of a sample of 105 cows was monitored and the results are summarised in contingency Table 1.
\begin{center}
\begin{tabular}{|l|l|l|l|l|l|}
\hline
\multicolumn{2}{|c|}{\multirow{2}{*}{Table 1}} & \multicolumn{4}{|c|}{Yield} \\
\hline
& & Low & Medium & High & Total \\
\hline
\multirow{4}{*}{Breed} & A & 4 & 5 & 12 & 21 \\
\hline
& B & 10 & 6 & 4 & 20 \\
\hline
& C & 8 & 17 & 7 & 32 \\
\hline
& D & 5 & 20 & 7 & 32 \\
\hline
& Total & 27 & 48 & 30 & 105 \\
\hline
\end{tabular}
\end{center}
The sample of cows may be regarded as random.\\
Robyn decides to carry out a $\chi ^ { 2 }$-test for association between milk yield and breed using the information given in Table 1.
7
\begin{enumerate}[label=(\alph*)]
\item Contingency Table 2 gives some of the expected frequencies for this test.\\
Complete Table 2 with the missing expected values.
\begin{center}
\begin{tabular}{|l|l|l|l|l|}
\hline
\multicolumn{2}{|l|}{\multirow[t]{2}{*}{Table 2}} & \multicolumn{3}{|c|}{Yield} \\
\hline
& & Low & Medium & High \\
\hline
\multirow{4}{*}{Breed} & A & & & 6 \\
\hline
& B & 5.14 & 9.14 & 5.71 \\
\hline
& C & & & \\
\hline
& D & 8.23 & 14.63 & 9.14 \\
\hline
\end{tabular}
\end{center}
7
\item (i) For Robyn's test, the test statistic $\sum \frac { ( O - E ) ^ { 2 } } { E } = 19.4$ correct to three significant figures.\\
Use this information to carry out Robyn's test, using the $1 \%$ level of significance.\\
7 (b) (ii) By considering the observed frequencies given in Table 1 with the expected frequencies in Table 2, interpret, in context, the association, if any, between milk yield and breed.
\end{enumerate}
\hfill \mbox{\textit{AQA Further AS Paper 2 Statistics Q7 [9]}}