AQA Further Paper 3 Statistics 2023 June — Question 5 8 marks

Exam BoardAQA
ModuleFurther Paper 3 Statistics (Further Paper 3 Statistics)
Year2023
SessionJune
Marks8
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicChi-squared test of independence
TypeCell combining required
DifficultyStandard +0.3 This is a straightforward chi-squared test application with standard procedures: part (a) tests understanding of degrees of freedom formula (11-1)(11-1)=100, reduced by 10 for combining cells; part (b) is routine hypothesis testing comparing 124.8 to critical value; part (c) simply requires identifying the largest contribution. All parts are direct recall/application of standard chi-squared procedures with no problem-solving or novel insight required.
Spec5.06a Chi-squared: contingency tables

5 A school management team oversees 11 different schools.
The school management team allows each student in the schools to choose one enrichment activity from 11 possible activities. The school management team count the number of students in each school choosing each of the possible activities. They then conduct a \(\chi ^ { 2 }\)-test for association with the data they have gathered. 5
  1. Exactly one of the calculated expected frequencies for the \(\chi ^ { 2 }\)-test is less than 5
    Explain why the number of degrees of freedom for the test is 90
    5
  2. The school management team claims that there is an association between the school a student attends and the activity they choose. The test statistic is 124.8 Test the claim using the \(1 \%\) level of significance.
    5
  3. During the hypothesis test, the value of \(\frac { ( O - E ) ^ { 2 } } { E }\), where \(O\) is the observed frequency and \(E\) is the expected frequency, was calculated for each group of students. The values for four groups of students are shown in the table below.
    Group\(\frac { ( O - E ) ^ { 2 } } { E }\)
    Attends school 3 and chose activity 10.01
    Attends school 8 and chose activity 318.5
    Attends school 8 and chose activity 724.2
    Attends school 11 and chose activity 749.0
    State, with a reason, which of the four groups of students represents the strongest source of association.

Question 5(a):
AnswerMarks Guidance
Answer/WorkingMark Guidance
Two rows or two columns need to be mergedE1 Deduces test requires two rows or two columns to be merged
So \(\text{dof} = (11-1)(10-1) = 90\)E1 Completes reasoned argument showing merging results in dof \(= (11-1)(10-1) = 90\)
Subtotal: 2 marks
Question 5(b):
AnswerMarks Guidance
Answer/WorkingMark Guidance
\(H_0\): There is no association between school and activity chosen; \(H_1\): There is an association between school and activity chosenB1 States both hypotheses using correct language; variables needed in at least null hypothesis
\(\chi^2\) cv for 90 dof \(= 124.116\); or corresponding probability AWRT 0.009B1 AWRT 124.1
\(124.8 > 124.116\)R1 Evaluates \(\chi^2\) test statistic by correctly comparing critical value with test statistic
Reject \(H_0\)E1F Infers \(H_0\) rejected; FT their comparison using \(\chi^2\) model
Sufficient evidence to suggest there is an association between school and activity chosenE1F Conclusion in context; must not be definite; FT consistent with their conclusion
Subtotal: 5 marks
Question 5(c):
AnswerMarks Guidance
Answer/WorkingMark Guidance
Students attending school 11 choosing activity 7 as they have the highest value of \(\frac{(O-E)^2}{E}\)B1 Correctly identifies strongest source of association as group with highest \(\frac{(O-E)^2}{E}\); condone "the highest value"
Subtotal: 1 mark
Question 5 total: 8 marks
## Question 5(a):

| Answer/Working | Mark | Guidance |
|---|---|---|
| Two rows or two columns need to be merged | E1 | Deduces test requires two rows or two columns to be merged |
| So $\text{dof} = (11-1)(10-1) = 90$ | E1 | Completes reasoned argument showing merging results in dof $= (11-1)(10-1) = 90$ |

**Subtotal: 2 marks**

---

## Question 5(b):

| Answer/Working | Mark | Guidance |
|---|---|---|
| $H_0$: There is no association between school and activity chosen; $H_1$: There is an association between school and activity chosen | B1 | States both hypotheses using correct language; variables needed in at least null hypothesis |
| $\chi^2$ cv for 90 dof $= 124.116$; or corresponding probability AWRT 0.009 | B1 | AWRT 124.1 |
| $124.8 > 124.116$ | R1 | Evaluates $\chi^2$ test statistic by correctly comparing critical value with test statistic |
| Reject $H_0$ | E1F | Infers $H_0$ rejected; FT their comparison using $\chi^2$ model |
| Sufficient evidence to suggest there is an association between school and activity chosen | E1F | Conclusion in context; must not be definite; FT consistent with their conclusion |

**Subtotal: 5 marks**

---

## Question 5(c):

| Answer/Working | Mark | Guidance |
|---|---|---|
| Students attending school 11 choosing activity 7 as they have the highest value of $\frac{(O-E)^2}{E}$ | B1 | Correctly identifies strongest source of association as group with highest $\frac{(O-E)^2}{E}$; condone "the highest value" |

**Subtotal: 1 mark**

**Question 5 total: 8 marks**

---
5 A school management team oversees 11 different schools.\\
The school management team allows each student in the schools to choose one enrichment activity from 11 possible activities.

The school management team count the number of students in each school choosing each of the possible activities. They then conduct a $\chi ^ { 2 }$-test for association with the data they have gathered.

5
\begin{enumerate}[label=(\alph*)]
\item Exactly one of the calculated expected frequencies for the $\chi ^ { 2 }$-test is less than 5\\
Explain why the number of degrees of freedom for the test is 90\\

5
\item The school management team claims that there is an association between the school a student attends and the activity they choose.

The test statistic is 124.8

Test the claim using the $1 \%$ level of significance.\\

5
\item During the hypothesis test, the value of $\frac { ( O - E ) ^ { 2 } } { E }$, where $O$ is the observed frequency and $E$ is the expected frequency, was calculated for each group of students.

The values for four groups of students are shown in the table below.

\begin{center}
\begin{tabular}{|l|l|}
\hline
Group & $\frac { ( O - E ) ^ { 2 } } { E }$ \\
\hline
Attends school 3 and chose activity 1 & 0.01 \\
\hline
Attends school 8 and chose activity 3 & 18.5 \\
\hline
Attends school 8 and chose activity 7 & 24.2 \\
\hline
Attends school 11 and chose activity 7 & 49.0 \\
\hline
\end{tabular}
\end{center}

State, with a reason, which of the four groups of students represents the strongest source of association.
\end{enumerate}

\hfill \mbox{\textit{AQA Further Paper 3 Statistics 2023 Q5 [8]}}