| Exam Board | Edexcel |
|---|---|
| Module | S3 (Statistics 3) |
| Year | 2004 |
| Session | June |
| Marks | 12 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Chi-squared test of independence |
| Type | Percentages given, table construction required |
| Difficulty | Standard +0.3 This is a straightforward chi-squared test of independence with clearly presented data requiring construction of a 2×3 contingency table, calculation of expected frequencies, and application of the standard test procedure. While it requires careful organization of the given percentages into frequencies, it follows a completely standard template with no conceptual challenges beyond routine S3 material. |
| Spec | 5.06a Chi-squared: contingency tables |
| Answer | Marks | Guidance |
|---|---|---|
| Answer/Working | Marks | Guidance |
| Never, Sometimes, Regularly, Totals; Males: 30, 132, 78, 240; Females: 26, 143, 91, 260 | M1, A1 | convert % to freq; A1 (26, 91, 30, 132) |
| 56, 275, 169, 500 | B1, B1 | |
| \(H_0\): No association (independent) between gender and exercise; \(H_1\): association (not independent) between gender and exercise | M1, A1 | |
| Expected Values: Males: 26.88, 132, 81.12, 240; Females: 29.12, 143, 87.88, 260; 56, 275, 169, 500 | A1∨ | |
| \(\alpha = 0.05\) \(\nu = 2\) ; \(CV \chi^2 > 5.991\) | ||
| \(\sum \frac{(O-E)^2}{E}\) OR \(\sum \frac{O^2}{E} - N = 0.9271\) | answers in range 0.90 – 0.95 | |
| Not in critical region – no evidence of association between gender and exercise |
| Answer | Marks | Guidance |
|---|---|---|
| Answer/Working | Marks | Guidance |
| \(X \sim B(3, 1/6)\) | B1 | bino |
| 3, 1/6 | B1 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer/Working | Marks | Guidance |
| Probability distribution: | prob – must show working and use B(3,p) or may be implied by correct answer | |
| X = 0: \(\left(\frac{5}{6}\right)^3\) = 144.68 | M1, M1 | expected |
| X = 1: \(3 \times \frac{5}{6} \left(\frac{1}{6}\right)\) = 86.81 | ||
| X = 2: \(3 \times \frac{5}{6}\left(\frac{1}{6}\right)^2\) = 17.36 | B2 (-1 ee) | awrt 145.86,8,17.4,1.15/1.16 |
| X = 3: \(\left(\frac{1}{6}\right)^3\) = 1.15 (1.16) | ||
| \(H_0\): Binomial model is a good fit; \(H_1\): Binomial model is not a good fit | B1 | both, no ditto |
| Amalgamate 3 with another group | M1 | |
| \(\alpha = 0.01\) \(\nu = 2\) ; \(CR \chi^2 > 9.210\) | B1; B1∨ | |
| \(\sum \frac{(O-E)^2}{E}\) OR \(\sum \frac{O^2}{E} - N = 8.6894\ldots\) | M1, A1 | answers in range 8.67 – 8.70 or |
| Evidence that Binomial is a good model. | A1∨ |
| Answer | Marks | Guidance |
|---|---|---|
| Answer/Working | Marks | Guidance |
| Estimate p; Degrees of freedom reduced by 1 | B1, B1 | |
| Special case: Use of B(3, 0.192) in part (b) | ||
| Expected frequencies: 131.8785, 94.01242, 22.339, 1.769 | M1, M1, B0 | |
| \(H_0\): Binomial model is a good fit; \(H_1\): Binomial model is not a good fit | B1 | both, no ditto |
| Amalgamate 3 with another group | M1 | |
| \(\alpha = 0.01\) \(\nu = 1\) ; \(CR \chi^2 > 6.635\) | B1; B1∨ | |
| \(\sum \frac{(O-E)^2}{E}\) OR \(\sum \frac{O^2}{E} - N\) in range 5.45 -5.50 | M1, A1 | |
| Evidence that Binomial is a good model. | A1∨ |
| Answer | Marks | Guidance |
|---|---|---|
| Answer/Working | Marks | Guidance |
| \(E(D) = E(A) - 3E(B) + 4E(C) = 20\) | M1, A1 | |
| \(\text{Var}(D) = \text{Var}(A) + 9\text{Var}(B) + 16\text{Var}(C)\) | M1, M1 | Use of a²Var X; Adding 3 Var ie 4 +… |
| \(= 341\) | A1 | |
| \(P(D < 44) = P\left(z < \frac{44-20}{\sqrt{341}}\right)\) | M1, A1∨ | standardising their mean and sd |
| \(= P(z < 1.30)\) | A1 | awrt 1.30 |
| \(= 0.9032\) | A1 |
| Answer | Marks | Guidance |
|---|---|---|
| Answer/Working | Marks | Guidance |
| \(E(X) = 20\) | B1 | |
| \(\text{Var}(X) = \text{Var}(A) + 3\text{Var}(B) + 16\text{Var}(C)\) | M1, M1 | + and 16; 3 Var (B) |
| \(= 287\) | A1 | |
| \(P(X > 0) = P\left(z > \frac{-20}{\sqrt{287}}\right)\) | M1 | standardising their mean and sd |
| \(= P(z > -1.18)\) | A1 | awrt -1.18 |
| \(= 0.8810\) | A1 |
| Answer/Working | Marks | Guidance |
|---|---|---|
| Never, Sometimes, Regularly, Totals; Males: 30, 132, 78, 240; Females: 26, 143, 91, 260 | M1, A1 | convert % to freq; A1 (26, 91, 30, 132) |
| 56, 275, 169, 500 | B1, B1 | |
| $H_0$: No association (independent) between gender and exercise; $H_1$: association (not independent) between gender and exercise | M1, A1 | |
| Expected Values: Males: 26.88, 132, 81.12, 240; Females: 29.12, 143, 87.88, 260; 56, 275, 169, 500 | A1∨ | |
| $\alpha = 0.05$ $\nu = 2$ ; $CV \chi^2 > 5.991$ | | |
| $\sum \frac{(O-E)^2}{E}$ OR $\sum \frac{O^2}{E} - N = 0.9271$ | | answers in range 0.90 – 0.95 |
| Not in critical region – no evidence of association between gender and exercise | | |
**Total: 12 marks**
## Question 6a
| Answer/Working | Marks | Guidance |
|---|---|---|
| $X \sim B(3, 1/6)$ | B1 | bino |
| 3, 1/6 | B1 | |
**Total: 2 marks**
## Question 6b
| Answer/Working | Marks | Guidance |
|---|---|---|
| Probability distribution: | | prob – must show working and use B(3,p) or may be implied by correct answer |
| X = 0: $\left(\frac{5}{6}\right)^3$ = 144.68 | M1, M1 | expected |
| X = 1: $3 \times \frac{5}{6} \left(\frac{1}{6}\right)$ = 86.81 | | |
| X = 2: $3 \times \frac{5}{6}\left(\frac{1}{6}\right)^2$ = 17.36 | B2 (-1 ee) | awrt 145.86,8,17.4,1.15/1.16 |
| X = 3: $\left(\frac{1}{6}\right)^3$ = 1.15 (1.16) | | |
| $H_0$: Binomial model is a good fit; $H_1$: Binomial model is not a good fit | B1 | both, no ditto |
| Amalgamate 3 with another group | M1 | |
| $\alpha = 0.01$ $\nu = 2$ ; $CR \chi^2 > 9.210$ | B1; B1∨ | |
| $\sum \frac{(O-E)^2}{E}$ OR $\sum \frac{O^2}{E} - N = 8.6894\ldots$ | M1, A1 | answers in range 8.67 – 8.70 or |
| Evidence that Binomial is a good model. | A1∨ | |
**Total: 11 marks**
## Question 6c
| Answer/Working | Marks | Guidance |
|---|---|---|
| Estimate p; Degrees of freedom reduced by 1 | B1, B1 | |
| Special case: Use of B(3, 0.192) in part (b) | | |
| Expected frequencies: 131.8785, 94.01242, 22.339, 1.769 | M1, M1, B0 | |
| $H_0$: Binomial model is a good fit; $H_1$: Binomial model is not a good fit | B1 | both, no ditto |
| Amalgamate 3 with another group | M1 | |
| $\alpha = 0.01$ $\nu = 1$ ; $CR \chi^2 > 6.635$ | B1; B1∨ | |
| $\sum \frac{(O-E)^2}{E}$ OR $\sum \frac{O^2}{E} - N$ in range 5.45 -5.50 | M1, A1 | |
| Evidence that Binomial is a good model. | A1∨ | |
**Total: 11 marks**
## Question 7a
| Answer/Working | Marks | Guidance |
|---|---|---|
| $E(D) = E(A) - 3E(B) + 4E(C) = 20$ | M1, A1 | |
| $\text{Var}(D) = \text{Var}(A) + 9\text{Var}(B) + 16\text{Var}(C)$ | M1, M1 | Use of a²Var X; Adding 3 Var ie 4 +… |
| $= 341$ | A1 | |
| $P(D < 44) = P\left(z < \frac{44-20}{\sqrt{341}}\right)$ | M1, A1∨ | standardising their mean and sd |
| $= P(z < 1.30)$ | A1 | awrt 1.30 |
| $= 0.9032$ | A1 | |
**Total: 9 marks**
## Question 7b
| Answer/Working | Marks | Guidance |
|---|---|---|
| $E(X) = 20$ | B1 | |
| $\text{Var}(X) = \text{Var}(A) + 3\text{Var}(B) + 16\text{Var}(C)$ | M1, M1 | + and 16; 3 Var (B) |
| $= 287$ | A1 | |
| $P(X > 0) = P\left(z > \frac{-20}{\sqrt{287}}\right)$ | M1 | standardising their mean and sd |
| $= P(z > -1.18)$ | A1 | awrt -1.18 |
| $= 0.8810$ | A1 | |
**Total: 7 marks**
5. A random sample of 500 adults completed a questionnaire on how often they took part in some form of exercise. They gave a response of 'never', 'sometimes' or 'regularly'. Of those asked, $52 \%$ were females of whom $10 \%$ never exercised and $35 \%$ exercised regularly. Of the males, $12.5 \%$ never exercised and $55 \%$ sometimes exercised.
Test, at the $5 \%$ level of significance, whether or not there is any association between gender and the amount of exercise. State your hypotheses clearly.
\hfill \mbox{\textit{Edexcel S3 2004 Q5 [12]}}