Edexcel S3 2014 June — Question 6 14 marks

Exam BoardEdexcel
ModuleS3 (Statistics 3)
Year2014
SessionJune
Marks14
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicChi-squared goodness of fit
TypeChi-squared goodness of fit: Binomial
DifficultyStandard +0.3 This is a standard chi-squared goodness of fit test for a binomial distribution with straightforward calculations. Part (a) requires simple weighted mean calculation, part (b) uses binomial probability formula (routine for S3), part (c) is a textbook chi-squared test with cell combination, and part (d) requires basic interpretation. All steps are procedural with no novel insight required, making it slightly easier than average for S3 level.
Spec5.06b Fit prescribed distribution: chi-squared test5.06c Fit other distributions: discrete and continuous

6. Eight tasks were given to each of 125 randomly selected job applicants. The number of tasks failed by each applicant is recorded. The results are as follows
Number of tasks failed by an applicant0123456 or more
Frequency22145421230
  1. Show that the probability of a randomly selected task, from this sample, being failed is 0.3 An employer believes that a binomial distribution might provide a good model for the number of tasks, out of 8, that an applicant fails. He uses a binomial distribution, with the estimated probability 0.3 of a task being failed. The calculated expected frequencies are as follows
    Number of tasks failed by an applicant0123456 or more
    Expected frequency7.2124.7137.06\(r\)17.025.83\(s\)
  2. Find the value of \(r\) and the value of \(s\) giving your answers to 2 decimal places.
  3. Test, at the \(5 \%\) level of significance, whether or not a binomial distribution is a suitable model for these data. State your hypotheses and show your working clearly. The employer believes that all applicants have the same probability of failing each task.
  4. Use your result from part(c) to comment on this belief.

Question 6:
Part (a)
AnswerMarks Guidance
AnswerMark Guidance
\(\hat{p} = \frac{0(2)+1(21)+2(45)+3(42)+4(12)+5(3)}{8(2+21+45+42+12+3)}\) or \(8(125)\) \(\left\{=\frac{300}{1000}\right\} = 0.3\)M1 Must show clearly how to get either 300 or 1000
Answer is given. See notes.A1 cso Showing how to get both 300 and 1000 and reaching \(p=0.3\)
Part (b)
AnswerMarks Guidance
AnswerMark Guidance
\(r = 125 \times {}^8C_3(0.3)^3(0.7)^5 \{= 31.76523...\}\) (formula) or \(r = 125 \times (0.8059-0.5518)\{=31.7625\}\) (tables)M1 For any correct method (or correct expression) for finding either \(r\) or \(s\)
\(s = 125-(7.21+24.71+37.06+\text{their } r+17.02+5.83)\{=1.40477...\text{ or }1.4075\}\) or \(s=125\times(1-0.9887)\{=1.4125\}\)
\(r = 31.76523\) or \(31.7625\) or \(31.7575\)A1 \(r=\) awrt 31.77 or \(r=\) awrt 31.76
\(s = 1.40477\) or \(1.4075\) or \(1.4125\)A1 \(s=1.4\) or awrt 1.40 or \(s=\) awrt 1.41
Part (c)
AnswerMarks Guidance
AnswerMark Guidance
Pooling 5 failed tasks and \(\geqslant 6\) ONLY1st M1 For an attempt to pool 5 failed tasks and \(\geqslant 6\) failed tasks ONLY. Give 1st M0 for pooling 0 failed tasks and 1 failed task
At least 2 correct expressions/values for test statistic (to awrt 2 d.p. or truncated 2 d.p.)2nd M1
\(X^2 = \sum\frac{(O-E)^2}{E}\) or \(\sum\frac{O^2}{E}-125 :=\) awrt 13.3dM1 Dependent on 2nd M1. For applying either \(\sum\frac{(O-E)^2}{E}\) or \(\sum\frac{O^2}{E}-125\)
awrt 13.31st A1 See notes
\(\nu = 6-1-1=4\)1st B1 ft For their evaluated \(n-1-1\), i.e. realising they must subtract 2 from their \(n\)
\(\chi^2_4(0.05)=9.488 \Rightarrow\) CR: \(X^2 \geqslant 9.488\)2nd B1 A correct ft for their \(\chi^2_k(0.05)\), where \(k=n-1-1\) from their \(n\)
\(H_0\): Binomial distribution is a good (or suitable) model (or fit). \(H_1\): Binomial distribution is not a suitable model.3rd B1 Must have both hypotheses and mention Binomial at least once. Inclusion of 0.3 for \(p\) in hypotheses is B0 but condone in conclusion
[In the CR/significant/Reject \(H_0\)] Binomial distribution is not a suitable model.Final A1 Dependent on 2nd and 3rd Method marks only. A correct conclusion (context not required) which is based on their \(X^2\)-value and their \(\chi^2\)-critical value
Part (d)
AnswerMarks Guidance
AnswerMark Guidance
Following from a correct conclusion in part (c), a comment conveying either: \(p\) is not constant or employer's belief is not justifiedB1
# Question 6:

## Part (a)

| Answer | Mark | Guidance |
|--------|------|----------|
| $\hat{p} = \frac{0(2)+1(21)+2(45)+3(42)+4(12)+5(3)}{8(2+21+45+42+12+3)}$ or $8(125)$ $\left\{=\frac{300}{1000}\right\} = 0.3$ | M1 | Must show clearly how to get either 300 or 1000 |
| Answer is given. See notes. | A1 cso | Showing how to get both 300 and 1000 and reaching $p=0.3$ |

## Part (b)

| Answer | Mark | Guidance |
|--------|------|----------|
| $r = 125 \times {}^8C_3(0.3)^3(0.7)^5 \{= 31.76523...\}$ (formula) or $r = 125 \times (0.8059-0.5518)\{=31.7625\}$ (tables) | M1 | For any correct method (or correct expression) for finding either $r$ or $s$ |
| $s = 125-(7.21+24.71+37.06+\text{their } r+17.02+5.83)\{=1.40477...\text{ or }1.4075\}$ or $s=125\times(1-0.9887)\{=1.4125\}$ | | |
| $r = 31.76523$ or $31.7625$ or $31.7575$ | A1 | $r=$ awrt 31.77 or $r=$ awrt 31.76 |
| $s = 1.40477$ or $1.4075$ or $1.4125$ | A1 | $s=1.4$ or awrt 1.40 or $s=$ awrt 1.41 |

## Part (c)

| Answer | Mark | Guidance |
|--------|------|----------|
| Pooling 5 failed tasks and $\geqslant 6$ ONLY | 1st M1 | For an attempt to pool 5 failed tasks and $\geqslant 6$ failed tasks ONLY. Give 1st M0 for pooling 0 failed tasks and 1 failed task |
| At least 2 correct expressions/values for test statistic (to awrt 2 d.p. or truncated 2 d.p.) | 2nd M1 | |
| $X^2 = \sum\frac{(O-E)^2}{E}$ or $\sum\frac{O^2}{E}-125 :=$ awrt 13.3 | dM1 | Dependent on 2nd M1. For applying either $\sum\frac{(O-E)^2}{E}$ or $\sum\frac{O^2}{E}-125$ |
| awrt 13.3 | 1st A1 | See notes |
| $\nu = 6-1-1=4$ | 1st B1 ft | For their evaluated $n-1-1$, i.e. realising they must subtract 2 from their $n$ |
| $\chi^2_4(0.05)=9.488 \Rightarrow$ CR: $X^2 \geqslant 9.488$ | 2nd B1 | A correct ft for their $\chi^2_k(0.05)$, where $k=n-1-1$ from their $n$ |
| $H_0$: Binomial distribution is a good (or suitable) model (or fit). $H_1$: Binomial distribution is not a suitable model. | 3rd B1 | Must have both hypotheses and mention Binomial at least once. Inclusion of 0.3 for $p$ in hypotheses is B0 but condone in conclusion |
| [In the CR/significant/Reject $H_0$] Binomial distribution is not a suitable model. | Final A1 | Dependent on 2nd and 3rd Method marks only. A correct conclusion (context not required) which is based on **their** $X^2$-value and **their** $\chi^2$-critical value |

## Part (d)

| Answer | Mark | Guidance |
|--------|------|----------|
| Following from a correct conclusion in part (c), a comment conveying **either**: $p$ is not constant **or** employer's belief is not justified | B1 | |

---
6. Eight tasks were given to each of 125 randomly selected job applicants. The number of tasks failed by each applicant is recorded.

The results are as follows

\begin{center}
\begin{tabular}{|l|l|l|l|l|l|l|l|}
\hline
Number of tasks failed by an applicant & 0 & 1 & 2 & 3 & 4 & 5 & 6 or more \\
\hline
Frequency & 2 & 21 & 45 & 42 & 12 & 3 & 0 \\
\hline
\end{tabular}
\end{center}
\begin{enumerate}[label=(\alph*)]
\item Show that the probability of a randomly selected task, from this sample, being failed is 0.3

An employer believes that a binomial distribution might provide a good model for the number of tasks, out of 8, that an applicant fails.

He uses a binomial distribution, with the estimated probability 0.3 of a task being failed. The calculated expected frequencies are as follows

\begin{center}
\begin{tabular}{|l|l|l|l|l|l|l|l|}
\hline
Number of tasks failed by an applicant & 0 & 1 & 2 & 3 & 4 & 5 & 6 or more \\
\hline
Expected frequency & 7.21 & 24.71 & 37.06 & $r$ & 17.02 & 5.83 & $s$ \\
\hline
\end{tabular}
\end{center}
\item Find the value of $r$ and the value of $s$ giving your answers to 2 decimal places.
\item Test, at the $5 \%$ level of significance, whether or not a binomial distribution is a suitable model for these data. State your hypotheses and show your working clearly.

The employer believes that all applicants have the same probability of failing each task.
\item Use your result from part(c) to comment on this belief.
\end{enumerate}

\hfill \mbox{\textit{Edexcel S3 2014 Q6 [14]}}