Edexcel FS1 AS 2024 June — Question 4 15 marks

Exam BoardEdexcel
ModuleFS1 AS (Further Statistics 1 AS)
Year2024
SessionJune
Marks15
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicChi-squared goodness of fit
TypeChi-squared goodness of fit: Binomial
DifficultyStandard +0.3 This is a standard chi-squared goodness of fit test for a binomial distribution with straightforward parts: stating assumptions, estimating p from the mean, finding missing expected frequencies by subtraction, and performing the test. The only mild complication is understanding why df = 3 (combining cells and losing 1 df for estimating p), but this is routine FS1 material requiring no novel insight.
Spec5.02b Expectation and variance: discrete random variables5.02c Linear coding: effects on mean and variance5.06b Fit prescribed distribution: chi-squared test5.06c Fit other distributions: discrete and continuous

  1. Robin shoots 8 arrows at a target each day for 100 days.
The number of times he hits the target each day is summarised in the table below.
Number of hits012345678
Frequency1103034174202
Misha believes that these data can be modelled by a binomial distribution.
  1. State, in context, two assumptions that are implied by the use of this model.
  2. Find an estimate for the proportion of arrows Robin shoots that hit the target. Misha calculates expected frequencies, to 2 decimal places, as follows.
    Number of hits012345678
    Expected frequency2.8112.67\(r\)28.0519.73\(s\)2.500.400.03
  3. Find the value of \(r\) and the value of \(s\) Misha correctly used a suitable test to assess her belief.
    1. Explain why she used a test with 3 degrees of freedom.
    2. Complete the test using a \(5 \%\) level of significance. You should clearly state your hypotheses, test statistic, critical value and conclusion.

Question 4:
Part 4(a):
AnswerMarks Guidance
AnswerMark Guidance
e.g. The probability of an arrow hitting the target is constantB1 1st B1 for one suitable comment mentioning "arrows"/"shots"/"hit"/"target", or for both comments not in context
e.g. The arrows are shot independentlyB1 (2) 2nd B1 for both suitable comments which must mention "arrows"/"shots"/"hit"/"target" at least once
Part 4(b):
AnswerMarks Guidance
AnswerMark Guidance
\(\hat{p} = \dfrac{0\times1 + 1\times10 + 2\times30 + 3\times34 + 4\times17 + 5\times4 + 6\times2 + 7\times0 + 8\times2}{100 \times 8}\)M1 For correct attempt at mean or \(\hat{p}\) (allow at least 3 correct non-zero products seen)
\(= \left[\dfrac{2.88}{8}\right] = \mathbf{0.36}\)A1 (2) For 0.36 (sight of 2.88 scores M1 and A mark when divided by 8)
Part 4(c):
AnswerMarks Guidance
AnswerMark Guidance
\([\text{Let}\ X \sim B(8,\ \text{awrt "0.36")}]\) \(P(X=2) = 0.24936\ldots\) or \(P(X=5) = 0.08876\ldots\)M1 For sight or use of a correct binomial model to find either prob (ft their 0.36)
\((E(2)) = \text{awrt}\ \mathbf{24.94}\ (24.93\text{--}24.95)\) or \((E(5)) = \text{awrt}\ \mathbf{8.88}\ (8.87\text{--}8.89)\)A1 For either correct expected frequency
For using Sum of Expected frequencies \(= 100\) (accept awrt 100.01)B1ft (3) For two positive values of \(r\) and \(s\) such that \(r + s = 33.82\) or \(33.81\)
Part 4(d)(i):
AnswerMarks Guidance
AnswerMark Guidance
Need to pool some values to make all \(E_i > 5\) [needn't specify]B1 For mention of the need to pool since \(E_i < 5\) or to get \(E_i > 5\)
Two constraints AND \(p\) was estimated ["2 constraints" \(\Leftarrow 5 - 2 = 3\)]B1 (2) For mention of 2 constraints because \(p\) is estimated/calculated from data
Part 4(d)(ii):
AnswerMarks Guidance
AnswerMark Guidance
\(H_0\): Binomial is a good model for these data; \(H_1\): Binomial is not a suitable model for these dataB1 For both hypotheses correct
Pooled expected frequencies: 0 or 1: 15.48; 2: "24.94"; 3: 28.05; 4: 19.73; \(\geqslant 5\): "11.80"M1 For correct pooling \((0\ \&\ 1)\) and \((\ldots 5)\) (ft their \(r\) and \(s\)); may be implied by awrt 5.18
\(\dfrac{(O_i - E_i)^2}{E_i}\): \(\dfrac{(11-15.48)^2}{15.48}=1.30\); \(\dfrac{(30-\text{"24.94"})^2}{24.94}=1.03\); \(\dfrac{(34-28.05)^2}{28.05}=1.26\); \(\dfrac{(17-19.73)^2}{19.73}=0.38\); \(\dfrac{(8-\text{"11.80"})^2}{11.80}=1.22\)M1 For correct use of test statistic (at least one correct calc to at least 2 sf from 3rd or 4th row)
Test statistic \(= 5.188692\ldots\ \text{awrt}\ \mathbf{5.19}\) (allow awrt 5.18)A1 For awrt 5.19 (accept 5.18, 5.20)
CV: \(\chi^2_3(5\%) = 7.815\)B1 For 7.815 (or better); NB \(p\)-value \(= 0.1586\ldots\) and is B0
[Do not reject \(H_0\)] Data is compatible with Misha's belief/Binomial model is suitableA1 (6) dep on both M marks; e.g. \(B(8, 0.36)\) is a suitable model is A1
# Question 4:

## Part 4(a):
| Answer | Mark | Guidance |
|--------|------|----------|
| e.g. The **probability** of an arrow **hitting** the **target** is **constant** | B1 | 1st B1 for **one** suitable comment mentioning "arrows"/"shots"/"hit"/"target", or for **both** comments not in context |
| e.g. The **arrows** are shot **independently** | B1 (2) | 2nd B1 for **both** suitable comments which must mention "arrows"/"shots"/"hit"/"target" at least once |

## Part 4(b):
| Answer | Mark | Guidance |
|--------|------|----------|
| $\hat{p} = \dfrac{0\times1 + 1\times10 + 2\times30 + 3\times34 + 4\times17 + 5\times4 + 6\times2 + 7\times0 + 8\times2}{100 \times 8}$ | M1 | For correct attempt at mean or $\hat{p}$ (allow at least 3 correct non-zero products seen) |
| $= \left[\dfrac{2.88}{8}\right] = \mathbf{0.36}$ | A1 (2) | For 0.36 (sight of 2.88 scores M1 and A mark when divided by 8) |

## Part 4(c):
| Answer | Mark | Guidance |
|--------|------|----------|
| $[\text{Let}\ X \sim B(8,\ \text{awrt "0.36")}]$ $P(X=2) = 0.24936\ldots$ or $P(X=5) = 0.08876\ldots$ | M1 | For sight or use of a correct binomial model to find either prob (ft their 0.36) |
| $(E(2)) = \text{awrt}\ \mathbf{24.94}\ (24.93\text{--}24.95)$ or $(E(5)) = \text{awrt}\ \mathbf{8.88}\ (8.87\text{--}8.89)$ | A1 | For **either** correct expected frequency |
| For using Sum of Expected frequencies $= 100$ (accept awrt 100.01) | B1ft (3) | For two positive values of $r$ and $s$ such that $r + s = 33.82$ or $33.81$ |

## Part 4(d)(i):
| Answer | Mark | Guidance |
|--------|------|----------|
| Need to pool some values to make all $E_i > 5$ [needn't specify] | B1 | For mention of the need to pool since $E_i < 5$ or to get $E_i > 5$ |
| Two constraints **AND** $p$ was estimated ["2 constraints" $\Leftarrow 5 - 2 = 3$] | B1 (2) | For mention of 2 constraints because $p$ is estimated/calculated from data |

## Part 4(d)(ii):
| Answer | Mark | Guidance |
|--------|------|----------|
| $H_0$: Binomial is a good model for these data; $H_1$: Binomial is not a suitable model for these data | B1 | For both hypotheses correct |
| Pooled expected frequencies: 0 or 1: 15.48; 2: "24.94"; 3: 28.05; 4: 19.73; $\geqslant 5$: "11.80" | M1 | For correct pooling $(0\ \&\ 1)$ and $(\ldots 5)$ (ft their $r$ and $s$); may be implied by awrt 5.18 |
| $\dfrac{(O_i - E_i)^2}{E_i}$: $\dfrac{(11-15.48)^2}{15.48}=1.30$; $\dfrac{(30-\text{"24.94"})^2}{24.94}=1.03$; $\dfrac{(34-28.05)^2}{28.05}=1.26$; $\dfrac{(17-19.73)^2}{19.73}=0.38$; $\dfrac{(8-\text{"11.80"})^2}{11.80}=1.22$ | M1 | For correct use of test statistic (at least one correct calc to at least 2 sf from 3rd or 4th row) |
| Test statistic $= 5.188692\ldots\ \text{awrt}\ \mathbf{5.19}$ (allow awrt 5.18) | A1 | For awrt 5.19 (accept 5.18, 5.20) |
| CV: $\chi^2_3(5\%) = 7.815$ | B1 | For 7.815 (or better); NB $p$-value $= 0.1586\ldots$ and is B0 |
| [Do not reject $H_0$] Data is compatible with Misha's belief/Binomial model is suitable | A1 (6) | dep on both M marks; e.g. $B(8, 0.36)$ is a suitable model is A1 |
\begin{enumerate}
  \item Robin shoots 8 arrows at a target each day for 100 days.
\end{enumerate}

The number of times he hits the target each day is summarised in the table below.

\begin{center}
\begin{tabular}{ | l | c | c | c | c | c | c | c | c | c | }
\hline
Number of hits & 0 & 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 \\
\hline
Frequency & 1 & 10 & 30 & 34 & 17 & 4 & 2 & 0 & 2 \\
\hline
\end{tabular}
\end{center}

Misha believes that these data can be modelled by a binomial distribution.\\
(a) State, in context, two assumptions that are implied by the use of this model.\\
(b) Find an estimate for the proportion of arrows Robin shoots that hit the target.

Misha calculates expected frequencies, to 2 decimal places, as follows.

\begin{center}
\begin{tabular}{ | l | c | c | c | c | c | c | c | c | c | }
\hline
Number of hits & 0 & 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 \\
\hline
Expected frequency & 2.81 & 12.67 & $r$ & 28.05 & 19.73 & $s$ & 2.50 & 0.40 & 0.03 \\
\hline
\end{tabular}
\end{center}

(c) Find the value of $r$ and the value of $s$

Misha correctly used a suitable test to assess her belief.\\
(d) (i) Explain why she used a test with 3 degrees of freedom.\\
(ii) Complete the test using a $5 \%$ level of significance. You should clearly state your hypotheses, test statistic, critical value and conclusion.

\hfill \mbox{\textit{Edexcel FS1 AS 2024 Q4 [15]}}