OCR MEI S3 2011 June — Question 2 18 marks

Exam BoardOCR MEI
ModuleS3 (Statistics 3)
Year2011
SessionJune
Marks18
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicChi-squared goodness of fit
TypeChi-squared goodness of fit: Poisson
DifficultyStandard +0.3 Part (i) is a standard chi-squared goodness of fit test for a Poisson distribution with most calculations already provided—students just need to complete the table, calculate the test statistic, find critical value, and conclude. Part (ii) is a routine Wilcoxon signed-rank test application. Both are textbook exercises requiring careful execution but no novel insight, making this slightly easier than average for S3 level.
Spec5.02i Poisson distribution: random events model5.02j Poisson formula: P(X=x) = e^(-lambda)*lambda^x/x!5.02k Calculate Poisson probabilities5.06c Fit other distributions: discrete and continuous

2 Scientists researching into the chemical composition of dust in space collect specimens using a specially designed spacecraft. The craft collects the particles of dust in trays that are made up of a large array of cells containing aerogel. The aerogel traps the particles that penetrate into the cells.
  1. For a random sample of 100 cells, the number of particles of dust in each cell was counted, giving the following results.
    Number of particles0123456789\(10 +\)
    Frequency4710201715109530
    It is thought that the number of particles collected in each cell can be modelled using the distribution Poisson(4.2) since 4.2 is the sample mean for these data. Some of the calculations for a \(\chi ^ { 2 }\) test are shown below. The cells for 8,9 and \(10 +\) particles have been combined.
    Number of particles
    Observed frequency
    Expected frequency
    Contribution to \(X ^ { 2 }\)
    567\(8 +\)
    151098
    16.3311.446.866.39
    0.10830.18130.66760.4056
    Complete the calculations and carry out the test using a \(10 \%\) significance level to see whether the number of particles per cell may be modelled in this way.
  2. The diameters of the dust particles are believed to be distributed symmetrically about a median of 15 micrometres \(( \mu \mathrm { m } )\). For a random sample of 20 particles, the sum of the signed ranks of the diameters of the particles smaller than \(15 \mu \mathrm {~m} \left( W _ { - } \right)\)is found to be 53 . Test at the \(5 \%\) level of significance whether the median diameter appears to be more than \(15 \mu \mathrm {~m}\).

Question 2:
Part (i)
AnswerMarks Guidance
AnswerMark Guidance
Probabilities correct to 3 d.p. or betterM1
\(\times 100\) for expected frequenciesM1
All expected frequencies correctA1
Merge first two cellsM1
Calculation of \(X^2\)M1
\(X^2 = 1.3128 + 0.7843 + 0.1183 + 0.3063 + 0.1083 + 0.1813 + 0.6676 + 0.4056 = 3.884(5)\)A1 c.a.o. For ungrouped cells \(X^2 = 6.816\)
\(H_0\): The Poisson model fits the dataB1 Ignore any reference to the parameter
\(H_1\): The Poisson model does not fit the dataB1 Do not accept "data fit model"
Refer to \(\chi^2_6\)M1 Allow correct df (= cells \(-2\)) from wrongly grouped table
Upper 10% point is 10.64A1 No ft from here if wrong. \(\chi^2_7 = 12.02\), \(P(X^2 > 3.884) = 0.6924\)
Not significantA1 ft only c's test statistic
Evidence suggests model fits the dataA1 Do not accept "data fit model"
Part (ii)
AnswerMarks Guidance
AnswerMark Guidance
\(H_0: m = 15\), \(H_1: m > 15\) where \(m\) is the population median diameter (in \(\mu\)m)B1 B1 Both. Accept in words. Must include "population"
Given \(W_- = 53\) (\(\therefore W_+ = 157\))
Refer to Wilcoxon paired/single sample statistic for \(n = 20\)M1 No ft from here if wrong
Lower 5% point is 60 (or upper is 150 if \(W_+\) used)A1 1-tail test. No ft from here if wrong
Result is significantA1 ft only c's test statistic
Evidence suggests median diameter appears to be more than 15 \(\mu\)mA1 ft only c's test statistic. Context must include "average" o.e.
# Question 2:

## Part (i)
| Answer | Mark | Guidance |
|--------|------|----------|
| Probabilities correct to 3 d.p. or better | M1 | |
| $\times 100$ for expected frequencies | M1 | |
| All expected frequencies correct | A1 | |
| Merge first two cells | M1 | |
| Calculation of $X^2$ | M1 | |
| $X^2 = 1.3128 + 0.7843 + 0.1183 + 0.3063 + 0.1083 + 0.1813 + 0.6676 + 0.4056 = 3.884(5)$ | A1 | c.a.o. For ungrouped cells $X^2 = 6.816$ |
| $H_0$: The Poisson model fits the data | B1 | Ignore any reference to the parameter |
| $H_1$: The Poisson model does not fit the data | B1 | Do not accept "data fit model" |
| Refer to $\chi^2_6$ | M1 | Allow correct df (= cells $-2$) from wrongly grouped table |
| Upper 10% point is 10.64 | A1 | No ft from here if wrong. $\chi^2_7 = 12.02$, $P(X^2 > 3.884) = 0.6924$ |
| Not significant | A1 | ft only c's test statistic |
| Evidence suggests model fits the data | A1 | Do not accept "data fit model" |

## Part (ii)
| Answer | Mark | Guidance |
|--------|------|----------|
| $H_0: m = 15$, $H_1: m > 15$ where $m$ is the population median diameter (in $\mu$m) | B1 B1 | Both. Accept in words. Must include "population" |
| Given $W_- = 53$ ($\therefore W_+ = 157$) | | |
| Refer to Wilcoxon paired/single sample statistic for $n = 20$ | M1 | No ft from here if wrong |
| Lower 5% point is 60 (or upper is 150 if $W_+$ used) | A1 | 1-tail test. No ft from here if wrong |
| Result is significant | A1 | ft only c's test statistic |
| Evidence suggests median diameter appears to be more than 15 $\mu$m | A1 | ft only c's test statistic. Context must include "average" o.e. |

---
2 Scientists researching into the chemical composition of dust in space collect specimens using a specially designed spacecraft. The craft collects the particles of dust in trays that are made up of a large array of cells containing aerogel. The aerogel traps the particles that penetrate into the cells.\\
(i) For a random sample of 100 cells, the number of particles of dust in each cell was counted, giving the following results.

\begin{center}
\begin{tabular}{ | l | c | c | c | c | c | c | c | c | c | c | c | }
\hline
Number of particles & 0 & 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 & 9 & $10 +$ \\
\hline
Frequency & 4 & 7 & 10 & 20 & 17 & 15 & 10 & 9 & 5 & 3 & 0 \\
\hline
\end{tabular}
\end{center}

It is thought that the number of particles collected in each cell can be modelled using the distribution Poisson(4.2) since 4.2 is the sample mean for these data.

Some of the calculations for a $\chi ^ { 2 }$ test are shown below. The cells for 8,9 and $10 +$ particles have been combined.

\begin{center}
\begin{tabular}{ | l | }
\hline
Number of particles \\
\hline
Observed frequency \\
\hline
Expected frequency \\
\hline
Contribution to $X ^ { 2 }$ \\
\hline
\end{tabular}
\end{center}

\begin{center}
\begin{tabular}{ | c | c | c | c | }
\hline
5 & 6 & 7 & $8 +$ \\
\hline
15 & 10 & 9 & 8 \\
\hline
16.33 & 11.44 & 6.86 & 6.39 \\
\hline
0.1083 & 0.1813 & 0.6676 & 0.4056 \\
\hline
\end{tabular}
\end{center}

Complete the calculations and carry out the test using a $10 \%$ significance level to see whether the number of particles per cell may be modelled in this way.\\
(ii) The diameters of the dust particles are believed to be distributed symmetrically about a median of 15 micrometres $( \mu \mathrm { m } )$. For a random sample of 20 particles, the sum of the signed ranks of the diameters of the particles smaller than $15 \mu \mathrm {~m} \left( W _ { - } \right)$is found to be 53 . Test at the $5 \%$ level of significance whether the median diameter appears to be more than $15 \mu \mathrm {~m}$.

\hfill \mbox{\textit{OCR MEI S3 2011 Q2 [18]}}