OCR MEI S3 2010 June — Question 3 18 marks

Exam BoardOCR MEI
ModuleS3 (Statistics 3)
Year2010
SessionJune
Marks18
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicWilcoxon tests
TypeWilcoxon matched-pairs signed-rank test
DifficultyStandard +0.3 This is a standard S3 question testing two routine non-parametric methods (Wilcoxon signed rank test and chi-squared goodness of fit). Part (a) requires straightforward application of the Wilcoxon test with clear data, and part (b) is an incomplete chi-squared test. Both are textbook applications with no novel insight required, making this slightly easier than average for A-level statistics.
Spec5.06a Chi-squared: contingency tables5.07b Sign test: and Wilcoxon signed-rank

  1. In order to prevent and/or control the spread of infectious diseases, the Government has various vaccination programmes. One such programme requires people to receive a booster injection at the age of 18. It is felt that the proportion of people receiving this booster could be increased and a publicity campaign is undertaken for this purpose. In order to assess the effectiveness of this campaign, health authorities across the country are asked to report the percentage of 18-year-olds receiving the booster before and after the campaign. The results for a randomly chosen sample of 9 authorities are as follows.
    AuthorityABCDEFGHI
    Before769888818684839380
    After829793778395919589
    This sample is to be tested to see whether the campaign appears to have been successful in raising the percentage receiving the booster.
    1. Explain why the use of paired data is appropriate in this context. [1]
    2. Carry out an appropriate Wilcoxon signed rank test using these data, at the 5\% significance level. [10]
  2. Benford's Law predicts the following probability distribution for the first significant digit in some large data sets.
    Digit123456789
    Probability0.3010.1760.1250.0970.0790.0670.0580.0510.046
    On one particular day, the first significant digits of the stock market prices of the shares of a random sample of 200 companies gave the following results.
    Digit123456789
    Frequency55342716151712159
    Test at the 10\% level of significance whether Benford's Law provides a reasonable model in the context of share prices. [7]

Part (a)(i)
Use paired data in order to eliminate differences between authorities.
AnswerMarks
B1[1]
Part (a)(ii)
\(H_0: m = 0\)
\(H_1: m > 0\)
where \(m\) is the population median difference.
AnswerMarks Guidance
B1Both. Accept hypotheses in words.
B1Adequate definition of \(m\) to include "population".
Diff (After − Before)6 −1
Rank of \(\\text{diff}\ \)
\(W = 1 + 3 + 4 + 8\) (or \(2 + 5 + 6 + 7 + 8 + 9 = 37\))
Refer to tables of Wilcoxon paired (single sample) statistic for \(n = 9\).
Lower 5% point is 8 (or upper is 37 if \(W_-\) used).
Result is significant.
Evidence suggests the percentage has been raised (on the whole).
AnswerMarks
M1For differences. ZERO in this section if differences not used.
M1For ranks.
A1FT from here if ranks wrong
B1
M1No ft from here if wrong.
A1i.e. a 1-tail test. No ft from here if wrong.
A1If only c's test statistic.
A1If only c's test statistic.
[10]
Part (b)
\(H_0:\) Stock market prices can be modelled by Benford's Law.
\(H_1:\) Stock market prices can not be modelled by Benford's Law.
AnswerMarks Guidance
Prob0.301 0.176
Exp f60.2 35.2
Obs f55 34
M1Probs \(\times\) 200 for expected frequencies. All correct.
M1Calculation of \(\chi^2\).
\(\chi^2 = 0.44917 + 0.04091 + 0.16 + 0.59588 + 0.04051 + 0.96716 + 0.01379 + 2.25882 + 0.00435 = 4.5305(9)\)
Refer to \(\chi_8^2\).
AnswerMarks
A1c.a.o.
M1Allow correct df (= cells − 1) from wrongly grouped table and fit. Otherwise, no ft if wrong.
Upper 5% point is 13.36.
Not significant.
Suggests Benford's Law provides a reasonable model in the context of share prices.
AnswerMarks
A1\(P(\chi^2 > 4.53059) = 0.80636.\) No ft from here if wrong.
A1If only c's test statistic.
A1If only c's test statistic.
[7]
## Part (a)(i)
Use paired data in order to eliminate differences between authorities.

| B1 | [1]

## Part (a)(ii)
$H_0: m = 0$
$H_1: m > 0$

where $m$ is the population median difference.

| B1 | Both. Accept hypotheses in words. |
| B1 | Adequate definition of $m$ to include "population". |

| Diff (After − Before) | 6 | −1 | 5 | −4 | −3 | 11 | 8 | 2 | 9 |
|---|---|---|---|---|---|---|---|---|---|
| Rank of $\|\text{diff}\|$ | 6 | 1 | 5 | 4 | 3 | 9 | 7 | 2 | 8 |

$W = 1 + 3 + 4 + 8$ (or $2 + 5 + 6 + 7 + 8 + 9 = 37$)

Refer to tables of Wilcoxon paired (single sample) statistic for $n = 9$.
Lower 5% point is 8 (or upper is 37 if $W_-$ used).
Result is significant.
Evidence suggests the percentage has been raised (on the whole).

| M1 | For differences. ZERO in this section if differences not used. |
| M1 | For ranks. |
| A1 | FT from here if ranks wrong |
| B1 | |
| M1 | No ft from here if wrong. |
| A1 | i.e. a 1-tail test. No ft from here if wrong. |
| A1 | If only c's test statistic. |
| A1 | If only c's test statistic. |

[10]

## Part (b)
$H_0:$ Stock market prices can be modelled by Benford's Law.
$H_1:$ Stock market prices can not be modelled by Benford's Law.

| Prob | 0.301 | 0.176 | 0.125 | 0.097 | 0.079 | 0.067 | 0.058 | 0.051 | 0.046 |
|---|---|---|---|---|---|---|---|---|---|
| Exp f | 60.2 | 35.2 | 25.0 | 19.4 | 15.8 | 13.4 | 11.6 | 10.2 | 9.2 |
| Obs f | 55 | 34 | 27 | 16 | 15 | 17 | 12 | 15 | 9 |

| M1 | Probs $\times$ 200 for expected frequencies. All correct. |
| M1 | Calculation of $\chi^2$. |

$\chi^2 = 0.44917 + 0.04091 + 0.16 + 0.59588 + 0.04051 + 0.96716 + 0.01379 + 2.25882 + 0.00435 = 4.5305(9)$

Refer to $\chi_8^2$.

| A1 | c.a.o. |
| M1 | Allow correct df (= cells − 1) from wrongly grouped table and fit. Otherwise, no ft if wrong. |

Upper 5% point is 13.36.
Not significant.
Suggests Benford's Law provides a reasonable model in the context of share prices.

| A1 | $P(\chi^2 > 4.53059) = 0.80636.$ No ft from here if wrong. |
| A1 | If only c's test statistic. |
| A1 | If only c's test statistic. |

[7]

---
\begin{enumerate}[label=(\alph*)]
\item In order to prevent and/or control the spread of infectious diseases, the Government has various vaccination programmes. One such programme requires people to receive a booster injection at the age of 18. It is felt that the proportion of people receiving this booster could be increased and a publicity campaign is undertaken for this purpose. In order to assess the effectiveness of this campaign, health authorities across the country are asked to report the percentage of 18-year-olds receiving the booster before and after the campaign. The results for a randomly chosen sample of 9 authorities are as follows.

\begin{tabular}{|l|c|c|c|c|c|c|c|c|c|}
\hline
Authority & A & B & C & D & E & F & G & H & I \\
\hline
Before & 76 & 98 & 88 & 81 & 86 & 84 & 83 & 93 & 80 \\
\hline
After & 82 & 97 & 93 & 77 & 83 & 95 & 91 & 95 & 89 \\
\hline
\end{tabular}

This sample is to be tested to see whether the campaign appears to have been successful in raising the percentage receiving the booster.

\begin{enumerate}[label=(\roman*)]
\item Explain why the use of paired data is appropriate in this context. [1]

\item Carry out an appropriate Wilcoxon signed rank test using these data, at the 5\% significance level. [10]
\end{enumerate}

\item Benford's Law predicts the following probability distribution for the first significant digit in some large data sets.

\begin{tabular}{|l|c|c|c|c|c|c|c|c|c|}
\hline
Digit & 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 & 9 \\
\hline
Probability & 0.301 & 0.176 & 0.125 & 0.097 & 0.079 & 0.067 & 0.058 & 0.051 & 0.046 \\
\hline
\end{tabular}

On one particular day, the first significant digits of the stock market prices of the shares of a random sample of 200 companies gave the following results.

\begin{tabular}{|l|c|c|c|c|c|c|c|c|c|}
\hline
Digit & 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 & 9 \\
\hline
Frequency & 55 & 34 & 27 & 16 & 15 & 17 & 12 & 15 & 9 \\
\hline
\end{tabular}

Test at the 10\% level of significance whether Benford's Law provides a reasonable model in the context of share prices. [7]
\end{enumerate}

\hfill \mbox{\textit{OCR MEI S3 2010 Q3 [18]}}