OCR MEI S3 2011 January — Question 2 18 marks

Exam BoardOCR MEI
ModuleS3 (Statistics 3)
Year2011
SessionJanuary
Marks18
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicWilcoxon tests
TypeWilcoxon signed-rank test (single sample)
DifficultyStandard +0.3 This is a straightforward application of stratified sampling calculations and a standard Wilcoxon signed-rank test. Part (a) requires simple proportional allocation (multiply 2000 by each stratum's proportion), and part (b) follows the routine procedure of finding differences, ranking absolute values, summing ranks, and comparing to critical values. No novel insight or complex reasoning required—slightly easier than average due to clear structure and standard methodology.
Spec2.01c Sampling techniques: simple random, opportunity, etc5.06a Chi-squared: contingency tables5.07b Sign test: and Wilcoxon signed-rank

2
    1. What is stratified sampling? Why would it be used?
    2. A local authority official wishes to conduct a survey of households in the borough. He decides to select a stratified sample of 2000 households using Council Tax property bands as the strata. At the time of the survey there are 79368 households in the borough. The table shows the numbers of households in the different tax bands.
      Tax bandA - BC - DE - FG - H
      Number of households322983321197394120
      Calculate the number of households that the official should choose from each stratum in order to obtain his sample of 2000 households so that each stratum is represented proportionally.
    1. What assumption needs to be made when using a Wilcoxon single sample test?
    2. As part of an investigation into trends in local authority spending, one of the categories of expenditure considered was 'Highways and the Environment'. For a random sample of 10 local authorities, the percentages of their total expenditure spent on Highways and the Environment in 1999 and then in 2009 are shown in the table.
      Local authorityABCDEFGHIJ
      19999.608.408.679.329.899.357.918.089.618.55
      20098.948.427.878.4110.1710.118.319.769.549.67
      Use a Wilcoxon test, with a significance level of \(10 \%\), to determine whether there appears to be any change to the average percentage of total expenditure spent on Highways and the Environment between 1999 and 2009.

Question 2:
Part (a)(i)
AnswerMarks Guidance
AnswerMark Guidance
There are identifiable subgroups or strata that might exhibit different characteristicsE1
Each stratum is randomly sampledE1
Use it to obtain a representative sampleE1
Can get information on the individual strataE1 Total: 4
Part (a)(ii)
AnswerMarks Guidance
AnswerMark Guidance
For each stratum \(\ldots \times \frac{2000}{79368}\) giving 813.9, 836.9, 245.4, 103.8 so 814, 837, 245, 104M1, A1 All correct. Total: 2
Part (b)(i)
AnswerMarks Guidance
AnswerMark Guidance
The population (or underlying distribution) is assumed to be symmetrical about its medianE2 E2, 1, 0. Award E1 for 2 out of 3 of the key features. Total: 2
Part (b)(ii)
AnswerMarks Guidance
AnswerMark Guidance
\(H_0: m = 0\), \(H_1: m \neq 0\) where \(m\) is the population median difference for the percentagesB1, B1 Both hypotheses in words, only must include "population". For adequate verbal definition
Differences table with ranks as shownM1 For differences. ZERO (out of 8) in this section if paired differences not used
Ranks calculatedM1, A1 For ranks; ft from here if ranks wrong
\(W_- = 2 + 5 + 7 + 8 = 22\)B1 (or \(W_+ = 1+3+4+6+9+10 = 33\))
Refer to tables of Wilcoxon paired/single sample statistic for \(n = 10\)M1 No ft from here if wrong
Lower (or upper if 33 used) 5% tail is 10 (or 45 if 33 used)A1 i.e. a 2-tail test. No ft from here if wrong
Result is not significantA1 ft only c's test statistic
No evidence to suggest a change in spending on averageA1 ft only c's test statistic. Conclusion in context to include "on average" o.e. Total: 10
# Question 2:

## Part (a)(i)
| Answer | Mark | Guidance |
|--------|------|----------|
| There are identifiable subgroups or strata that might exhibit different characteristics | E1 | |
| Each stratum is randomly sampled | E1 | |
| Use it to obtain a representative sample | E1 | |
| Can get information on the individual strata | E1 | Total: 4 |

## Part (a)(ii)
| Answer | Mark | Guidance |
|--------|------|----------|
| For each stratum $\ldots \times \frac{2000}{79368}$ giving 813.9, 836.9, 245.4, 103.8 so 814, 837, 245, 104 | M1, A1 | All correct. Total: 2 |

## Part (b)(i)
| Answer | Mark | Guidance |
|--------|------|----------|
| The population (or underlying distribution) is assumed to be symmetrical about its median | E2 | E2, 1, 0. Award E1 for 2 out of 3 of the key features. Total: 2 |

## Part (b)(ii)
| Answer | Mark | Guidance |
|--------|------|----------|
| $H_0: m = 0$, $H_1: m \neq 0$ where $m$ is the population median difference for the percentages | B1, B1 | Both hypotheses in words, only must include "population". For adequate verbal definition |
| Differences table with ranks as shown | M1 | For differences. ZERO (out of 8) in this section if paired differences not used |
| Ranks calculated | M1, A1 | For ranks; ft from here if ranks wrong |
| $W_- = 2 + 5 + 7 + 8 = 22$ | B1 | (or $W_+ = 1+3+4+6+9+10 = 33$) |
| Refer to tables of Wilcoxon paired/single sample statistic for $n = 10$ | M1 | No ft from here if wrong |
| Lower (or upper if 33 used) 5% tail is 10 (or 45 if 33 used) | A1 | i.e. a 2-tail test. No ft from here if wrong |
| Result is not significant | A1 | ft only c's test statistic |
| No evidence to suggest a change in spending on average | A1 | ft only c's test statistic. Conclusion in context to include "on average" o.e. Total: 10 |

---
2
\begin{enumerate}[label=(\alph*)]
\item \begin{enumerate}[label=(\roman*)]
\item What is stratified sampling? Why would it be used?
\item A local authority official wishes to conduct a survey of households in the borough. He decides to select a stratified sample of 2000 households using Council Tax property bands as the strata. At the time of the survey there are 79368 households in the borough. The table shows the numbers of households in the different tax bands.

\begin{center}
\begin{tabular}{ | l | c | c | c | c | }
\hline
Tax band & A - B & C - D & E - F & G - H \\
\hline
Number of households & 32298 & 33211 & 9739 & 4120 \\
\hline
\end{tabular}
\end{center}

Calculate the number of households that the official should choose from each stratum in order to obtain his sample of 2000 households so that each stratum is represented proportionally.
\end{enumerate}\item \begin{enumerate}[label=(\roman*)]
\item What assumption needs to be made when using a Wilcoxon single sample test?
\item As part of an investigation into trends in local authority spending, one of the categories of expenditure considered was 'Highways and the Environment'. For a random sample of 10 local authorities, the percentages of their total expenditure spent on Highways and the Environment in 1999 and then in 2009 are shown in the table.

\begin{center}
\begin{tabular}{ | l | c | c | c | c | c | c | c | c | c | c | }
\hline
Local authority & A & B & C & D & E & F & G & H & I & J \\
\hline
1999 & 9.60 & 8.40 & 8.67 & 9.32 & 9.89 & 9.35 & 7.91 & 8.08 & 9.61 & 8.55 \\
\hline
2009 & 8.94 & 8.42 & 7.87 & 8.41 & 10.17 & 10.11 & 8.31 & 9.76 & 9.54 & 9.67 \\
\hline
\end{tabular}
\end{center}

Use a Wilcoxon test, with a significance level of $10 \%$, to determine whether there appears to be any change to the average percentage of total expenditure spent on Highways and the Environment between 1999 and 2009.
\end{enumerate}\end{enumerate}

\hfill \mbox{\textit{OCR MEI S3 2011 Q2 [18]}}