OCR S3 (Statistics 3) 2012 January

Question 1
View details
1 In a test of association of two factors, \(A\) and \(B\), a \(2 \times 2\) contingency table yielded 5.63 for the value of \(\chi ^ { 2 }\) with Yates’ correction.
  1. State the null hypothesis and alternative hypothesis for the test.
  2. State how Yates' correction is applied, and whether it increases or decreases the value of \(\chi ^ { 2 }\).
  3. Carry out the test at the \(2 \frac { 1 } { 2 } \%\) significance level.
Question 2
View details
2 An investigation in 2007 into the incidence of tuberculosis (TB) in badgers in a certain area found that 42 out of a random sample of 190 badgers tested positive for TB.
In 2010, 48 out of a random sample of 150 badgers tested positive for TB.
  1. Assuming that the population proportions of badgers with TB are the same in 2007 and 2010, obtain the best estimate of this proportion.
  2. Carry out a test at the \(2 \frac { 1 } { 2 } \%\) significance level of whether the population proportion of badgers with TB increased from 2007 to 2010.
Question 5
View details
5 A statistician suggested that the weekly sales \(X\) thousand litres at a petrol station could be modelled by the following probability density function. $$f ( x ) = \begin{cases} \frac { 1 } { 40 } ( 2 x + 3 ) & 0 \leqslant x < 5
0 & \text { otherwise } \end{cases}$$
  1. Show that, using this model, \(\mathrm { P } ( a \leqslant X < a + 1 ) = \frac { a + 2 } { 20 }\) for \(0 \leqslant a \leqslant 4\). Sales in 100 randomly chosen weeks gave the following grouped frequency table.
    \(x\)\(0 \leqslant x < 1\)\(1 \leqslant x < 2\)\(2 \leqslant x < 3\)\(3 \leqslant x < 4\)\(4 \leqslant x < 5\)
    Frequency1612183024
  2. Carry out a goodness of fit test at the \(10 \%\) significance level of whether \(\mathrm { f } ( x )\) fits the data.
Question 6
View details
6 The continuous random variable \(Y\) has probability density function given by $$f ( y ) = \begin{cases} - \frac { 1 } { 4 } y & - 2 \leqslant y < 0
\frac { 1 } { 4 } y & 0 \leqslant y \leqslant 2
0 & \text { otherwise. } \end{cases}$$ Find
  1. the interquartile range of \(Y\),
  2. \(\operatorname { Var } ( Y )\),
  3. \(\mathrm { E } ( | Y | )\).
Question 7
View details
7 The manufacturer's specification for batteries used in a certain electronic game is that the mean lifetime should be 32 hours. The manufacturer tests a random sample of 10 batteries made in Factory \(A\), and the lifetimes ( \(x\) hours) are summarised by $$n = 10 , \sum x = 289.0 \text { and } \sum x ^ { 2 } = 8586.19 .$$ It may be assumed that the population of lifetimes has a normal distribution.
  1. Carry out a one-tail test at the \(5 \%\) significance level of whether the specification is being met.
  2. Justify the use of a one-tail test in this context. Batteries made with the same specification are also made in Factory \(B\). The lifetimes of these batteries are also normally distributed. A random sample of 12 batteries from this factory was tested. The lifetimes are summarised by $$n = 12 , \sum x = 363.0 \text { and } \sum x ^ { 2 } = 11290.95 \text {. }$$
  3. (a) State what further assumption must be made in order to test whether there is any difference in the mean lifetimes of batteries made at the two factories.
    Use the data to comment on whether this assumption is reasonable.
    (b) Carry out the test at the \(10 \%\) significance level.