OCR MEI S3 (Statistics 3) 2011 January

Question 1
View details
1 Each month the amount of electricity, measured in kilowatt-hours ( kWh ), used by a particular household is Normally distributed with mean 406 and standard deviation 12.
  1. Find the probability that, in a randomly chosen month, less than 420 kWh is used. The charge for electricity used is 14.6 pence per kWh .
  2. Write down the distribution of the total charge for the amount of electricity used in any one month. Hence find the probability that, in a randomly chosen month, the total charge is more than \(\pounds 60\).
  3. The household receives a bill every three months. Assume that successive months may be regarded as independent of each other. Find the value of \(b\) such that the probability that a randomly chosen bill is less than \(\pounds b\) is 0.99 . In a different household, the amount of electricity used per month was Normally distributed with mean 432 kWh . This household buys a new washing machine that is claimed to be cheaper to run than the old one. Over the next six months the amounts of electricity used, in kWh , are as follows. $$\begin{array} { l l l l l l } 404 & 433 & 420 & 423 & 413 & 440 \end{array}$$
  4. Treating this as a random sample, carry out an appropriate test, with a \(5 \%\) significance level, to see if there is any evidence to suggest that the amount of electricity used per month by this household has decreased on average.
Question 2
View details
2
    1. What is stratified sampling? Why would it be used?
    2. A local authority official wishes to conduct a survey of households in the borough. He decides to select a stratified sample of 2000 households using Council Tax property bands as the strata. At the time of the survey there are 79368 households in the borough. The table shows the numbers of households in the different tax bands.
      Tax bandA - BC - DE - FG - H
      Number of households322983321197394120
      Calculate the number of households that the official should choose from each stratum in order to obtain his sample of 2000 households so that each stratum is represented proportionally.
    1. What assumption needs to be made when using a Wilcoxon single sample test?
    2. As part of an investigation into trends in local authority spending, one of the categories of expenditure considered was 'Highways and the Environment'. For a random sample of 10 local authorities, the percentages of their total expenditure spent on Highways and the Environment in 1999 and then in 2009 are shown in the table.
      Local authorityABCDEFGHIJ
      19999.608.408.679.329.899.357.918.089.618.55
      20098.948.427.878.4110.1710.118.319.769.549.67
      Use a Wilcoxon test, with a significance level of \(10 \%\), to determine whether there appears to be any change to the average percentage of total expenditure spent on Highways and the Environment between 1999 and 2009.
Question 3
View details
3 The masses, in kilograms, of a random sample of 100 chickens on sale in a large supermarket were recorded as follows.
Mass \(( m \mathrm {~kg} )\)\(m < 1.6\)\(1.6 \leqslant m < 1.8\)\(1.8 \leqslant m < 2.0\)\(2.0 \leqslant m < 2.2\)\(2.2 \leqslant m < 2.4\)\(2.4 \leqslant m < 2.6\)\(2.6 \leqslant m\)
Frequency2830421152
  1. Assuming that the first and last classes are the same width as the other classes, calculate an estimate of the sample mean and show that the corresponding estimate of the sample standard deviation is 0.2227 kg . A Normal distribution using the mean and standard deviation found in part (i) is to be fitted to these data. The expected frequencies for the classes are as follows.
    Mass \(( m \mathrm {~kg} )\)\(m < 1.6\)\(1.6 \leqslant m < 1.8\)\(1.8 \leqslant m < 2.0\)\(2.0 \leqslant m < 2.2\)\(2.2 \leqslant m < 2.4\)\(2.4 \leqslant m < 2.6\)\(2.6 \leqslant m\)
    Expected
    frequency
    2.1710.92\(f\)33.8519.225.130.68
  2. Use the Normal distribution to find \(f\).
  3. Carry out a goodness of fit test of this Normal model using a significance level of 5\%.
  4. Discuss the outcome of the test with reference to the contributions to the test statistic and to the possibility of other significance levels.
Question 4
View details
4 A timber supplier cuts wooden fence posts from felled trees. The posts are of length \(( k + X ) \mathrm { cm }\) where \(k\) is a constant and \(X\) is a random variable which has probability density function $$f ( x ) = \begin{cases} 1 + x & - 1 \leqslant x < 0
1 - x & 0 \leqslant x \leqslant 1
0 & \text { elsewhere } \end{cases}$$
  1. Sketch \(\mathrm { f } ( x )\).
  2. Write down the value of \(\mathrm { E } ( X )\) and find \(\operatorname { Var } ( X )\).
  3. Write down, in terms of \(k\), the approximate distribution of \(\bar { L }\), the mean length of a random sample of 50 fence posts. Justify your choice of distribution.
  4. In a particular sample of 50 posts, the mean length is 90.06 cm . Find a \(95 \%\) confidence interval for the true mean length of the fence posts.
  5. Explain whether it is reasonable to suppose that \(k = 90\).