OCR S2 (Statistics 2) 2013 June

Question 1
View details
1 It is required to select a random sample of 30 pupils from a school with 853 pupils. A student suggests the following method.
"Give each pupil sequentially a three-digit number from 001 to 853 . Use a calculator to generate random three-digit numbers from 0.000 to 0.999 inclusive, multiply the answer by 853 , add 1 and round off to the nearest whole number. Select the corresponding pupil, and repeat as necessary".
  1. Determine which pupil would be picked for each of the following calculator outputs: $$0.103 , \quad 0.104 , \quad 0.105 , \quad 0.106 , \quad 0.107$$
  2. Use your answers to part (i) to show that this method is biased, and suggest an improvement.
Question 2
View details
2 The number of neutrinos that pass through a certain region in one second is a random variable with the distribution \(\operatorname { Po } \left( 5 \times 10 ^ { 4 } \right)\). Use a suitable approximation to calculate the probability that the number of neutrinos passing through the region in 40 seconds is less than \(1.999 \times 10 ^ { 6 }\).
Question 3
View details
3 The mean of a sample of 80 independent observations of a continuous random variable \(Y\) is denoted by \(\bar { Y }\). It is given that \(\mathrm { P } ( \bar { Y } \leqslant 157.18 ) = 0.1\) and \(\mathrm { P } ( \bar { Y } \geqslant 164.76 ) = 0.7\).
  1. Calculate \(\mathrm { E } ( Y )\) and the standard deviation of \(Y\).
  2. State
    (a) where in your calculations you have used the Central Limit Theorem,
    (b) why it was necessary to use the Central Limit Theorem,
    (c) why it was possible to use the Central Limit Theorem.
Question 4
View details
4 The number of floods in a certain river plain is known to have a Poisson distribution. It is known that up until 10 years ago the mean number of floods per year was 0.32 . During the last 10 years there were 6 floods. Test at the \(1 \%\) significance level whether there is evidence of an increase in the mean number of floods per year.
Question 5
View details
5 Two random variables \(S\) and \(T\) have probability density functions given by $$\begin{aligned} & f _ { S } ( x ) = \begin{cases} \frac { 3 } { a ^ { 3 } } ( x - a ) ^ { 2 } & 0 \leqslant x \leqslant a
0 & \text { otherwise } \end{cases}
& f _ { T } ( x ) = \begin{cases} c & 0 \leqslant x \leqslant a
0 & \text { otherwise } \end{cases} \end{aligned}$$ where \(a\) and \(c\) are constants.
  1. On a single diagram sketch both probability density functions.
  2. Calculate the mean of \(S\), in terms of \(a\).
  3. Use your diagram to explain which of \(S\) or \(T\) has the bigger variance. (Answers obtained by calculation will score no marks.)
Question 6
View details
6 The random variable \(X\) denotes the yield, in kilograms per acre, of a certain crop. Under the standard treatment it is known that \(\mathrm { E } ( X ) = 38.4\). Under a new treatment, the yields of 50 randomly chosen regions can be summarised as $$n = 50 , \quad \sum x = 1834.0 , \quad \sum x ^ { 2 } = 70027.37 .$$ Test at the \(1 \%\) level whether there has been a change in the mean crop yield.
Question 7
View details
7 Past experience shows that \(35 \%\) of the senior pupils in a large school know the regulations about bringing cars to school. The head teacher addresses this subject in an assembly, and afterwards a random sample of 120 senior pupils is selected. In this sample it is found that 50 of these pupils know the regulations. Use a suitable approximation to test, at the \(10 \%\) significance level, whether there is evidence that the proportion of senior pupils who know the regulations has increased. Justify your approximation.
Question 8
View details
8 The random variable \(R\) has the distribution \(\mathrm { B } ( 14 , p )\). A test is carried out at the \(\alpha \%\) significance level of the null hypothesis \(\mathrm { H } _ { 0 } : p = 0.25\), against \(\mathrm { H } _ { 1 } : p > 0.25\).
  1. Given that \(\alpha\) is as close to 5 as possible, find the probability of a Type II error when the true value of \(p\) is 0.4 .
  2. State what happens to the probability of a Type II error as
    (a) \(p\) increases from 0.4,
    (b) \(\alpha\) increases, giving a reason.
Question 9
View details
9 The managers of a car breakdown recovery service are discussing whether the number of breakdowns per day can be modelled by a Poisson distribution. They agree that breakdowns occur randomly. Manager \(A\) says, "it must be assumed that breakdowns occur at a constant rate throughout the day".
  1. Give an improved version of Manager \(A\) 's statement, and explain why the improvement is necessary.
  2. Explain whether you think your improved statement is likely to hold in this context. Assume now that the number \(B\) of breakdowns per day can be modelled by the distribution \(\operatorname { Po } ( \lambda )\).
  3. Given that \(\lambda = 9.0\) and \(\mathrm { P } \left( B > B _ { 0 } \right) < 0.1\), use tables to find the smallest possible value of \(B _ { 0 }\), and state the corresponding value of \(\mathrm { P } \left( B > B _ { 0 } \right)\).
  4. Given that \(\mathrm { P } ( B = 2 ) = 0.0072\), show that \(\lambda\) satisfies an equation of the form \(\lambda = 0.12 \mathrm { e } ^ { k \lambda }\), for a value of \(k\) to be stated. Evaluate the expression \(0.12 \mathrm { e } ^ { k \lambda }\) for \(\lambda = 8.5\) and \(\lambda = 8.6\), giving your answers correct to 4 decimal places. What can be deduced about a possible value of \(\lambda\) ?