CAIE S2 (Statistics 2) 2011 June

Question 1
View details
1 On average, 2 people in every 10000 in the UK have a particular gene. A random sample of 6000 people in the UK is chosen. The random variable \(X\) denotes the number of people in the sample who have the gene. Use an approximating distribution to calculate the probability that there will be more than 2 people in the sample who have the gene.
Question 2
View details
2
  1. The time taken by a worker to complete a task was recorded for a random sample of 50 workers. The sample mean was 41.2 minutes and an unbiased estimate of the population variance was 32.6 minutes \({ } ^ { 2 }\). Find a \(95 \%\) confidence interval for the mean time taken to complete the task.
  2. The probability that an \(\alpha \%\) confidence interval includes only values that are lower than the population mean is \(\frac { 1 } { 16 }\). Find the value of \(\alpha\).
Question 3
View details
3 Past experience has shown that the heights of a certain variety of rose bush have been normally distributed with mean 85.0 cm . A new fertiliser is used and it is hoped that this will increase the heights. In order to test whether this is the case, a botanist records the heights, \(x \mathrm {~cm}\), of a large random sample of \(n\) rose bushes and calculates that \(\bar { x } = 85.7\) and \(s = 4.8\), where \(\bar { x }\) is the sample mean and \(s ^ { 2 }\) is an unbiased estimate of the population variance. The botanist then carries out an appropriate hypothesis test.
  1. The test statistic, \(z\), has a value of 1.786 correct to 3 decimal places. Calculate the value of \(n\).
  2. Using this value of the test statistic, carry out the test at the \(5 \%\) significance level.
Question 4
View details
4

  1. \includegraphics[max width=\textwidth, alt={}, center]{7c9a87ac-69c6-4850-82aa-8235bba581e8-2_611_712_1466_358}
    \includegraphics[max width=\textwidth, alt={}, center]{7c9a87ac-69c6-4850-82aa-8235bba581e8-2_618_716_1464_1155} The diagrams show the graphs of two functions, \(g\) and \(h\). For each of the functions \(g\) and \(h\), give a reason why it cannot be a probability density function.
  2. The distance, in kilometres, travelled in a given time by a cyclist is represented by the continuous random variable \(X\) with probability density function given by $$f ( x ) = \begin{cases} \frac { 30 } { x ^ { 2 } } & 10 \leqslant x \leqslant 15
    0 & \text { otherwise } \end{cases}$$
    1. Show that \(\mathrm { E } ( X ) = 30 \ln 1.5\).
    2. Find the median of \(X\). Find also the probability that \(X\) lies between the median and the mean.
Question 5
View details
5 Cans of drink are packed in boxes, each containing 4 cans. The weights of these cans are normally distributed with mean 510 g and standard deviation 14 g . The weights of the boxes, when empty, are independently normally distributed with mean 200 g and standard deviation 8 g .
  1. Find the probability that the total weight of a full box of cans is between 2200 g and 2300 g .
  2. Two cans of drink are chosen at random. Find the probability that they differ in weight by more than 20 g .
Question 6
View details
6 The number of injuries per month at a certain factory has a Poisson distribution. In the past the mean was 2.1 injuries per month. New safety procedures are put in place and the management wishes to use the next 3 months to test, at the \(2 \%\) significance level, whether there are now fewer injuries than before, on average.
  1. Find the critical region for the test.
  2. Find the probability of a Type I error.
  3. During the next 3 months there are a total of 3 injuries. Carry out the test.
  4. Assuming that the mean remains 2.1 , calculate an estimate of the probability that there will be fewer than 20 injuries during the next 12 months.