AQA S3 (Statistics 3) 2014 June

Question 1 2 marks
View details
1 A hotel's management is concerned about the quality of the free pens that it provides in its meeting rooms. The hotel's assistant manager tests a random sample of 200 such pens and finds that 23 of them fail to write immediately.
  1. Calculate an approximate \(\mathbf { 9 6 \% }\) confidence interval for the proportion of pens that fail to write immediately.
  2. The supplier of the pens to the hotel claims that at most 2 in 50 pens fail to write immediately. Comment, with numerical justification, on the supplier's claim.
    [0pt] [2 marks] QUESTION
    PART Answer space for question 1
Question 2 6 marks
View details
2 Each household within a district council's area has two types of wheelie-bin: a black one for general refuse and a green one for garden refuse. Each type of bin is emptied by the council fortnightly. The weight, in kilograms, of refuse emptied from a black bin can be modelled by the random variable \(B \sim \mathrm {~N} \left( \mu _ { B } , 0.5625 \right)\). The weight, in kilograms, of refuse emptied from a green bin can be modelled by the random variable \(G \sim \mathrm {~N} \left( \mu _ { G } , 0.9025 \right)\). The mean weight of refuse emptied from a random sample of 20 black bins was 21.35 kg . The mean weight of refuse emptied from an independent random sample of 15 green bins was 21.90 kg . Test, at the \(5 \%\) level of significance, the hypothesis that \(\mu _ { B } = \mu _ { G }\).
[0pt] [6 marks]
Question 3 12 marks
View details
3 An investigation was carried out into the type of vehicle being driven when its driver was caught speeding. The investigation was restricted to drivers who were caught speeding when driving vehicles with at least 4 wheels. An analysis of the results showed that \(65 \%\) were driving cars ( C ), \(20 \%\) were driving vans (V) and 15\% were driving lorries (L). Of those driving cars, \(30 \%\) were caught by fixed speed cameras (F), 55\% were caught by mobile speed cameras (M) and 15\% were caught by average speed cameras (A). Of those driving vans, \(35 \%\) were caught by fixed speed cameras (F), \(45 \%\) were caught by mobile speed cameras (M) and 20\% were caught by average speed cameras (A). Of those driving lorries, \(10 \%\) were caught by fixed speed cameras \(( \mathrm { F } )\), \(65 \%\) were caught by mobile speed cameras (M) and \(25 \%\) were caught by average speed cameras (A).
  1. Represent this information by a tree diagram on which are shown labels and percentages or probabilities.
  2. Hence, or otherwise, calculate the probability that a driver, selected at random from those caught speeding:
    1. was driving either a car or a lorry and was caught by a mobile speed camera;
    2. was driving a lorry, given that the driver was caught by an average speed camera;
    3. was not caught by a fixed speed camera, given that the driver was not driving a car.
      [0pt] [8 marks]
  3. Three drivers were selected at random from those caught speeding by fixed speed cameras. Calculate the probability that they were driving three different types of vehicle.
    [0pt] [4 marks]
Question 4 8 marks
View details
4 A sample of 50 male Eastern Grey kangaroos had a mean weight of 42.6 kg and a standard deviation of 6.2 kg . A sample of 50 male Western Grey kangaroos had a mean weight of 39.7 kg and a standard deviation of 5.3 kg .
  1. Construct a 98\% confidence interval for the difference between the mean weight of male Eastern Grey kangaroos and that of male Western Grey kangaroos.
    [0pt] [5 marks]
    1. What assumption about the selection of each of the two samples was it necessary to make in order that the confidence interval constructed in part (a) was valid?
      [0pt] [1 mark]
    2. Why was it not necessary to assume anything about the distributions of the weights of male kangaroos in order that the confidence interval constructed in part (a) was valid?
      [0pt] [2 marks]
Question 5 4 marks
View details
5 The numbers of daily morning operations, \(X\), and daily afternoon operations, \(Y\), in an operating theatre of a small private hospital can be modelled by the following bivariate probability distribution.
\multirow{2}{*}{}Number of morning operations ( \(\boldsymbol { X }\) )
23456\(\mathbf { P } ( \boldsymbol { Y } = \boldsymbol { y } )\)
\multirow{3}{*}{Number of afternoon operations ( \(\boldsymbol { Y }\) )}30.000.050.200.200.050.50
40.000.150.100.050.000.30
50.050.050.100.000.000.20
\(\mathrm { P } ( \boldsymbol { X } = \boldsymbol { x } )\)0.050.250.400.250.051.00
    1. State why \(\mathrm { E } ( X ) = 4\) and show that \(\operatorname { Var } ( X ) = 0.9\).
    2. Given that $$\mathrm { E } ( Y ) = 3.7 , \operatorname { Var } ( Y ) = 0.61 \text { and } \mathrm { E } ( X Y ) = 14.4$$ calculate values for \(\operatorname { Cov } ( X , Y )\) and \(\rho _ { X Y }\).
  1. Calculate values for the mean and the variance of:
    1. \(T = X + Y\);
    2. \(\quad D = X - Y\).
      [0pt] [4 marks]
Question 6 5 marks
View details
6 Population \(A\) has a normal distribution with unknown mean \(\mu _ { A }\) and a variance of 18.8.
Population \(B\) has a normal distribution with unknown mean \(\mu _ { B }\) but with the same variance as Population \(A\). The random variables \(\bar { X } _ { A }\) and \(\bar { X } _ { B }\) denote the means of independent samples, each of size \(n\), from populations \(A\) and \(B\) respectively.
  1. Find an expression, in terms of \(n\), for \(\operatorname { Var } \left( \bar { X } _ { A } - \bar { X } _ { B } \right)\).
  2. Given that the width of a \(99 \%\) confidence interval for \(\mu _ { A } - \mu _ { B }\) is to be at most 5 , calculate the minimum value for \(n\).
    [0pt] [5 marks]
Question 7 4 marks
View details
7
  1. The random variable \(X\) has a Poisson distribution with parameter \(\lambda\).
    1. Prove, from first principles, that \(\mathrm { E } ( X ) = \lambda\).
    2. Given that \(\mathrm { E } \left( X ^ { 2 } - X \right) = \lambda ^ { 2 }\), deduce that \(\operatorname { Var } ( X ) = \lambda\).
  2. The number of faults in a 100-metre ball of nylon string may be modelled by a Poisson distribution with parameter \(\lambda\).
    1. An analysis of one ball of string, selected at random, showed 15 faults. Using an exact test, investigate the claim that \(\lambda > 10\). Use the \(5 \%\) level of significance.
    2. A subsequent analysis of a random sample of 20 balls of string showed a total of 241 faults.
      (A) Using an approximate test, re-investigate the claim that \(\lambda > 10\). Use the \(5 \%\) level of significance.
      (B) Determine the critical value of the total number of faults for the test in part (b)(ii)(A).
      (C) Given that, in fact, \(\lambda = 12\), estimate the probability of a Type II error for a test of the claim that \(\lambda > 10\) based upon a random sample of 20 balls of string and using the \(5 \%\) level of significance.
      [0pt] [4 marks] \includegraphics[max width=\textwidth, alt={}, center]{d5852425-9340-4aae-82da-e3bf6772a0de-22_2490_1728_219_141}
      \includegraphics[max width=\textwidth, alt={}, center]{d5852425-9340-4aae-82da-e3bf6772a0de-23_2490_1719_217_150}
      \includegraphics[max width=\textwidth, alt={}, center]{d5852425-9340-4aae-82da-e3bf6772a0de-24_2489_1728_221_141}