OCR S2 (Statistics 2) 2009 January

Question 1
View details
1 A newspaper article consists of 800 words. For each word, the probability that it is misprinted is 0.005 , independently of all other words. Use a suitable approximation to find the probability that the total number of misprinted words in the article is no more than 6 . Give a reason to justify your approximation.
Question 2
View details
2 The continuous random variable \(Y\) has the distribution \(\mathrm { N } \left( 23.0,5.0 ^ { 2 } \right)\). The mean of \(n\) observations of \(Y\) is denoted by \(\bar { Y }\). It is given that \(\mathrm { P } ( \bar { Y } > 23.625 ) = 0.0228\). Find the value of \(n\).
Question 3
View details
3 The number of incidents of radio interference per hour experienced by a certain listener is modelled by a random variable with distribution \(\operatorname { Po } ( 0.42 )\).
  1. Find the probability that the number of incidents of interference in one randomly chosen hour is
    (a) 0 ,
    (b) exactly 1 .
  2. Find the probability that the number of incidents in a randomly chosen 5-hour period is greater than 3.
  3. One hundred hours of listening are monitored and the numbers of 1 -hour periods in which 0,1 , \(2 , \ldots\) incidents of interference are experienced are noted. A bar chart is drawn to represent the results. Without any further calculations, sketch the shape that you would expect for the bar chart. (There is no need to use an exact numerical scale on the frequency axis.)
Question 4
View details
4 A television company believes that the proportion of adults who watched a certain programme is 0.14 . Out of a random sample of 22 adults, it is found that 2 watched the programme.
  1. Carry out a significance test, at the \(10 \%\) level, to determine, on the basis of this sample, whether the television company is overestimating the proportion of adults who watched the programme.
  2. The sample was selected randomly. State what properties of this method of sampling are needed to justify the use of the distribution used in your test.
Question 5
View details
5 The continuous random variables \(S\) and \(T\) have probability density functions as follows. $$\begin{array} { l l } S : & \mathrm { f } ( x ) = \begin{cases} \frac { 1 } { 4 } & - 2 \leqslant x \leqslant 2
0 & \text { otherwise } \end{cases}
T : & \mathrm { g } ( x ) = \begin{cases} \frac { 5 } { 64 } x ^ { 4 } & - 2 \leqslant x \leqslant 2
0 & \text { otherwise } \end{cases} \end{array}$$
  1. Sketch, on the same axes, the graphs of f and g .
  2. Describe in everyday terms the difference between the distributions of the random variables \(S\) and \(T\). (Answers that comment only on the shapes of the graphs will receive no credit.)
  3. Calculate the variance of \(T\).
Question 6
View details
6 The weight of a plastic box manufactured by a company is \(W\) grams, where \(W \sim \mathrm {~N} ( \mu , 20.25 )\). A significance test of the null hypothesis \(\mathrm { H } _ { 0 } : \mu = 50.0\), against the alternative hypothesis \(\mathrm { H } _ { 1 } : \mu \neq 50.0\), is carried out at the \(5 \%\) significance level, based on a sample of size \(n\).
  1. Given that \(n = 81\),
    (a) find the critical region for the test, in terms of the sample mean \(\bar { W }\),
    (b) find the probability that the test results in a Type II error when \(\mu = 50.2\).
  2. State how the probability of this Type II error would change if \(n\) were greater than 81 .
Question 7
View details
7 A motorist records the time taken, \(T\) minutes, to drive a particular stretch of road on each of 64 occasions. Her results are summarised by $$\Sigma t = 876.8 , \quad \Sigma t ^ { 2 } = 12657.28$$
  1. Test, at the \(5 \%\) significance level, whether the mean time for the motorist to drive the stretch of road is greater than 13.1 minutes.
  2. Explain whether it is necessary to use the Central Limit Theorem in your test.
Question 8
View details
8 A sales office employs 21 representatives. Each day, for each representative, the probability that he or she achieves a sale is 0.7 , independently of other representatives. The total number of representatives who achieve a sale on any one day is denoted by \(K\).
  1. Using a suitable approximation (which should be justified), find \(\mathrm { P } ( K \geqslant 16 )\).
  2. Using a suitable approximation (which should be justified), find the probability that the mean of 36 observations of \(K\) is less than or equal to 14.0 . 4