OCR S3 (Statistics 3) 2011 January

Question 1
View details
1 A random variable has a normal distribution with unknown mean \(\mu\) and known standard deviation 0.19 . In order to estimate \(\mu\) a random sample of five observations of the random variable was taken. The values were as follows. $$\begin{array} { l l l l l } 5.44 & 4.93 & 5.12 & 5.36 & 5.40 \end{array}$$ Using these five values, calculate,
  1. an estimate of \(\mu\),
  2. a 95\% confidence interval for \(\mu\).
Question 2
View details
2 In a Year 8 internal examination in a large school the Geography marks, \(G\), and Mathematics marks, \(M\), had means and standard deviations as follows.
MeanStandard deviation
\(G\)36.426.87
\(M\)42.6510.25
Assuming that \(G\) and \(M\) have independent normal distributions, find the probability that a randomly chosen Geography candidate scores at least 10 marks more than a randomly chosen Mathematics candidate. Do not use a continuity correction.
Question 3
View details
3 The continuous random variable \(T\) has probability density function given by $$\mathrm { f } ( t ) = \begin{cases} 0 & t < 0 ,
\frac { a } { \mathrm { e } } & 0 \leqslant t < 2 ,
a \mathrm { e } ^ { - \frac { 1 } { 2 } t } & t \geqslant 2 , \end{cases}$$ where \(a\) is a positive constant.
  1. Show that \(a = \frac { 1 } { 4 } \mathrm { e }\).
  2. Find the upper quartile of \(T\).
Question 4
View details
4 A study in 1981 investigated the effect of water fluoridation on children's dental health. In a town with fluoridation, 61 out of a random sample of 107 children showed signs of increased tooth decay after six months. In a town without fluoridation the corresponding number was 106 out of a random sample of 143 children. The population proportions of children with increased tooth decay are denoted by \(p _ { 1 }\) and \(p _ { 2 }\) for the towns with fluoridation and without fluoridation respectively. A test is carried out of the null hypothesis \(p _ { 1 } = p _ { 2 }\) against the alternative hypothesis \(p _ { 1 } < p _ { 2 }\). Find the smallest significance level at which the null hypothesis is rejected.
Question 5
View details
5 An experiment with hybrid corn resulted in yellow kernels and purple kernels. Of a random sample of 90 kernels, 18 were yellow and 72 were purple.
  1. Calculate an approximate \(90 \%\) confidence interval for the proportion of yellow kernels produced in all such experiments.
  2. Deduce an approximate \(90 \%\) confidence interval for the proportion of purple kernels produced in all such experiments.
  3. Explain what is meant by a \(90 \%\) confidence interval for a population proportion.
  4. Mendel's theory of inheritance predicts that \(25 \%\) of all such kernels will be yellow. State, giving a reason, whether or not your calculations support the theory.
Question 6
View details
6 The continuous random variable \(X\) has (cumulative) distribution function given by $$\mathrm { F } ( x ) = \begin{cases} 0 & x < \frac { 1 } { 2 }
\frac { 2 x - 1 } { x + 1 } & \frac { 1 } { 2 } \leqslant x \leqslant 2
1 & x > 2 . \end{cases}$$
  1. Given that \(Y = \frac { 1 } { X }\), find the (cumulative) distribution function of \(Y\), and deduce that \(Y\) and \(X\) have identical distributions.
  2. Find \(\mathrm { E } ( X + 1 )\) and deduce the value of \(\mathrm { E } \left( \frac { 1 } { X } \right)\).
Question 7
View details
7
  1. When should Yates' correction be applied when carrying out a \(\chi ^ { 2 }\) test? Two vaccines against typhoid fever, \(A\) and \(B\), were tested on a total of 700 people in Nepal during a particular year. The vaccines were allocated randomly and whether or not typhoid had developed was noted during the following year. The results are shown in the table.
    \multirow{2}{*}{}Vaccines
    \cline { 2 - 3 }\(A\)\(B\)
    Developed typhoid194
    Did not develop typhoid310367
  2. Carry out a suitable \(\chi ^ { 2 }\) test at the \(1 \%\) significance level to determine whether the outcome depends on the vaccine used. Comment on the result.
Question 8
View details
8
  1. State circumstances under which it would be necessary to calculate a pooled estimate of variance when carrying out a two-sample hypothesis test.
  2. An investigation into whether passive smoking affects lung capacity considered a random sample of 20 children whose parents did not smoke and a random sample of 22 children whose parents did smoke. None of the children themselves smoked. The lung capacity, in litres, of each child was measured and the results are summarised as follows. For the children whose parents did not smoke: \(n _ { 1 } = 20 , \Sigma x _ { 1 } = 42.4\) and \(\Sigma x _ { 1 } ^ { 2 } = 90.43\).
    For the children whose parents did smoke: \(\quad n _ { 2 } = 22 , \Sigma x _ { 2 } = 42.5\) and \(\Sigma x _ { 2 } ^ { 2 } = 82.93\).
    The means of the two populations are denoted by \(\mu _ { 1 }\) and \(\mu _ { 2 }\) respectively.
    (a) State conditions for which a \(t\)-test would be appropriate for testing whether \(\mu _ { 1 }\) exceeds \(\mu _ { 2 }\).
    (b) Assuming the conditions are valid, carry out the test at the \(1 \%\) significance level and comment on the result.
    (c) Calculate a 99\% confidence interval for \(\mu _ { 1 } - \mu _ { 2 }\).