OCR S3 (Statistics 3) 2012 January

Mark scheme PDF ↗

Question 1 6 marks
View details
In a test of association of two factors, \(A\) and \(B\), a \(2 \times 2\) contingency table yielded \(5.63\) for the value of \(\chi^2\) with Yates' correction.
  1. State the null hypothesis and alternative hypothesis for the test. [1]
  2. State how Yates' correction is applied, and whether it increases or decreases the value of \(\chi^2\). [2]
  3. Carry out the test at the \(2\frac{1}{2}\%\) significance level. [3]
Question 2 7 marks
View details
An investigation in 2007 into the incidence of tuberculosis (TB) in badgers in a certain area found that 42 out of a random sample of 190 badgers tested positive for TB. In 2010, 48 out of a random sample of 150 badgers tested positive for TB.
  1. Assuming that the population proportions of badgers with TB are the same in 2007 and 2010, obtain the best estimate of this proportion. [1]
  2. Carry out a test at the \(2\frac{1}{2}\%\) significance level of whether the population proportion of badgers with TB increased from 2007 to 2010. [6]
Question 3 8 marks
View details
The continuous random variable \(U\) has a normal distribution with unknown mean \(\mu\) and known variance 1. A random sample of four observations of \(U\) gave the values \(3.9, 2.1, 4.6\) and \(1.4\).
  1. Calculate a \(90\%\) confidence interval for \(\mu\). [3]
  2. The probability that the sum of four random observations of \(U\) is less than 11 is denoted by \(p\). For each of the end points of the confidence interval in part (i) calculate the corresponding value of \(p\). [5]
Question 4 10 marks
View details
\(X\) is a continuous random variable with the distribution N\((48.5, 12.5^2)\). The values of \(X\) are transformed to standardised values of \(Y\), using the equation \(Y = aX + b\), where \(a\) and \(b\) are constants with \(a > 0\).
  1. Find values of \(a\) and \(b\) for which the mean and standard deviation of \(Y\) are 40 and 10 respectively. [4]
  2. State the distribution of \(Y\). [1]
Two randomly chosen standardised values are denoted by \(Y_1\) and \(Y_2\).
  1. Calculate the probability that \(Y_2\) is at least 10 greater than \(Y_1\). [5]
Question 5 10 marks
View details
A statistician suggested that the weekly sales \(X\) thousand litres at a petrol station could be modelled by the following probability density function. $$\text{f}(x) = \begin{cases} \frac{1}{40}(2x + 3) & 0 \leqslant x < 5, \\ 0 & \text{otherwise.} \end{cases}$$
  1. Show that, using this model, P\((a < X < a + 1) = \frac{a + 2}{20}\) for \(0 \leqslant a < 4\). [3]
Sales in 100 randomly chosen weeks gave the following grouped frequency table.
\(x\)\(0 \leqslant x < 1\)\(1 \leqslant x < 2\)\(2 \leqslant x < 3\)\(3 \leqslant x < 4\)\(4 \leqslant x < 5\)
Frequency1612183024
  1. Carry out a goodness of fit test at the \(10\%\) significance level of whether f\((x)\) fits the data. [7]
Question 6 13 marks
View details
The continuous random variable \(Y\) has probability density function given by $$\text{f}(y) = \begin{cases} -\frac{1}{4}y & -2 < y < 0, \\ \frac{1}{4}y & 0 \leqslant y \leqslant 2, \\ 0 & \text{otherwise.} \end{cases}$$ Find
  1. the interquartile range of \(Y\), [4]
  2. Var\((Y)\), [5]
  3. E\((|Y|)\). [4]
Question 7 18 marks
View details
The manufacturer's specification for batteries used in a certain electronic game is that the mean lifetime should be 32 hours. The manufacturer tests a random sample of 10 batteries made in Factory A, and the lifetimes (\(x\) hours) are summarised by \(n = 10\), \(\sum x = 289.0\) and \(\sum x^2 = 8586.19\). It may be assumed that the population of lifetimes has a normal distribution.
  1. Carry out a one-tail test at the \(5\%\) significance level of whether the specification is being met. [7]
  2. Justify the use of a one-tail test in this context. [1]
Batteries made with the same specification are also made in Factory B. The lifetimes of these batteries are also normally distributed. A random sample of 12 batteries from this factory was tested. The lifetimes are summarised by \(n = 12\), \(\sum x = 363.0\) and \(\sum x^2 = 11290.95\).
    1. State what further assumption must be made in order to test whether there is any difference in the mean lifetimes of batteries made at the two factories. Use the data to comment on whether this assumption is reasonable. [3]
    2. Carry out the test at the \(10\%\) significance level. [7]