OCR MEI S3 (Statistics 3) 2006 January

Mark scheme PDF ↗

Question 1 18 marks
View details
A railway company is investigating operations at a junction where delays often occur. Delays (in minutes) are modelled by the random variable \(T\) with the following cumulative distribution function. $$F(t) = \begin{cases} 0 & t \leq 0 \\ 1 - e^{-\frac{1}{t}} & t > 0 \end{cases}$$
  1. Find the median delay and the 90th percentile delay. [5]
  2. Derive the probability density function of \(T\). Hence use calculus to find the mean delay. [5]
  3. Find the probability that a delay lasts longer than the mean delay. [2]
You are given that the variance of \(T\) is 9.
  1. Let \(\overline{T}\) denote the mean of a random sample of 30 delays. Write down an approximation to the distribution of \(\overline{T}\). [3]
  2. A random sample of 30 delays is found to have mean 4.2 minutes. Does this cast any doubt on the modelling? [3]
Question 2 18 marks
View details
Geoffrey is a university lecturer. He has to prepare five questions for an examination. He knows by experience that it takes about 3 hours to prepare a question, and he models the time (in minutes) taken to prepare one by the Normally distributed random variable \(X\) with mean 180 and standard deviation 12, independently for all questions.
  1. One morning, Geoffrey has a gap of 2 hours 50 minutes (170 minutes) between other activities. Find the probability that he can prepare a question in this time. [3]
  2. One weekend, Geoffrey can devote 14 hours to preparing the complete examination paper. Find the probability that he can prepare all five questions in this time. [3]
A colleague, Helen, has to check the questions.
  1. She models the time (in minutes) to check a question by the Normally distributed random variable \(Y\) with mean 50 and standard deviation 6, independently for all questions and independently of \(X\). Find the probability that the total time for Geoffrey to prepare a question and Helen to check it exceeds 4 hours. [3]
  2. When working under pressure of deadlines, Helen models the time to check a question in a different way. She uses the Normally distributed random variable \(\frac{1}{2}X\), where \(X\) is as above. Find the length of time, as given by this model, which Helen needs to ensure that, with probability 0.9, she has time to check a question. [4]
Ian, an educational researcher, suggests that a better model for the time taken to prepare a question would be a constant \(k\) representing "thinking time" plus a random variable \(T\) representing the time required to write the question itself, independently for all questions.
  1. Taking \(k\) as 45 and \(T\) as Normally distributed with mean 120 and standard deviation 10 (all units are minutes), find the probability according to Ian's model that a question can be prepared in less than 2 hours 30 minutes. [2]
Juliet, an administrator, proposes that the examination should be reduced in time and shorter questions should be used.
  1. Juliet suggests that Ian's model should be used for the time taken to prepare such shorter questions but with \(k = 30\) and \(T\) replaced by \(\frac{2}{3}T\). Find the probability as given by this model that a question can be prepared in less than \(1\frac{1}{4}\) hours. [3]
Question 3 18 marks
View details
A production line has two machines, A and B, for delivering liquid soap into bottles. Each machine is set to deliver a nominal amount of 250 ml, but it is not expected that they will work to a high level of accuracy. In particular, it is known that the ambient temperature affects the rate of flow of the liquid and leads to variation in the amounts delivered. The operators think that machine B tends to deliver a somewhat greater amount than machine A, no matter what the ambient temperature. This is being investigated by an experiment. A random sample of 10 results from the experiment is shown below. Each column of data is for a different ambient temperature.
Ambient temperature\(T_1\)\(T_2\)\(T_3\)\(T_4\)\(T_5\)\(T_6\)\(T_7\)\(T_8\)\(T_9\)\(T_{10}\)
Amount delivered by machine A246.2251.6252.0246.6258.4251.0247.5247.1248.1253.4
Amount delivered by machine B248.3252.6252.8247.2258.8250.0247.2247.9249.0254.5
  1. Use an appropriate \(t\) test to examine, at the 5\% level of significance, whether the mean amount delivered by machine B may be taken as being greater than that delivered by machine A, stating carefully your null and alternative hypotheses and the required distributional assumption. [11]
  2. Using the data for machine A in the table above, provide a two-sided 95\% confidence interval for the mean amount delivered by this machine, stating the required distributional assumption. Explain whether you would conclude that the machine appears to be working correctly in terms of the nominal amount as set. [7]
Question 4 18 marks
View details
Quality control inspectors in a factory are investigating the lengths of glass tubes that will be used to make laboratory equipment.
  1. Data on the observed lengths of a random sample of 200 glass tubes from one batch are available in the form of a frequency distribution as follows.
    Length \(x\) (mm)Observed frequency
    \(x \leq 298\)1
    \(298 < x \leq 300\)30
    \(300 < x \leq 301\)62
    \(301 < x \leq 302\)70
    \(302 < x \leq 304\)34
    \(x > 304\)3
    The sample mean and standard deviation are 301.08 and 1.2655 respectively. The corresponding expected frequencies for the Normal distribution with parameters estimated by the sample statistics are
    Length \(x\) (mm)Expected frequency
    \(x \leq 298\)1.49
    \(298 < x \leq 300\)37.85
    \(300 < x \leq 301\)55.62
    \(301 < x \leq 302\)58.32
    \(302 < x \leq 304\)44.62
    \(x > 304\)2.10
    Examine the goodness of fit of a Normal distribution, using a 5\% significance level. [7]
  2. It is thought that the lengths of tubes in another batch have an underlying distribution similar to that for the batch in part (i) but possibly with different location and dispersion parameters. A random sample of 10 tubes from this batch gives the following lengths (in mm). 301.3 \quad 301.4 \quad 299.6 \quad 302.2 \quad 300.3 \quad 303.2 \quad 302.6 \quad 301.8 \quad 300.9 \quad 300.8
    1. Discuss briefly whether it would be appropriate to use a \(t\) test to examine a hypothesis about the population mean length for this batch. [2]
    2. Use a Wilcoxon test to examine at the 10\% significance level whether the population median length for this batch is 301 mm. [9]