OCR MEI S2 (Statistics 2) 2013 January

Question 1
View details
1 A manufacturer of playground safety tiles is testing a new type of tile. Tiles of various thicknesses are tested to estimate the maximum height at which people would be unlikely to sustain injury if they fell onto a tile. The results of the test are as follows.
Thickness \(( t \mathrm {~mm} )\)20406080100
Maximum height \(( h \mathrm {~m} )\)0.721.091.621.972.34
  1. Draw a scatter diagram to illustrate these data.
  2. State which of the two variables is the independent variable, giving a reason for your answer.
  3. Calculate the equation of the regression line of maximum height on thickness.
  4. Use the equation of the regression line to calculate estimates of the maximum height for thicknesses of
    (A) 70 mm ,
    (B) 120 mm . Comment on the reliability of each of these estimates.
  5. Calculate the value of the residual for the data point at which \(t = 40\).
  6. In a further experiment, the manufacturer tests a tile with a thickness of 200 mm and finds that the corresponding maximum height is 2.96 m . What can be said about the relationship between tile thickness and maximum height?
Question 2
View details
2 John is observing butterflies being blown across a fence in a strong wind. He uses the Poisson distribution with mean 2.1 to model the number of butterflies he observes in one minute.
  1. Find the probability that John observes
    (A) no butterflies in a minute,
    (B) at least 2 butterflies in a minute,
    (C) between 5 and 10 butterflies inclusive in a period of 5 minutes.
  2. Use a suitable approximating distribution to find the probability that John observes at least 130 butterflies in a period of 1 hour. In fact some of the butterflies John observes being blown across the fence are being blown in pairs.
  3. Explain why this invalidates one of the assumptions required for a Poisson distribution to be a suitable model. John decides to revise his model for the number of butterflies he observes in one minute. In this new model, the number of pairs of butterflies is modelled by the Poisson distribution with mean 0.2 , and the number of single butterflies is modelled by an independent Poisson distribution with mean 1.7.
  4. Find the probability that John observes no more than 3 butterflies altogether in a period of one minute.
Question 3
View details
3 The amount of data, \(X\) megabytes, arriving at an internet server per second during the afternoon is modelled by the Normal distribution with mean 435 and standard deviation 30.
  1. Find
    (A) \(\mathrm { P } ( X < 450 )\),
    (B) \(\mathrm { P } ( 400 < X < 450 )\).
  2. Find the probability that, during 5 randomly selected seconds, the amounts of data arriving are all between 400 and 450 megabytes. The amount of data, \(Y\) megabytes, arriving at the server during the evening is modelled by the Normal distribution with mean \(\mu\) and standard deviation \(\sigma\).
  3. Given that \(\mathrm { P } ( Y < 350 ) = 0.2\) and \(\mathrm { P } ( Y > 390 ) = 0.1\), find the values of \(\mu\) and \(\sigma\).
  4. Find values of \(a\) and \(b\) for which \(\mathrm { P } ( a < Y < b ) = 0.95\).
Question 4
View details
4
  1. A random sample of 60 students studying mathematics was selected. Their grades in the Core 1 module are summarised in the table below, classified according to whether they worked less than 5 hours per week or at least 5 hours per week. Test, at the \(5 \%\) significance level, whether there is any association between grade and hours worked.
    Hours worked
    \cline { 3 - 4 } \multicolumn{2}{|c|}{}Less than 5At least 5
    \multirow{2}{*}{Grade}A or B2011
    \cline { 2 - 4 }C or lower1316
  2. At a canning factory, cans are filled with tomato purée. The machine which fills the cans is set so that the volume of tomato purée in a can, measured in millilitres, is Normally distributed with mean 420 and standard deviation 3.5. After the machine is recalibrated, a quality control officer wishes to check whether the mean is still 420 millilitres. A random sample of 10 cans of tomato purée is selected and the volumes, measured in millilitres, are as follows. $$\begin{array} { l l l l l l l l l l } 417.2 & 422.6 & 414.3 & 419.6 & 420.4 & 410.0 & 418.3 & 416.9 & 418.9 & 419.7 \end{array}$$ Carry out a test at the \(1 \%\) significance level to investigate whether the mean is still 420 millilitres. You should assume that the volumes are Normally distributed with unchanged standard deviation.