OCR S3 (Statistics 3) Specimen

Question 1
View details
1 A car repair firm receives call-outs both as a result of breakdowns and also as a result of accidents. On weekdays (Monday to Friday), call-outs resulting from breakdowns occur at random, at an average rate of 6 per 5 -day week; call-outs resulting from accidents occur at random, at an average rate of 2 per 5 -day week. The two types of call-out occur independently of each other. Find the probability that the total number of call-outs received by the firm on one randomly chosen weekday is more than 3 .
Question 2 7 marks
View details
2 Boxes of matches contain 50 matches. Full boxes have mean mass 20.0 grams and standard deviation 0.4 grams. Empty boxes have mean mass 12.5 grams and standard deviation 0.2 grams. Stating any assumptions that you need to make, calculate the mean and standard deviation of the mass of a match. [7]
Question 3
View details
3 A random sample of 80 precision-engineered cylindrical components is checked as part of a quality control process. The diameters of the cylinders should be 25.00 cm . Accurate measurements of the diameters, \(x \mathrm {~cm}\), for the sample are summarised by $$\Sigma ( x - 25 ) = 0.44 , \quad \Sigma ( x - 25 ) ^ { 2 } = 0.2287 .$$
  1. Calculate a \(99 \%\) confidence interval for the population mean diameter of the components.
  2. For the calculation in part (i) to be valid, is it necessary to assume that component diameters are normally distributed? Justify your answer.
Question 4
View details
4 The lengths of time, in seconds, between vehicles passing a fixed observation point on a road were recorded at a time when traffic was flowing freely. The frequency distribution in Table 1 is a summary of the data from 100 observations. \begin{table}[h]
Time interval \(( x\) seconds \()\)\(0 < x \leqslant 5\)\(5 < x \leqslant 10\)\(10 < x \leqslant 20\)\(20 < x \leqslant 40\)\(40 < x\)
Observed frequency49222072
\captionsetup{labelformat=empty} \caption{Table 1}
\end{table} It is thought that the distribution of times might be modelled by the continuous random variable \(X\) with probability density function given by $$f ( x ) = \begin{cases} 0.1 e ^ { - 0.1 x } & x > 0
0 & \text { otherwise } \end{cases}$$ Using this model, the expected frequencies (correct to 2 decimal places) for the given time intervals are shown in Table 2. \begin{table}[h]
Time interval \(( x\) seconds \()\)\(0 < x \leqslant 5\)\(5 < x \leqslant 10\)\(10 < x \leqslant 20\)\(20 < x \leqslant 40\)\(40 < x\)
Expected frequency39.3523.8723.2511.701.83
\captionsetup{labelformat=empty} \caption{Table 2}
\end{table}
  1. Show how the expected frequency of 23.87, corresponding to the interval \(5 < x \leqslant 10\), is obtained.
  2. Test, at the 10\% significance level, the goodness of fit of the model to the data.
Question 5
View details
5 The continuous random variable \(X\) has a triangular distribution with probability density function given by $$f ( x ) = \left\{ \begin{array} { l r } 1 + x & - 1 \leqslant x \leqslant 0
1 - x & 0 \leqslant x \leqslant 1
0 & \text { otherwise } \end{array} \right.$$
  1. Show that, for \(0 \leqslant a \leqslant 1\), $$\mathrm { P } ( | X | \leqslant a ) = 2 a - a ^ { 2 } .$$ The random variable \(Y\) is given by \(Y = X ^ { 2 }\).
  2. Express \(\mathrm { P } ( Y \leqslant y )\) in terms of \(y\), for \(0 \leqslant y \leqslant 1\), and hence show that the probability density function of \(Y\) is given by $$g ( y ) = \frac { 1 } { \sqrt { } y } - 1 , \quad \text { for } 0 < y \leqslant 1 .$$
  3. Use the probability density function of \(Y\) to find \(\mathrm { E } ( Y )\), and show how the value of \(\mathrm { E } ( Y )\) may also be obtained directly using the probability density function of \(X\).
  4. Find \(\mathrm { E } ( \sqrt { } Y )\).
Question 6
View details
6 Certain types of food are now sold in metric units. A random sample of 1000 shoppers was asked whether they were in favour of the change to metric units or not. The results, classified according to age, were as shown in the table.
\cline { 2 - 4 } \multicolumn{1}{c|}{}Age of shopper
\cline { 2 - 4 } \multicolumn{1}{c|}{}Under 3535 and overTotal
In favour of change187161348
Not in favour of change283369652
Total4705301000
  1. Use a \(\chi ^ { 2 }\) test to show that there is very strong evidence that shoppers' views about changing to metric units are not independent of their ages.
  2. The data may also be regarded as consisting of two random samples of shoppers; one sample consists of 470 shoppers aged under 35 , of whom 187 were in favour of change, and the second sample consists of 530 shoppers aged 35 or over, of whom 161 were in favour of change. Determine whether a test for equality of population proportions supports the conclusion in part (i).
Question 7
View details
7 A factory manager wished to compare two methods of assembling a new component, to determine which method could be carried out more quickly, on average, by the workforce. A random sample of 12 workers was taken, and each worker tried out each of the methods of assembly. The times taken, in seconds, are shown in the table.
Worker\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)\(K\)\(L\)
Time in seconds for Method 1483847596241505258544960
Time in seconds for Method 2474038555742424062474751
  1. (a) Carry out an appropriate \(t\)-test, using a \(2 \%\) significance level, to test whether there is any difference in the times for the two methods of assembly.
    (b) State an assumption needed in carrying out this test.
    (c) Calculate a \(95 \%\) confidence interval for the population mean time difference for the two methods of assembly.
  2. Instead of using the same 12 workers to try both methods, the factory manager could have used two independent random samples of workers, allocating Method 1 to the members of one sample and Method 2 to the members of the other sample.
    (a) State one disadvantage of a procedure based on two independent random samples.
    (b) State any assumptions that would need to be made to carry out a \(t\)-test based on two independent random samples.