OCR S3 (Statistics 3) 2012 June

Question 1
View details
1 A machine fills packets of flour whose nominal weights are 500 g . Each of a random sample of 100 packets was weighed and 14 packets weighed less than 500 g . The population proportion of packets that weigh less than 500 g is denoted by \(p\).
  1. Calculate an approximate \(95 \%\) confidence interval for \(p\).
  2. The weights of the packets, in grams, are normally distributed with mean \(\mu\) and variance 50 . Assuming that \(p = 0.14\), calculate the value of \(\mu\).
Question 2
View details
2 Four pairs of randomly chosen twins were each given identical puzzles to solve. The times taken (in seconds) are shown in the following table.
Twin pair1234
Time for first-born46384449
Time for second-born40413746
Stating any necessary assumption, test at the \(10 \%\) significance level whether there is a difference between the population mean times of first-born and second-born twins.
Question 3
View details
3 A charity raises money by sending letters asking for donations. Because of recent poor responses, the charity’s fund-raiser, Anna, decides to alter the letter’s appearance and designs two possible alternatives, one colourful and the other plain. She believes that the colourful letter will be more successful. Anna sends 60 colourful letters and 40 plain letters to 100 people randomly chosen from the charity's database. There were 39 positive responses to the colourful letter and 12 positive responses to the plain letter. The population proportions of positive responses to the colourful and plain letters are denoted by \(p _ { C }\) and \(p _ { P }\) respectively. Test the null hypothesis \(p _ { C } - p _ { P } = 0.15\) against the alternative hypothesis \(p _ { C } - p _ { P } > 0.15\) at the \(2 \frac { 1 } { 2 } \%\) significance level and state what Anna could report to her manager.
Question 4
View details
4 The time interval, \(T\) minutes, between consecutive stoppages of a particular grinding machine is regularly measured. \(T\) is normally distributed with mean \(\mu\).
24 randomly chosen values of \(T\) are summarised by $$\sum _ { i = 1 } ^ { 24 } t _ { i } = 348.0 \text { and } \sum _ { i = 1 } ^ { 24 } t _ { i } ^ { 2 } = 5195.5 .$$
  1. Calculate a symmetric \(95 \%\) confidence interval for \(\mu\).
  2. For the machine to be working acceptably, \(\mu\) should be at least 15.0 . Using a test at the 10\% significance level, decide whether the machine is working acceptably.
Question 5
View details
5 The discrete random variables \(X\) and \(Y\) are independent with \(X \sim \mathrm {~B} \left( 32 , \frac { 1 } { 2 } \right)\) and \(Y \sim \operatorname { Po } ( 28 )\).
  1. Find the values of \(\mathrm { E } ( Y - X )\) and \(\operatorname { Var } ( Y - X )\).
  2. State, with justification, an approximate distribution for \(Y - X\).
  3. Hence find \(\mathrm { P } ( | Y - X | \geqslant 3 )\).
Question 6
View details
6
\includegraphics[max width=\textwidth, alt={}, center]{054e0081-afce-4a87-93f5-650dad40b313-3_508_611_262_719} The diagram shows the probability density function f of the continuous random variable \(T\), given by $$f ( t ) = \begin{cases} a t & 0 \leqslant t \leqslant 1
a & 1 < t \leqslant 4
0 & \text { otherwise } \end{cases}$$ where \(a\) is a constant.
  1. Find the value of \(a\).
  2. Obtain the cumulative distribution function of \(T\).
  3. Find the cumulative distribution of \(Y\), where \(Y = T ^ { \frac { 1 } { 2 } }\), and hence find the probability density function of \(Y\).
Question 7
View details
7 A study was carried out into whether patients suffering from a certain respiratory disorder would benefit from particular treatments. Each of 90 patients who agreed to take part was given one of three treatments \(A\), \(B\) or \(C\) as shown in the table.
Treatment\(A\)\(B\)\(C\)
Number in group312534
  1. It is claimed that each patient was equally likely to have been given any of the treatments. Test at the \(5 \%\) significance level whether the numbers given each treatment are consistent with this claim.
  2. After 3 months the numbers of patients showing improvement for treatments \(A , B\) and \(C\) were 14, 18 and 25 respectively. By setting up a \(2 \times 3\) contingency table, test whether the outcome is dependent on the treatment. Use a \(5 \%\) significance level.
  3. If one of the treatments is abandoned, explain briefly which it should be. \section*{THERE ARE NO QUESTIONS WRITTEN ON THIS PAGE}