OCR S2 (Statistics 2) 2006 June

Question 1
View details
1 Calculate the variance of the continuous random variable with probability density function given by $$f ( x ) = \begin{cases} \frac { 3 } { 37 } x ^ { 2 } & 3 \leqslant x \leqslant 4
0 & \text { otherwise } \end{cases}$$
Question 2
View details
2
  1. The random variable \(R\) has the distribution \(\mathrm { B } ( 6 , p )\). A random observation of \(R\) is found to be 6. Carry out a \(5 \%\) significance test of the null hypothesis \(\mathrm { H } _ { 0 } : p = 0.45\) against the alternative hypothesis \(\mathrm { H } _ { 1 } : p \neq 0.45\), showing all necessary details of your calculation.
  2. The random variable \(S\) has the distribution \(\mathrm { B } ( n , p ) . \mathrm { H } _ { 0 }\) and \(\mathrm { H } _ { 1 }\) are as in part (i). A random observation of \(S\) is found to be 1 . Use tables to find the largest value of \(n\) for which \(\mathrm { H } _ { 0 }\) is not rejected. Show the values of any relevant probabilities.
Question 3
View details
3 The continuous random variable \(T\) has mean \(\mu\) and standard deviation \(\sigma\). It is known that \(\mathrm { P } ( T < 140 ) = 0.01\) and \(\mathrm { P } ( T < 300 ) = 0.8\).
  1. Assuming that \(T\) is normally distributed, calculate the values of \(\mu\) and \(\sigma\). In fact, \(T\) represents the time, in minutes, taken by a randomly chosen runner in a public marathon, in which about \(10 \%\) of runners took longer than 400 minutes.
  2. State with a reason whether the mean of \(T\) would be higher than, equal to, or lower than the value calculated in part (i).
Question 4
View details
4
  1. Explain briefly what is meant by a random sample. Random numbers are used to select, with replacement, a sample of size \(n\) from a population numbered 000, 001, 002, ..., 799.
  2. If \(n = 6\), find the probability that exactly 4 of the selected sample have numbers less than 500 .
  3. If \(n = 60\), use a suitable approximation to calculate the probability that at least 40 of the selected sample have numbers less than 500 .
Question 5
View details
5 An airline has 300 seats available on a flight to Australia. It is known from experience that on average only \(99 \%\) of those who have booked seats actually arrive to take the flight, the remaining \(1 \%\) being called 'no-shows'. The airline therefore sells more than 300 seats. If more than 300 passengers then arrive, the flight is over-booked. Assume that the number of no-show passengers can be modelled by a binomial distribution.
  1. If the airline sells 303 seats, state a suitable distribution for the number of no-show passengers, and state a suitable approximation to this distribution, giving the values of any parameters. Using the distribution and approximation in part (i),
  2. show that the probability that the flight is over-booked is 0.4165 , correct to 4 decimal places,
  3. find the largest number of seats that can be sold for the probability that the flight is over-booked to be less than 0.2.
Question 6
View details
6 Customers arrive at a post office at a constant average rate of 0.4 per minute.
  1. State an assumption needed to model the number of customers arriving in a given time interval by a Poisson distribution. Assuming that the use of a Poisson distribution is justified,
  2. find the probability that more than 2 customers arrive in a randomly chosen 1 -minute interval,
  3. use a suitable approximation to calculate the probability that more than 55 customers arrive in a given two-hour interval,
  4. calculate the smallest time for which the probability that no customers arrive in that time is less than 0.02 , giving your answer to the nearest second.
Question 7
View details
7 Three independent researchers, \(A , B\) and \(C\), carry out significance tests on the power consumption of a manufacturer's domestic heaters. The power consumption, \(X\) watts, is a normally distributed random variable with mean \(\mu\) and standard deviation 60. Each researcher tests the null hypothesis \(\mathrm { H } _ { 0 } : \mu = 4000\) against the alternative hypothesis \(\mathrm { H } _ { 1 } : \mu > 4000\). Researcher \(A\) uses a sample of size 50 and a significance level of \(5 \%\).
  1. Find the critical region for this test, giving your answer correct to 4 significant figures. In fact the value of \(\mu\) is 4020 .
  2. Calculate the probability that Researcher \(A\) makes a Type II error.
  3. Researcher \(B\) uses a sample bigger than 50 and a significance level of \(5 \%\). Explain whether the probability that Researcher \(B\) makes a Type II error is less than, equal to, or greater than your answer to part (ii).
  4. Researcher \(C\) uses a sample of size 50 and a significance level bigger than \(5 \%\). Explain whether the probability that Researcher \(C\) makes a Type II error is less than, equal to, or greater than your answer to part (ii).
  5. State with a reason whether it is necessary to use the Central Limit Theorem at any point in this question.