Edexcel S4 (Statistics 4) 2014 June

Question 1
View details
  1. A production line is designed to fill bottles with oil. The amount of oil placed in a bottle is normally distributed and the mean is set to 100 ml .
The amount of oil, \(x \mathrm { ml }\), in each of 8 randomly selected bottles is recorded, and the following statistics are obtained. $$\bar { x } = 92.875 \quad s = 8.3055$$ Malcolm believes that the mean amount of oil placed in a bottle is less than 100 ml .
Stating your hypotheses clearly, test, at the \(5 \%\) significance level, whether or not Malcolm's belief is supported.
Question 2
View details
2. (a) Define
  1. a Type I error,
  2. a Type II error. Rolls of material, manufactured by a machine, contain defects at a mean rate of 6 per roll. The machine is modified. A single roll is selected at random and a test is carried out to see whether or not the mean number of defects per roll has decreased. The significance level is chosen to be as close as possible to \(5 \%\).
    (b) Calculate the probability of a Type I error for this test.
    (c) Given that the true mean number of defects per roll of material made by the machine is now 4, calculate the probability of a Type II error.
Question 3
View details
3. A large number of chicks were fed a special diet for 10 days. A random sample of 9 of these chicks is taken and the weight gained, \(x\) grams, by each chick is recorded. The results are summarised below. $$\sum x = 181 \quad \sum x ^ { 2 } = 3913$$ You may assume that the weights gained by the chicks are normally distributed.
Calculate a 95\% confidence interval for
    1. the mean of the weights gained by the chicks,
    2. the variance of the weights gained by the chicks. A chick which gains less than \(16 g\) has to be given extra feed.
  1. Using appropriate confidence limits from part (a), find the lowest estimate of the proportion of chicks that need extra feed.
Question 4
View details
  1. A random sample of 8 people were given a new drug designed to help people sleep.
In a two-week period the drug was given for one week and a placebo (a tablet that contained no drug) was given for one week. In the first week 4 people, selected at random, were given the drug and the other 4 people were given the placebo. Those who were given the drug in the first week were given the placebo in the second week. Those who were given the placebo in the first week were given the drug in the second week. The mean numbers of hours of sleep per night for each of the people are shown in the table.
Person\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
Hours of sleep with drug10.87.28.76.89.410.911.17.6
Hours of sleep with placebo10.06.59.05.68.78.09.86.8
  1. State one assumption that needs to be made in order to carry out a paired \(t\)-test.
  2. Stating your hypotheses clearly, test, at the \(1 \%\) level of significance, whether or not the drug increases the mean number of hours of sleep per night by more than 10 minutes. State the critical value for this test.
Question 5
View details
5. A statistician believes a coin is biased and the probability, \(p\), of getting a head when the coin is tossed is less than 0.5 The statistician decides to test this by tossing the coin 10 times and recording the number, \(X\), of heads. He sets up the hypotheses \(\mathrm { H } _ { 0 } : p = 0.5\) and \(\mathrm { H } _ { 1 } : p < 0.5\) and rejects the null hypothesis if \(x < 3\)
  1. Find the size of the test.
  2. Show that the power function of this test is $$( 1 - p ) ^ { 8 } \left( 36 p ^ { 2 } + 8 p + 1 \right)$$ Table 1 gives values, to 2 decimal places, of the power function for the statistician's test. \begin{table}[h] \end{table} Table 1
  3. On the axes below draw the graph of the power function for the statistician's test.
  4. Find the range of values of \(p\) for which the probability of accepting the coin as unbiased, when in fact it is biased, is less than or equal to 0.4
    \includegraphics[max width=\textwidth, alt={}, center]{1d84c9fc-be67-45be-b439-3111c48ff1cb-09_1143_1209_945_402}
Question 6
View details
  1. (a) Explain what is meant by the sampling distribution of an estimator \(T\) of the population parameter \(\theta\).
    (b) Explain what you understand by the statement that \(T\) is a biased estimator of \(\theta\).
A population has mean \(\mu\) and variance \(\sigma ^ { 2 }\)
A random sample \(X _ { 1 } , X _ { 2 } , \ldots , X _ { 10 }\) is taken from this population.
(c) Calculate the bias of each of the following estimators of \(\mu\). $$\begin{aligned} & \hat { \mu } _ { 1 } = \frac { X _ { 3 } + X _ { 5 } + X _ { 7 } } { 3 }
& \hat { \mu } _ { 2 } = \frac { 5 X _ { 1 } + 2 X _ { 2 } + X _ { 9 } } { 6 }
& \hat { \mu } _ { 3 } = \frac { 3 X _ { 10 } - X _ { 1 } } { 3 } \end{aligned}$$ (d) Find the variance of each of these three estimators.
(e) State, giving a reason, which of these three estimators for \(\mu\) is
  1. the best estimator,
  2. the worst estimator.
Question 7
View details
7. Two groups of students take the same examination. A random sample of students is taken from each of the groups. The marks of the 9 students from Group 1 are as follows $$\begin{array} { l l l l l l l l l } 30 & 29 & 35 & 27 & 23 & 33 & 33 & 35 & 28 \end{array}$$ The marks, \(x\), of the 7 students from Group 2 gave the following statistics $$\bar { x } = 31.29 \quad s ^ { 2 } = 12.9$$ A test is to be carried out to see whether or not there is a difference between the mean marks of the two groups of students. You may assume that the samples are taken from normally distributed populations and that they are independent.
  1. State one other assumption that must be made in order to apply this test and show that this assumption is reasonable by testing it at a \(10 \%\) level of significance. State your hypotheses clearly.
  2. Stating your hypotheses clearly, test, using a significance level of \(5 \%\), whether or not there is a difference between the mean marks of the two groups of students.