Edexcel S4 (Statistics 4) 2009 June

Question 1
View details
  1. A company manufactures bolts with a mean diameter of 5 mm . The company wishes to check that the diameter of the bolts has not decreased. A random sample of 10 bolts is taken and the diameters, \(x \mathrm {~mm}\), of the bolts are measured. The results are summarised below.
$$\sum x = 49.1 \quad \sum x ^ { 2 } = 241.2$$ Using a \(1 \%\) level of significance, test whether or not the mean diameter of the bolts is less than 5 mm .
(You may assume that the diameter of the bolts follows a normal distribution.)
Question 2
View details
2. An emission-control device is tested to see if it reduces \(\mathrm { CO } _ { 2 }\) emissions from cars. The emissions from 6 randomly selected cars are measured with and without the device. The results are as follows.
CarA\(B\)CDE\(F\)
Emissions without device151.4164.3168.5148.2139.4151.2
Emissions with device148.9162.7166.9150.1140.0146.7
  1. State an assumption that needs to be made in order to carry out a \(t\)-test in this case.
  2. State why a paired \(t\)-test is suitable for use with these data.
  3. Using a \(5 \%\) level of significance, test whether or not there is evidence that the device reduces \(\mathrm { CO } _ { 2 }\) emissions from cars.
  4. Explain, in context, what a type II error would be in this case.
Question 3
View details
  1. Define, in terms of \(\mathrm { H } _ { 0 }\) and/or \(\mathrm { H } _ { 1 }\),
    1. the size of a hypothesis test,
    2. the power of a hypothesis test.
    The probability of getting a head when a coin is tossed is denoted by \(p\).
    This coin is tossed 12 times in order to test the hypotheses \(\mathrm { H } _ { 0 } : p = 0.5\) against \(\mathrm { H } _ { 1 } : p \neq 0.5\), using a 5\% level of significance.
  2. Find the largest critical region for this test, such that the probability in each tail is less than 2.5\%.
  3. Given that \(p = 0.4\)
    1. find the probability of a type II error when using this test,
    2. find the power of this test.
  4. Suggest two ways in which the power of the test can be increased.
Question 4
View details
  1. A farmer set up a trial to assess whether adding water to dry feed increases the milk yield of his cows. He randomly selected 22 cows. Thirteen of the cows were given dry feed and the other 9 cows were given the feed with water added. The milk yields, in litres per day, were recorded with the following results.
\cline { 2 - 4 } \multicolumn{1}{c|}{}Sample sizeMean\(s ^ { 2 }\)
Dry feed1325.542.45
Feed with water added927.941.02
You may assume that the milk yield from cows given the dry feed and the milk yield from cows given the feed with water added are from independent normal distributions.
  1. Test, at the \(10 \%\) level of significance, whether or not the variances of the populations from which the samples are drawn are the same. State your hypotheses clearly.
  2. Calculate a \(95 \%\) confidence interval for the difference between the two mean milk yields.
  3. Explain the importance of the test in part (a) to the calculation in part (b).
Question 5
View details
  1. A machine fills jars with jam. The weight of jam in each jar is normally distributed. To check the machine is working properly the contents of a random sample of 15 jars are weighed in grams. Unbiased estimates of the mean and variance are obtained as
$$\hat { \mu } = 560 \quad s ^ { 2 } = 25.2$$ Calculate a 95\% confidence interval for,
  1. the mean weight of jam,
  2. the variance of the weight of jam. A weight of more than 565 g is regarded as too high and suggests the machine is not working properly.
  3. Use appropriate confidence limits from parts (a) and (b) to find the highest estimate of the proportion of jars that weigh too much.
Question 6
View details
6. A continuous uniform distribution on the interval \([ 0 , k ]\) has mean \(\frac { k } { 2 }\) and variance \(\frac { k ^ { 2 } } { 12 }\). A random sample of three independent variables \(X _ { 1 } , X _ { 2 }\) and \(X _ { 3 }\) is taken from this distribution.
  1. Show that \(\frac { 2 } { 3 } X _ { 1 } + \frac { 1 } { 2 } X _ { 2 } + \frac { 5 } { 6 } X _ { 3 }\) is an unbiased estimator for \(k\). An unbiased estimator for \(k\) is given by \(\hat { k } = a X _ { 1 } + b X _ { 2 }\) where \(a\) and \(b\) are constants.
  2. Show that \(\operatorname { Var } ( \hat { k } ) = \left( a ^ { 2 } - 2 a + 2 \right) \frac { k ^ { 2 } } { 6 }\)
  3. Hence determine the value of \(a\) and the value of \(b\) for which \(\hat { k }\) has minimum variance, and calculate this minimum variance.