Edexcel S4 (Statistics 4) 2018 June

Question 1
View details
  1. A machine fills packets with almonds. The weight, in grams, of almonds in a packet is modelled by \(\mathrm { N } \left( \mu , \sigma ^ { 2 } \right)\). To check that the machine is working properly, a random sample of 10 packets is selected and unbiased estimates for \(\mu\) and \(\sigma ^ { 2 }\) are
$$\bar { x } = 202 \quad \text { and } \quad s ^ { 2 } = 3.6$$ Stating your hypotheses clearly, test, at the \(1 \%\) level of significance, whether or not the mean weight of almonds in a packet is more than 200 g .
Question 2
View details
  1. Jeremiah currently uses a Fruity model of juicer. He agrees to trial a new model of juicer, Zesty. The amounts of juice extracted, \(x \mathrm { ml }\), from each of 9 randomly selected oranges, using the Zesty are summarised as
$$\sum x = 468 \quad \sum x ^ { 2 } = 24560$$ Given that the amounts of juice extracted follow a normal distribution,
  1. calculate a 95\% confidence interval for
    1. the mean amount of juice extracted from an orange using the Zesty,
    2. the standard deviation of the amount of juice extracted from an orange using the Zesty. Jeremiah knows that, for his Fruity, the mean amount of juice extracted from an orange is 38 ml and the standard deviation of juice extracted from an orange is 5 ml . He decides that he will replace his Fruity with a Zesty if both
      • the mean for the Zesty is more than \(20 \%\) higher than the mean for his Fruity and
  2. the standard deviation for the Zesty is less than 5.5 ml .
  3. Using your answers to part (a), explain whether or not Jeremiah should replace his Fruity with the Zesty.
Question 3
View details
  1. A random sample of 8 students is selected from a school database.
Each student's reaction time is measured at the start of the school day and again at the end of the school day. The reaction times, in milliseconds, are recorded below.
StudentA\(B\)CD\(E\)\(F\)G\(H\)
Reaction time at the start of the school day10.87.28.76.89.410.911.17.6
Reaction time at the end of the school day106.18.85.78.78.19.86.8
  1. State one assumption that needs to be made in order to carry out a paired \(t\)-test.
    (1) The random variable \(R\) is the reaction time at the start of the school day minus the reaction time at the end of the school day. The mean of \(R\) is \(\mu\). John uses a paired \(t\)-test to test the hypotheses $$\mathrm { H } _ { 0 } : \mu = m \quad \mathrm { H } _ { 1 } : \mu \neq m$$ Given that \(\mathrm { H } _ { 0 }\) is rejected at the 5\% level of significance but accepted at the 1\% level of significance,
  2. find the ranges of possible values for \(m\).
Question 4
View details
  1. A glue supplier claims that Goglue is stronger than Tackfast. A company is presently using Tackfast but agrees to change to Goglue if, at the 5\% significance level,
  • the standard deviation of the force required for Goglue to fail is not greater than the standard deviation of the force required for Tackfast to fail and
  • the mean force required for Goglue to fail is more than 4 newtons greater than the mean force for Tackfast to fail.
A series of trials is carried out, using Goglue and Tackfast, and the glues are tested to destruction. The force, \(x\) newtons, at which each glue fails is recorded.
Sample size \(( n )\)Sample mean \(( \bar { x } )\)Standard deviation \(( s )\)
Tackfast \(( T )\)65.270.31
Goglue \(( G )\)510.120.66
It can be assumed that the force at which each glue fails is normally distributed.
  1. Test, at the \(5 \%\) level of significance, whether or not there is evidence that the standard deviation of the force required for Goglue to fail is greater than the standard deviation of the force required for the Tackfast to fail. State your hypotheses clearly. The supplier claims that the mean force required for its Goglue to fail is more than 4 newtons greater than the mean force required for Tackfast to fail.
  2. Stating your hypotheses clearly and using a \(5 \%\) level of significance, test the supplier's claim.
  3. Show that, at the \(5 \%\) level of significance, the supplier's claim will be accepted if \(\bar { X } _ { G } - \bar { X } _ { T } > 4.55\), where \(\bar { X } _ { G }\) and \(\bar { X } _ { T }\) are the mean forces required for Goglue to fail and Tackfast to fail respectively. Later, it was found that an error had been made when recording the results for Goglue. This resulted in all the forces recorded for Goglue being 0.5 newtons more than they should have been. The results for Tackfast were correct.
  4. Explain whether or not this information affects the decision about which glue the supplier decides to use.
Question 5
View details
  1. A machine makes posts. The length of a post is normally distributed with unknown mean \(\mu\) and standard deviation 4 cm .
A random sample of size \(n\) is taken to test, at the \(5 \%\) significance level, the hypotheses $$\mathrm { H } _ { 0 } : \mu = 150 \quad \mathrm { H } _ { 1 } : \mu > 150$$
  1. State the probability of a Type I error for this test. The manufacturer requires the probability of a Type II error to be less than 0.1 when the actual value of \(\mu\) is 152
  2. Calculate the minimum value of \(n\).
Question 6
View details
  1. The continuous random variable \(X\) has probability density function \(\mathrm { f } ( x )\)
$$f ( x ) = \left\{ \begin{array} { c c } \frac { x } { 2 \theta ^ { 2 } } & 0 \leqslant x \leqslant 2 \theta
0 & \text { otherwise } \end{array} \right.$$ where \(\theta\) is a constant.
  1. Use integration to show that \(\mathrm { E } \left( X ^ { N } \right) = \frac { 2 ^ { N + 1 } } { N + 2 } \theta ^ { N }\)
  2. Hence
    1. write down an expression for \(\mathrm { E } ( X )\) in terms of \(\theta\)
    2. find \(\operatorname { Var } ( X )\) in terms of \(\theta\) A random sample \(X _ { 1 } , X _ { 2 } , \ldots , X _ { n }\) where \(n \geqslant 2\) is taken to estimate the value of \(\theta\) The random variable \(S _ { 1 } = q \bar { X }\) is an unbiased estimator of \(\theta\)
  3. Write down the value of \(q\) and show that \(S _ { 1 }\) is a consistent estimator of \(\theta\) The continuous random variable \(Y\) is independent of \(X\) and is uniformly distributed over the interval \(\left[ 0 , \frac { 2 \theta } { 3 } \right]\), where \(\theta\) is the same unknown constant as in \(\mathrm { f } ( x )\). The random variable \(S _ { 2 } = a X + b Y\) is an unbiased estimator of \(\theta\) and is based on one observation of \(X\) and one observation of \(Y\).
  4. Find the value of \(a\) and the value of \(b\) for which \(S _ { 2 }\) has minimum variance.
  5. Show that the minimum variance of \(S _ { 2 }\) is \(\frac { \theta ^ { 2 } } { 11 }\)
  6. Explain which of \(S _ { 1 }\) or \(S _ { 2 }\) is the better estimator for \(\theta\)