Edexcel FS2 (Further Statistics 2) 2019 June

Question 1
View details
1 A machine is set to fill pots with yoghurt such that the mean weight of yoghurt in a pot is 505 grams. To check that the machine is working properly, a random sample of 8 pots is selected. The weight of yoghurt, in grams, in each pot is as follows $$\begin{array} { l l l l l l l l } 508 & 510 & 500 & 500 & 498 & 503 & 508 & 505 \end{array}$$ Given that the weights of the yoghurt delivered by the machine follow a normal distribution with standard deviation 5.4 grams,
  1. find a \(95 \%\) confidence interval for the mean weight, \(\mu\) grams, of yoghurt in a pot. Give your answers to 2 decimal places.
  2. Comment on whether or not the machine is working properly, giving a reason for your answer.
  3. State the probability that a \(95 \%\) confidence interval for \(\mu\) will not contain \(\mu\) grams.
  4. Without carrying out any further calculations, explain the changes, if any, that would need to be made in calculating the confidence interval in part (a) if the standard deviation was unknown. Give a reason for your answer.
    You may assume that the weights of the yoghurt delivered by the machine still follow a normal distribution.
Question 2
View details
2 A large field of wheat is split into 8 plots of equal area. Each plot is treated with a different amount of fertiliser, \(f\) grams \(/ \mathrm { m } ^ { 2 }\). The yield of wheat, \(w\) tonnes, from each plot is recorded. The results are summarised below. $$\sum f = 28 \quad \sum w = 303 \quad \sum w ^ { 2 } = 13447 \quad \mathrm {~S} _ { f f } = 42 \quad \mathrm {~S} _ { f w } = 269.5$$
  1. Calculate the product moment correlation coefficient between \(f\) and \(w\)
  2. Interpret the value of your product moment correlation coefficient.
  3. Find the equation of the regression line of \(w\) on \(f\) in the form \(w = a + b f\)
  4. Using your equation, estimate the decrease in yield when the amount of fertiliser decreases by 0.5 grams \(/ \mathrm { m } ^ { 2 }\) The residuals of the data recorded are calculated and plotted on the graph below.
    \includegraphics[max width=\textwidth, alt={}, center]{67df73d4-6ce4-45f7-8a69-aa94292ea814-04_1232_1294_1169_301}
  5. With reference to this graph, comment on the suitability of the model you found in part (c).
  6. Suggest how you might be able to refine your model.
Question 3
View details
3 Yin grows two varieties of potato, plant \(A\) and plant \(B\). A random sample of each variety of potato is taken and the yield, \(x \mathrm {~kg}\), produced by each plant is measured. The following statistics are obtained from the data.
Number of plants\(\sum x\)\(\sum x ^ { 2 }\)
\(A\)25194.71637.37
\(B\)26227.52031.19
  1. Stating your hypotheses clearly, test, at the \(10 \%\) significance level, whether or not the variances of the yields of the two varieties of potato are the same.
  2. State an assumption you have made in order to carry out the test in part (a).
Question 4
View details
4 The continuous random variable \(X\) has cumulative distribution function given by $$\mathrm { F } ( x ) = \left\{ \begin{array} { c c } 0 & x \leqslant 0
k \left( x ^ { 3 } - \frac { 3 } { 8 } x ^ { 4 } \right) & 0 < x \leqslant 2
1 & x > 2 \end{array} \right.$$ where \(k\) is a constant.
  1. Show that \(k = \frac { 1 } { 2 }\)
  2. Showing your working clearly, use calculus to find
    1. \(\mathrm { E } ( X )\)
    2. the mode of \(X\)
  3. Describe, giving a reason, the skewness of the distribution of \(X\)
Question 5
View details
5 Alexa believes that students are equally likely to achieve the same percentage score on each of two tests, paper I and paper II. She randomly selects 8 students and gives them each paper I and paper II. The percentage scores for each paper are recorded. The following paired data are collected.
Student\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
Paper I (\%)7070848064656590
Paper II (\%)6476727468645876
Test, at the \(1 \%\) significance level, whether or not there is evidence to support Alexa's belief. State your hypotheses clearly and show your working.
Question 6
View details
6 A company manufactures bolts. The diameter of the bolts follows a normal distribution with a mean diameter of 5 mm . Stan believes that the mean diameter of the bolts is less than 5 mm . He takes a random sample of 10 bolts and measures their diameters. He calculates some statistics but spills ink on his work before completing them. The only information he has left is as follows
\includegraphics[max width=\textwidth, alt={}, center]{67df73d4-6ce4-45f7-8a69-aa94292ea814-16_394_1150_527_456} Stating your hypotheses clearly, test, at the \(5 \%\) level of significance, whether or not Stan's belief is supported.
Question 7
View details
7 A manufacturer makes two versions of a toy. One version is made out of wood and the other is made out of plastic. The weights, \(W \mathrm {~kg}\), of the wooden toys are normally distributed with mean 2.5 kg and standard deviation 0.7 kg . The weights, \(X \mathrm {~kg}\), of the plastic toys are normally distributed with mean 1.27 kg and standard deviation 0.4 kg . The random variables \(W\) and \(X\) are independent.
  1. Find the probability that the weight of a randomly chosen wooden toy is more than double the weight of a randomly chosen plastic toy. The manufacturer packs \(n\) of these wooden toys and \(2 n\) of these plastic toys into the same container. The maximum weight the container can hold is 252 kg . The probability of the contents of this container being overweight is 0.2119 to 4 decimal places.
  2. Calculate the value of \(n\).
Question 8
View details
8 Nine athletes, \(A , B , C , D , E , F , G , H\) and \(I\), competed in both the 100 m sprint and the long jump. After the two events the positions of each athlete were recorded and Spearman's rank correlation coefficient was calculated and found to be 0.85
  1. Stating your hypotheses clearly, test whether or not there is evidence to suggest that the higher an athlete's position is in the 100 m sprint, the higher their position is in the long jump. Use a \(5 \%\) level of significance. The piece of paper the positions were recorded on was mislaid. Although some of the athletes agreed their positions, there was some disagreement between athletes \(B , C\) and \(D\) over their long jump results. The table shows the results that are agreed to be correct.
    Athlete\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)
    Position in 100 m sprint467928315
    Position in long jump549312
    Given that there were no tied ranks,
  2. find the correct positions of athletes \(B , C\) and \(D\) in the long jump. You must show your working clearly and give reasons for your answers.
  3. Without recalculating the coefficient, explain how Spearman's rank correlation coefficient would change if athlete \(H\) was disqualified from both the 100 m sprint and the long jump.