Edexcel S1 (Statistics 1) 2016 June

Question 1
View details
  1. A biologist is studying the behaviour of bees in a hive. Once a bee has located a source of food, it returns to the hive and performs a dance to indicate to the other bees how far away the source of the food is. The dance consists of a series of wiggles. The biologist records the distance, \(d\) metres, of the food source from the hive and the average number of wiggles, \(w\), in the dance.
Distance, \(\boldsymbol { d } \mathbf { m }\)305080100150400500650
Average number
of wiggles, \(\boldsymbol { w }\)
0.7251.2101.7752.2503.5186.3828.1859.555
[You may use \(\sum w = 33.6 \sum d w = 13833 \mathrm {~S} _ { d d } = 394600 \mathrm {~S} _ { w w } = 80.481\) (to 3 decimal places)]
  1. Show that \(\mathrm { S } _ { d w } = 5601\)
  2. State, giving a reason, which is the response variable.
  3. Calculate the product moment correlation coefficient for these data.
  4. Calculate the equation of the regression line of \(w\) on \(d\), giving your answer in the form \(w = a + b d\) A new source of food is located 350 m from the hive.
    1. Use your regression equation to estimate the average number of wiggles in the corresponding dance.
    2. Comment, giving a reason, on the reliability of your estimate.
Question 2
View details
2. The discrete random variable \(X\) has the following probability distribution, where \(p\) and \(q\) are constants.
\(x\)- 2- 1\(\frac { 1 } { 2 }\)\(\frac { 3 } { 2 }\)2
\(\mathrm { P } ( X = x )\)\(p\)\(q\)0.20.3\(p\)
  1. Write down an equation in \(p\) and \(q\) Given that \(\mathrm { E } ( X ) = 0.4\)
  2. find the value of \(q\)
  3. hence find the value of \(p\) Given also that \(\mathrm { E } \left( X ^ { 2 } \right) = 2.275\)
  4. find \(\operatorname { Var } ( X )\) Sarah and Rebecca play a game.
    A computer selects a single value of \(X\) using the probability distribution above.
    Sarah's score is given by the random variable \(S = X\) and Rebecca's score is given by the random variable \(R = \frac { 1 } { X }\)
  5. Find \(\mathrm { E } ( R )\) Sarah and Rebecca work out their scores and the person with the higher score is the winner. If the scores are the same, the game is a draw.
  6. Find the probability that
    1. Sarah is the winner,
    2. Rebecca is the winner.
Question 3
View details
3. Before going on holiday to Seapron, Tania records the weekly rainfall ( \(x \mathrm {~mm}\) ) at Seapron for 8 weeks during the summer. Her results are summarised as $$\sum x = 86.8 \quad \sum x ^ { 2 } = 985.88$$
  1. Find the standard deviation, \(\sigma _ { x }\), for these data.
    (3) Tania also records the number of hours of sunshine ( \(y\) hours) per week at Seapron for these 8 weeks and obtains the following $$\bar { y } = 58 \quad \sigma _ { y } = 9.461 \text { (correct to } 4 \text { significant figures) } \quad \sum x y = 4900.5$$
  2. Show that \(\mathrm { S } _ { y y } = 716\) (correct to 3 significant figures)
  3. Find \(\mathrm { S } _ { x y }\)
  4. Calculate the product moment correlation coefficient, \(r\), for these data. During Tania's week-long holiday at Seapron there are 14 mm of rain and 70 hours of sunshine.
  5. State, giving a reason, what the effect of adding this information to the above data would be on the value of the product moment correlation coefficient.
Question 4
View details
4. The Venn diagram shows the probabilities of customer bookings at Harry’s hotel.
\(R\) is the event that a customer books a room
\(B\) is the event that a customer books breakfast
\(D\) is the event that a customer books dinner
\(u\) and \(t\) are probabilities.
\includegraphics[max width=\textwidth, alt={}, center]{e3b92a5b-c0ad-4176-9b05-cb07a44aa265-08_604_1047_696_450}
  1. Write down the probability that a customer books breakfast but does not book a room. Given that the events \(B\) and \(D\) are independent
  2. find the value of \(t\)
  3. hence find the value of \(u\)
  4. Find
    1. \(\quad\) P( \(D \mid R \cap B\) )
    2. \(\mathrm { P } \left( D \mid R \cap B ^ { \prime } \right)\) A coach load of 77 customers arrive at Harry’s hotel. Of these 77 customers 40 have booked a room and breakfast 37 have booked a room without breakfast
  5. Estimate how many of these 77 customers will book dinner.
Question 5
View details
5. A midwife records the weights, in kg , of a sample of 50 babies born at a hospital. Her results are given in the table below.
Weight ( \(\boldsymbol { w } \mathbf { ~ k g }\) )Frequency (f)Weight midpoint (x)
\(0 \leqslant w < 2\)11
\(2 \leqslant w < 3\)82.5
\(3 \leqslant w < 3.5\)173.25
\(3.5 \leqslant w < 4\)173.75
\(4 \leqslant w < 5\)74.5
[You may use \(\sum \mathrm { f } x ^ { 2 } = 611.375\) ] A histogram has been drawn to represent these data. The bar representing the weight \(2 \leqslant w < 3\) has a width of 1 cm and a height of 4 cm .
  1. Calculate the width and height of the bar representing a weight of \(3 \leqslant w < 3.5\)
  2. Use linear interpolation to estimate the median weight of these babies.
    1. Show that an estimate of the mean weight of these babies is 3.43 kg .
    2. Find an estimate of the standard deviation of the weights of these babies. Shyam decides to model the weights of babies born at the hospital, by the random variable \(W\), where \(W \sim \mathrm {~N} \left( 3.43,0.65 ^ { 2 } \right)\)
  3. Find \(\mathrm { P } ( W < 3 )\)
  4. With reference to your answers to (b), (c)(i) and (d) comment on Shyam's decision. A newborn baby weighing 3.43 kg is born at the hospital.
  5. Without carrying out any further calculations, state, giving a reason, what effect the addition of this newborn baby to the sample would have on your estimate of the
    1. mean,
    2. standard deviation.
Question 6
View details
6. The time, in minutes, taken by men to run a marathon is modelled by a normal distribution with mean 240 minutes and standard deviation 40 minutes.
  1. Find the proportion of men that take longer than 300 minutes to run a marathon.
    (3) Nathaniel is preparing to run a marathon. He aims to finish in the first 20\% of male runners.
  2. Using the above model estimate the longest time that Nathaniel can take to run the marathon and achieve his aim.
    (3) The time, \(W\) minutes, taken by women to run a marathon is modelled by a normal distribution with mean \(\mu\) minutes. Given that \(\mathrm { P } ( W < \mu + 30 ) = 0.82\)
  3. find \(\mathrm { P } ( W < \mu - 30 \mid W < \mu )\)