Edexcel S1 (Statistics 1) 2016 January

Question 1
View details
  1. The discrete random variable \(X\) has the probability distribution given in the table below.
\(x\)- 21346
\(\mathrm { P } ( X = x )\)\(\frac { 1 } { 4 }\)\(\frac { 1 } { 6 }\)\(\frac { 1 } { 3 }\)\(\frac { 1 } { 12 }\)\(\frac { 1 } { 6 }\)
  1. Write down the value of \(\mathrm { F } ( 5 )\)
  2. Find \(\mathrm { E } ( X )\)
  3. Find \(\operatorname { Var } ( X )\) The random variable \(Y = 7 - 2 X\)
  4. Find
    1. \(\mathrm { E } ( Y )\)
    2. \(\operatorname { Var } ( Y )\)
    3. \(\mathrm { P } ( Y > X )\) \includegraphics[max width=\textwidth, alt={}, center]{70137e9a-0a6b-48b5-8dd4-c436cb063351-03_2261_47_313_37}
Question 2
View details
2. \begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{70137e9a-0a6b-48b5-8dd4-c436cb063351-04_284_1244_260_388} \captionsetup{labelformat=empty} \caption{Figure 1}
\end{figure} Figure 1 shows part of a box and whisker plot for the marks in an examination with a large number of candidates. Part of the lower whisker has been torn off.
  1. Given that \(75 \%\) of the candidates passed the examination, state the lowest mark for the award of a pass.
  2. Given that the top \(25 \%\) of the candidates achieved a merit grade, state the lowest mark for the award of a merit grade. An outlier is defined as any value greater than \(c\) or any value less than \(d\) where $$\begin{aligned} & c = Q _ { 3 } + 1.5 \left( Q _ { 3 } - Q _ { 1 } \right)
    & d = Q _ { 1 } - 1.5 \left( Q _ { 3 } - Q _ { 1 } \right) \end{aligned}$$
  3. Find the value of \(c\) and the value of \(d\).
  4. Write down the 3 highest marks scored in the examination. The 3 lowest marks in the examination were 5, 10 and 15
  5. On the diagram on page 7, complete the box and whisker plot. Three candidates are selected at random from those who took this examination.
  6. Find the probability that all 3 of these candidates passed the examination but only 2 achieved a merit grade.
    \includegraphics[max width=\textwidth, alt={}, center]{70137e9a-0a6b-48b5-8dd4-c436cb063351-05_285_1628_2343_166} Turn over for a spare diagram if you need to redraw your plot.
Question 3
View details
3. A publisher collects information about the amount spent on advertising, \(\pounds x\), and the sales, \(y\) books, for some of her publications. She collects information for a random sample of 8 textbooks and codes the data using \(v = \frac { x + 50 } { 200 }\) and \(s = \frac { y } { 1000 }\) to give
\(v\)0.608.104.300.401.606.402.505.10
\(s\)1.846.735.951.302.457.464.826.25
[You may use: \(\sum v = 29 \sum s = 36.8 \sum s ^ { 2 } = 209.72 \sum v s = 177.311 \quad \mathrm {~S} _ { v v } = 55.275\) ]
  1. Find \(\mathrm { S } _ { v s }\) and \(\mathrm { S } _ { s s }\)
  2. Calculate the product moment correlation coefficient for these data. The publisher believes that a linear regression model may be appropriate to describe these data.
  3. State, giving a reason, whether or not your answer to part (b) supports the publisher's belief.
  4. Find the equation of the regression line of \(s\) on \(v\), giving your answer in the form \(s = a + b v\)
  5. Hence find the equation of the regression line of \(y\) on \(x\) for the sample of textbooks, giving your answer in the form \(y = c + d x\) The publisher calculated the regression line for a sample of novels and obtained the equation $$y = 3100 + 1.2 x$$ She wants to increase the sales of books by spending more money on advertising.
  6. State, giving your reasons, whether the publisher should spend more money on advertising textbooks or novels.
Question 4
View details
4. A training agency awards a certificate to each student who passes a test while completing a course.
Students failing the test will attempt the test again up to 3 more times, and, if they pass the test, will be awarded a certificate.
The probability of passing the test at the first attempt is 0.7 , but the probability of passing reduces by 0.2 at each attempt.
  1. Complete the tree diagram below to show this information.
    \includegraphics[max width=\textwidth, alt={}, center]{70137e9a-0a6b-48b5-8dd4-c436cb063351-08_545_1244_639_340} A student who completed the course is selected at random.
  2. Find the probability that the student was awarded a certificate.
  3. Given that the student was awarded a certificate, find the probability that the student passed on the first or second attempt. The training agency decides to alter the test taken by the students while completing the course, but will not allow more than 2 attempts. The agency requires the probability of passing the test at the first attempt to be \(p\), and the probability of passing the test at the second attempt to be ( \(p - 0.2\) ). The percentage of students who complete the course and are awarded a certificate is to be \(95 \%\)
  4. Show that \(p\) satisfies the equation $$p ^ { 2 } - 2.2 p + 1.15 = 0$$
  5. Hence find the value of \(p\), giving your answer to 3 decimal places.
    \includegraphics[max width=\textwidth, alt={}, center]{70137e9a-0a6b-48b5-8dd4-c436cb063351-09_2261_47_313_37}
Question 5
View details
5. Rosie keeps bees. The amount of honey, in kg, produced by a hive of Rosie's bees in a season, is modelled by a normal distribution with a mean of 22 kg and a standard deviation of 10 kg .
  1. Find the probability that a hive of Rosie's bees produces less than 18 kg of honey in a season. The local bee keepers’ club awards a certificate to every hive that produces more than 39 kg of honey in a season, and a medal to every hive that produces more than 50 kg in a season. Given that one of Rosie's bee hives is awarded a certificate
  2. find the probability that this hive is also awarded a medal.
    (5) Sam also keeps bees. The amount of honey, in kg, produced by a hive of Sam's bees in a season, is modelled by a normal distribution with mean \(\mu \mathrm { kg }\) and standard deviation \(\sigma \mathrm { kg }\). The probability that a hive of Sam’s bees produces less than 28 kg of honey in a season is 0.8413 Only 20\% of Sam's bee hives produce less than 18 kg of honey in a season.
  3. Find the value of \(\mu\) and the value of \(\sigma\). Give your answers to 2 decimal places.
    (6)
    \includegraphics[max width=\textwidth, alt={}, center]{70137e9a-0a6b-48b5-8dd4-c436cb063351-11_2261_47_313_37}
Question 6
View details
6. Yujie is investigating the weights of 10 young rabbits. She records the weight, \(x\) grams, of each rabbit and the results are summarised below. $$\sum x = 8360 \quad \text { and } \quad \sum ( x - \bar { x } ) ^ { 2 } = 63840$$
  1. Calculate the mean and the standard deviation of the weights of these rabbits. Given that the median weight of these rabbits is 815 grams,
  2. describe, giving a reason, the skewness of these data. Two more rabbits weighing 776 grams and 896 grams are added to make a group of 12 rabbits.
  3. State, giving a reason, how the inclusion of these two rabbits would affect the mean.
  4. By considering the change in \(\sum ( x - \bar { x } ) ^ { 2 }\), state what effect the inclusion of these two rabbits would have on the standard deviation.
    END