Edexcel S1 (Statistics 1) 2023 June

Mark scheme PDF ↗

Question 1 8 marks
View details
The histogram shows the distances, in km, that 274 people travel to work. \includegraphics{figure_1} Given that 60 of these people travel between 10km and 20km to work, estimate
  1. the number of people who travel between 22km and 45km to work, [3]
  2. the median distance travelled to work by these 274 people, [2]
  3. the mean distance travelled to work by these 274 people. [3]
Question 2 13 marks
View details
Two students, Olive and Shan, collect data on the weight, \(w\) grams, and the tail length, \(t\) cm, of 15 mice. Olive summarised the data as follows \(S_tt = 5.3173\) \quad \(\sum w^2 = 6089.12\) \quad \(\sum tw = 2304.53\) \quad \(\sum w = 297.8\) \quad \(\sum t = 114.8\)
  1. Calculate the value of \(S_{ww}\) and the value of \(S_{tw}\) [3]
  2. Calculate the value of the product moment correlation coefficient between \(w\) and \(t\) [2]
  3. Show that the equation of the regression line of \(w\) on \(t\) can be written as $$w = -16.7 + 4.77t$$ [3]
  4. Give an interpretation of the gradient of the regression line. [1]
  5. Explain why it would not be appropriate to use the regression line in part (c) to estimate the weight of a mouse with a tail length of 2cm. [2]
Shan decided to code the data using \(x = t - 6\) and \(y = \frac{w}{2} - 5\)
  1. Write down the value of the product moment correlation coefficient between \(x\) and \(y\) [1]
  2. Write down an equation of the regression line of \(y\) on \(x\) You do not need to simplify your equation. [1]
Question 3 9 marks
View details
Jim records the length, \(l\) mm, of 81 salmon. The data are coded using \(x = l - 600\) and the following summary statistics are obtained. $$n = 81 \quad \sum x = 3711 \quad \sum x^2 = 475181$$
  1. Find the mean length of these salmon. [3]
  2. Find the variance of the lengths of these salmon. [2]
The weight, \(w\) grams, of each of the 81 salmon is recorded to the nearest gram. The recorded results for the 81 salmon are summarised in the box plot below. \includegraphics{figure_2}
  1. Find the maximum number of salmon that have weights in the interval $$4600 < w \leqslant 7700$$ [1]
Raj says that the box plot is incorrect as Jim has not included outliers. For these data an outlier is defined as a value that is more than \(1.5 \times\) IQR above the upper quartile \quad or \quad \(1.5 \times\) IQR below the lower quartile
  1. Show that there are no outliers. [3]
Question 4 9 marks
View details
A bag contains a large number of coloured counters. Each counter is labelled A, B or C 30% of the counters are labelled A 45% of the counters are labelled B The rest of the counters are labelled C It is known that 2% of the counters labelled A are red 4% of the counters labelled B are red 6% of the counters labelled C are red One counter is selected at random from the bag.
  1. Complete the tree diagram on the opposite page to illustrate this information. [2]
  2. Calculate the probability that the counter is labelled A and is not red. [2]
  3. Calculate the probability that the counter is red. [2]
  4. Given that the counter is red, find the probability that it is labelled C [3]
\includegraphics{figure_3}
Question 5 13 marks
View details
A discrete random variable \(Y\) has probability function $$\mathrm{P}(Y = y) = \begin{cases} k(3 - y) & y = 1, 2 \\ k(y^2 - 8) & y = 3, 4, 5 \\ k & y = 6 \\ 0 & \text{otherwise} \end{cases}$$ where \(k\) is a constant.
  1. Show that \(k = \frac{1}{30}\) [2]
Find the exact value of
  1. P\((1 < Y \leqslant 4)\) [2]
  2. E\((Y)\) [2]
The random variable \(X = 15 - 2Y\)
  1. Calculate P\((Y \geqslant X)\) [3]
  2. Calculate Var\((X)\) [4]
Question 6 9 marks
View details
Three events \(A\), \(B\) and \(C\) are such that $$\mathrm{P}(A) = 0.1 \quad \mathrm{P}(B|A) = 0.3 \quad \mathrm{P}(A \cup B) = 0.25 \quad \mathrm{P}(C) = 0.5$$ Given that \(A\) and \(C\) are mutually exclusive
  1. find P\((A \cup C)\) [1]
  2. Show that P\((B) = 0.18\) [3]
Given also that \(B\) and \(C\) are independent,
  1. draw a Venn diagram to represent the events \(A\), \(B\) and \(C\) and the probabilities associated with each region. [5]
Question 7 14 marks
View details
A machine squeezes apples to extract their juice. The volume of juice, \(J\) ml, extracted from 1 kg of apples is modelled by a normal distribution with mean \(\mu\) and standard deviation \(\sigma\) Given that \(\mu = 500\) and \(\sigma = 25\) use standardisation to
    1. show that P\((J > 510) = 0.3446\) [2]
    2. calculate the value of \(d\) such that P\((J > d) = 0.9192\) [3]
Zen randomly selects 5 bags each containing 1 kg of apples and records the volume of juice extracted from each bag of apples.
  1. Calculate the probability that each of the 5 bags of apples produce less than 510ml of juice. [2]
Following adjustments to the machine, the volume of juice, \(R\) ml, extracted from 1 kg of apples is such that \(\mu = 520\) and \(\sigma = k\) Given that P\((R < r) = 0.15\) and P\((R > 3r - 800) = 0.005\)
  1. find the value of \(r\) and the value of \(k\) [7]