Edexcel S1 (Statistics 1) 2020 June

Question 1
View details
  1. The discrete random variable \(X\) takes the values \(- 1,2,3,4\) and 7 only.
Given that $$\mathrm { P } ( X = x ) = \frac { 8 - x } { k } \text { for } x = - 1,2,3,4 \text { and } 7$$ find the value of \(\mathrm { E } ( X )\)
Question 2
View details
  1. In a school canteen, students can choose from a main course of meat ( \(M\) ), fish ( \(F\) ) or vegetarian ( \(V\) ). They can then choose a drink of either water ( \(W\) ) or juice ( \(J\) ).
The partially completed tree diagram, where \(p\) and \(q\) are probabilities, shows the probabilities of these choices for a randomly selected student. \section*{Drink} \begin{figure}[h]
\captionsetup{labelformat=empty} \caption{Main course} \includegraphics[alt={},max width=\textwidth]{81d5e460-9559-4d25-aa08-6440559aec83-04_783_1013_593_463}
\end{figure}
  1. Complete the tree diagram, giving your answers in terms of \(p\) and \(q\) where appropriate.
  2. Find an expression, in terms of \(p\) and \(q\), for the probability that a randomly selected student chooses water to drink. The events "choosing a vegetarian main course" and "choosing water to drink" are independent.
  3. Find a linear equation in terms of \(p\) and \(q\). A student who has chosen juice to drink is selected at random. The probability that they chose fish for their main course is \(\frac { 7 } { 30 }\)
  4. Find the value of \(p\) and the value of \(q\). The canteen manager claims that students who choose water to drink are most likely to choose a fish main course.
  5. State, showing your working clearly, whether or not the manager's claim is correct.
Question 3
View details
3. The distance achieved in a long jump competition by students at a school is normally Students who achieve a distance greater than 4.3 metres receive a medal.
  1. Find the proportion of students who receive medals. The school wishes to give a certificate of achievement or a medal to the \(80 \%\) of students who achieve a distance of at least \(d\) metres.
  2. Find the value of \(d\). Of those who received medals, the \(\frac { 1 } { 3 }\) who jump the furthest will receive gold medals.
  3. Find the shortest distance, \(g\) metres, that must be achieved to receive a gold medal. A journalist from the local newspaper interviews a randomly selected group of 3 medal winners.
  4. Find the exact probability that there is at least one gold medal winner in the group.
    \includegraphics[max width=\textwidth, alt={}, center]{81d5e460-9559-4d25-aa08-6440559aec83-08_79_1153_233_251} Students who achieve a distance greater than 4.3 metres receive a medal.
  5. Find the proportion of students who receive medals.
    VIXV SIHIANI III IM IONOOVIAV SIHI NI JYHAM ION OOVI4V SIHI NI JLIYM ION OO
Question 4
View details
4. A group of students took some tests. A teacher is analysing the average mark for each student. Each student obtained a different average mark. For these average marks, the lower quartile is 24 , the median is 30 and the interquartile range (IQR) is 10
The three lowest average marks are 8, 10 and 15.5 and the three highest average marks are 45, 52.5 and 56 The teacher defines an outlier to be a value that is either
more than \(1.5 \times\) IQR below the lower quartile or
more than \(1.5 \times\) IQR above the upper quartile
  1. Determine any outliers in these data.
  2. On the grid below draw a box plot for these data, indicating clearly any outliers.
    \includegraphics[max width=\textwidth, alt={}, center]{81d5e460-9559-4d25-aa08-6440559aec83-12_350_1223_1128_370}
  3. Use the quartiles to describe the skewness of these data. Give a reason for your answer. Two more students also took the tests. Their average marks, which were both less than 45, are added to the data and the box plot redrawn. The median and the upper quartile are the same but the lower quartile is now 26
  4. Redraw the box plot on the grid below.
    (3)
    \includegraphics[max width=\textwidth, alt={}, center]{81d5e460-9559-4d25-aa08-6440559aec83-12_350_1221_2106_367}
  5. Give ranges of values within which each of these students' average marks must lie. Turn over for spare grids if you need to redraw your answer for part (b) or part (d).
    VIXV SIHIANI III IM IONOOVIAV SIHI NI JYHAM ION OOVI4V SIHI NI JLIYM ION OO
    \begin{figure}[h]
    \captionsetup{labelformat=empty} \caption{Copy of grid for part (b)} \includegraphics[alt={},max width=\textwidth]{81d5e460-9559-4d25-aa08-6440559aec83-15_356_1226_1726_367}
    \end{figure} \begin{figure}[h]
    \captionsetup{labelformat=empty} \caption{Copy of grid for part (d)} \includegraphics[alt={},max width=\textwidth]{81d5e460-9559-4d25-aa08-6440559aec83-15_353_1226_2240_367}
    \end{figure}
Question 5
View details
  1. A large company rents shops in different parts of the country. A random sample of 10 shops was taken and the floor area, \(x\) in \(10 \mathrm {~m} ^ { 2 }\), and the annual rent, \(y\) in thousands of dollars, were recorded.
    The data are summarised by the following statistics
$$\sum x = 900 \quad \sum x ^ { 2 } = 84818 \quad \sum y = 183 \quad \sum y ^ { 2 } = 3434$$ and the regression line of \(y\) on \(x\) has equation \(y = 6.066 + 0.136 x\)
  1. Use the regression line to estimate the annual rent in dollars for a shop with a floor area of \(800 \mathrm {~m} ^ { 2 }\)
  2. Find \(\mathrm { S } _ { y y }\) and \(\mathrm { S } _ { x x }\)
  3. Find the product moment correlation coefficient between \(y\) and \(x\). An 11th shop is added to the sample. The floor area is \(900 \mathrm {~m} ^ { 2 }\) and the annual rent is 15000 dollars.
  4. Use the formula \(\mathrm { S } _ { x y } = \sum ( x - \bar { x } ) ( y - \bar { y } )\) to show that the value of \(\mathrm { S } _ { x y }\) for the 11 shops will be the same as it was for the original 10 shops.
  5. Find the new equation of the regression line of \(y\) on \(x\) for the 11 shops. The company is considering renting a larger shop with area of \(3000 \mathrm {~m} ^ { 2 }\)
  6. Comment on the suitability of using the new regression line to estimate the annual rent. Give a reason for your answer.
Question 6
View details
6. The random variable \(A\) represents the score when a spinner is spun. The probability distribution for \(A\) is given in the following table.
\(a\)1457
\(\mathrm { P } ( A = a )\)0.400.200.250.15
  1. Show that \(\mathrm { E } ( A ) = 3.5\)
  2. Find \(\operatorname { Var } ( A )\) The random variable \(B\) represents the score on a 4 -sided die. The probability distribution for \(B\) is given in the following table where \(k\) is a positive integer.
    \(b\)134\(k\)
    \(\mathrm { P } ( B = b )\)0.250.250.250.25
  3. Write down the name of the probability distribution of \(B\).
  4. Given that \(\mathrm { E } ( B ) = \mathrm { E } ( A )\) state, giving a reason, the value of \(k\). The random variable \(X \sim \mathrm {~N} \left( \mu , \sigma ^ { 2 } \right)\) Sam and Tim are playing a game with the spinner and the die. They each spin the spinner once to obtain their value of \(A\) and each roll the die once to obtain their value of \(B\).
    Their value of \(A\) is taken as their value of \(\mu\) and their value of \(B\) is taken as their value of \(\sigma\). The person with the larger value of \(\mathrm { P } ( X > 3.5 )\) is the winner.
  5. Given that Sam obtained values of \(a = 4\) and \(b = 3\) and Tim obtained \(b = 4\) find, giving a reason, the probability that Tim wins.
  6. Find the largest value of \(\mathrm { P } ( X > 3.5 )\) achievable in this game.
  7. Find the probability of achieving this value. \includegraphics[max width=\textwidth, alt={}, center]{81d5e460-9559-4d25-aa08-6440559aec83-21_2255_50_314_34}