Edexcel S1 (Statistics 1) Specimen

Question 1
View details
  1. Gary compared the total attendance, \(x\), at home matches and the total number of goals, \(y\), scored at home during a season for each of 12 football teams playing in a league. He correctly calculated:
$$S _ { x x } = 1022500 \quad S _ { y y } = 130.9 \quad S _ { x y } = 8825$$
  1. Calculate the product moment correlation coefficient for these data.
  2. Interpret the value of the correlation coefficient. Helen was given the same data to analyse. In view of the large numbers involved she decided to divide the attendance figures by 100 . She then calculated the product moment correlation coefficient between \(\frac { x } { 100 }\) and \(y\).
  3. Write down the value Helen should have obtained.
Question 2
View details
2. An experiment consists of selecting a ball from a bag and spinning a coin. The bag contains 5 red balls and 7 blue balls. A ball is selected at random from the bag, its colour is noted and then the ball is returned to the bag. When a red ball is selected, a biased coin with probability \(\frac { 2 } { 3 }\) of landing heads is spun.
When a blue ball is selected a fair coin is spun.
  1. Complete the tree diagram below to show the possible outcomes and associated probabilities.
    \includegraphics[max width=\textwidth, alt={}, center]{61983561-79f7-4883-8ae7-ab1f4955d444-04_785_385_744_568}
    \includegraphics[max width=\textwidth, alt={}, center]{61983561-79f7-4883-8ae7-ab1f4955d444-04_1054_483_760_954} Shivani selects a ball and spins the appropriate coin.
  2. Find the probability that she obtains a head. Given that Tom selected a ball at random and obtained a head when he spun the appropriate coin,
  3. find the probability that Tom selected a red ball. Shivani and Tom each repeat this experiment.
  4. Find the probability that the colour of the ball Shivani selects is the same as the colour of the ball Tom selects.
Question 3
View details
  1. The discrete random variable \(X\) has probability distribution given by
\(x\)- 10123
\(\mathrm { P } ( X = x )\)\(\frac { 1 } { 5 }\)\(a\)\(\frac { 1 } { 10 }\)\(a\)\(\frac { 1 } { 5 }\)
where \(a\) is a constant.
  1. Find the value of \(a\).
  2. Write down \(\mathrm { E } ( X )\).
  3. Find \(\operatorname { Var } ( X )\). The random variable \(Y = 6 - 2 X\)
  4. Find \(\operatorname { Var } ( Y )\).
  5. Calculate \(\mathrm { P } ( X \geqslant Y )\).
Question 4
View details
  1. The Venn diagram in Figure 1 shows the number of students in a class who read any of 3 popular magazines \(A , B\) and \(C\).
\begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{61983561-79f7-4883-8ae7-ab1f4955d444-12_396_912_411_523} \captionsetup{labelformat=empty} \caption{Figure 1}
\end{figure} One of these students is selected at random.
  1. Show that the probability that the student reads more than one magazine is \(\frac { 1 } { 6 }\).
  2. Find the probability that the student reads \(A\) or \(B\) (or both).
  3. Write down the probability that the student reads both \(A\) and \(C\). Given that the student reads at least one of the magazines,
  4. find the probability that the student reads \(C\).
  5. Determine whether or not reading magazine \(B\) and reading magazine \(C\) are statistically independent.
Question 5
View details
  1. A teacher selects a random sample of 56 students and records, to the nearest hour, the time spent watching television in a particular week.
Hours\(1 - 10\)\(11 - 20\)\(21 - 25\)\(26 - 30\)\(31 - 40\)\(41 - 59\)
Frequency615111383
Mid-point5.515.52850
  1. Find the mid-points of the 21-25 hour and 31-40 hour groups. A histogram was drawn to represent these data. The 11-20 group was represented by a bar of width 4 cm and height 6 cm .
  2. Find the width and height of the 26-30 group.
  3. Estimate the mean and standard deviation of the time spent watching television by these students.
  4. Use linear interpolation to estimate the median length of time spent watching television by these students. The teacher estimated the lower quartile and the upper quartile of the time spent watching television to be 15.8 and 29.3 respectively.
  5. State, giving a reason, the skewness of these data.
Question 6
View details
  1. A travel agent sells flights to different destinations from Beerow airport. The distance \(d\), measured in 100 km , of the destination from the airport and the fare \(\pounds f\) are recorded for a random sample of 6 destinations.
Destination\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)
\(d\)2.24.06.02.58.05.0
\(f\)182025233228
$$\text { [You may use } \sum d ^ { 2 } = 152.09 \quad \sum f ^ { 2 } = 3686 \quad \sum f d = 723.1 \text { ] }$$
  1. Using the axes below, complete a scatter diagram to illustrate this information.
  2. Explain why a linear regression model may be appropriate to describe the relationship between \(f\) and \(d\).
  3. Calculate \(S _ { d d }\) and \(S _ { f d }\)
  4. Calculate the equation of the regression line of \(f\) on \(d\) giving your answer in the form \(f = a + b d\).
  5. Give an interpretation of the value of \(b\). Jane is planning her holiday and wishes to fly from Beerow airport to a destination \(t \mathrm {~km}\) away. A rival travel agent charges 5 p per km.
  6. Find the range of values of \(t\) for which the first travel agent is cheaper than the rival.
    \includegraphics[max width=\textwidth, alt={}, center]{61983561-79f7-4883-8ae7-ab1f4955d444-20_967_1630_1722_164}
Question 7
View details
  1. The distances travelled to work, \(D \mathrm {~km}\), by the employees at a large company are normally distributed with \(D \sim \mathrm {~N} \left( 30,8 ^ { 2 } \right)\).
    1. Find the probability that a randomly selected employee has a journey to work of more than 20 km .
    2. Find the upper quartile, \(Q _ { 3 }\), of \(D\).
    3. Write down the lower quartile, \(Q _ { 1 }\), of \(D\).
    An outlier is defined as any value of \(D\) such that \(D < h\) or \(D > k\) where $$h = Q _ { 1 } - 1.5 \times \left( Q _ { 3 } - Q _ { 1 } \right) \quad \text { and } \quad k = Q _ { 3 } + 1.5 \times \left( Q _ { 3 } - Q _ { 1 } \right)$$
  2. Find the value of \(h\) and the value of \(k\). An employee is selected at random.
  3. Find the probability that the distance travelled to work by this employee is an outlier.
    END