Edexcel S1 (Statistics 1) 2017 June

Question 1
View details
  1. Nina weighed a random sample of 50 carrots from her shop and recorded the weight, in grams to the nearest gram, for each carrot. The results are summarised below.
Weight of carrotFrequency (f)Weight midpoint \(( \boldsymbol { x }\) grams \()\)
\(45 - 54\)549.5
\(55 - 59\)1057
\(60 - 64\)2262
\(65 - 74\)1369.5
$$\text { (You may use } \sum \mathrm { f } x ^ { 2 } = 192102.5 \text { ) }$$
  1. Use linear interpolation to estimate the median weight of these carrots.
  2. Find an estimate for the mean weight of these carrots.
  3. Find an estimate for the standard deviation of the weights of these carrots. A carrot is selected at random from Nina's shop.
  4. Estimate the probability that the weight of this carrot is more than 70 grams.
Question 2
View details
2. The box plot shows the times, \(t\) minutes, it takes a group of office workers to travel to work.
\includegraphics[max width=\textwidth, alt={}, center]{7d45bacd-20ac-49b4-8f3f-613edf3739f9-04_365_1237_351_356}
  1. Find the range of the times.
  2. Find the interquartile range of the times.
  3. Using the quartiles, describe the skewness of these data. Give a reason for your answer. Chetna believes that house prices will be higher if the time to travel to work is shorter. She asks a random sample of these office workers for their house prices \(\pounds x\), where \(x\) is measured in thousands, and obtains the following statistics $$\mathrm { S } _ { x x } = 5514 \quad \mathrm {~S} _ { x t } = 10 \quad \mathrm {~S} _ { t t } = 1145.6$$
  4. Calculate the product moment correlation coefficient between \(x\) and \(t\).
  5. State, giving a reason, whether or not your correlation coefficient supports Chetna's belief. Adam and Betty are part of the group of office workers and they have both moved house. Adam's time to travel to work changes from 32 minutes to 36 minutes. Betty's time to travel to work changes from 38 minutes to 58 minutes. Outliers are defined as values that are more than 1.5 times the interquartile range above the upper quartile.
  6. Showing all necessary calculations, determine how the box plot of times to travel to work will change and draw a new box plot on the grid on page 5. \includegraphics[max width=\textwidth, alt={}, center]{7d45bacd-20ac-49b4-8f3f-613edf3739f9-05_499_1413_2122_180}
Question 3
View details
  1. At a school athletics day, the distances, in metres, achieved by students in the long jump are modelled by the normal distribution with mean 3.3 m and standard deviation 0.6 m
    1. Find an estimate for the proportion of students who jump less than 2.5 m
    The long jump competition consists of 2 jumps. All the students can take part in the first jump and the \(40 \%\) who jump the greatest distance in their first jump qualify for the second jump.
  2. Find an estimate for the minimum distance achieved in the first jump in order to qualify for the second jump.
    Give your answer correct to 4 significant figures.
  3. Find an estimate for the median distance achieved in the first jump by those who qualify for the second jump. The distance of the second jump is independent of the distance of the first jump and is modelled with the same normal distribution. Students who jump a distance greater than 4.1 m in their second jump receive a certificate. At the start of the long jump competition, a student is selected at random.
  4. Find the probability that this student will receive a certificate.
Question 4
View details
4.The partially completed tree diagram,where \(p\) and \(q\) are probabilities,gives information about Andrew's journey to work each day.
\includegraphics[max width=\textwidth, alt={}, center]{7d45bacd-20ac-49b4-8f3f-613edf3739f9-12_661_794_395_511}
\(R\) represents the event that it is raining
W represents the event that Andrew walks to work
\(B\) represents the event that Andrew takes the bus to work
\(C\) represents the event that Andrew cycles to work Given that \(\mathrm { P } ( B ) = 0.26\)
(a)find the value of \(p\) Given also that \(\mathrm { P } \left( R ^ { \prime } \mid W \right) = 0.175\)
(b)find the value of \(q\)
(c)Find the probability that Andrew cycles to work. Given that Andrew did not cycle to work on Friday,
(d)find the probability that it was raining on Friday.
Question 5
View details
  1. Tomas is studying the relationship between temperature and hours of sunshine in Seapron. He records the midday temperature, \(t ^ { \circ } \mathrm { C }\), and the hours of sunshine, \(s\) hours, for a random sample of 9 days in October. He calculated the following statistics
$$\sum s = 15 \quad \sum s ^ { 2 } = 44.22 \quad \sum t = 127 \quad \mathrm {~S} _ { t t } = 10.89$$
  1. Calculate \(\mathrm { S } _ { s s }\) Tomas calculated the product moment correlation coefficient between \(s\) and \(t\) to be 0.832 correct to 3 decimal places.
  2. State, giving a reason, whether or not this correlation coefficient supports the use of a linear regression model to describe the relationship between midday temperature and hours of sunshine.
  3. State, giving a reason, why the hours of sunshine would be the explanatory variable in a linear regression model between midday temperature and hours of sunshine.
  4. Find \(\mathrm { S } _ { s t }\)
  5. Calculate a suitable linear regression equation to model the relationship between midday temperature and hours of sunshine.
  6. Calculate the standard deviation of \(s\) Tomas uses this model to estimate the midday temperature in Seapron for a day in October with 5 hours of sunshine.
  7. State the value of Tomas' estimate. Given that the values of \(s\) are all within 2 standard deviations of the mean,
  8. comment, giving your reason, on the reliability of this estimate.
Question 6
View details
  1. A biased coin has probability 0.4 of showing a head. In an experiment, the coin is spun until a head appears. If a head has not appeared after 4 spins, the coin is not spun again. The random variable \(X\) represents the number of times the coin is spun.
For example, \(X = 3\) if the first two spins do not show a head but the third spin does show a head. The coin would not then be spun a fourth time since the coin has already shown a head.
  1. Show that \(\mathrm { P } ( X = 3 ) = 0.144\) The table gives some values for the probability distribution of \(X\)
    \(x\)1234
    \(\mathrm { P } ( X = x )\)0.240.144
    1. Write down the value of \(\mathrm { P } ( X = 1 )\)
    2. Find \(\mathrm { P } ( X = 4 )\)
  2. Find \(\mathrm { E } ( X )\)
  3. Find \(\operatorname { Var } ( X )\) The random variable \(H\) represents the number of heads obtained when the coin is spun in the experiment.
  4. Explain why \(H\) can only take the values 0 and 1 and find the probability distribution of \(H\).
  5. Write down the value of
    1. \(\mathrm { P } ( \{ X = 3 \} \cap \{ H = 0 \} )\)
    2. \(\mathrm { P } ( \{ X = 4 \} \cap \{ H = 0 \} )\) The random variable \(S = X + H\)
  6. Find the probability distribution of \(S\)