Edexcel S1 (Statistics 1) 2013 June

Question 1
View details
  1. Sammy is studying the number of units of gas, \(g\), and the number of units of electricity, \(e\), used in her house each week. A random sample of 10 weeks use was recorded and the data for each week were coded so that \(x = \frac { g - 60 } { 4 }\) and \(y = \frac { e } { 10 }\). The results for the coded data are summarised below
$$\sum x = 48.0 \quad \sum y = 58.0 \quad \mathrm {~S} _ { x x } = 312.1 \quad \mathrm {~S} _ { y y } = 2.10 \quad \mathrm {~S} _ { x y } = 18.35$$
  1. Find the equation of the regression line of \(y\) on \(x\) in the form \(y = a + b x\). Give the values of \(a\) and \(b\) correct to 3 significant figures.
  2. Hence find the equation of the regression line of \(e\) on \(g\) in the form \(e = c + d g\). Give the values of \(c\) and \(d\) correct to 2 significant figures.
  3. Use your regression equation to estimate the number of units of electricity used in a week when 100 units of gas were used.
    (a)Find the probability distribution of \(X\) .
    (b)Write down the value of \(\mathrm { F } ( 1.8 )\) .
    (a)Find the probability distribution of \(X\) .勤
Question 2
View details
2.The discrete random variable \(X\) takes the values 1,2 and 3 and has cum
function \(\mathrm { F } ( x )\) given by
\(x\)123
\(\mathrm {~F} ( x )\)0.40.651
\includegraphics[max width=\textwidth, alt={}, center]{4cf4f2d7-d912-4b65-a666-caa37009661a-04_24_37_182_2010}
Question 3
View details
3. An agriculturalist is studying the yields, \(y \mathrm {~kg}\), from tomato plants. The data from a random sample of 70 tomato plants are summarised below.
Yield ( \(y \mathrm {~kg}\) )Frequency (f)Yield midpoint ( \(x \mathrm {~kg}\) )
\(0 \leqslant y < 5\)162.5
\(5 \leqslant y < 10\)247.5
\(10 \leqslant y < 15\)1412.5
\(15 \leqslant y < 25\)1220
\(25 \leqslant y < 35\)430
$$\text { (You may use } \sum \mathrm { f } x = 755 \text { and } \sum \mathrm { f } x ^ { 2 } = 12037.5 \text { ) }$$ A histogram has been drawn to represent these data. The bar representing the yield \(5 \leqslant y < 10\) has a width of 1.5 cm and a height of 8 cm .
  1. Calculate the width and the height of the bar representing the yield \(15 \leqslant y < 25\)
  2. Use linear interpolation to estimate the median yield of the tomato plants.
  3. Estimate the mean and the standard deviation of the yields of the tomato plants.
  4. Describe, giving a reason, the skewness of the data.
  5. Estimate the number of tomato plants in the sample that have a yield of more than 1 standard deviation above the mean.
Question 4
View details
  1. The time, in minutes, taken to fly from London to Malaga has a normal distribution with mean 150 minutes and standard deviation 10 minutes.
    1. Find the probability that the next flight from London to Malaga takes less than 145 minutes.
    The time taken to fly from London to Berlin has a normal distribution with mean 100 minutes and standard deviation \(d\) minutes. Given that \(15 \%\) of the flights from London to Berlin take longer than 115 minutes,
  2. find the value of the standard deviation \(d\). The time, \(X\) minutes, taken to fly from London to another city has a normal distribution with mean \(\mu\) minutes. Given that \(\mathrm { P } ( X < \mu - 15 ) = 0.35\)
  3. find \(\mathrm { P } ( X > \mu + 15 \mid X > \mu - 15 )\).
Question 5
View details
5. A researcher believes that parents with a short family name tended to give their children a long first name. A random sample of 10 children was selected and the number of letters in their family name, \(x\), and the number of letters in their first name, \(y\), were recorded. The data are summarised as: $$\sum x = 60 , \quad \sum y = 61 , \quad \sum y ^ { 2 } = 393 , \quad \sum x y = 382 , \quad \mathrm {~S} _ { x x } = 28$$
  1. Find \(\mathrm { S } _ { y y }\) and \(\mathrm { S } _ { x y }\)
  2. Calculate the product moment correlation coefficient, \(r\), between \(x\) and \(y\).
  3. State, giving a reason, whether or not these data support the researcher's belief. The researcher decides to add a child with family name "Turner" to the sample.
  4. Using the definition \(\mathrm { S } _ { x x } = \sum ( x - \bar { x } ) ^ { 2 }\), state the new value of \(\mathrm { S } _ { x x }\) giving a reason for your answer. Given that the addition of the child with family name "Turner" to the sample leads to an increase in \(\mathrm { S } _ { y y }\)
  5. use the definition \(\mathrm { S } _ { x y } = \sum ( x - \bar { x } ) ( y - \bar { y } )\) to determine whether or not the value of \(r\) will increase, decrease or stay the same. Give a reason for your answer.
Question 6
View details
6. \begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{4cf4f2d7-d912-4b65-a666-caa37009661a-11_606_1131_210_411} \captionsetup{labelformat=empty} \caption{Figure 1}
\end{figure} The Venn diagram in Figure 1 shows three events \(A , B\) and \(C\) and the probabilities associated with each region of \(B\). The constants \(p , q\) and \(r\) each represent probabilities associated with the three separate regions outside \(B\). The events \(A\) and \(B\) are independent.
  1. Find the value of \(p\). Given that \(\mathrm { P } ( B \mid C ) = \frac { 5 } { 11 }\)
  2. find the value of \(q\) and the value of \(r\).
  3. Find \(\mathrm { P } ( A \cup C \mid B )\).
Question 7
View details
7. The score \(S\) when a spinner is spun has the following probability distribution.
\(s\)01245
\(\mathrm { P } ( S = s )\)0.20.20.10.30.2
  1. Find \(\mathrm { E } ( S )\).
  2. Show that \(\mathrm { E } \left( S ^ { 2 } \right) = 10.4\)
  3. Hence find \(\operatorname { Var } ( S )\).
  4. Find
    1. \(\mathrm { E } ( 5 S - 3 )\),
    2. \(\operatorname { Var } ( 5 S - 3 )\).
  5. Find \(\mathrm { P } ( 5 S - 3 > S + 3 )\) The spinner is spun twice.
    The score from the first spin is \(S _ { 1 }\) and the score from the second spin is \(S _ { 2 }\)
    The random variables \(S _ { 1 }\) and \(S _ { 2 }\) are independent and the random variable \(X = S _ { 1 } \times S _ { 2 }\)
  6. Show that \(\mathrm { P } \left( \left\{ S _ { 1 } = 1 \right\} \cap X < 5 \right) = 0.16\)
  7. Find \(\mathrm { P } ( X < 5 )\).