Edexcel S1 (Statistics 1) 2011 June

Question 1
View details
  1. On a particular day the height above sea level, \(x\) metres, and the mid-day temperature, \(y ^ { \circ } \mathrm { C }\), were recorded in 8 north European towns. These data are summarised below
$$\mathrm { S } _ { x x } = 3535237.5 \quad \sum y = 181 \quad \sum y ^ { 2 } = 4305 \quad \mathrm {~S} _ { x y } = - 23726.25$$
  1. Find \(\mathrm { S } _ { y y }\)
  2. Calculate, to 3 significant figures, the product moment correlation coefficient for these data.
  3. Give an interpretation of your coefficient. A student thought that the calculations would be simpler if the height above sea level, \(h\), was measured in kilometres and used the variable \(h = \frac { x } { 1000 }\) instead of \(x\).
  4. Write down the value of \(\mathrm { S } _ { h h }\)
  5. Write down the value of the correlation coefficient between \(h\) and \(y\).
Question 2
View details
  1. The random variable \(X \sim \mathrm {~N} \left( \mu , 5 ^ { 2 } \right)\) and \(\mathrm { P } ( X < 23 ) = 0.9192\)
    1. Find the value of \(\mu\).
    2. Write down the value of \(\mathrm { P } ( \mu < X < 23 )\).
    3. The discrete random variable \(Y\) has probability distribution
    \(y\)1234
    \(\mathrm { P } ( Y = y )\)\(a\)\(b\)0.3\(c\)
    where \(a , b\) and \(c\) are constants. The cumulative distribution function \(\mathrm { F } ( y )\) of \(Y\) is given in the following table
    \(y\)1234
    \(\mathrm {~F} ( y )\)0.10.5\(d\)1.0
    where \(d\) is a constant.
  2. Find the value of \(a\), the value of \(b\), the value of \(c\) and the value of \(d\).
  3. Find \(\mathrm { P } ( 3 Y + 2 \geqslant 8 )\).
Question 4
View details
4. Past records show that the times, in seconds, taken to run 100 m by children at a school can be modelled by a normal distribution with a mean of 16.12 and a standard deviation of 1.60 A child from the school is selected at random.
  1. Find the probability that this child runs 100 m in less than 15 s . On sports day the school awards certificates to the fastest \(30 \%\) of the children in the 100 m race.
  2. Estimate, to 2 decimal places, the slowest time taken to run 100 m for which a child will be awarded a certificate.
Question 5
View details
5. A class of students had a sudoku competition. The time taken for each student to complete the sudoku was recorded to the nearest minute and the results are summarised in the table below.
TimeMid-point, \(x\)Frequency, f
2-852
9-127
13-15145
16-18178
19-2220.54
23-3026.54
$$\text { (You may use } \sum \mathrm { f } x ^ { 2 } = 8603.75 \text { ) }$$
  1. Write down the mid-point for the 9-12 interval.
  2. Use linear interpolation to estimate the median time taken by the students.
  3. Estimate the mean and standard deviation of the times taken by the students. The teacher suggested that a normal distribution could be used to model the times taken by the students to complete the sudoku.
  4. Give a reason to support the use of a normal distribution in this case. On another occasion the teacher calculated the quartiles for the times taken by the students to complete a different sudoku and found $$Q _ { 1 } = 8.5 \quad Q _ { 2 } = 13.0 \quad Q _ { 3 } = 21.0$$
  5. Describe, giving a reason, the skewness of the times on this occasion.
Question 6
View details
  1. Jake and Kamil are sometimes late for school.
The events \(J\) and \(K\) are defined as follows
\(J =\) the event that Jake is late for school
\(K =\) the event that Kamil is late for school
\(\mathrm { P } ( J ) = 0.25 , \mathrm { P } ( J \cap K ) = 0.15\) and \(\mathrm { P } \left( J ^ { \prime } \cap K ^ { \prime } \right) = 0.7\) On a randomly selected day, find the probability that
  1. at least one of Jake or Kamil are late for school,
  2. Kamil is late for school. Given that Jake is late for school,
  3. find the probability that Kamil is late. The teacher suspects that Jake being late for school and Kamil being late for school are linked in some way.
  4. Determine whether or not \(J\) and \(K\) are statistically independent.
  5. Comment on the teacher's suspicion in the light of your calculation in (d).
Question 7
View details
  1. A teacher took a random sample of 8 children from a class. For each child the teacher recorded the length of their left foot, \(f \mathrm {~cm}\), and their height, \(h \mathrm {~cm}\). The results are given in the table below.
\(f\)2326232227242021
\(h\)135144134136140134130132
(You may use \(\sum f = 186 \quad \sum h = 1085 \quad \mathrm {~S} _ { f f } = 39.5 \quad \mathrm {~S} _ { h h } = 139.875 \quad \sum f h = 25291\) )
  1. Calculate \(\mathrm { S } _ { f h }\)
  2. Find the equation of the regression line of \(h\) on \(f\) in the form \(h = a + b f\). Give the value of \(a\) and the value of \(b\) correct to 3 significant figures.
  3. Use your equation to estimate the height of a child with a left foot length of 25 cm .
  4. Comment on the reliability of your estimate in (c), giving a reason for your answer. The left foot length of the teacher is 25 cm .
  5. Give a reason why the equation in (b) should not be used to estimate the teacher's height.
Question 8
View details
  1. A spinner is designed so that the score \(S\) is given by the following probability distribution.
\(s\)01245
\(\mathrm { P } ( S = s )\)\(p\)0.250.250.200.20
  1. Find the value of \(p\).
  2. Find \(\mathrm { E } ( S )\).
  3. Show that \(\mathrm { E } \left( S ^ { 2 } \right) = 9.45\)
  4. Find \(\operatorname { Var } ( S )\). Tom and Jess play a game with this spinner. The spinner is spun repeatedly and \(S\) counters are awarded on the outcome of each spin. If \(S\) is even then Tom receives the counters and if \(S\) is odd then Jess receives them. The first player to collect 10 or more counters is the winner.
  5. Find the probability that Jess wins after 2 spins.
  6. Find the probability that Tom wins after exactly 3 spins.
  7. Find the probability that Jess wins after exactly 3 spins.