Edexcel S1 (Statistics 1) 2015 June

Question 1
View details
  1. Each of 60 students was asked to draw a \(20 ^ { \circ }\) angle without using a protractor. The size of each angle drawn was measured. The results are summarised in the box plot below.
    \includegraphics[max width=\textwidth, alt={}, center]{9626e3ce-35d6-41b5-a0bd-1185f38b9e36-02_371_1040_340_461}
    1. Find the range for these data.
    2. Find the interquartile range for these data.
    The students were then asked to draw a \(70 ^ { \circ }\) angle.
    The results are summarised in the table below.
    Angle, \(\boldsymbol { a }\), (degrees)Number of students
    \(55 \leqslant a < 60\)6
    \(60 \leqslant a < 65\)15
    \(65 \leqslant a < 70\)13
    \(70 \leqslant a < 75\)11
    \(75 \leqslant a < 80\)8
    \(80 \leqslant a < 85\)7
  2. Use linear interpolation to estimate the size of the median angle drawn. Give your answer to 1 decimal place.
  3. Show that the lower quartile is \(63 ^ { \circ }\) For these data, the upper quartile is \(75 ^ { \circ }\), the minimum is \(55 ^ { \circ }\) and the maximum is \(84 ^ { \circ }\) An outlier is an observation that falls either more than \(1.5 \times\) (interquartile range) above the upper quartile or more than \(1.5 \times\) (interquartile range) below the lower quartile.
    1. Show that there are no outliers for these data.
    2. Draw a box plot for these data on the grid on page 3.
  4. State which angle the students were more accurate at drawing. Give reasons for your answer.
    (3) \includegraphics[max width=\textwidth, alt={}, center]{9626e3ce-35d6-41b5-a0bd-1185f38b9e36-03_378_1059_2067_447}
Question 2
View details
2. An estate agent recorded the price per square metre, \(p \pounds / \mathrm { m } ^ { 2 }\), for 7 two-bedroom houses. He then coded the data using the coding \(q = \frac { p - a } { b }\), where \(a\) and \(b\) are positive constants. His results are shown in the table below.
\(p\)1840184818301824181918341850
\(q\)4.04.83.02.41.93.45.0
  1. Find the value of \(a\) and the value of \(b\) The estate agent also recorded the distance, \(d \mathrm {~km}\), of each house from the nearest train station. The results are summarised below. $$\mathrm { S } _ { d d } = 1.02 \quad \mathrm {~S} _ { q q } = 8.22 \quad \mathrm {~S} _ { d q } = - 2.17$$
  2. Calculate the product moment correlation coefficient between \(d\) and \(q\)
  3. Write down the value of the product moment correlation coefficient between \(d\) and \(p\) The estate agent records the price and size of 2 additional two-bedroom houses, \(H\) and \(J\).
    HousePrice \(( \pounds )\)Size \(\left( \mathrm { m } ^ { 2 } \right)\)
    \(H\)15640085
    \(J\)17290095
  4. Suggest which house is most likely to be closer to a train station. Justify your answer.
Question 3
View details
  1. A college has 80 students in Year 12.
20 students study Biology
28 students study Chemistry
30 students study Physics
7 students study both Biology and Chemistry
11 students study both Chemistry and Physics
5 students study both Physics and Biology
3 students study all 3 of these subjects
  1. Draw a Venn diagram to represent this information. A Year 12 student at the college is selected at random.
  2. Find the probability that the student studies Chemistry but not Biology or Physics.
  3. Find the probability that the student studies Chemistry or Physics or both. Given that the student studies Chemistry or Physics or both,
  4. find the probability that the student does not study Biology.
  5. Determine whether studying Biology and studying Chemistry are statistically independent.
Question 4
View details
  1. Statistical models can provide a cheap and quick way to describe a real world situation.
    1. Give two other reasons why statistical models are used.
    A scientist wants to develop a model to describe the relationship between the average daily temperature, \(x ^ { \circ } \mathrm { C }\), and her household's daily energy consumption, \(y \mathrm { kWh }\), in winter. A random sample of the average daily temperature and her household's daily energy consumption are taken from 10 winter days and shown in the table.
    \(x\)- 0.4- 0.20.30.81.11.41.82.12.52.6
    \(y\)28302625262726242221
    $$\text { [You may use } \sum x ^ { 2 } = 24.76 \quad \sum y = 255 \quad \sum x y = 283.8 \quad \mathrm {~S} _ { x x } = 10.36 \text { ] }$$
  2. Find \(\mathrm { S } _ { x y }\) for these data.
  3. Find the equation of the regression line of \(y\) on \(x\) in the form \(y = a + b x\) Give the value of \(a\) and the value of \(b\) to 3 significant figures.
  4. Give an interpretation of the value of \(a\)
  5. Estimate her household's daily energy consumption when the average daily temperature is \(2 ^ { \circ } \mathrm { C }\) The scientist wants to use the linear regression model to predict her household's energy consumption in the summer.
  6. Discuss the reliability of using this model to predict her household's energy consumption in the summer.
Question 5
View details
  1. In a quiz, a team gains 10 points for every question it answers correctly and loses 5 points for every question it does not answer correctly. The probability of answering a question correctly is 0.6 for each question. One round of the quiz consists of 3 questions.
The discrete random variable \(X\) represents the total number of points scored in one round. The table shows the incomplete probability distribution of \(X\)
\(x\)30150- 15
\(\mathrm { P } ( X = x )\)0.2160.064
  1. Show that the probability of scoring 15 points in a round is 0.432
  2. Find the probability of scoring 0 points in a round.
  3. Find the probability of scoring a total of 30 points in 2 rounds.
  4. Find \(\mathrm { E } ( X )\)
  5. Find \(\operatorname { Var } ( X )\) In a bonus round of 3 questions, a team gains 20 points for every question it answers correctly and loses 5 points for every question it does not answer correctly.
  6. Find the expected number of points scored in the bonus round.
Question 6
View details
  1. The random variable \(Z \sim \mathrm {~N} ( 0,1 )\)
    \(A\) is the event \(Z > 1.1\)
    \(B\) is the event \(Z > - 1.9\)
    \(C\) is the event \(- 1.5 < Z < 1.5\)
    1. Find
      1. \(\mathrm { P } ( A )\)
      2. \(\mathrm { P } ( B )\)
      3. \(\mathrm { P } ( C )\)
      4. \(\mathrm { P } ( A \cup C )\)
    The random variable \(X\) has a normal distribution with mean 21 and standard deviation 5
  2. Find the value of \(w\) such that \(\mathrm { P } ( X > w \mid X > 28 ) = 0.625\)