Edexcel S1 (Statistics 1) 2002 January

Mark scheme PDF ↗

Question 1 4 marks
View details
  1. Explain briefly what you understand by
    1. a statistical experiment, [1]
    2. an event. [1]
  2. State one advantage and one disadvantage of a statistical model. [2]
Question 2 7 marks
View details
A meteorologist measured the number of hours of sunshine, to the nearest hour, each day for 100 days. The results are summarised in the table below.
Hours of sunshineDays
116
2-432
5-628
712
89
9-112
121
  1. On graph paper, draw a histogram to represent these data. [5]
  2. Calculate an estimate of the number of days that had between 6 and 9 hours of sunshine. [2]
Question 3 7 marks
View details
A discrete random variable \(X\) has the probability function shown in the table below.
\(x\)012
P(\(X = x\))\(\frac{1}{3}\)\(a\)\(\frac{2}{3} - a\)
  1. Given that E(\(X\)) = \(\frac{2}{3}\), find \(a\). [3]
  2. Find the exact value of Var (\(X\)). [3]
  3. Find the exact value of P(\(X \leq 15\)). [1]
Question 4 10 marks
View details
A contractor bids for two building projects. He estimates that the probability of winning the first project is 0.5, the probability of winning the second is 0.3 and the probability of winning both projects is 0.2.
  1. Find the probability that he does not win either project. [3]
  2. Find the probability that he wins exactly one project. [2]
  3. Given that he does not win the first project, find the probability that he wins the second. [2]
  4. By calculation, determine whether or not winning the first contract and winning the second contract are independent events. [3]
Question 5 11 marks
View details
The duration of the pregnancy of a certain breed of cow is normally distributed with mean \(\mu\) days and standard deviation \(\sigma\) days. Only 2.5\% of all pregnancies are shorter than 235 days and 15\% are longer than 286 days.
  1. Show that \(\mu - 235 = 1.96\sigma\). [2]
  2. Obtain a second equation in \(\mu\) and \(\sigma\). [3]
  3. Find the value of \(\mu\) and the value of \(\sigma\). [4]
  4. Find the values between which the middle 68.3\% of pregnancies lie. [2]
Question 6 17 marks
View details
Hospital records show the number of babies born in a year. The number of babies delivered by 15 male doctors is summarised by the stem and leaf diagram below. Babies \quad (4|5 means 45) \quad Totals 0 \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad (0) 1|9 \quad \quad \quad \quad \quad \quad \quad \quad \quad (1) 2|1 6 7 7 \quad \quad \quad \quad \quad \quad (4) 3|2 2 3 4 8 \quad \quad \quad \quad \quad (5) 4|5 \quad \quad \quad \quad \quad \quad \quad \quad \quad (1) 5|1 \quad \quad \quad \quad \quad \quad \quad \quad \quad (1) 6|0 \quad \quad \quad \quad \quad \quad \quad \quad \quad (1) 7 \quad \quad \quad \quad \quad \quad \quad \quad \quad \quad (0) 8|6 7 \quad \quad \quad \quad \quad \quad \quad \quad (2)
  1. Find the median and inter-quartile range of these data. [3]
  2. Given that there are no outliers, draw a box plot on graph paper to represent these data. Start your scale at the origin. [4]
  3. Calculate the mean and standard deviation of these data. [5]
The records also contain the number of babies delivered by 10 female doctors. 34 \quad 30 \quad 20 \quad 15 \quad 6 32 \quad 26 \quad 19 \quad 11 \quad 4 The quartiles are 11, 19.5 and 30.
  1. Using the same scale as in part (b) and on the same graph paper draw a box plot for the data for the 10 female doctors. [3]
  2. Compare and contrast the box plots for the data for male and female doctors. [2]
Question 7 19 marks
View details
A number of people were asked to guess the calorific content of 10 foods. The mean \(s\) of the guesses for each food and the true calorific content \(t\) are given in the table below.
Food\(t\)\(s\)
Packet of biscuits170420
1 potato90160
1 apple80110
Crisp breads1070
Chocolate bar260360
1 slice white bread75135
1 slice brown bread60115
Portion of beef curry270350
Portion of rice pudding165390
Half a pint of milk160200
[You may assume that \(\Sigma t = 1340\), \(\Sigma s = 2310\), \(\Sigma ts = 396775\), \(\Sigma t^2 = 246050\), \(\Sigma s^2 = 694650\).]
  1. Draw a scatter diagram, indicating clearly which is the explanatory (independent) and which is the response (dependent) variable. [3]
  2. Calculate, to 3 significant figures, the product moment correlation coefficient for the above data. [7]
  3. State, with a reason, whether or not the value of the product moment correlation coefficient changes if all the guesses are 50 calories higher than the values in the table. [2]
The mean of the guesses for the portion of rice pudding and for the packet of biscuits are outside the linear relation of the other eight foods.
  1. Find the equation of the regression line of \(s\) on \(t\) excluding the values for rice pudding and biscuits. [3]
[You may now assume that \(S_{tt} = 72587\), \(S_{st} = 63671.875\), \(\bar{t} = 125.625\), \(\bar{s} = 187.5\).]
  1. Draw the regression line on your scatter diagram. [2]
  2. State, with a reason, what the effect would be on the regression line of including the values for a portion of rice pudding and a packet of biscuits. [2]