Edexcel S1 (Statistics 1) 2012 June

Question 1
View details
  1. A discrete random variable \(X\) has the probability function
$$\mathrm { P } ( X = x ) = \begin{cases} k ( 1 - x ) ^ { 2 } & x = - 1,0,1 \text { and } 2
0 & \text { otherwise } \end{cases}$$
  1. Show that \(k = \frac { 1 } { 6 }\)
  2. Find \(\mathrm { E } ( X )\)
  3. Show that \(\mathrm { E } \left( X ^ { 2 } \right) = \frac { 4 } { 3 }\)
  4. Find \(\operatorname { Var } ( 1 - 3 X )\)
Question 2
View details
2. A bank reviews its customer records at the end of each month to find out how many customers have become unemployed, \(u\), and how many have had their house repossessed, \(h\), during that month. The bank codes the data using variables \(x = \frac { u - 100 } { 3 }\) and \(y = \frac { h - 20 } { 7 }\) The results for the 12 months of 2009 are summarised below. $$\sum x = 477 \quad S _ { x x } = 5606.25 \quad \sum y = 480 \quad S _ { y y } = 4244 \quad \sum x y = 23070$$
  1. Calculate the value of the product moment correlation coefficient for \(x\) and \(y\).
  2. Write down the product moment correlation coefficient for \(u\) and \(h\). The bank claims that an increase in unemployment among its customers is associated with an increase in house repossessions.
  3. State, with a reason, whether or not the bank's claim is supported by these data.
Question 3
View details
3. A scientist is researching whether or not birds of prey exposed to pollutants lay eggs with thinner shells. He collects a random sample of egg shells from each of 6 different nests and tests for pollutant level, \(p\), and measures the thinning of the shell, \(t\). The results are shown in the table below.
\(p\)3830251512
\(t\)1391056
[You may use \(\sum p ^ { 2 } = 1967\) and \(\sum p t = 694\) ]
  1. Draw a scatter diagram on the axes on page 7 to represent these data.
  2. Explain why a linear regression model may be appropriate to describe the relationship between \(p\) and \(t\).
  3. Calculate the value of \(S _ { p t }\) and the value of \(S _ { p p }\).
  4. Find the equation of the regression line of \(t\) on \(p\), giving your answer in the form \(t = a + b p\).
  5. Plot the point ( \(\bar { p } , \bar { t }\) ) and draw the regression line on your scatter diagram. The scientist reviews similar studies and finds that pollutant levels above 16 are likely to result in the death of a chick soon after hatching.
  6. Estimate the minimum thinning of the shell that is likely to result in the death of a chick. \includegraphics[max width=\textwidth, alt={}, center]{0593544d-392d-465b-b922-c9cb1435abb5-05_1257_1568_301_173}
Question 4
View details
4. \begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{0593544d-392d-465b-b922-c9cb1435abb5-06_611_1127_237_447} \captionsetup{labelformat=empty} \caption{Figure 1}
\end{figure} Figure 1 shows how 25 people travelled to work.
Their travel to work is represented by the events $$\begin{array} { l l } B & \text { bicycle }
T & \text { train }
W & \text { walk } \end{array}$$
  1. Write down 2 of these events that are mutually exclusive. Give a reason for your answer.
  2. Determine whether or not \(B\) and \(T\) are independent events. One person is chosen at random.
    Find the probability that this person
  3. walks to work,
  4. travels to work by bicycle and train.
  5. Given that this person travels to work by bicycle, find the probability that they will also take the train.
Question 5
View details
5. \begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{0593544d-392d-465b-b922-c9cb1435abb5-08_1031_1239_116_354} \captionsetup{labelformat=empty} \caption{Figure 2}
\end{figure} A policeman records the speed of the traffic on a busy road with a 30 mph speed limit. He records the speeds of a sample of 450 cars. The histogram in Figure 2 represents the results.
  1. Calculate the number of cars that were exceeding the speed limit by at least 5 mph in the sample.
  2. Estimate the value of the mean speed of the cars in the sample.
  3. Estimate, to 1 decimal place, the value of the median speed of the cars in the sample.
  4. Comment on the shape of the distribution. Give a reason for your answer.
  5. State, with a reason, whether the estimate of the mean or the median is a better representation of the average speed of the traffic on the road.
Question 6
View details
  1. The heights of an adult female population are normally distributed with mean 162 cm and standard deviation 7.5 cm .
    1. Find the probability that a randomly chosen adult female is taller than 150 cm .
      (3)
    Sarah is a young girl. She visits her doctor and is told that she is at the 60th percentile for height.
  2. Assuming that Sarah remains at the 60th percentile, estimate her height as an adult. The heights of an adult male population are normally distributed with standard deviation 9.0 cm . Given that \(90 \%\) of adult males are taller than the mean height of adult females,
  3. find the mean height of an adult male.
Question 7
View details
  1. A manufacturer carried out a survey of the defects in their soft toys. It is found that the probability of a toy having poor stitching is 0.03 and that a toy with poor stitching has a probability of 0.7 of splitting open. A toy without poor stitching has a probability of 0.02 of splitting open.
    1. Draw a tree diagram to represent this information.
    2. Find the probability that a randomly chosen soft toy has exactly one of the two defects, poor stitching or splitting open.
      (3)
    The manufacturer also finds that soft toys can become faded with probability 0.05 and that this defect is independent of poor stitching or splitting open. A soft toy is chosen at random.
  2. Find the probability that the soft toy has none of these 3 defects.
  3. Find the probability that the soft toy has exactly one of these 3 defects.