AQA S1 (Statistics 1) 2008 January

Question 1
View details
1 In large-scale tree-felling operations, a machine cuts down trees, strips off the branches and then cuts the trunks into logs of length \(X\) metres for transporting to a sawmill. It may be assumed that values of \(X\) are normally distributed with mean \(\mu\) and standard deviation 0.16 , where \(\mu\) can be set to a specific value.
  1. Given that \(\mu\) is set to 3.3 , determine:
    1. \(\mathrm { P } ( X < 3.5 )\);
    2. \(\mathrm { P } ( X > 3.0 )\);
    3. \(\mathrm { P } ( 3.0 < X < 3.5 )\).
  2. The sawmill now requires a batch of logs such that there is a probability of 0.025 that any given log will have a length less than 3.1 metres. Determine, to two decimal places, the new value of \(\mu\).
Question 2
View details
2 The head and body length, \(x\) millimetres, and tail length, \(y\) millimetres, of each of a sample of 20 adult dormice were measured. The following statistics are derived from the results. $$S _ { x x } = 1280.55 \quad S _ { y y } = 281.8 \quad S _ { x y } = 416.3$$
  1. Calculate the value of the product moment correlation coefficient between \(x\) and \(y\).
  2. Interpret your value in the context of this question.
  3. Write down the value of the product moment correlation coefficient if the measurements had been recorded in centimetres.
  4. Give a reason why it is not generally advisable to calculate the value of the product moment correlation coefficient without first viewing a scatter diagram of the data. Illustrate your answer with a sketch.
Question 3
View details
3 The height, in metres, of adult male African elephants may be assumed to be normally distributed with mean \(\mu\) and standard deviation 0.20 . The heights of a sample of 12 such elephants were measured with the following results, in metres. $$\begin{array} { l l l l l l l l l l l l } 3.37 & 3.45 & 2.93 & 3.42 & 3.49 & 3.67 & 2.96 & 3.57 & 3.36 & 2.89 & 3.22 & 2.91 \end{array}$$
  1. Stating a necessary assumption, construct a \(98 \%\) confidence interval for \(\mu\). (6 marks)
  2. The mean height of adult male Asian elephants is known to be 2.90 metres. Using your confidence interval, state, with a reason, what can be concluded about the mean heights of adult males in these two types of elephant.
Question 4
View details
4 [Figure 1, printed on the insert, is provided for use in this question.]
Roseen is a self-employed decorator who wishes to estimate the times that it will take her to decorate bedrooms based upon their floor areas. She records the floor area, \(x \mathrm {~m} ^ { 2 }\), and the decorating time, \(y\) hours, for each of 10 bedrooms she has recently decorated.
\(\boldsymbol { x }\)11.022.07.521.013.016.514.016.018.520.5
\(\boldsymbol { y }\)15.035.016.023.524.017.514.527.522.534.5
  1. On Figure 1, plot a scatter diagram of these data.
  2. Calculate the equation of the least squares regression line of \(y\) on \(x\).
  3. Draw your regression line on Figure 1.
    1. Use your regression equation to estimate the time that Roseen will take to decorate a bedroom with a floor area of \(15 \mathrm {~m} ^ { 2 }\).
    2. Making reference to Figure 1, comment on the likely reliability of your estimate in part (d)(i).
Question 5
View details
5 A health club has a number of facilities which include a gym and a sauna. Andrew and his wife, Heidi, visit the health club together on Tuesday evenings. On any visit, Andrew uses either the gym or the sauna or both, but no other facilities. The probability that he uses the gym, \(\mathrm { P } ( G )\), is 0.70 . The probability that he uses the sauna, \(\mathrm { P } ( S )\), is 0.55 . The probability that he uses both the gym and the sauna is 0.25 .
  1. Calculate the probability that, on a particular visit:
    1. he does not use the gym;
    2. he uses the gym but not the sauna;
    3. he uses either the gym or the sauna but not both.
  2. Assuming that Andrew's decision on what facility to use is independent from visit to visit, calculate the probability that, during a month in which there are exactly four Tuesdays, he does not use the gym.
  3. The probability that Heidi uses the gym when Andrew uses the gym is 0.6 , but is only 0.1 when he does not use the gym. Calculate the probability that, on a particular visit, Heidi uses the gym.
  4. On any visit, Heidi uses exactly one of the club's facilities. The probability that she uses the sauna is 0.35 .
    Calculate the probability that, on a particular visit, she uses a facility other than the gym or the sauna.
Question 6
View details
6 For each of the Premiership football seasons 2004/05 and 2005/06, a count is made of the number of goals scored in each of the 380 matches. The results are shown in the table.
\multirow{2}{*}{Number of goals scored in a match}Number of matches
2004/052005/06
03032
17982
29995
36878
46048
52430
6119
766
820
910
Total380380
  1. For the number of goals scored in a match during the 2004/05 season:
    1. determine the median and the interquartile range;
    2. calculate the mean and the standard deviation.
  2. Two statistics students, Jole and Katie, independently analyse the data on the number of goals scored in a match during the 2005/06 season.
    • Jole determines correctly that the median is 2 and that the interquartile range is also 2.
    • Katie calculates correctly, to two decimal places, that the mean is 2.48 and that the standard deviation is 1.59 .
      1. Use your answers from part (a), together with Jole's and Katie's results, to compare briefly the two seasons with regard to the average and the spread of the number of goals scored in a match.
      2. Jole claims that Katie's results must be wrong as \(95 \%\) of values always lie within 2 standard deviations of the mean and \(( 2.48 - 2 \times 1.59 ) < 0\) which is nonsense.
    Explain why Jole's claim is incorrect. (You are not expected to confirm Katie's results.)
Question 7
View details
7 A travel agency in Tunisia offers customers a 3-day tour into the Sahara desert by either coach or minibus.
  1. The agency accepts bookings from 50 customers for seats on the coach. The probability that a customer, who has booked a seat on the coach, will not turn up to claim the seat is 0.08 , and may be assumed to be independent of the behaviour of other customers. Determine the probability that, of the customers who have booked a seat on the coach:
    1. two or more will not turn up;
    2. three or more will not turn up.
  2. The agency accepts bookings from 15 customers for seats on the minibus. The probability that a customer, who has booked a seat on the minibus, will not turn up to claim the seat is 0.025 , and may be assumed to be independent of the behaviour of other customers. Calculate the probability that, of the customers who have booked a seat on the minibus:
    1. all will turn up;
    2. one or more will not turn up.
  3. The coach has 48 seats and the minibus has 14 seats. If 14 or fewer customers who have booked seats on the minibus turn up, they will be allocated a seat on the minibus. If all 15 customers who have booked seats on the minibus turn up, one will be allocated a seat on the coach. This will leave only 47 seats available for the 50 customers who have booked seats on the coach. Use your results from parts (a) and (b) to calculate the probability that there will be seats available on the coach for all those who turn up having booked such seats.
    (4 marks)