Edexcel S3 (Statistics 3) 2009 June

Question 1
View details
  1. A telephone directory contains 50000 names. A researcher wishes to select a systematic sample of 100 names from the directory.
    1. Explain in detail how the researcher should obtain such a sample.
    2. Give one advantage and one disadvantage of
      1. quota sampling,
      2. systematic sampling.
    3. The heights of a random sample of 10 imported orchids are measured. The mean height of the sample is found to be 20.1 cm . The heights of the orchids are normally distributed.
    Given that the population standard deviation is 0.5 cm ,
  2. estimate limits between which \(95 \%\) of the heights of the orchids lie,
  3. find a 98\% confidence interval for the mean height of the orchids. A grower claims that the mean height of this type of orchid is 19.5 cm .
  4. Comment on the grower's claim. Give a reason for your answer.
Question 3
View details
3. A doctor is interested in the relationship between a person's Body Mass Index (BMI) and their level of fitness. She believes that a lower BMI leads to a greater level of fitness. She randomly selects 10 female 18 year-olds and calculates each individual's BMI. The females then run a race and the doctor records their finishing positions. The results are shown in the table.
Individual\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)
BMI17.421.418.924.419.420.122.618.425.828.1
Finishing position35196410278
  1. Calculate Spearman's rank correlation coefficient for these data.
  2. Stating your hypotheses clearly and using a one tailed test with a \(5 \%\) level of significance, interpret your rank correlation coefficient.
  3. Give a reason to support the use of the rank correlation coefficient rather than the product moment correlation coefficient with these data.
Question 4
View details
4. A sample of size 8 is to be taken from a population that is normally distributed with mean 55 and standard deviation 3. Find the probability that the sample mean will be greater than 57.
Question 5
View details
5. The number of goals scored by a football team is recorded for 100 games. The results are summarised in Table 1 below. \begin{table}[h]
Number of goalsFrequency
040
133
214
38
45
\captionsetup{labelformat=empty} \caption{Table 1}
\end{table}
  1. Calculate the mean number of goals scored per game. The manager claimed that the number of goals scored per match follows a Poisson distribution. He used the answer in part (a) to calculate the expected frequencies given in Table 2. \begin{table}[h]
    Number of goalsExpected Frequency
    034.994
    1\(r\)
    2\(s\)
    36.752
    \(\geqslant 4\)2.221
    \captionsetup{labelformat=empty} \caption{Table 2}
    \end{table}
  2. Find the value of \(r\) and the value of \(s\) giving your answers to 3 decimal places.
  3. Stating your hypotheses clearly, use a \(5 \%\) level of significance to test the manager's claim.
Question 6
View details
  1. The lengths of a random sample of 120 limpets taken from the upper shore of a beach had a mean of 4.97 cm and a standard deviation of 0.42 cm . The lengths of a second random sample of 150 limpets taken from the lower shore of the same beach had a mean of 5.05 cm and a standard deviation of 0.67 cm .
    1. Test, using a \(5 \%\) level of significance, whether or not the mean length of limpets from the upper shore is less than the mean length of limpets from the lower shore. State your hypotheses clearly.
    2. State two assumptions you made in carrying out the test in part (a).
    3. A company produces climbing ropes. The lengths of the climbing ropes are normally distributed. A random sample of 5 ropes is taken and the length, in metres, of each rope is measured. The results are given below.
      119.9
      120.3
      120.1
      120.4
      120.2
    4. Calculate unbiased estimates for the mean and the variance of the lengths of the climbing ropes produced by the company.
    The lengths of climbing rope are known to have a standard deviation of 0.2 m . The company wants to make sure that there is a probability of at least 0.90 that the estimate of the population mean, based on a random sample size of \(n\), lies within 0.05 m of its true value.
  2. Find the minimum sample size required.
Question 8
View details
  1. The random variable \(A\) is defined as
$$A = 4 X - 3 Y$$ where \(X \sim \mathrm {~N} \left( 30,3 ^ { 2 } \right) , Y \sim \mathrm {~N} \left( 20,2 ^ { 2 } \right)\) and \(X\) and \(Y\) are independent. Find
  1. \(\mathrm { E } ( A )\),
  2. \(\operatorname { Var } ( A )\). The random variables \(Y _ { 1 } , Y _ { 2 } , Y _ { 3 }\) and \(Y _ { 4 }\) are independent and each has the same distribution as \(Y\). The random variable \(B\) is defined as $$B = \sum _ { i = 1 } ^ { 4 } Y _ { i }$$
  3. Find \(\mathrm { P } ( B > A )\).