Edexcel S1 (Statistics 1) 2002 November

Question 1
View details
  1. (a) Explain briefly why statistical models are used when attempting to solve real-world problems.
    (b) Write down the name of the distribution you would recommend as a suitable model for each of the following situations.
    1. The weight of marmalade in a jar.
    2. The number on the uppermost face of a fair die after it has been rolled.
      (2)
    3. There are 125 sixth-form students in a college, of whom 60 are studying only arts subjects, 40 only science subjects and the rest a mixture of both.
    Three students are selected at random, without replacement.
    Find the probability that
    (a) all three students are studying only arts subjects,
    (b) exactly one of the three students is studying only science subjects.
Question 3
View details
3. The events \(A\) and \(B\) are independent such that \(\mathrm { P } ( A ) = 0.25\) and \(\mathrm { P } ( B ) = 0.30\). Find
  1. \(\mathrm { P } ( A \cap B )\),
  2. \(\mathrm { P } ( A \cup B )\),
  3. \(\mathrm { P } \left( A B ^ { \prime } \right)\).
Question 4
View details
4. Strips of metal are cut to length \(L \mathrm {~cm}\), where \(L \sim \mathrm {~N} \left( \mu , 0.5 ^ { 2 } \right)\).
  1. Given that \(2.5 \%\) of the cut lengths exceed 50.98 cm , show that \(\mu = 50\).
  2. Find \(\mathrm { P } ( 49.25 < L < 50.75 )\). Those strips with length either less than 49.25 cm or greater than 50.75 cm cannot be used.
    Two strips of metal are selected at random.
  3. Find the probability that both strips cannot be used.
Question 5
View details
5. An agricultural researcher collected data, in appropriate units, on the annual rainfall \(x\) and the annual yield of wheat \(y\) at 8 randomly selected places. The data were coded using \(s = x - 6\) and \(t = y - 20\) and the following summations were obtained. $$\Sigma s = 48.5 , \quad \Sigma t = 65.0 , \quad \Sigma s ^ { 2 } = 402.11 , \quad \Sigma t ^ { 2 } = 701.80 , \quad \Sigma s t = 523.23$$
  1. Find the equation of the regression line of \(t\) on \(s\) in the form \(t = p + q s\).
  2. Find the equation of the regression line of \(y\) on \(x\) in the form \(y = a + b x\), giving \(a\) and \(b\) to 3 decimal places. The value of the product moment correlation coefficient between \(s\) and \(t\) is 0.943 , to 3 decimal places.
  3. Write down the value of the product moment correlation coefficient between \(x\) and \(y\). Give a justification for your answer.
Question 6
View details
6. The discrete random variable \(X\) has the following probability distribution.
\(x\)- 2- 1012
\(\mathrm { P } ( X = x )\)\(\alpha\)0.20.10.2\(\beta\)
  1. Given that \(\mathrm { E } ( X ) = - 0.2\), find the value of \(\alpha\) and the value of \(\beta\).
  2. Write down \(\mathrm { F } ( 0.8 )\).
  3. Evaluate \(\operatorname { Var } ( X )\). Find the value of
  4. \(\mathrm { E } ( 3 X - 2 )\),
  5. \(\operatorname { Var } ( 2 X + 6 )\).
Question 7
View details
7. The following stem and leaf diagram shows the aptitude scores \(x\) obtained by all the applicants for a particular job.
Aptitude score31 means 31
3129(3)
424689(5)
51335679(7)
60133356889(10)
71222455568889(14)
801235889(8)
9012(3)
  1. Write down the modal aptitude score.
  2. Find the three quartiles for these data. Outliers can be defined to be outside the limits \(\mathrm { Q } _ { 1 } - 1.0 \left( \mathrm { Q } _ { 3 } - \mathrm { Q } _ { 1 } \right)\) and \(\mathrm { Q } _ { 3 } + 1.0 \left( \mathrm { Q } _ { 3 } - \mathrm { Q } _ { 1 } \right)\).
  3. On a graph paper, draw a box plot to represent these data. For these data, \(\Sigma x = 3363\) and \(\Sigma x ^ { 2 } = 238305\).
  4. Calculate, to 2 decimal places, the mean and the standard deviation for these data.
  5. Use two different methods to show that these data are negatively skewed.