OCR S1 (Statistics 1) 2010 January

Question 1
View details
1 Andy makes repeated attempts to thread a needle. The number of attempts up to and including his first success is denoted by \(X\).
  1. State two conditions necessary for \(X\) to have a geometric distribution.
  2. Assuming that \(X\) has the distribution \(\operatorname { Geo } ( 0.3 )\), find
    (a) \(\mathrm { P } ( X = 5 )\),
    (b) \(\mathrm { P } ( X > 5 )\).
  3. Suggest a reason why one of the conditions you have given in part (i) might not be satisfied in this context. 240 people were asked to guess the length of a certain road. Each person gave their guess, \(l \mathrm {~km}\), correct to the nearest kilometre. The results are summarised below.
    \(l\)\(10 - 12\)\(13 - 15\)\(16 - 20\)\(21 - 30\)
    Frequency113206
  4. (a) Use appropriate formulae to calculate estimates of the mean and standard deviation of \(l\).
    (b) Explain why your answers are only estimates.
  5. A histogram is to be drawn to illustrate the data. Calculate the frequency density of the block for the 16-20 class.
  6. Explain which class contains the median value of \(l\).
  7. Later, the person whose guess was between 10 km and 12 km changed his guess to between 13 km and 15 km . Without calculation state whether the following will increase, decrease or remain the same:
    (a) the mean of \(l\),
    (b) the standard deviation of \(l\).
Question 3
View details
3 The heights, \(h \mathrm {~m}\), and weights, \(m \mathrm {~kg}\), of five men were measured. The results are plotted on the diagram.
\includegraphics[max width=\textwidth, alt={}, center]{5c25d6cf-2c23-4b49-88fb-e4abe6c281e4-3_738_956_386_593} The results are summarised as follows. $$n = 5 \quad \Sigma h = 9.02 \quad \Sigma m = 377.7 \quad \Sigma h ^ { 2 } = 16.382 \quad \Sigma m ^ { 2 } = 28558.67 \quad \Sigma h m = 681.612$$
  1. Use the summarised data to calculate the value of the product moment correlation coefficient, \(r\).
  2. Comment on your value of \(r\) in relation to the diagram.
  3. It was decided to re-calculate the value of \(r\) after converting the heights to feet and the masses to pounds. State what effect, if any, this will have on the value of \(r\).
  4. One of the men had height 1.63 m and mass 78.4 kg . The data for this man were removed and the value of \(r\) was re-calculated using the original data for the remaining four men. State in general terms what effect, if any, this will have on the value of \(r\).
Question 4
View details
4 A certain four-sided die is biased. The score, \(X\), on each throw is a random variable with probability distribution as shown in the table. Throws of the die are independent.
\(x\)0123
\(\mathrm { P } ( X = x )\)\(\frac { 1 } { 2 }\)\(\frac { 1 } { 4 }\)\(\frac { 1 } { 8 }\)\(\frac { 1 } { 8 }\)
  1. Calculate \(\mathrm { E } ( X )\) and \(\operatorname { Var } ( X )\). The die is thrown 10 times.
  2. Find the probability that there are not more than 4 throws on which the score is 1 .
  3. Find the probability that there are exactly 4 throws on which the score is 2 .
Question 5
View details
5 A washing-up bowl contains 6 spoons, 5 forks and 3 knives. Three of these 14 items are removed at random, without replacement. Find the probability that
  1. all three items are of different kinds,
  2. all three items are of the same kind.
Question 6
View details
6
  1. A student calculated the values of the product moment correlation coefficient, \(r\), and Spearman's rank correlation coefficient, \(r _ { s }\), for two sets of bivariate data, \(A\) and \(B\). His results are given below. $$\begin{array} { l l } A : & r = 0.9 \text { and } r _ { s } = 1
    B : & r = 1 \quad \text { and } r _ { s } = 0.9 \end{array}$$ With the aid of a diagram where appropriate, explain why the student's results for \(A\) could both be correct but his results for \(B\) cannot both be correct.
  2. An old research paper has been partially destroyed. The surviving part of the paper contains the following incomplete information about some bivariate data from an experiment.
    \includegraphics[max width=\textwidth, alt={}, center]{5c25d6cf-2c23-4b49-88fb-e4abe6c281e4-4_339_1200_1117_511} Calculate the missing constant at the end of the equation of the second regression line.
Question 7
View details
7 The table shows the numbers of male and female members of a vintage car club who own either a Jaguar or a Bentley. No member owns both makes of car.
\cline { 2 - 3 } \multicolumn{1}{c|}{}MaleFemale
Jaguar2515
Bentley128
One member is chosen at random from these 60 members.
  1. Given that this member is male, find the probability that he owns a Jaguar. Now two members are chosen at random from the 60 members. They are chosen one at a time, without replacement.
  2. Given that the first one of these members is female, find the probability that both own Jaguars.
Question 8
View details
8 The five letters of the word NEVER are arranged in random order in a straight line.
  1. How many different orders of the letters are possible?
  2. In how many of the possible orders are the two Es next to each other?
  3. Find the probability that the first two letters in the order include exactly one letter E.
    \(9 R\) and \(S\) are independent random variables each having the distribution \(\operatorname { Geo } ( p )\).
  4. Find \(\mathrm { P } ( R = 1\) and \(S = 1 )\) in terms of \(p\).
  5. Show that \(\mathrm { P } ( R = 3\) and \(S = 3 ) = p ^ { 2 } q ^ { 4 }\), where \(q = 1 - p\).
  6. Use the formula for the sum to infinity of a geometric series to show that $$\mathrm { P } ( R = S ) = \frac { p } { 2 - p }$$