OCR Further Statistics AS (Further Statistics AS) 2024 June

Question 1
View details
1 The random variable \(W\) can take values 1,2 or 3 and has a discrete uniform distribution.
  1. Write down the value of \(\mathrm { E } ( 2 W )\).
  2. Find the value of \(\operatorname { Var } ( 2 W )\).
  3. Determine the value of the constant \(k\) for which \(\mathrm { E } ( 2 \mathrm {~W} + \mathrm { k } ) = \operatorname { Var } ( 2 \mathrm {~W} + \mathrm { k } )\). The random variable \(S\) has the probability distribution shown in the following table.
    \(S\)23456
    \(P ( S = S )\)\(\frac { 2 } { 9 }\)\(\frac { 1 } { 9 }\)\(\frac { 1 } { 3 }\)\(\frac { 1 } { 9 }\)\(\frac { 2 } { 9 }\)
  4. Calculate \(\operatorname { Var } ( S )\).
Question 2
View details
2 For a random sample of 160 employees of a large company, the principal method of transport for getting to work, arranged according to grade of employee, is shown in the table.
GradeWalk or cyclePrivate motorised transportPublic transport
A9136
B164341
C11813
A test is carried out at the \(5 \%\) significance level of whether there is association between grade of employee and method of transport.
  1. State appropriate hypotheses for the test. The contributions to the test statistic are shown in the following table, correct to 3 decimal places.
    GradeWalk or cyclePrivate motorised transportPublic transport
    A1.1570.2891.929
    B1.8780.2250.327
    C2.0061.8000.083
  2. Show how the value 0.225 is obtained.
  3. Complete the test, stating the conclusion.
  4. Which combination of grade of employee and method of transport most strongly suggests association? Justify your answer.
Question 3
View details
3 The ages, \(x\) years, and the reaction time, \(t\) seconds, in an experiment carried out on a sample of 15 volunteers are summarised as follows.
\(n = 15 \quad \sum x = 762 \quad \sum t = 8.7 \quad \sum x ^ { 2 } = 44204 \quad \sum t ^ { 2 } = 5.65 \quad \sum x t = 490.1\)
  1. Calculate the value of the product moment correlation coefficient between \(x\) and \(t\).
  2. Calculate the equation of the line of regression of \(t\) on \(x\). Give your answer in the form \(\mathrm { t } = \mathrm { a } + \mathrm { bx }\) where \(a\) and \(b\) are constants to be determined.
  3. Explain the relevance of the quantity \(\sum ( t - a - b x ) ^ { 2 }\) to your answer to part (b).
  4. Estimate the reaction time, in seconds, for a volunteer aged 42. It is subsequently decided to measure the reaction time in tenths of a second rather than in seconds (so, for example, a time of 0.6 seconds would now be recorded as 6 ).
    1. State what effect, if any, this change would have on your answer to part (a).
    2. State what effect, if any, this change would have on your answer to part (b). It is known that the sample of 15 volunteers consisted almost entirely of students and retired people.
  5. Using this information, and the value of the product moment correlation coefficient, comment on the reliability of your estimate in part (d).
Question 4
View details
  1. Find the probability that 4 telephone calls are received in a randomly chosen one-minute period.
  2. A sample of 10 independent observations of \(X\) is obtained. Find the expected number of these 10 observations that are in the interval \(2 < X < 8\). It is also known that
    \(P ( X + Y = 4 ) = \frac { 27 } { 8 } P ( X = 2 ) \times P ( Y = 2 )\).
  3. Determine the possible values of \(\mathrm { E } ( Y )\).
  4. Explain where in your solution to part (c) you have used the assumption that telephone calls and e-mails are received independently of one another.
Question 5
View details
5 In a fashion competition, two judges gave marks to a large number of contestants. The value of Spearman's rank correlation coefficient, \(\mathrm { r } _ { \mathrm { s } }\), between the marks given to 7 randomly chosen contestants is \(\frac { 27 } { 28 }\).
  1. An excerpt from the table of critical values of \(\mathrm { r } _ { \mathrm { s } }\) is shown below. \section*{Critical values of Spearman's rank correlation coefficient}
    1-tail test5\%2.5\%1\%0.5\%
    2-tail test10\%5\%2\%1\%
    \multirow{3}{*}{\(n\)}60.82860.88570.94291.0000
    70.71430.78570.89290.9286
    80.64290.73810.83330.8810
    Test whether there is evidence, at the 1\% significance level, that the judges agree with each another. The marks given by the two judges to the 7 randomly chosen contestants were as follows, where \(x\) is an integer.
    ContestantABCD\(E\)\(F\)G
    Judge 164656778798086
    Judge 2616378808190\(x\)
  2. Use the value \(\mathrm { r } _ { \mathrm { s } } = \frac { 27 } { 28 }\) to determine the range of possible values of \(x\).
  3. Give a reason why it might be preferable to use the product moment correlation coefficient rather than Spearman's rank correlation coefficient in this context.
Question 6
View details
6 Anika walks along a street that contains parked cars. The number of cars that Anika passes, up to and including the first car that is white, is denoted by \(X\).
  1. State two assumptions needed for \(X\) to be well modelled by a geometric distribution. Assume now that \(X\) can be well modelled by the distribution \(\operatorname { Geo } ( p )\), where \(0 < p < 1\).
  2. For \(p = 0.1\), find \(\mathrm { P } ( X > 6 )\). The number of cars that Anika passes, up to but not including the first car that is white, is denoted by \(Y\).
  3. For a general value of \(p\), determine a simplified expression for \(\mathrm { E } ( Y ) \div \operatorname { Var } ( Y )\), in terms of \(p\). Ben walks along a different street that also contains parked cars. The number of cars that Ben passes, up to and including the first white car on which the last digit of the number plate is even is denoted by \(Z\). It may be assumed that \(Z\) can be well modelled by the distribution \(\operatorname { Geo } \left( \frac { 1 } { 2 } p \right)\), where \(p\) is the parameter of the distribution of \(X\). It is given that \(\mathrm { P } ( \mathrm { Z } = 3 ) = \mathrm { kP } ( \mathrm { X } = 3 )\), where \(k\) is a positive constant.
  4. Determine the range of possible values of \(k\).