OCR Further Statistics AS (Further Statistics AS) 2023 June

Question 1
View details
1 A radar device is used to detect flaws in motorway roads before they become dangerous. The number of flaws in a 1 km stretch of motorway is denoted by \(X\). It may be assumed that flaws occur randomly.
  1. State two further assumptions that are necessary for \(X\) to be well modelled by a Poisson distribution. Assume now that \(X\) can be modelled by distribution \(\operatorname { Po } ( 5.7 )\).
  2. Determine the probability that in a randomly chosen stretch of motorway, of length 1 km , there are between 8 and 11 flaws, inclusive.
  3. Determine the probability that in two randomly chosen, non-overlapping, stretches of motorway, each of length 5 km , there are at least 30 flaws in one stretch and fewer than 30 flaws in the other stretch.
Question 2
View details
2 A music lover has 30 CDs arranged in a random order in a line on a shelf. Of these CDs, 7 are classed as Baroque, 10 as Classical and 13 as Romantic.
  1. Determine the probability that all 7 Baroque CDs are next to each other.
  2. Determine the probability that, of the 10 CDs furthest to the left on the shelf, at least 6 are Baroque.
Question 3
View details
3 An insurance company collected data concerning the age, \(x\) years, of policy holders and the average size of claim, \(\pounds y\) thousand. The data is summarised as follows.
\(n = 32 \quad \sum x = 1340 \quad \sum y = 612 \quad \sum x ^ { 2 } = 64282 \quad \sum y ^ { 2 } = 13418 \quad \sum x y = 27794\)
  1. Find the variance of \(x\).
  2. Find the equation of the regression line of \(y\) on \(x\).
  3. Hence estimate the expected size of claim from a policy holder of age 48. Tom is aged 48. He claims that the range of the data probably does not include people of his age because the mean age for the data is 41.875 , and 48 is not close to this.
  4. Use your answer to part (a) to determine how likely it is that Tom's claim is correct.
  5. Comment on the reliability of your estimate in part (c). You should refer to the value of the product-moment correlation coefficient for the data, which is 0.579 correct to 3 significant figures.
Question 4
View details
4 A discrete random variable \(W\) has the probability distribution shown in the following table, in which \(a\) and \(b\) are constants.
\(w\)585960616263
\(\mathrm { P } ( W = w )\)\(a\)\(b\)0.20.20.10.1
It is given that \(\mathrm { E } ( W - 60 ) = 0.15\). Determine the value of \(\operatorname { Var } ( 4 W - 60 )\).
Question 5
View details
5 A psychologist investigates the relationship between 'openness' and 'creativity' in adults. Each member of a random sample of 15 adults is given two tests, one on openness and one on creativity. Each test has a maximum score of 75 . The results are given in the table.
AdultABCDEFGHIJKLMNO
Openness, \(x\)393429204035203655314143333033
Creativity, \(y\)593417294946455460384635435634
\(n = 15 \quad \sum x = 519 \quad \sum y = 645 \quad \sum x ^ { 2 } = 19033 \quad \sum y ^ { 2 } = 29751 \quad \sum x y = 23034\)
  1. Use Pearson's product-moment correlation coefficient to test, at the \(5 \%\) significance level, whether there is positive association between openness and creativity.
  2. State what the value of Pearson's product-moment correlation coefficient shows about a scatter diagram illustrating the data.
  3. A student suggests that there is a way to obtain a more accurate measure of the correlation. Before carrying out the test it would be better to standardise the test scores so that they have the same mean and variance. Explain whether you agree with this suggestion.
Question 6
View details
6 A machine is used to toss a coin repeatedly. Rosa believes that the outcome of each toss made by the machine is not independent of the previous toss. Rosa gets the machine to toss a coin 6 times and record the number of heads, \(X\), obtained. After recording the number of heads obtained, Rosa resets the machine and gets it to toss the coin 6 more times. Rosa again records the number of heads obtained and she repeats this procedure until she has recorded 88 independent values of \(X\).
  1. The sample mean and sample variance of \(X\) are 3.35 and 3.392 respectively. Explain what these results suggest about the validity of a binomial model \(\mathrm { B } ( 6 , p )\) for the data. Rosa uses a computer spreadsheet to work out the probabilities for a more sophisticated model in which the outcome of each toss is dependent on the outcome of the previous toss. Her model suggests that the probabilities \(\mathrm { P } ( X = x )\), for \(x = 0,1,2,3,4,5,6\), are approximately in the ratio \(5 : 6 : 7 : 8 : 7 : 6 : 5\). She carries out a \(\chi ^ { 2 }\) test to investigate whether this model is a good fit for the data. The following table shows the full results of the experiments, together with some of the calculations needed for the test.
    \(x\)0123456Total
    Observed frequency710161515111488
    Expected frequency
    Contribution to \(\chi ^ { 2 }\) statistic0.90.33330.28570.06250.0714
  2. In the Printed Answer Booklet, complete the table.
  3. Carry out the test, using a 10\% significance level.
  4. Rosa says that the results definitely show that one of the two proposed models is correct. Comment on this statement.
Question 7
View details
7 A town council is planning to introduce a new set of parking regulations. An interviewer contacts randomly chosen people in the town and asks them whether they are in favour of the proposal. The first person who is not in favour of the regulation is the \(R\) th person interviewed. It can be assumed that the probability that any randomly chosen person is not in favour of the proposal is a constant \(p\), and that \(p\) does not equal 0 or 1 . Assume first that \(\mathrm { E } ( R ) = 10\).
  1. Determine \(\mathrm { P } ( R \geqslant 14 )\). Now, without the assumption that \(\mathrm { E } ( R ) = 10\), consider a general value of \(p\).
    It is given that \(\mathrm { P } ( R = 3 ) - 0.4 \times \mathrm { P } ( R = 2 ) - a \times \mathrm { P } ( R = 1 ) = 0\), where \(a\) is a positive constant.
  2. Determine the range of possible values of \(a\).