OCR MEI Further Statistics A AS (Further Statistics A AS) 2022 June

Question 1
View details
1 A fair five-sided spinner has sectors labelled 1, 2, 3, 4, 5. In a game at a stall at a charity event, the spinner is spun twice. The random variable \(X\) represents the lower of the two scores. The probability distribution of \(X\) is given by the formula
\(\mathrm { P } ( \mathrm { X } = \mathrm { r } ) = \mathrm { k } ( 11 - 2 \mathrm { r } )\) for \(r = 1,2,3,4,5\),
where \(k\) is a constant.
  1. Complete the copy of this table in the Printed Answer Booklet.
    \(r\)12345
    \(\mathrm { P } ( X = r )\)\(7 k\)\(3 k\)
  2. Determine the value of \(k\).
  3. Find each of the following.
    • \(\mathrm { E } ( X )\)
    • \(\operatorname { Var } ( X )\)
    • The stall-holder charges a player \(C\) pence to play the game, and then pays the player \(50 X\) pence, where \(X\) is the player's score.
    Given that the average profit that the stall-holder makes on one game is 25 pence, find the value of \(C\).
Question 2
View details
2 On a car assembly line, a robot is used for a particular task.
  1. State the conditions under which a Poisson distribution is an appropriate model for the number of breakdowns of the robot in a week. It is given that the average number of breakdowns of the robot in a week is 1.7 . For the remainder of this question, you should assume that a Poisson distribution is an appropriate model for the number of breakdowns of the robot in a week.
    1. Find the probability that the number of breakdowns of the robot in a week is exactly 4.
    2. Determine the probability that the number of breakdowns of the robot in a week is at least 2 .
  2. Determine the probability that the number of breakdowns of the robot in 52 weeks is less than 100.
Question 3
View details
3 A biology student is doing an experiment in which plants are inoculated with a particular microorganism in an attempt to help them grow. She is investigating whether there is any association between the percentage of roots which have been colonised by the microorganism and the dry weight of the plant shoots. After the plants have grown for a few weeks, the student takes a random sample of 10 plants and measures the percentage of roots which have been colonised by the microorganism and the dry weight of the plant shoots. The spreadsheet output shows the data, together with a scatter diagram to illustrate the data.
\includegraphics[max width=\textwidth, alt={}, center]{8f1e0c68-a334-4657-823e-386ab0994c02-3_722_1648_635_244}
  1. The student decides that a test based on Pearson’s product moment correlation coefficient may not be valid. Explain why she comes to this conclusion.
  2. Calculate the value of Spearman’s rank correlation coefficient.
  3. Carry out a test based on this coefficient, at the \(5 \%\) significance level, to investigate whether there is any association between percentage colonisation and shoot dry weight.
Question 4
View details
4 A random number generator generates integers between 1 and 50 inclusive, with each number having an equal probability of being generated.
  1. State the probability distribution of the numbers generated.
  2. Determine the probability that a number generated is within one standard deviation of the mean.
Question 5
View details
5 A researcher is investigating whether there is any relationship between the overall performance of a student at GCSE and their grade in A Level Mathematics. Their A Level Mathematics grade is classified as A* or A, B, C or lower, and their overall performance at GCSE is classified as Low, Middle, High. Data are collected for a sample of 80 students in a particular area. The researcher carries out a chi-squared test. The screenshot below shows part of a spreadsheet used to analyse the data. Some values in the spreadsheet have been deliberately omitted.
1ABCDE
\multirow{2}{*}{
}Observed frequency
A* or ABC or lowerTotals
3Low613928
4Middle106824
5High1510328
6Totals31292080
7
8\multirow{2}{*}{}
9A* or ABC or lower
10Low10.85
11Middle9.30
12High10.85
13\multirow[b]{2}{*}{Contribution to the test statistic}
14
15A* or ABC or lower
16Low2.16800.80020.5714
17Middle0.05270.83790.6667
18High1.5873
2.2857
2.2857
19
  1. State what needs to be known about the sample for the test to be valid. For the remainder of this question, you should assume that the test is valid.
  2. Determine the missing values in each of the following cells.
    • C11
    • C18
    • In this question you must show detailed reasoning.
    Carry out a hypothesis test at the \(10 \%\) significance level to investigate whether there is any association between level of performance at GCSE and A Level Mathematics grade.
  3. Discuss briefly what the data suggest about A Level Mathematics grade for different levels of performance at GCSE.
  4. State one disadvantage of using a 10\% significance level rather than a 5\% significance level in a hypothesis test.
Question 6
View details
6 Tom has read in a newspaper that you can tell the air temperature by counting how often a cricket chirps in a period of 20 seconds. (A cricket is a type of insect.) He wants to know exactly how the temperature can be predicted. On 8 randomly selected days, when Tom can hear crickets chirping, he records the number of chirps, \(x\), made by a cricket in a 20-second interval, and also the temperature, \(y ^ { \circ } \mathrm { C }\), at that time. The data are summarised as follows.
\(n = 8 \quad \sum x = 268 \quad \sum y = 141.9 \quad \sum x ^ { 2 } = 9618 \quad \sum y ^ { 2 } = 2630.55 \quad \sum \mathrm { xy } = 5009.1\)
These data are illustrated below.
\includegraphics[max width=\textwidth, alt={}, center]{8f1e0c68-a334-4657-823e-386ab0994c02-5_661_1035_699_242}
  1. Determine the equation of the regression line of \(y\) on \(x\). Give your answer in the form \(\mathrm { y } = \mathrm { ax } + \mathrm { b }\), giving the values of \(a\) and \(b\) correct to \(\mathbf { 3 }\) significant figures.
  2. Use the equation of the regression line to predict the temperature for the following values of \(x\).
    • 35
    • 10
    • Comment on the reliability of your predictions in part (b).
    • State the coordinates of the point of intersection of the line whose equation you have calculated with the regression line of \(x\) on \(y\).
Question 7
View details
7 On average one in five packets of a breakfast cereal contains a voucher for a discount on the next packet bought. Whether or not a packet contains a voucher is independent of other packets, and can only be determined by opening the packet.
  1. State the distribution of the number of packets that need to be opened in order to find one which contains a voucher.
  2. Determine the probability that exactly 4 packets have to be opened in order to find one which contains a voucher.
  3. Determine the probability that exactly 10 packets have to be opened in order to find two which contain a voucher.
  4. I have \(n\) packets, and I open them one by one until I find a voucher or until all the packets are open. Given that the probability that I find a voucher is greater than 0.99 , determine the least possible value of \(n\).