OCR Further Statistics (Further Statistics) 2022 June

Question 1
View details
1 A researcher wishes to find people who say that they support a specific plan. Each day the researcher interviews people at random, one after the other, until they find one person who says that they support this plan. The researcher does not then interview any more people that day. The total number of people interviewed on any one day is denoted by \(R\).
  1. Assume that in fact \(1 \%\) of the population would say that they support the plan.
    1. State an appropriate distribution with which to model \(R\), giving the value(s) of any parameter(s).
    2. Find \(\mathrm { P } ( 50 < R \leqslant 150 )\). The researcher incorrectly believes that the variance of a random variable \(X\) with any discrete probability distribution is given by the formula \([ \mathrm { E } ( X ) ] ^ { 2 } - \mathrm { E } ( X )\).
  2. Show that, for the type of distribution stated in part (a), they will obtain the correct value of the variance, regardless of the value(s) of the parameter(s).
Question 2
View details
2 The directors of a large company believe that there are more computer failures in the Head Office when temperatures are higher. They obtain data for the Head Office for the maximum temperature, \(T ^ { \circ } \mathrm { C }\), and the number of computer failures, \(X\), on each of 12 randomly chosen days.
  1. State which of the following words can be applied to \(T\). Dependent Independent Controlled Response The data is summarised as follows.
    \(n = 12 \quad \sum t = 261 \quad \sum x = 41 \quad \sum t ^ { 2 } = 5869 \quad \sum x ^ { 2 } = 311 \quad \sum \mathrm { tx } = 1021\)
  2. Calculate the value of the product moment correlation coefficient \(r\).
  3. The directors wish to investigate their belief using a significance test at the \(1 \%\) level.
    1. Explain why a 1-tail test is appropriate in this situation.
    2. Carry out the test.
  4. One of the directors prefers the temperatures to be given in Fahrenheit ( \({ } ^ { \circ } \mathrm { F }\) ), rather than Centigrade ( \({ } ^ { \circ } \mathrm { C }\) ). The relationship between F and C is \(\mathrm { F } = \frac { 9 } { 5 } \mathrm { C } + 32\).
    State the value of \(r\) that would result from using temperatures in Fahrenheit in the calculation.
Question 3
View details
3 In this question you must show detailed reasoning. A discrete random variable \(V\) has the following probability distribution, where \(p\) and \(q\) are constants.
\(v\)0123
\(\mathrm { P } ( \mathrm { V } = \mathrm { v } )\)\(p\)\(q\)0.120.2
It is given that \(\mathrm { E } ( V ) = \operatorname { Var } ( V )\). Determine the value of \(p\) and the value of \(q\).
Question 4
View details
4 The manager of a car breakdown service uses the distribution \(\operatorname { Po } ( 2.7 )\) to model the number of punctures, \(R\), in a 24-hour period in a given rural area. The manager knows that, for this model to be valid, punctures must occur randomly and independently of one another.
  1. State a further assumption needed for the Poisson model to be valid.
  2. State the value of the standard deviation of \(R\).
  3. Use the model to calculate the probability that, in a randomly chosen period of 168 hours, at least 22 punctures occur. The manager uses the distribution \(\operatorname { Po } ( 0.8 )\) to model the number of flat batteries in a 24 -hour period in the same rural area, and he assumes that instances of flat batteries are independent of punctures. A day begins and ends at midnight, and a "bad" day is a day on which there are more than 6 instances, in total, of punctures and flat batteries.
  4. Assume first that both the manager's models are correct. Calculate the probability that a randomly chosen day is a "bad" day.
  5. It is found that 12 of the next 100 days are "bad" days. Comment on whether this casts doubt on the validity of the manager's models.
Question 5
View details
5 A company uses two drivers for deliveries.
Driver \(A\) charges a fixed rate of \(\pounds 80\) per day plus \(\pounds 2\) per mile travelled on that day. Driver \(B\) charges a fixed rate of \(\pounds 120\) per day plus \(\pounds 1.50\) per mile travelled on that day.
On each working day the total distance, in miles, travelled by each driver is a random variable with the distribution \(\mathrm { N } ( 83,360 )\).
  1. Find the probability that driver \(A\) charges the company less than \(\pounds 235.00\) for a randomly chosen day’s deliveries.
  2. Find the probability that the total charge to the company of three randomly chosen days' deliveries by driver \(A\) is at least \(\pounds 300\) more than the total charge of two randomly chosen days' deliveries by driver \(B\).
Question 6
View details
6 The random variable \(X\) was assumed to have a normal distribution with mean \(\mu\). Using a random sample of size 128, a significance test was carried out using the following hypotheses.
\(\mathrm { H } _ { 0 } : \mu = 30\)
\(\mathrm { H } _ { 1 } : \mu > 30\)
It was found that \(\sum x = 3929.6\) and \(\sum x ^ { 2 } = 123483.52\). The conclusion of the test was to reject the null hypothesis.
  1. Determine the range of possible values of the significance level of the test.
  2. It was subsequently found that \(X\) was not normally distributed. Explain whether this invalidates the conclusion of the test.
Question 7
View details
7 The continuous random variable \(X\) has probability density function
\(f ( x ) = \begin{cases} k x ^ { n } & 0 \leqslant x \leqslant 1 ,
0 & \text { otherwise, } \end{cases}\)
where \(k\) is a constant and \(n\) is a parameter whose value is positive. It is given that the median of \(X\) is 0.8816 correct to 4 decimal places. Ten independent observations of \(X\) are obtained. Find the expected number of observations that are less than 0.8 .
Question 8
View details
8 The critical region for an \(r\) \% two-tailed Wilcoxon signed-rank test, based on a large sample of size \(n\), is \(\left\{ W _ { + } \leqslant 113 \right\} \cup \left\{ W _ { + } \geqslant 415 \right\}\).
  1. Show that \(n = 32\).
  2. Using a suitable approximation, determine the value of \(r\).
Question 9
View details
9 The head teacher of a school believes that, on average, pupil absences on the days Monday, Tuesday, Wednesday, Thursday and Friday are in the ratio \(3 : 2 : 2 : 2 : 3\). The head teacher takes a random sample of 120 pupil absences. The results are as follows.
Day of weekMondayTuesdayWednesdayThursdayFriday
Number of absences2816241636
  1. Test at the \(5 \%\) significance level whether these results are consistent with the head teacher's belief. A significance test at the \(5 \%\) level is also carried out on a second, independent, random sample of \(n\) pupil absences. All the numbers of absences are integers. The ratio of the numbers of absences for each day in this sample is identical to the ratio of the numbers of absences for each day in the original sample of size 120.
  2. Determine the smallest value of \(n\) for which the conclusion of this significance test is that the data are not consistent with the head teacher's belief.