OCR MEI Further Statistics B AS (Further Statistics B AS) Specimen

Question 1
View details
1 Abby runs a stall at a charity event. Visitors to the stall pay to play a game in which six fair dice are rolled. If the difference between the highest and lowest scores is less than 3 then the player wins \(\pounds 5\). Otherwise the player wins nothing. Abby designs the spreadsheet shown in Fig. 1 to estimate the probability of a player winning, by simulating 20 goes at the game. Cell C5, highlighted, shows that the 2nd dice in simulated game 4 scores 5 . Cells H5 and I5 show the highest and lowest scores, respectively, in game 4, and cell J5 gives the difference between them. \begin{table}[h]
C5\(\times \vee f _ { x }\)=RANDBETWEEN(1,6)
ABCDEFGHJ
1dice 1dice 2dice 3dice 4dice 5dice 6High scoreLow scoreDifference
2game 1224233422
3game 2263212615
4game 3315346615
5game 4652563624
6game 5633532624
7game 6563514615
8game 7231264615
9game 8666615615
10game 9362541615
11game 10511461615
12game 11256165615
13game 12256666624
14game 13222244422
15game 14166635615
16game 15223351514
17game 16123433413
18game 17524216615
19game 18615215615
20game 19135135514
21game 20543251
\captionsetup{labelformat=empty} \caption{Fig. 1}
\end{table}
  1. (A) Write down the numbers in columns H , I and J for game 20 .
    (B) Use the spreadsheet to estimate the probability of a player winning a game.
  2. State how the estimate of probability in (i) (B) could be improved.
  3. Give one advantage and one disadvantage of using this simulation technique compared with working out the theoretical probability. All profit made by the stall is given to charity. Abby has to decide how much to charge players to play.
  4. If Abby charges \(\pounds 1\) per game, estimate the total profit when 50 players each play the game once.
Question 2
View details
2 The cumulative distribution function of the continuous random variable, \(Y\), is given below. $$\mathrm { F } ( y ) = \left\{ \begin{array} { c c } 0 & y < 0
\frac { y ^ { 3 } - y ^ { 2 } } { 4 } & 1 \leq y \leq 2
1 & y > 2 \end{array} \right.$$
  1. Find \(\mathrm { P } ( Y \leq 1.5 )\)
  2. Verify that the median of \(Y\) lies between 1.6 and 1.7.
  3. Find the probability density function of \(Y\).
Question 3
View details
3 At a factory, flour is packed into bags. A model for the mass in grams of flour packed into each bag is \(1500 + X\), where \(X\) is a continuous random variable with probability density function $$f ( x ) = \left\{ \begin{array} { c c } k x ( 6 - x ) & 0 \leq x \leq 6
0 & \text { elsewhere, } \end{array} \right.$$ where \(k\) is a constant.
  1. Show that \(k = \frac { 1 } { 36 }\).
  2. Find the probability that a randomly selected bag of flour contains 1505 grams of flour or more.
  3. Find
    • the mean of \(X\),
    • the standard deviation of \(X\).
Question 4
View details
4 An online encyclopedia claims that the average mass of an adult European hedgehog is 720 g . In an investigation to check this average figure, the masses in grams of twelve randomly chosen adult European hedgehogs are measured and shown below.
705730720691718680
731723745708724736
  1. What assumption is required to carry out a Wilcoxon test in this situation?
  2. Given that this assumption is met, carry out a 2 -tail Wilcoxon test at the \(5 \%\) level to test whether the median mass is 720 g . You should state your hypotheses and complete the table of calculations in the Printed Answer Booklet.
Question 5
View details
5 A particular alloy of bronze is specified as containing \(11.5 \%\) copper on average. A researcher takes a random sample of 14 specimens of this bronze and undertakes an analysis of each of them. The percentages of copper are found to be as follows.
11.1211.2911.4211.4311.2011.2511.65
11.3311.5611.3411.4411.2411.6011.52
The researcher uses software to draw a Normal probability plot for these data and to conduct a Kolmogorov-Smirnov test for Normality. The output is shown in Fig 5.1. \begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{0de8222f-7df5-4e17-ab68-0f9d84fc615d-4_428_1550_1434_299} \captionsetup{labelformat=empty} \caption{Fig 5.1}
\end{figure}
  1. Comment on what the Normal probability plot and the \(p\)-value of the test suggest about the data. The researcher uses software to produce a \(99 \%\) confidence interval for the mean percentage of copper in the alloy, based on the \(t\) distribution. The output from the software is shown in Fig 5.2. \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{0de8222f-7df5-4e17-ab68-0f9d84fc615d-5_1058_615_434_726} \captionsetup{labelformat=empty} \caption{Fig 5.2}
    \end{figure}
  2. State the confidence interval which the software gives, in the form \(a < \mu < b\).
  3. (A) State an assumption necessary for the use of the \(t\) distribution in the construction of this confidence interval.
    (B) State whether the assumption in part (iii) (A) seems reasonable.
  4. Does the confidence interval suggest that the copper content is different from \(11.5 \%\), on average? Explain your answer.
  5. In the output from the software shown in Fig 5.2, SE stands for 'standard error'.
    (A) Explain what a standard error is.
    (B) Show how the standard error was calculated in this case.
  6. Suggest a way in which the researcher could produce a narrower confidence interval.
Question 6
View details
6 The table below shows the mean and variance of the test scores of a random samples of 70 girls who are starting an A level Mathematics course.
Sample meanSample variance
118.8686.57
  1. Showing your working, find a \(95 \%\) confidence interval for the population mean.
  2. Explain why you can construct the interval in part (i) despite no information about the distribution of the parent population being given.
  3. The same random sample of girls repeats the test. The mean improvement in score is 0.9 . The \(95 \%\) confidence interval for the improvement is \([ - 1.5,3.3 ]\). What is the sample variance for the improvement in score?
Question 7
View details
7 Two flatmates work at the same location. One of them takes the bus to work and the other one cycles. Journey times, measured in minutes, are distributed as follows.
  • By bus: Normally distributed with mean 23 and standard deviation 6
  • By bicycle: Normally distributed with mean 21 and standard deviation 2
You should assume that all journey times are independent.
  1. One morning the two flatmates set out at the same time. Find the probability that the person who takes the bus arrives before the cyclist.
  2. Find the probability that the total time taken for 5 bus journeys is less than 2 hours.
  3. Comment on the assumption that all journey times are independent. \section*{END OF QUESTION PAPER} OCR is committed to seeking permission to reproduce all third-party content that it uses in the assessment materials. OCR has attempted to identify and contact all copyright holders whose work is used in this paper. To avoid the issue of disclosure of answer-related information to candidates, all copyright acknowledgements are reproduced in the OCR Copyright Acknowledgements booklet. This is produced for each series of examinations and is freely available to download from our public website (\href{http://www.ocr.org.uk}{www.ocr.org.uk}) after the live examination series. If OCR has unwittingly failed to correctly acknowledge or clear any third-party content in this assessment material, OCR will be happy to correct its mistake at the earliest possible opportunity.
    For queries or further information please contact the Copyright Team, First Floor, 9 Hills Road, Cambridge CB2 1GE.
    OCR is part of the Cambridge Assessment Group; Cambridge Assessment is the brand name of University of Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge. }\section*{}