Questions (30179 questions)

Browse by board
AQA AS Paper 1 AS Paper 2 C1 C2 C3 C4 D1 D2 FP1 FP2 FP3 Further AS Paper 1 Further AS Paper 2 Discrete Further AS Paper 2 Mechanics Further AS Paper 2 Statistics Further Paper 1 Further Paper 2 Further Paper 3 Discrete Further Paper 3 Mechanics Further Paper 3 Statistics M1 M2 M3 Paper 1 Paper 2 Paper 3 S1 S2 S3 CAIE FP1 FP2 Further Paper 1 Further Paper 2 Further Paper 3 Further Paper 4 M1 M2 P1 P2 P3 S1 S2 Edexcel AEA AS Paper 1 AS Paper 2 C1 C12 C2 C3 C34 C4 CP AS CP1 CP2 D1 D2 F1 F2 F3 FD1 FD1 AS FD2 FD2 AS FM1 FM1 AS FM2 FM2 AS FP1 FP1 AS FP2 FP2 AS FP3 FS1 FS1 AS FS2 FS2 AS M1 M2 M3 M4 M5 P1 P2 P3 P4 PMT Mocks Paper 1 Paper 2 Paper 3 S1 S2 S3 S4 OCR AS Pure C1 C2 C3 C4 D1 D2 FD1 AS FM1 AS FP1 FP1 AS FP2 FP3 FS1 AS Further Additional Pure Further Additional Pure AS Further Discrete Further Discrete AS Further Mechanics Further Mechanics AS Further Pure Core 1 Further Pure Core 2 Further Pure Core AS Further Statistics Further Statistics AS H240/01 H240/02 H240/03 M1 M2 M3 M4 Mechanics 1 PURE Pure 1 S1 S2 S3 S4 Stats 1 OCR MEI AS Paper 1 AS Paper 2 C1 C2 C3 C4 D1 D2 FP1 FP2 FP3 Further Extra Pure Further Mechanics A AS Further Mechanics B AS Further Mechanics Major Further Mechanics Minor Further Numerical Methods Further Pure Core Further Pure Core AS Further Pure with Technology Further Statistics A AS Further Statistics B AS Further Statistics Major Further Statistics Minor M1 M2 M3 M4 Paper 1 Paper 2 Paper 3 S1 S2 S3 S4 WJEC Further Unit 1 Further Unit 2 Further Unit 3 Further Unit 4 Further Unit 5 Further Unit 6 Unit 1 Unit 2 Unit 3 Unit 4
OCR MEI Further Statistics A AS Specimen Q5
8 marks Moderate -0.8
5 In a recent report, it was stated that \(40 \%\) of working people have a degree. For the whole of this question, you should assume that this is true. A researcher wishes to interview a working person who has a degree. He asks working people at random whether they have a degree and counts the number of people he has to ask until he finds one with a degree.
  1. Find the probability that he has to ask 5 people.
  2. Find the mean number of people the researcher has to ask. Subsequently, the researcher decides to take a random sample from the population of working people.
  3. A random sample of 5 working people is chosen. What is the probability that at least one of them has a degree?
  4. How large a random sample of working people would the researcher need to take to ensure that the probability that at least one person has a degree is 0.99 or more?
OCR MEI Further Statistics A AS Specimen Q6
12 marks Standard +0.3
6 A motorist decides to check the fuel consumption, \(y\) miles per gallon, of her car at particular speeds, \(x \mathrm { mph }\), on flat roads. She carries out the check on a suitable stretch of motorway. Fig. 6 shows her results. \begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{880026ad-1cd3-40bb-bc87-8dcc94bd9bbd-4_707_1091_1320_477} \captionsetup{labelformat=empty} \caption{Fig. 6}
\end{figure}
  1. Explain why it would not be appropriate to carry out a hypothesis test for correlation based on the product moment correlation coefficient.
  2. (A) One of the results is an outlier. Circle the outlier on the copy of Fig. 6 in the Printed Answer Booklet.
    (B) Suggest one possible reason for the outlier in part (ii) (A) not being used in any analysis. The motorist decides to remove this item of data from any analysis. The table below shows part of a spreadsheet that was used to analyse the 14 remaining data items (with the outlier removed). Some rows of the spreadsheet have been deliberately omitted.
    Data item\(x\)\(y\)\(x ^ { 2 }\)\(y ^ { 2 }\)\(x y\)
    15053.625002872.962680
    25053.325002840.892665
    137044.849002007.043136
    147044.249001953.643094
    Sum8406865115033779.740812
  3. Calculate the equation of the regression line of \(y\) on \(x\).
  4. Use the equation of the regression line to predict the fuel consumption of the car at
    (A) 58 mph ,
    (B) 30 mph .
  5. Comment on the reliability of your predictions in part (iv). OCR is committed to seeking permission to reproduce all third-party content that it uses in the assessment materials. OCR has attempted to identify and contact all copyright holders whose work is used in this paper. To avoid the issue of disclosure of answer-related information to candidates, all copyright acknowledgements are reproduced in the OCR Copyright Acknowledgements booklet. This is produced for each series of examinations and is freely available to download from our public website (\href{http://www.ocr.org.uk}{www.ocr.org.uk}) after the live examination series. If OCR has unwittingly failed to correctly acknowledge or clear any third-party content in this assessment material, OCR will be happy to correct its mistake at the earliest possible opportunity.
    For queries or further information please contact the Copyright Team, First Floor, 9 Hills Road, Cambridge CB2 1GE.
    OCR is part of the Cambridge Assessment Group; Cambridge Assessment is the brand name of University of Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge. }\section*{}
OCR MEI Further Statistics B AS 2018 June Q1
6 marks Moderate -0.8
1 The birth weights, in kilograms, of a random sample of 9 captive-bred elephants are as follows. $$\begin{array} { l l l l l l l l l } 94 & 138 & 130 & 118 & 146 & 165 & 82 & 115 & 69 \end{array}$$ A researcher uses software to produce a \(99 \%\) confidence interval for the mean birth weight of captive-bred elephants. The output from the software is shown in Fig. 1. \begin{table}[h]
Result
T Estimate of a Mean
Mean
s
SE
N
df
Lower limit
Upper limit
Interval
\captionsetup{labelformat=empty} \caption{Fig. 1}
\end{table}
  1. State an assumption about the distribution of the population from which these weights come that is necessary in order to produce this interval.
  2. State the confidence interval which the software gives, in the form \(a < \mu < b\).
  3. Explain
    • what the label df means,
    • how the value of df is calculated for a confidence interval produced using the \(t\) distribution.
    • State two ways in which the researcher could have obtained a narrower confidence interval.
OCR MEI Further Statistics B AS 2018 June Q2
9 marks Standard +0.3
2 A supermarket sells oranges. Their weights are modelled by the random variable \(X\) which has a Normal distribution with mean 345 grams and standard deviation 15 grams. When the oranges have been peeled, their weights in grams, \(Y\), are modelled by \(Y = 0.7 X\).
  1. Find the probability that a randomly chosen peeled orange weighs less than 250 grams. I randomly choose 5 oranges to buy.
  2. Find the probability that the total weight of the 5 unpeeled oranges is at least 1800 grams.
  3. I peel three of the oranges and leave the remaining two unpeeled. Find the probability that the total weight of the two unpeeled oranges is greater than the total weight of the three peeled ones.
OCR MEI Further Statistics B AS 2018 June Q3
10 marks Standard +0.3
3 The probability density function of the continuous random variable \(X\) is given by $$\mathrm { f } ( x ) = \begin{cases} c + x & - c \leqslant x \leqslant 0 \\ c - x & 0 \leqslant x \leqslant c \\ 0 & \text { otherwise } \end{cases}$$ where \(c\) is a positive constant.
  1. (A) Sketch the graph of the probability density function.
    (B) Show that \(c = 1\).
  2. Find \(\mathrm { P } \left( X < \frac { 1 } { 4 } \right)\).
  3. Find
    • the mean of \(X\),
    • the standard deviation of \(X\).
OCR MEI Further Statistics B AS 2018 June Q4
15 marks Easy -1.2
4 The random variable \(X\) has a continuous uniform distribution on [ 0,10 ].
  1. Find \(\mathrm { P } ( 3 < X < 6 )\).
  2. Find each of the following.
    • \(\mathrm { E } ( X )\)
    • \(\operatorname { Var } ( X )\)
    Marisa is investigating the sample mean, \(Y\), of 8 independent values of \(X\). She designs a simulation shown in the spreadsheet in Fig. 4.1. Each of the 25 rows below the heading row consists of 8 values of \(X\) together with the value of \(Y\). All of the values in the spreadsheet have been rounded to 2 decimal places. \begin{table}[h]
    1ABCDEFGHIJ
    1\(X _ { 1 }\)\(X _ { 2 }\)\(X _ { 3 }\)\(X _ { 4 }\)\(X _ { 5 }\)\(X _ { 6 }\)\(X _ { 7 }\)\(X _ { 8 }\)\(Y\)
    26.312.453.273.064.161.530.437.993.65
    31.701.527.108.936.442.709.967.835.77
    49.150.524.956.996.523.150.815.354.68
    50.652.717.929.650.504.876.462.674.43
    63.096.113.960.090.184.670.676.203.12
    77.065.841.973.609.361.974.483.474.72
    81.461.575.450.373.767.568.489.124.72
    99.421.854.911.611.948.001.775.344.36
    102.985.322.914.129.161.769.976.885.39
    112.833.443.287.851.000.938.774.034.01
    124.510.595.849.878.653.947.180.235.10
    134.490.693.658.784.968.963.771.434.59
    146.578.084.856.757.920.279.694.046.02
    158.351.098.638.047.232.122.579.595.95
    165.249.536.088.213.617.076.657.636.75
    177.895.503.090.716.475.496.474.955.07
    188.367.272.359.040.582.263.017.905.10
    193.761.019.619.657.899.986.284.346.56
    209.946.843.385.530.268.535.725.125.66
    217.259.100.342.884.662.656.377.635.11
    227.187.145.380.044.096.474.964.234.94
    238.695.044.902.942.004.234.130.974.11
    243.466.330.489.350.231.187.976.374.42
    252.377.267.161.245.262.803.553.844.19
    262.168.307.173.322.961.309.110.314.33
    27
    \captionsetup{labelformat=empty} \caption{Fig. 4.1}
    \end{table}
  3. Use the spreadsheet to estimate \(\mathrm { P } ( 3 < Y < 6 )\).
  4. Explain why it is not surprising that this estimated probability is substantially greater than the value which you calculated in part (i). Marisa wonders whether, even though the sample size is only 8, use of the Central Limit Theorem will provide a good approximation to \(\mathrm { P } ( 3 < Y < 6 )\).
  5. Calculate an estimate of \(\mathrm { P } ( 3 < Y < 6 )\) using the Central Limit Theorem. A Normal probability plot of the 25 simulated values of \(Y\) is shown in Fig. 4.2. \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{0c58d4d7-10e9-473a-888a-b407ec90bf08-5_800_1291_306_386} \captionsetup{labelformat=empty} \caption{Fig. 4.2}
    \end{figure}
  6. Explain what the Normal probability plot suggests about the use of the Central Limit Theorem to approximate \(\mathrm { P } ( 3 < Y < 6 )\). Marisa now decides to use a spreadsheet with 1000 rows below the heading row, rather than the 25 which she used in the initial simulation shown in Fig. 4.1. She uses a counter to count the number of values of \(Y\) between 3 and 6. This value is 808.
  7. Explain whether the value 808 supports the suggestion that the Central Limit Theorem provides a good approximation to \(\mathrm { P } ( 3 < Y < 6 )\). Marisa decides to repeat each of her two simulations many times in order to investigate how variable the probability estimates are in each case.
  8. Explain whether you would expect there to be more, the same or less variability in the probability estimates based on 1000 rows than in the probability estimates based on 25 rows.
OCR MEI Further Statistics B AS 2018 June Q5
10 marks Moderate -0.3
5 The flight time between two airports is known to be Normally distributed with mean 3.75 hours and standard deviation 0.21 hours. A new airline starts flying the same route. The flight times for a random sample of 12 flights with the new airline are shown in the spreadsheet (Fig. 5), together with the sample mean. \begin{table}[h]
ABCDEFGHIJKL
13.5953.7233.5843.6433.6693.6973.5503.6743.9243.5633.3303.706
2
3Mean3.638
\captionsetup{labelformat=empty} \caption{Fig. 5}
\end{table} \section*{(i) In this question you must show detailed reasoning.} You should assume that:
  • the flight times for the new airline are Normally distributed,
  • the standard deviation of the flight times is still 0.21 hours.
Carry out a test at the \(5 \%\) significance level to investigate whether the mean flight time for the new airline is less than 3.75 hours.
(ii) If both of the assumptions in part (i) were false, name an alternative test that you could carry out to investigate average flight times, stating any assumption necessary for this test.
(iii) If instead the flight times were still Normally distributed but the standard deviation was not known to be 0.21 hours, name another test that you could carry out.
OCR MEI Further Statistics B AS 2018 June Q6
10 marks Standard +0.3
6 A company has a large fleet of cars. It is claimed that use of a fuel additive will reduce fuel consumption. In order to test this claim a researcher at the company randomly selects 40 of the cars. The fuel consumption of each of the cars is measured, both with and without the fuel additive. The researcher then calculates the difference \(d\) litres per kilometre between the two figures for each car, where \(d\) is the fuel consumption without the additive minus the fuel consumption with the additive. The sample mean of \(d\) is 0.29 and the sample standard deviation is 1.64 .
  1. Showing your working, find a 95\% confidence interval for the population mean difference.
  2. Explain whether the confidence interval suggests that, on average, the fuel additive does reduce fuel consumption.
  3. Explain why you can construct the interval in part (i) despite not having any information about the distribution of the population of differences.
  4. Explain why the sample used was random.
OCR MEI Further Statistics B AS 2019 June Q1
6 marks Standard +0.3
1 It is known that the red blood cell count of adults in a particular country, measured in suitable units, has mean 4.96 and variance 0.15.
  1. Find the probability that the mean red blood cell count of a random sample of 50 adults from this country is at least 5.00.
  2. Explain how you can find the probability in part (a) despite the fact that you do not know the distribution of red blood cell counts.
OCR MEI Further Statistics B AS 2019 June Q2
8 marks Moderate -0.8
2 Leila and Caleb are playing a game, using fair six-sided dice and unbiased coins.
  • Leila rolls two dice, and her score \(L\) is the total of the scores on the two dice.
  • Caleb spins 4 coins and his score \(C\) is three times the number of heads obtained.
The winner of a game is the player with the higher score. If the two scores are equal, the result of the game is a draw. The spreadsheet in Fig. 2 shows a simulation of 20 plays of the game. \begin{table}[h]
ABCDEFGH
1First diceSecond diceTotal (Leila's score) \(\boldsymbol { L }\)Coin 1Coin 2Coin 3Coin 4Caleb's score \(\boldsymbol { C }\)
2123HTTT3
3617THTT3
4268HHTT6
5257THHH9
6156THTT3
7527HHHH12
8112HTHT6
9268THTH6
10628HTHT6
11134THHH9
12617THTT3
13314TTTT0
14369HTHH9
15235THHH9
16257HHHH12
17156HHTH9
185611THHH9
19426THHT6
206511TTHH6
21112TTTT0
\captionsetup{labelformat=empty} \caption{Fig. 2}
\end{table}
  1. Explain why the value of \(C\) in row 2 is 3 .
  2. Use the spreadsheet to estimate \(\mathrm { P } ( C > 6 )\) and \(\mathrm { P } ( L > 6 )\).
  3. Use the spreadsheet to estimate the probability that Leila loses a randomly chosen game.
  4. Explain why your answers to parts (b) and (c) may not be very close to the true values.
  5. Leila claims that the game is fair (that Leila and Caleb each have an equal chance of winning) because both she and Caleb can get a maximum score of 12 and also in the simulation she won exactly \(50 \%\) of the games.
    Make 2 comments about Leila's claim.
OCR MEI Further Statistics B AS 2019 June Q3
9 marks Moderate -0.3
3 A bus runs from point A on the outskirts of a city, stops at point B outside the rail station, and continues to point C in the city centre.
The journey times for the sections A to B and B to C vary according to traffic conditions, and are modelled by independent Normal distributions with means and standard deviations as shown in the table.
\multirow{2}{*}{}Journey time (minutes)
\cline { 2 - 3 }MeanStandard deviation
A to B213
B to C294
  1. Find the probability that a randomly chosen journey from A to B takes less than the scheduled time of 23 minutes. For every journey, the bus stops for 1 minute when it reaches B to drop off and pick up passengers.
  2. Find the probability that a randomly chosen journey from A to C takes less than the scheduled time of 50 minutes. Mary travels on the bus from the station at B to her workplace at C every working day. You should assume that times for her bus journeys on different days are independent.
  3. Find the probability that the total time taken for her five journeys on the bus in a randomly chosen week is at least \(2 \frac { 1 } { 2 }\) hours.
  4. Comment on the assumption that times on different days are independent.
OCR MEI Further Statistics B AS 2019 June Q4
12 marks Standard +0.8
4 The cumulative distribution function of the continuous random variable \(X\) is given by \(\mathrm { F } ( x ) = \begin{cases} 0 & x < 0 , \\ k \left( 12 x - x ^ { 2 } \right) & 0 \leqslant x \leqslant 2 , \\ 1 & x > 2 , \end{cases}\) where \(k\) is a constant.
  1. Show that \(k = 0.05\).
  2. Find \(\mathrm { P } ( 1 \leqslant X \leqslant 1.5 )\).
  3. Find the median of \(X\), correct to 3 significant figures.
  4. Find which of the median, mean and mode of \(X\) is the largest of the three measures of central tendency.
OCR MEI Further Statistics B AS 2019 June Q5
14 marks Standard +0.3
5 A technician is investigating whether a batch of nylon 66 (a particular type of nylon) is contaminated by another type of nylon.
The average melting point of nylon 66 is \(264 ^ { \circ } \mathrm { C }\). However, if the batch is contaminated by the other type of nylon the melting point will be lower. The melting points, in \({ } ^ { \circ } \mathrm { C }\), of a random sample of 8 pieces of nylon from the batch are as follows. \(\begin{array} { l l l l l l l l } 262.7 & 265.0 & 264.1 & 261.7 & 262.9 & 263.5 & 261.3 & 262.6 \end{array}\)
  1. Find
    • the sample mean,
    • the sample standard deviation.
    The technician produces a Normal probability plot and carries out a Kolmogorov-Smirnov test for these data as shown in Fig. 5. \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{1db026a2-dffc-4877-b927-247fbf0e7a78-5_560_1358_982_246} \captionsetup{labelformat=empty} \caption{Fig. 5}
    \end{figure}
  2. Comment on what the Normal probability plot and the \(p\)-value of the test suggest about the data.
  3. In this question you must show detailed reasoning. Carry out a suitable test at the \(5 \%\) significance level to investigate whether the batch appears to be contaminated with another type of nylon.
  4. Name an alternative test that could have been carried out if the population standard deviation had been known.
OCR MEI Further Statistics B AS 2019 June Q6
11 marks Standard +0.3
6 The label on a pack of strawberries in a large batch states that it holds 250 g of strawberries. A random sample of 40 packs from the batch is selected and software is used to produce a \(95 \%\) confidence interval for the mean weight of strawberries per pack. An extract from the software output is shown in Fig. 6. \begin{table}[h]
Sample Mean248.92
Standard Error0.61506
Sample Size40
Confidence Level0.95
Interval\(248.92 \pm 1.2055\)
\captionsetup{labelformat=empty} \caption{Fig. 6}
\end{table}
  1. Explain whether the confidence interval suggests that the mean weight of strawberries per pack in the batch is different from 250 g .
  2. A manager looking at the data says that the conclusion would have been different if a \(90 \%\) confidence interval had been used.
    Determine whether the manager is correct.
  3. Explain briefly whether or not it is appropriate for the manager to vary the confidence level before coming to any conclusions.
    [0pt]
  4. On another occasion, using the same sample size, a 95\% confidence interval for the mean weight of strawberries per pack is [248.05, 249.95].
    Find the sample variance in this case.
  5. Explain the meaning of a 95\% confidence interval.
OCR MEI Further Statistics B AS 2022 June Q1
10 marks Moderate -0.3
1 Each working day, Beth takes a bus to her place of work. She believes that the mean time that her journey takes is 30 minutes. In order to check this, Beth selects a random sample of 8 journeys. The times in minutes for these 8 journeys are as follows. \(\begin{array} { l l l l l l l l } 31.9 & 28.5 & 35.9 & 31.0 & 30.2 & 34.9 & 28.9 & 31.3 \end{array}\)
  1. What assumption does Beth need to make in order to construct a confidence interval for the mean journey time based on the \(t\) distribution?
  2. In this question you must show detailed reasoning. Given that the assumption in part (a) is valid, determine a 95\% confidence interval for the mean journey time.
  3. Explain whether the confidence interval suggests that Beth may be correct in the belief that her mean journey time is 30 minutes.
OCR MEI Further Statistics B AS 2022 June Q2
6 marks Moderate -0.8
2 The continuous random variable \(X\) has cumulative distribution function given by \(F ( x ) = \begin{cases} 0 & x < a , \\ \frac { x - a } { b - a } & a \leqslant x \leqslant b , \\ 1 & x > b , \end{cases}\) where \(a\) and \(b\) are constants with \(0 < \mathrm { a } < \mathrm { b }\).
  1. Find \(\mathrm { P } \left( \mathrm { X } < \frac { 1 } { 2 } ( \mathrm { a } + \mathrm { b } ) \right)\).
  2. Sketch the graph of the probability density function of \(X\).
  3. Find the variance of \(X\) when \(a = 2\) and \(b = 8\).
OCR MEI Further Statistics B AS 2022 June Q3
8 marks Standard +0.3
3 A local council collects domestic kitchen waste for composting. Householders place their kitchen waste in a 'compost bin' and this is emptied weekly by the council. The average weight of kitchen waste collected per household each week is known to be 3.4 kg . The council runs a campaign to try to increase the amount of kitchen waste per household which is put in the compost bin. After the campaign, a random sample of 40 households is selected and the weights in kg of kitchen waste in their compost bins are measured. A hypothesis test is carried out in order to investigate whether the campaign has been successful, using software to analyse the sample. The output from the software is shown below.
â–ˇ
Z Test of a Mean
Null Hypothesis \(\mu = 3.4\) Alternative Hypothesis \(\bigcirc < 0 > 0 \neq\) Sample
Mean3.565
s1.05
N40
Result
Z Test of a Mean
Mean3.565
S1.05
SE0.1660
N40
Z0.994
p0.160
  1. Explain why the test is based on the Normal distribution even though the distribution of the population of amounts of kitchen waste per household is not known.
  2. Using the output from the software, complete the test at the \(5 \%\) significance level.
  3. Show how the value of \(Z\) in the software output was calculated.
  4. Calculate the least value of the sample mean which would have resulted in the conclusion of the test in part (b) being different. You should assume that the standard error is unchanged.
OCR MEI Further Statistics B AS 2022 June Q4
10 marks Standard +0.3
4 A wood contains a large number of mature beech trees. The diameters in centimetres of a random sample of 10 of these trees are as follows. \(\begin{array} { l l l l l l l l l l } 82.6 & 79.2 & 77.8 & 38.4 & 88.1 & 32.2 & 26.5 & 23.4 & 94.3 & 104.2 \end{array}\) A tree surgeon wants to know if the average diameter of mature beech trees in this wood is 50 cm . The tree surgeon produces a Normal probability plot for these data. \includegraphics[max width=\textwidth, alt={}, center]{4caa7409-cb32-41da-ad64-012a45753296-4_796_1230_589_230}
  1. Explain why the tree surgeon should not carry out a test based on the \(t\) distribution.
  2. Carry out a suitable test at the \(5 \%\) significance level to investigate whether the average diameter of mature beech trees in this wood is 50 cm .
OCR MEI Further Statistics B AS 2022 June Q5
8 marks Standard +0.3
5 Layla works at an internet café. Each terminal at the café has its own keyboard, and keyboards need to be replaced whenever faults develop. Layla knows that the number of weeks for which a keyboard lasts before it needs to be replaced can be modelled by the random variable \(X\), which has an exponential distribution with mean 20 and variance 400 . She wants to investigate how likely it is that the keyboard at a terminal will need to be replaced at least 3 times within a year (taken as being a period of 52 weeks). Layla designs the simulation shown in the spreadsheet below. Each of the 20 rows below the heading row consists of 3 values of \(X\) together with their sum \(T\). All of the values in the spreadsheet have been rounded to 1 decimal place.
ABCD
1\(\mathrm { X } _ { 1 }\)\(\mathrm { X } _ { 2 }\)\(\mathrm { X } _ { 3 }\)T
210.921.55.337.7
323.952.485.3161.6
45.210.424.039.6
52.914.40.818.1
69.043.349.7102.0
70.416.212.429.0
844.139.522.1105.7
99.243.613.966.7
1040.410.96.157.4
113.254.815.773.7
125.36.11.613.0
1320.528.922.972.3
1437.32.128.668.0
157.113.650.170.8
1618.62.09.329.9
179.01.249.960.1
181.99.569.881.2
199.02.110.421.5
2028.71.493.8123.9
211.82.934.839.5
  1. Explain why \(T\) represents the number of weeks after which the third keyboard at a terminal will need to be replaced.
  2. Use the information in the spreadsheet to write down an estimate of \(\mathrm { P } ( T > 52 )\).
  3. Explain how you could obtain a more reliable estimate of \(\mathrm { P } ( T > 52 )\).
  4. The internet café has 50 terminals. You are given that faults in keyboards occur independently of each other. Determine an estimate of the probability that the mean number of weeks before which the third keyboard at a terminal needs to be replaced is more than 52 .
OCR MEI Further Statistics B AS 2022 June Q6
9 marks Standard +0.3
6 The length \(L\) of a particular type of fence panel is Normally distributed with mean 179.2 cm and standard deviation 0.8 cm . You should assume that the lengths of individual fence panels are independent of each other.
  1. Find the probability that the length of a randomly chosen fence panel is at least 180 cm .
  2. Find the probability that the total length of 5 randomly chosen fence panels is less than 895 cm . The width \(W\) of a fence post is Normally distributed with mean 9.8 cm and standard deviation 0.3 cm . A straight fence is constructed using 6 posts and 5 panels with no gaps between them. Fig. 6 shows a view from above of the first two posts, the first panel and the start of the second panel. You should assume that the lengths of fence panels and widths of fence posts are independent. \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{4caa7409-cb32-41da-ad64-012a45753296-6_213_1522_934_244} \captionsetup{labelformat=empty} \caption{Fig. 6}
    \end{figure}
  3. Determine the probability that the total length of the fence, including the posts, is less than 9.5 m .
  4. State another assumption that is necessary for the calculation of the probability in part (c) to be valid.
OCR MEI Further Statistics B AS 2022 June Q7
9 marks Standard +0.3
7 Many cars have pollen filters to try to remove as much pollen as possible from the passenger compartment. In a test car, the amount of pollen is regularly monitored. The amount of pollen is measured using a scale from 0 to 1 , and is modelled by the continuous random variable \(X\) with probability density function given by \(f ( x ) = \begin{cases} k \left( 5 x ^ { 4 } - 16 x ^ { 2 } + 11 x \right) & 0 \leqslant x \leqslant 1 , \\ 0 & \text { otherwise } , \end{cases}\) where \(k\) is a positive constant.
  1. Show that \(k = \frac { 6 } { 7 }\).
  2. Determine \(\mathrm { P } ( X < \mathrm { E } ( X ) )\).
  3. Verify that the median amount of pollen according to the model lies between 0.417 and 0.418.
OCR MEI Further Statistics B AS 2021 November Q1
9 marks Moderate -0.8
1 Over time LED light bulbs gradually lose brightness. For a particular type of LED bulb, it is known that the mean reduction in brightness after 10000 hours is \(2.6 \%\). A manufacturer produces a new version of this bulb, which costs less to make, but is claimed to have the same reduction in brightness after 10000 hours as the previous version. In order to check this claim, a random sample of 10 bulbs is selected. For each bulb, the original brightness and the brightness after 10000 hours are measured, in suitable units. A spreadsheet is used to produce a \(95 \%\) confidence interval for the mean percentage reduction in brightness. A screenshot of the spreadsheet is shown in Fig. 1. \begin{table}[h]
ABCDEFGH1JK
1Original brightness1075112111061095110111091114112311081115
2After 10000 hours1042108410761065107010791081109110801082
3Percentage reduction3.073.302.712.742.822.712.962.852.532.96
4
5
6Sample mean (\%)2.8650
7Sample sd (\%)0.2179
8SE0.0689
9DF9
10tvalue2.262
11Lower limit2.709
12Upper limit3.021
1.3
\captionsetup{labelformat=empty} \caption{Fig. 1}
\end{table}
  1. State the confidence interval in the form \(a < \mu < b\).
  2. Explain whether the confidence interval suggests that the mean percentage reduction in brightness after 10000 hours is different from 2.6\%.
  3. Explain how the value in cell B8 was calculated.
  4. State an assumption necessary for this confidence interval to be calculated.
  5. Explain the advantage of using the same bulbs for both measurements.
OCR MEI Further Statistics B AS 2021 November Q2
10 marks Moderate -0.3
2 Natasha is investigating binomial distributions. She constructs the spreadsheet in Fig. 2 which shows the first 3 and last 4 rows of a simulation involving two independent variables, \(X\) and \(Y\), with distributions \(\mathrm { B } ( 10,0.3 )\) and \(\mathrm { B } ( 50,0.3 )\) respectively. The spreadsheet also shows the corresponding value of the random variable \(Z\), defined by \(Z = 5 X - Y\), for each pair of values of \(X\) and \(Y\). There are 100 simulated values of each of \(X , Y\) and \(Z\). The spreadsheet also shows whether each value of \(Z\) is greater than 6, and cells D103 and D104 show the number of values of \(Z\) which are greater than 6 and not greater than 6 respectively. \begin{table}[h]
1ABCDE
1XY\(\mathbf { Z } = \mathbf { 5 } \mathbf { X } - \mathbf { Y }\)\(\mathbf { Z } > \mathbf { 6 }\)
24137Y
34173N
4321-6N
5
6
98114-9N
9951213Y
100318-3N
1013150N
102
103Number of Y19
104Number of N81
105
\captionsetup{labelformat=empty} \caption{Fig. 2}
\end{table}
  1. Use the information in the spreadsheet to write down an estimate of \(\mathrm { P } ( Z > 6 )\).
  2. Explain how a more reliable estimate of \(\mathrm { P } ( Z > 6 )\) could be obtained.
    1. State the greatest possible value of \(Z\).
    2. Explain why it is very unlikely that \(Z\) would have this value.
  3. Use the Central Limit Theorem to calculate an estimate of the probability that the mean of 20 independent values of \(Z\) is greater than 2 .
OCR MEI Further Statistics B AS 2021 November Q3
12 marks Standard +0.3
3 The weights in kg of male otters in a large river system are known to be Normally distributed with mean 8.3 and standard deviation 1.8. A researcher believes that weights of male otters in another river are higher because of what he suspects is better availability of food. The researcher records the weights of a random sample of 9 male otters in this other river. The sum of these 9 weights is 83.79 kg .
  1. In this question you must show detailed reasoning. You should assume that:
    • the weights of otters in the other river are Normally distributed,
    • the standard deviation of the weights of otters in the other river is also 1.8 kg .
    Show that a test at the \(5 \%\) significance level provides sufficient evidence to conclude that the mean weight of male otters in the other river is greater than 8.3 kg .
  2. Explain whether the result of the test suggests that the weights are higher due to better availability of food.
  3. If the standard deviation of the weights of otters in the other river could not be assumed to be 1.8 kg , name an alternative test that the researcher could carry out to investigate otter weights.
  4. Explain why, even if a test at the \(5 \%\) significance level results in the rejection of the null hypothesis, you cannot be sure that the alternative hypothesis is true.
OCR MEI Further Statistics B AS 2021 November Q4
9 marks Standard +0.3
4 John regularly downloads podcasts onto his mobile phone. From past experience he knows that the average time to download one 30 -minute podcast is 12.7 s . He believes that this time has recently increased. At each of 12 randomly chosen times, he downloads a 30-minute podcast. The times in seconds to download the 12 podcasts are as follows. \(\begin{array} { l l l l l l l l l l l } 12.63 & 13.24 & 11.73 & 14.91 & 13.17 & 13.53 & 12.33 & 14.27 & 11.48 & 13.51 & 13.05 \end{array} 13.83\)
  1. Given that it is not known whether the times are Normally distributed, carry out a suitable test at the \(5 \%\) significance level to investigate whether the average download time has increased.
  2. What assumption is required to carry out the test in part (a)?