Questions (30179 questions)

Browse by board
AQA AS Paper 1 AS Paper 2 C1 C2 C3 C4 D1 D2 FP1 FP2 FP3 Further AS Paper 1 Further AS Paper 2 Discrete Further AS Paper 2 Mechanics Further AS Paper 2 Statistics Further Paper 1 Further Paper 2 Further Paper 3 Discrete Further Paper 3 Mechanics Further Paper 3 Statistics M1 M2 M3 Paper 1 Paper 2 Paper 3 S1 S2 S3 CAIE FP1 FP2 Further Paper 1 Further Paper 2 Further Paper 3 Further Paper 4 M1 M2 P1 P2 P3 S1 S2 Edexcel AEA AS Paper 1 AS Paper 2 C1 C12 C2 C3 C34 C4 CP AS CP1 CP2 D1 D2 F1 F2 F3 FD1 FD1 AS FD2 FD2 AS FM1 FM1 AS FM2 FM2 AS FP1 FP1 AS FP2 FP2 AS FP3 FS1 FS1 AS FS2 FS2 AS M1 M2 M3 M4 M5 P1 P2 P3 P4 PMT Mocks Paper 1 Paper 2 Paper 3 S1 S2 S3 S4 OCR AS Pure C1 C2 C3 C4 D1 D2 FD1 AS FM1 AS FP1 FP1 AS FP2 FP3 FS1 AS Further Additional Pure Further Additional Pure AS Further Discrete Further Discrete AS Further Mechanics Further Mechanics AS Further Pure Core 1 Further Pure Core 2 Further Pure Core AS Further Statistics Further Statistics AS H240/01 H240/02 H240/03 M1 M2 M3 M4 Mechanics 1 PURE Pure 1 S1 S2 S3 S4 Stats 1 OCR MEI AS Paper 1 AS Paper 2 C1 C2 C3 C4 D1 D2 FP1 FP2 FP3 Further Extra Pure Further Mechanics A AS Further Mechanics B AS Further Mechanics Major Further Mechanics Minor Further Numerical Methods Further Pure Core Further Pure Core AS Further Pure with Technology Further Statistics A AS Further Statistics B AS Further Statistics Major Further Statistics Minor M1 M2 M3 M4 Paper 1 Paper 2 Paper 3 S1 S2 S3 S4 WJEC Further Unit 1 Further Unit 2 Further Unit 3 Further Unit 4 Further Unit 5 Further Unit 6 Unit 1 Unit 2 Unit 3 Unit 4
OCR MEI Further Statistics Major 2019 June Q9
15 marks Moderate -0.5
9 Every weekday Jonathan takes an underground train to work. On any weekday the time in minutes that he has to wait at the station for a train is modelled by the continuous uniform distribution over \([ 0,5 ]\).
  1. Find the probability that Jonathan has to wait at least 3 minutes for a train. The total time that Jonathan has to wait on two days is modelled by the continuous random variable \(X\) with probability density function given by \(\mathrm { f } ( x ) = \begin{cases} \frac { 1 } { 25 } x & 0 \leqslant x \leqslant 5 , \\ \frac { 1 } { 25 } ( 10 - x ) & 5 < x \leqslant 10 , \\ 0 & \text { otherwise } . \end{cases}\)
  2. Find the probability that Jonathan has to wait a total of at most 6 minutes on two days. Jonathan's friend suggests that the total waiting time for 5 days, \(T\) minutes, will almost certainly be less than 18 minutes. In order to investigate this suggestion, Jonathan constructs the simulation shown in Fig. 9. All of the numbers in the simulation have been rounded to 2 decimal places. \begin{table}[h]
    ABCDEF
    1MonTueWedThuFriTotal T
    21.784.362.743.884.6417.41
    30.951.304.834.291.8113.18
    44.274.904.571.413.6618.81
    50.800.063.201.760.356.17
    60.034.821.263.530.139.77
    73.884.731.193.751.2914.84
    84.113.544.330.774.5017.25
    93.540.113.852.861.5811.94
    101.871.823.003.531.8312.05
    114.002.984.591.731.7615.06
    121.913.852.081.722.8212.38
    130.104.862.510.522.1710.15
    141.244.260.951.331.789.57
    152.990.693.853.412.4213.36
    164.671.762.133.483.1015.14
    171.941.070.910.633.347.89
    180.112.290.714.210.868.18
    190.434.584.891.862.8414.60
    204.230.882.714.884.2016.91
    213.724.583.114.893.1819.49
    \captionsetup{labelformat=empty} \caption{Fig. 9}
    \end{table}
  3. Use the simulation to estimate \(\mathrm { P } ( T > 18 )\).
  4. Explain how Jonathan could obtain a better estimate. Jonathan thinks that he can use the Central Limit Theorem to provide a very good approximation to the distribution of \(T\).
  5. Find each of the following.
    • \(\mathrm { E } ( T )\)
    • \(\operatorname { Var } ( T )\)
    • Use the Central Limit Theorem to estimate \(\mathrm { P } ( T > 18 )\).
    • Comment briefly on the use of the Central Limit Theorem in this case.
    Jonathan travels to work on 200 days in a year.
  6. Find the probability that the total waiting time for Jonathan in a year is more than 510 minutes.
    [0pt] [3]
OCR MEI Further Statistics Major 2019 June Q10
14 marks Standard +0.8
10 The probability density function of the continuous random variable \(X\) is given by \(f ( x ) = \begin{cases} k x ^ { m } & 0 \leqslant x \leqslant a , \\ 0 & \text { otherwise, } \end{cases}\) where \(a , k\) and \(m\) are positive constants.
  1. Show that \(k = \frac { m + 1 } { a ^ { m + 1 } }\).
  2. Find the cumulative distribution function of \(X\) in terms of \(x , a\) and \(m\).
  3. Given that \(\mathrm { P } \left( \frac { 1 } { 4 } a < X < \frac { 1 } { 2 } a \right) = \frac { 1 } { 10 }\),
    1. show that \(2 p ^ { 2 } - 10 p + 5 = 0\), where \(p = 2 ^ { m }\),
    2. find the value of \(m\). \section*{END OF QUESTION PAPER}
OCR MEI Further Statistics Major 2022 June Q1
7 marks Moderate -0.8
1 During a meteor shower, the number of meteors that can be seen at a particular location can be modelled by a Poisson distribution with mean 1.2 per minute.
  1. Find the probability that exactly 2 meteors are seen in a period of 1 minute.
  2. Find the probability that more than 3 meteors are seen in a period of 1 minute.
  3. Find the probability that no more than 8 meteors are seen in a period of 10 minutes.
  4. Explain what the fact that the number of meteors seen can be modelled by a Poisson distribution tells you about the occurrence of meteors.
OCR MEI Further Statistics Major 2022 June Q2
7 marks Standard +0.3
2 A manufacturer is testing how long coloured LED lights will last before the battery runs out, using two different battery types. The times in hours before the battery runs out are modelled by independent Normal distributions with means and standard deviations as shown in the table.
\cline { 2 - 3 } \multicolumn{1}{c|}{}Time
TypeMean
Standard
deviation
A232.8
B353.6
  1. In a particular test, a battery of type A is used and the time taken for it to run out is recorded. This process is repeated until a total of 5 randomly selected batteries have been used. Determine the probability that the total time the 5 batteries last is at least 120 hours.
  2. In a similar test, 3 randomly selected batteries of type A are used, one after the other. Then 2 randomly selected batteries of type B are used, one after the other. Determine the probability that the 3 type A batteries last longer in total than the 2 type B batteries.
  3. Explain why it is necessary that the Normal distributions are independent in order to be able to find the probability in part (b).
OCR MEI Further Statistics Major 2022 June Q3
6 marks Standard +0.3
3 The table shows the probability distribution of the random variable \(X\), where \(a\) and \(b\) are constants.
\(r\)01234
\(\mathrm { P } ( X = r )\)\(a\)\(b\)0.240.32\(b ^ { 2 }\)
  1. Given that \(\mathrm { E } ( X ) = 1.8\), determine the values of \(a\) and \(b\). The random variable \(Y\) is given by \(Y = 10 - 3 X\).
  2. Using the values of \(a\) and \(b\) which you found in part (a), find each of the following.
    • \(\mathrm { E } ( Y )\)
    • \(\operatorname { Var } ( Y )\)
OCR MEI Further Statistics Major 2022 June Q4
5 marks Standard +0.8
4 A pack of \(k\) cards is labelled \(1,2 , \ldots , k\). A card is drawn at random from the pack. The random variable \(X\) represents the number on the card.
  1. Given that \(k > 10\), find \(\mathrm { P } ( X \geqslant 10 )\). You are now given that \(k = 20\).
  2. A card is drawn at random from the pack and the number on it is noted. The card is then returned to the pack. This process is repeated until the second occasion on which the number noted is less than 9 . Find the probability that no more than 4 cards have to be drawn. Answer all the questions. Section B (95 marks)
OCR MEI Further Statistics Major 2022 June Q5
11 marks Moderate -0.3
5 A motorist is investigating the relationship between tyre pressure and temperature. As the temperature increases during a hot day, she records the pressure (measured in bars) of one of her car tyres at specific temperatures of \(20 ^ { \circ } \mathrm { C } , 22 ^ { \circ } \mathrm { C } , \ldots , 36 ^ { \circ } \mathrm { C }\). The results are shown in Table 5.1. \begin{table}[h]
Temperature \(\left( t ^ { \circ } \mathrm { C } \right)\)202224262830323436
Tyre pressure \(( P\) bar \()\)2.0122.0362.0652.0742.1142.1402.1492.1762.192
\captionsetup{labelformat=empty} \caption{Table 5.1}
\end{table}
  1. Calculate the equation of the regression line of pressure on temperature. Give your answer in the form \(P = a t + b\), giving the values of \(a\) and \(b\) to \(\mathbf { 4 }\) significant figures.
  2. Table 5.2 shows the residuals for most of the data values. Complete the copy of the table in the Printed Answer Booklet. \begin{table}[h]
    Temperature202224262830323436
    Residual tyre
    pressure
    - 0.003- 0.0020.004- 0.0100.011- 0.0030.001
    \captionsetup{labelformat=empty} \caption{Table 5.2}
    \end{table}
  3. With reference to the values of the residuals, comment on the goodness of fit of the regression line.
  4. Use your answer to part (a) to calculate an estimate of the pressure in the tyre at each of the following temperatures, giving your answers to \(\mathbf { 3 }\) decimal places.
    • \(25 ^ { \circ } \mathrm { C }\)
    • \(10 ^ { \circ } \mathrm { C }\)
    • Comment on the reliability of each of your estimates.
OCR MEI Further Statistics Major 2022 June Q7
8 marks Standard +0.8
7 Amir is trying to thread a needle. On each attempt the probability that he is successful is 0.3 , independently of any other attempt. The random variable \(X\) represents the number of attempts that he takes to thread the needle.
  1. Find \(\mathrm { P } ( X = 5 )\).
  2. During the course of a day, Amir has to thread 6 needles. Determine the probability that it takes him more than 3 attempts to be successful for at least 4 of the 6 needles.
  3. Amaya is also trying to thread a needle. On each attempt the probability that she is successful is \(p\), independently of any other attempt. The probability that Amaya takes 2 attempts to thread a particular needle is \(\frac { 28 } { 121 }\). Determine the possible values of \(p\).
OCR MEI Further Statistics Major 2022 June Q8
14 marks Standard +0.3
8 A swimming coach is investigating whether there is correlation between the times taken by teenage swimmers to swim 50 m Butterfly and 50 m Freestyle. The coach selects a random sample of 11 teenage swimmers and records the times that each of them take for each event. The spreadsheet shows the data, together with a scatter diagram to illustrate the data. \includegraphics[max width=\textwidth, alt={}, center]{77eabbd6-a058-457f-9601-d66f3c2db005-06_712_1465_456_274}
  1. In the scatter diagram, Butterfly times have been plotted on the horizontal axis and Freestyle times on the vertical axis. A student states that the variables should have been plotted the other way around. Explain whether the student is correct. The student decides to carry out a hypothesis test to investigate whether there is any correlation between the times taken for the two events.
  2. Explain why the student decides to carry out a test based on Spearman's rank correlation coefficient.
  3. In this question you must show detailed reasoning. Carry out the test at the 5\% significance level.
  4. The student concludes that there is definitely no correlation between the times. Comment on the student's conclusion.
OCR MEI Further Statistics Major 2022 June Q9
11 marks Easy -1.2
9 The random variable \(X\) has a discrete uniform distribution over the values \(\{ 0,1,2 , \ldots , 20 \}\).
  1. Find \(\mathrm { P } ( X \leqslant 7 )\).
  2. Find each of the following.
    • \(\mathrm { E } ( X )\)
    • \(\operatorname { Var } ( X )\)
    The spreadsheet shows a simulation of the distribution of \(X\). Each of the 25 rows of the spreadsheet below the heading row shows a simulation of 10 independent values of \(X\) together with the value of the mean of the 10 values, denoted by \(Y\).
    \includegraphics[max width=\textwidth, alt={}]{77eabbd6-a058-457f-9601-d66f3c2db005-07_38_45_880_279}ABCDEFGHIJKL
    1\(X _ { 1 }\)\(X _ { 2 }\)\(X _ { 3 }\)\(X _ { 4 }\)\(X _ { 5 }\)\(X _ { 6 }\)\(X _ { 7 }\)\(X _ { 8 }\)\(X _ { 9 }\)\(X _ { 10 }\)\(Y\)
    216211864911116.9
    313141224111601608.8
    441711641012218139.7
    5281214161221588.0
    6715160471130208.3
    71513101120201516610.8
    81413171221816189412.3
    9202123173018151310.3
    10212512260910157.3
    115111310917104201511.4
    12149976202211169.6
    1315191819766203812.1
    1451064119158171810.3
    150315151112039168.4
    16112115041111926.6
    171250838121913129.2
    1895113541811197.6
    19162202012172782012.4
    20181732818701169.0
    211510720405611149.2
    223910142186076.0
    23111011101911371009.2
    241214665201118101411.6
    25111514111011205.6
    26014711185102011910.5
    27
  3. Use the spreadsheet to estimate \(\mathrm { P } ( Y \leqslant 7 )\).
  4. Explain why the true value of \(\mathrm { P } ( Y \leqslant 7 )\) is less than \(\mathrm { P } ( X \leqslant 7 )\), relating your answer to \(\operatorname { Var } ( X )\) and \(\operatorname { Var } ( Y )\).
  5. The random variable \(W\) is the mean of 30 independent values of \(X\). Determine an estimate of \(\mathrm { P } ( W \leqslant 7 )\).
OCR MEI Further Statistics Major 2022 June Q10
13 marks Standard +0.3
10 A scientist is researching dietary fat intake and cholesterol level. A random sample of 60 people is selected and their dietary fat intakes and cholesterol levels are measured. Dietary fat intakes are classified as low, medium and high, and cholesterol levels are classified as normal and high. The scientist decides to carry out a chi-squared test to investigate whether there is any association between dietary fat intake and cholesterol level. Tables \(\mathbf { 1 0 . 1 }\) and \(\mathbf { 1 0 . 2 }\) show the data and some of the expected frequencies for the test. \begin{table}[h]
\multirow{2}{*}{}Dietary fat intake
LowMediumHighTotal
\multirow{2}{*}{Cholesterol level}Normal918532
High3131228
Total12311760
\captionsetup{labelformat=empty} \caption{Table 10.1}
\end{table} \begin{table}[h]
Expected frequencyDietary fat intake
\cline { 3 - 5 }LowMediumHigh
\multirow{2}{*}{
Cholesterol
level
}
Normal9.0667
\cline { 2 - 5 }High7.9333
\captionsetup{labelformat=empty} \caption{Table 10.2}
\end{table}
  1. Complete the table of expected frequencies in the Printed Answer Booklet.
  2. Determine the contribution to the chi-squared test statistic for people with normal cholesterol level and high dietary fat intake, giving your answer to \(\mathbf { 4 }\) decimal places. The contributions to the chi-squared test statistic for the remaining categories are shown in Table 10.3. \begin{table}[h]
    Dietary fat intake
    \cline { 2 - 5 }LowMediumHigh
    \multirow{2}{*}{
    Cholesterol
    level
    }
    Normal1.05630.1301
    \cline { 2 - 5 }High1.20710.14872.0846
    \captionsetup{labelformat=empty} \caption{Table 10.3} \end{table}
  3. In this question you must show detailed reasoning. Carry out the test at the 5\% significance level.
  4. For each level of dietary fat intake, give a brief interpretation of what the data suggest about the level of cholesterol.
  5. OCR MEI Further Statistics Major 2022 June Q11
    13 marks Standard +0.3
    11 A particular dietary supplement, when taken for a period of 1 month, is claimed to increase lean body mass of adults by an average of 1 kg . A researcher believes that this claim overestimates the increase. She selects a random sample of 10 adults who then each take the supplement for a month. The increases in lean body masses in kg are as follows. $$\begin{array} { l l l l l l l l l l } - 0.84 & - 0.76 & - 0.16 & 0.43 & 1.31 & 1.32 & 1.47 & 1.64 & 1.93 & 2.14 \end{array}$$ A Normal probability plot and the \(p\)-value of the Kolmogorov-Smirnov test for these data are shown below. \includegraphics[max width=\textwidth, alt={}, center]{77eabbd6-a058-457f-9601-d66f3c2db005-09_575_1485_689_242}
    1. The researcher decides to carry out a hypothesis test in order to investigate the claim. Comment on the type of hypothesis test that should be used. You should refer to
      • The Normal probability plot
      • The \(p\)-value of the Kolmogorov-Smirnov test
      • Carry out a test at the \(5 \%\) significance level to investigate whether the researcher's belief may be correct.
      • If the Normal probability plot had been different, giving a \(p\)-value of 0.65 for the KolmogorovSmirnov test, a different procedure could have been used to investigate the researcher's belief.
      • State what alternative test could have been used in this case.
      • State what the hypotheses would have been.
    OCR MEI Further Statistics Major 2022 June Q12
    14 marks Challenging +1.2
    12 The continuous random variable \(X\) has cumulative distribution function given by $$F ( x ) = \begin{cases} 0 & x < 0 \\ k \left( a x - 0.5 x ^ { 2 } \right) & 0 \leqslant x \leqslant a \\ 1 & x > a \end{cases}$$ where \(a\) and \(k\) are positive constants.
    1. Determine the median of \(X\) in terms of \(a\).
    2. Given that \(a = 10\), determine the probability that \(X\) is within one standard deviation of its mean.
    OCR MEI Further Statistics Major 2023 June Q1
    10 marks Standard +0.3
    1 A website simulates the outcome of throwing four fair dice. Ten thousand people take part in a challenge using the website in which they have one attempt at getting four sixes in the four throws of the dice. The number of people who succeed in getting four sixes is denoted by the random variable \(X\).
    1. Show that, for each person, the probability that the person gets four sixes is equal to \(\frac { 1 } { 1296 }\).
    2. Explain why you could use either a binomial distribution or a Poisson distribution to model the distribution of \(X\).
    3. Use a Poisson distribution to calculate each of the following probabilities.
      • \(\mathrm { P } ( X = 10 )\)
      • \(\mathrm { P } ( X > 10 )\)
      • In another challenge on the website, 50 people are each given 20 independent attempts to try to get four sixes as often as they can.
      Determine the probability that no more than 2 people succeed in getting four sixes at least once in their 20 attempts.
    OCR MEI Further Statistics Major 2023 June Q2
    5 marks Easy -1.2
    2 A student is investigating the link between temperature and electricity consumption in the winter months. The student finds the average minimum temperature, \(x ^ { \circ } \mathrm { C }\), from across the country on a day. The student then finds the total electricity consumption for that day, \(y \mathrm { GWh }\). The scatter diagram below shows the values of \(x\) and \(y\) obtained from a random sample of 10 winter days. It also shows the equation of the regression line of \(y\) on \(x\) and the value of \(r ^ { 2 }\), where \(r\) is the product moment correlation coefficient. \includegraphics[max width=\textwidth, alt={}, center]{c692fb20-436f-4bc1-89bd-10fdba41ceba-03_776_1043_609_244}
    1. Use the regression line to estimate the electricity consumption at each of the following average minimum temperatures.
      • \(5 ^ { \circ } \mathrm { C }\)
      • \(- 4 ^ { \circ } \mathrm { C }\)
      • Comment on the reliability of your estimates.
    OCR MEI Further Statistics Major 2023 June Q3
    10 marks Moderate -0.3
    3 A tennis player is practising her serve. Each time she serves, she has a \(55 \%\) chance of being successful (getting the serve in the required area without hitting the net). You should assume that whether she is successful on any serve is independent of whether she is successful on any other serve.
    1. Find the probability that the player is not successful in any of her first three serves.
    2. Determine the probability that the player is successful at least 10 times in her first 20 serves.
    3. Determine the probability that the player is successful for the first time on her fifth serve.
    4. Determine the probability that the player is successful for the fifth time on her tenth serve. Another player is also practising his serve. Each time he serves, he has a probability \(p\) of being successful. You should assume that whether he is successful on any serve is independent of whether he is successful on any other serve. The probability that he is successful for the first time on his second serve is 0.2496 and the probability that he is successful on both of his first two serves is less than 0.25 .
    5. Determine the value of \(p\).
    OCR MEI Further Statistics Major 2023 June Q4
    11 marks Standard +0.3
    4 A machine manufactures batches of 100 titanium sheets. The thickness of every sheet in a batch is Normally distributed with mean \(\mu \mathrm { mm }\) and standard deviation 0.03 mm . You should assume that each sheet is of uniform thickness and that the thicknesses of different sheets are independent of each other. The values of \(\mu\) for three different batches, A, B and C, are 3.125, 3.117 and 3.109 respectively.
    1. Determine the probability that the total thickness of 10 sheets from Batch A is less than 31.0 mm .
    2. Determine the probability that, if a single sheet from Batch A is cut into pieces and 10 of the pieces are stacked together, the total thickness of the stack is less than 31.0 mm .
    3. Determine the probability that, if one sheet from each of Batches A, B and C are stacked together, the total thickness of the stack is at least 9.4 mm .
    4. Determine the probability that the total thickness of 10 sheets from Batch A is less than the total thickness of 10 sheets from Batch B.
    OCR MEI Further Statistics Major 2023 June Q5
    13 marks Standard +0.3
    5 Amari is investigating how accurately people can estimate a short time period. He asks each of a random sample of 40 people to estimate a period of 20 seconds. For each person, he starts a stopwatch and then stops it when they tell him that they think that 20 s has elapsed. The times which he records are denoted by \(x \mathrm {~s}\). You are given that \(\sum x = 765 , \quad \sum x ^ { 2 } = 15065\).
    1. Determine a 95\% confidence interval for the mean estimated time.
    2. Amari says that the confidence interval supports the suggestion that people can estimate 20 s accurately. Make two comments about Amari's statement.
    3. Discuss whether you could have constructed the confidence interval if there had only been 10 people involved in the experiment. Amari thinks that people would be able to estimate more accurately if he gave them a second attempt. He repeats the experiment with each person and again records the times. Software is used to produce a \(95 \%\) confidence interval for the mean estimated time. The output from the software is shown below. Z Estimate of a Mean Confidence level 0.95 Sample
      Mean19.68
      s1.38
      N40
      Result
      Z Estimate of a Mean
      Mean19.68
      s1.38
      SE0.2182
      N40
      Interval\(19.68 \pm 0.4277\)
    4. State the confidence interval in the form \(\mathrm { a } < \mu < \mathrm { b }\).
    5. Make two comments based on this confidence interval about Amari's opinion that second attempts result in more accurate estimates.
    OCR MEI Further Statistics Major 2023 June Q6
    12 marks Standard +0.3
    6 A student wonders if there is any correlation between download and upload speeds of data to and from the internet. The student decides to carry out a hypothesis test to investigate this and so measures the download speed \(x\) and upload speed \(y\) in suitable units on 20 randomly chosen occasions. The scatter diagram below illustrates the data which the student collected. \includegraphics[max width=\textwidth, alt={}, center]{c692fb20-436f-4bc1-89bd-10fdba41ceba-07_824_1411_440_246}
    1. Explain why the student decides to carry out a test based on the product moment correlation coefficient. Summary statistics for the 20 occasions are as follows. $$\sum x = 342.10 \quad \sum y = 273.65 \quad \sum x ^ { 2 } = 5989.53 \quad \sum y ^ { 2 } = 3919.53 \quad \sum x y = 4713.62$$
    2. In this question you must show detailed reasoning. Calculate the product moment correlation coefficient.
    3. Carry out a hypothesis test at the \(5 \%\) significance level to investigate whether there is any correlation between download speed and upload speed.
    4. Both of the variables, download speed and upload speed, are random. Explain why, if download speed had been a non-random variable, the student could not have carried out the hypothesis test to investigate whether there was any correlation between download speed and upload speed.
    OCR MEI Further Statistics Major 2023 June Q7
    13 marks Standard +0.3
    7 An analyst routinely examines bottles of hair shampoo in order to check that the average percentage of a particular chemical which the shampoo contains does not exceed the value of \(1.0 \%\) specified by the manufacturer. The percentages of the chemical in a random sample of 12 bottles of the shampoo are as follows. \(\begin{array} { l l l l l l l l l l l } 1.087 & 1.171 & 1.047 & 0.846 & 0.909 & 1.052 & 1.042 & 0.893 & 1.021 & 1.085 & 1.096 \end{array} 0.931\) The analyst uses software to draw a Normal probability plot for these data, and to carry out a Normality test as shown below. \includegraphics[max width=\textwidth, alt={}, center]{c692fb20-436f-4bc1-89bd-10fdba41ceba-08_524_1539_694_264}
    1. The analyst is going to carry out a hypothesis test to check whether the average percentage exceeds 1.0\%. Explain which test the analyst should use, referring to each of the following.
      • The Normal probability plot
      • The \(p\)-value of the Kolmogorov-Smirnov test
      • In this question you must show detailed reasoning.
      Carry out the test at the 5\% significance level.
    OCR MEI Further Statistics Major 2023 June Q8
    12 marks Moderate -0.5
    8 The random variable \(X\) has a continuous uniform distribution over [0,10].
    1. Find the probability that, if two independent values of \(X\) are taken, one is less than 3 and the other is greater than 3 . The random variable \(T\) denotes the sum of 5 independent values of \(X\).
    2. State the value of \(\mathrm { P } ( T \leqslant 25 )\). The spreadsheet below shows the heading row and the first 20 data rows from a total of 100 data rows of a simulation of the distribution of \(X\). Each of the 100 rows shows a simulation of 5 independent values of \(X\), together with \(T\), the sum of the 5 values. All of the values have been rounded to 2 decimal places. In column I the spreadsheet shows the number of values of \(T\) that are less than or equal to the corresponding values in column H . For example, there are 75 simulated values of \(T\) that are less than or equal to 30 .
      ABcDEFGHI
      1\(\mathrm { X } _ { 1 }\)\(\mathrm { X } _ { 2 }\)\(\mathrm { X } _ { 3 }\)\(\mathrm { X } _ { 4 }\)\(\mathrm { X } _ { 5 }\)TtNumber \(\leqslant \mathrm { t }\)
      23.736.654.930.419.3325.0600
      34.956.584.482.517.2625.7950
      48.104.874.263.830.7921.85101
      56.704.105.101.826.7624.48154
      63.738.388.499.871.3131.792023
      73.224.360.121.349.4918.532548
      89.177.135.474.352.4428.553075
      93.421.936.042.998.8523.243593
      100.980.689.829.837.2828.584099
      115.861.677.774.087.1426.5245100
      129.200.315.825.316.4527.1050100
      137.044.302.060.064.1617.62
      140.315.021.485.371.7713.94
      153.776.041.217.675.0123.69
      161.215.541.901.436.9117.00
      179.271.985.809.379.3435.76
      184.305.662.801.561.1915.51
      197.153.196.895.412.1824.82
      206.186.323.016.499.1231.13
      215.035.995.196.973.5526.73
    3. Use the spreadsheet output to estimate each of the following.
      • \(\mathrm { P } ( T \leqslant 25 )\)
      • \(\mathrm { P } ( T > 35 )\)
      • In this question you must show detailed reasoning.
      The random variable \(Y\) is the mean of 100 independent values of \(T\). Determine an estimate of \(\mathrm { P } ( Y > 26 )\).
    OCR MEI Further Statistics Major 2023 June Q9
    10 marks Standard +0.3
    9 A cyclist who lives on an island suspects that car drivers with locally registered number plates allow more space when passing her than those with non-locally registered number plates. She decides to carry out a hypothesis test and so over a period of time selects a random sample of 250 cars which pass her. For each car she estimates whether the car driver allows at least the recommended 1.5 metres when passing her. The table shows the data which she collected.
    Where registered
    \cline { 3 - 4 } \multicolumn{2}{|c|}{}LocalNon-local
    \multirow{2}{*}{
    Passing
    distance
    }
    Under 1.5 m1211
    \cline { 2 - 4 }At least 1.5 m15770
    1. In this question you must show detailed reasoning. Carry out the test at the \(5 \%\) significance level to examine whether there is any association between where the car is registered and passing distance.
    2. A friend of the cyclist suggests that there may be a problem with the data, since the cyclist may have introduced some bias in estimating whether cars were allowing the recommended distance. Explain how any bias might have arisen.
    OCR MEI Further Statistics Major 2023 June Q10
    15 marks Challenging +1.2
    10 The continuous random variable \(X\) has probability density function given by \(f ( x ) = \begin{cases} \frac { 4 } { 15 } \left( \frac { a } { x ^ { 2 } } + 3 x ^ { 2 } - \frac { 7 } { 2 } \right) & 1 \leqslant x \leqslant 2 , \\ 0 & \text { otherwise, } \end{cases}\) where \(a\) is a positive constant.
    1. Find the cumulative distribution function of \(X\) in terms of \(a\).
    2. Hence or otherwise determine the value of \(a\).
    3. Show that the median value \(m\) of \(X\) satisfies the equation $$8 m ^ { 4 } - 28 m ^ { 2 } + 9 m - 4 = 0 .$$
    4. Verify that the median value of \(X\) is 1.74, correct to \(\mathbf { 2 }\) decimal places.
    5. Find \(\mathrm { E } ( X )\).
    6. Determine the mode of \(X\).
    OCR MEI Further Statistics Major 2023 June Q11
    9 marks Moderate -0.5
    11 The random variable \(X\) takes the value 1 with probability \(p\) and the value 0 with probability \(1 - p\).
    1. Find each of the following.
      • \(\mathrm { E } ( X )\)
      • \(\operatorname { Var } ( X )\)
      • The random variable \(Y \sim \mathrm {~B} ( 50,0.2 )\) has mean \(\mu\) and variance \(\sigma ^ { 2 }\).
      Use the results of part (a) to prove that
      • \(\mu = 10\)
      • \(\sigma ^ { 2 } = 8\).
    OCR MEI Further Statistics Major 2024 June Q1
    5 marks Easy -1.8
    1 The number of insurance policy sales made per month by a salesperson is modelled by the random variable \(X\), with probability distribution shown in the table.
    \(r\)0123456
    \(\mathrm { P } ( \mathrm { X } = \mathrm { r } )\)0.050.10.250.30.150.10.05
    1. Find each of the following.
      • \(\mathrm { E } ( X )\)
      • \(\operatorname { Var } ( X )\)
      The salesperson is paid a basic salary of \(\pounds 1000\) per month plus \(\pounds 500\) for each policy that is sold.
    2. Find the mean and standard deviation of the salesperson's monthly salary.