OCR MEI Further Statistics B AS (Further Statistics B AS) 2021 November

Question 1
View details
1 Over time LED light bulbs gradually lose brightness. For a particular type of LED bulb, it is known that the mean reduction in brightness after 10000 hours is \(2.6 \%\). A manufacturer produces a new version of this bulb, which costs less to make, but is claimed to have the same reduction in brightness after 10000 hours as the previous version. In order to check this claim, a random sample of 10 bulbs is selected. For each bulb, the original brightness and the brightness after 10000 hours are measured, in suitable units. A spreadsheet is used to produce a \(95 \%\) confidence interval for the mean percentage reduction in brightness. A screenshot of the spreadsheet is shown in Fig. 1. \begin{table}[h]
ABCDEFGH1JK
1Original brightness1075112111061095110111091114112311081115
2After 10000 hours1042108410761065107010791081109110801082
3Percentage reduction3.073.302.712.742.822.712.962.852.532.96
4
5
6Sample mean (\%)2.8650
7Sample sd (\%)0.2179
8SE0.0689
9DF9
10tvalue2.262
11Lower limit2.709
12Upper limit3.021
1.3
\captionsetup{labelformat=empty} \caption{Fig. 1}
\end{table}
  1. State the confidence interval in the form \(a < \mu < b\).
  2. Explain whether the confidence interval suggests that the mean percentage reduction in brightness after 10000 hours is different from 2.6\%.
  3. Explain how the value in cell B8 was calculated.
  4. State an assumption necessary for this confidence interval to be calculated.
  5. Explain the advantage of using the same bulbs for both measurements.
Question 2
View details
2 Natasha is investigating binomial distributions. She constructs the spreadsheet in Fig. 2 which shows the first 3 and last 4 rows of a simulation involving two independent variables, \(X\) and \(Y\), with distributions \(\mathrm { B } ( 10,0.3 )\) and \(\mathrm { B } ( 50,0.3 )\) respectively. The spreadsheet also shows the corresponding value of the random variable \(Z\), defined by \(Z = 5 X - Y\), for each pair of values of \(X\) and \(Y\). There are 100 simulated values of each of \(X , Y\) and \(Z\). The spreadsheet also shows whether each value of \(Z\) is greater than 6, and cells D103 and D104 show the number of values of \(Z\) which are greater than 6 and not greater than 6 respectively. \begin{table}[h]
1ABCDE
1XY\(\mathbf { Z } = \mathbf { 5 } \mathbf { X } - \mathbf { Y }\)\(\mathbf { Z } > \mathbf { 6 }\)
24137Y
34173N
4321-6N
5
6
98114-9N
9951213Y
100318-3N
1013150N
102
103Number of Y19
104Number of N81
105
\captionsetup{labelformat=empty} \caption{Fig. 2}
\end{table}
  1. Use the information in the spreadsheet to write down an estimate of \(\mathrm { P } ( Z > 6 )\).
  2. Explain how a more reliable estimate of \(\mathrm { P } ( Z > 6 )\) could be obtained.
    1. State the greatest possible value of \(Z\).
    2. Explain why it is very unlikely that \(Z\) would have this value.
  3. Use the Central Limit Theorem to calculate an estimate of the probability that the mean of 20 independent values of \(Z\) is greater than 2 .
Question 3
View details
3 The weights in kg of male otters in a large river system are known to be Normally distributed with mean 8.3 and standard deviation 1.8. A researcher believes that weights of male otters in another river are higher because of what he suspects is better availability of food. The researcher records the weights of a random sample of 9 male otters in this other river. The sum of these 9 weights is 83.79 kg .
  1. In this question you must show detailed reasoning. You should assume that:
    • the weights of otters in the other river are Normally distributed,
    • the standard deviation of the weights of otters in the other river is also 1.8 kg .
    Show that a test at the \(5 \%\) significance level provides sufficient evidence to conclude that the mean weight of male otters in the other river is greater than 8.3 kg .
  2. Explain whether the result of the test suggests that the weights are higher due to better availability of food.
  3. If the standard deviation of the weights of otters in the other river could not be assumed to be 1.8 kg , name an alternative test that the researcher could carry out to investigate otter weights.
  4. Explain why, even if a test at the \(5 \%\) significance level results in the rejection of the null hypothesis, you cannot be sure that the alternative hypothesis is true.
Question 4
View details
4 John regularly downloads podcasts onto his mobile phone. From past experience he knows that the average time to download one 30 -minute podcast is 12.7 s . He believes that this time has recently increased. At each of 12 randomly chosen times, he downloads a 30-minute podcast. The times in seconds to download the 12 podcasts are as follows.
\(\begin{array} { l l l l l l l l l l l } 12.63 & 13.24 & 11.73 & 14.91 & 13.17 & 13.53 & 12.33 & 14.27 & 11.48 & 13.51 & 13.05 \end{array} 13.83\)
  1. Given that it is not known whether the times are Normally distributed, carry out a suitable test at the \(5 \%\) significance level to investigate whether the average download time has increased.
  2. What assumption is required to carry out the test in part (a)?
Question 5
View details
5 A food company makes mini apple pies. The weight of pastry in a pie is Normally distributed with mean 75 g and standard deviation 4 g . The weight of filling in a pie is Normally distributed with mean 130 g and standard deviation 8 g . You should assume that the weights of pastry and filling in a pie are independent.
  1. Find the probability that the weight of pastry in a randomly chosen pie is between 70 g and 80 g .
  2. Find the probability that the mean weight of filling in 10 randomly chosen pies is at least 125 g. The pies are sold in packs of 4 . The weight of the packaging is Normally distributed with mean 165 g and standard deviation 6 g .
  3. In order to find the probability that the total weight of a pack of 4 pies is less than 1 kg , you must assume that the weight of the packaging is independent of the weight of the pies.
    1. State another necessary assumption.
    2. Given that the assumptions are valid, calculate this probability.
Question 6
View details
6 The probability density function of the continuous random variable \(X\) is given by
\(f ( x ) = \begin{cases} 2 ( 1 + a x ) & 0 \leqslant x \leqslant 1 ,
0 & \text { otherwise } , \end{cases}\)
where \(a\) is a constant.
  1. Show that \(a = - 1\).
  2. Find the cumulative distribution function of \(X\).
  3. Find \(\mathrm { P } ( X < 0.5 )\).
  4. Show that \(\mathrm { E } ( X )\) is greater than the median of \(X\).