5.05c Hypothesis test: normal distribution for population mean

681 questions

Sort by: Default | Easiest first | Hardest first
CAIE S2 2015 November Q3
5 marks Standard +0.3
3 From a random sample of 65 people in a certain town, the proportion who own a bicycle was noted. From this result an \(\alpha \%\) confidence interval for the proportion, \(p\), of all people in the town who own a bicycle was calculated to be \(0.284 < p < 0.516\).
  1. Find the proportion of people in the sample who own a bicycle.
  2. Calculate the value of \(\alpha\) correct to 2 significant figures.
CAIE S2 2016 November Q5
8 marks Standard +0.3
5
  1. The masses, in grams, of certain tomatoes are normally distributed with standard deviation 9 grams. A random sample of 100 tomatoes has a sample mean of 63 grams. Find a \(90 \%\) confidence interval for the population mean mass of these tomatoes.
  2. The masses, in grams, of certain potatoes are normally distributed with known population standard deviation but unknown population mean. A random sample of potatoes is taken in order to find a confidence interval for the population mean. Using a sample of size 50 , a \(95 \%\) confidence interval is found to have width 8 grams.
    1. Using another sample of size 50 , an \(\alpha \%\) confidence interval has width 4 grams. Find \(\alpha\).
    2. Find the sample size \(n\), such that a \(95 \%\) confidence interval has width 4 grams.
CAIE S2 2016 November Q7
11 marks Standard +0.3
7 In the past the time, in minutes, taken for a particular rail journey has been found to have mean 20.5 and standard deviation 1.2. Some new railway signals are installed. In order to test whether the mean time has decreased, a random sample of 100 times for this journey are noted. The sample mean is found to be 20.3 minutes. You should assume that the standard deviation is unchanged.
  1. Carry out a significance test, at the \(4 \%\) level, of whether the population mean time has decreased. Later another significance test of the same hypotheses, using another random sample of size 100 , is carried out at the \(4 \%\) level.
  2. Given that the population mean is now 20.1, find the probability of a Type II error.
  3. State what is meant by a Type II error in this context.
CAIE S2 2016 November Q4
5 marks Challenging +1.2
4 The manufacturer of a tablet computer claims that the mean battery life is 11 hours. A consumer organisation wished to test whether the mean is actually greater than 11 hours. They invited a random sample of members to report the battery life of their tablets. They then calculated the sample mean. Unfortunately a fire destroyed the records of this test except for the following partial document. \includegraphics[max width=\textwidth, alt={}, center]{c460afa4-1387-421d-87ac-74a64be99714-2_467_593_1612_776} Given that the population of battery lives is normally distributed with standard deviation 1.6 hours, find the set of possible values of the sample size, \(n\).
CAIE Further Paper 4 2022 June Q1
8 marks Standard +0.3
1 A manager is investigating the times taken by employees to complete a particular task as a result of the introduction of new technology. He claims that the mean time taken to complete the task is reduced by more than 0.4 minutes. He chooses a random sample of 10 employees. The times taken, in minutes, before and after the introduction of the new technology are recorded in the table.
Employee\(A\)\(B\)\(C\)D\(E\)\(F\)G\(H\)IJ
Time before new technology10.29.812.411.610.811.214.610.612.311.0
Time after new technology9.68.512.410.910.210.612.810.812.510.6
  1. Test at the 10\% significance level whether the manager's claim is justified.
  2. State an assumption that is necessary for this test to be valid.
CAIE Further Paper 4 2022 June Q5
9 marks Standard +0.3
5 Raman is researching the heights of male giraffes in a particular region. Raman assumes that the heights of male giraffes in this region are normally distributed. He takes a random sample of 8 male giraffes from the region and measures the height, in metres, of each giraffe. These heights are as follows. $$\begin{array} { c c c c c c c c } 5.2 & 5.8 & 4.9 & 6.1 & 5.5 & 5.9 & 5.4 & 5.6 \end{array}$$
  1. Find a \(90 \%\) confidence interval for the population mean height of male giraffes in this region. [5]
    Raman claims that the population mean height of male giraffes in the region is less than 5.9 metres.
  2. Test at the \(2.5 \%\) significance level whether this sample provides sufficient evidence to support Raman's claim.
CAIE Further Paper 4 2022 June Q6
12 marks Standard +0.8
6 A company has two machines, \(A\) and \(B\), which independently fill small bottles with a liquid. The volumes of liquid per bottle, in suitable units, filled by machines \(A\) and \(B\) are denoted by \(x\) and \(y\) respectively. A scientist at the company takes a random sample of 40 bottles filled by machine \(A\) and a random sample of 50 bottles filled by machine \(B\). The results are summarised as follows. $$\sum x = 1120 \quad \sum x ^ { 2 } = 31400 \quad \sum y = 1370 \quad \sum y ^ { 2 } = 37600$$ The population means of the volumes of liquid in the bottles filled by machines \(A\) and \(B\) are denoted by \(\mu _ { A }\) and \(\mu _ { B }\).
  1. Test at the \(2 \%\) significance level whether there is any difference between \(\mu _ { A }\) and \(\mu _ { B }\).
  2. Find the set of values of \(\alpha\) for which there would be evidence at the \(\alpha \%\) significance level that \(\mu _ { \mathrm { A } } - \mu _ { \mathrm { B } }\) is greater than 0.25.
    If you use the following page to complete the answer to any question, the question number must be clearly shown.
CAIE Further Paper 4 2023 June Q2
8 marks Standard +0.3
2 The children at two large schools, \(P\) and \(Q\), are all given the same puzzle to solve. A random sample of size 10 is taken from the children at school \(P\). Their individual times to complete the puzzle give a sample mean of 9.12 minutes and an unbiased variance estimate of 2.16 minutes \({ } ^ { 2 }\). A random sample of size 12 is taken from the children at school \(Q\). Their individual times, \(x\) minutes, to complete the puzzle are summarised by $$\sum x = 99.6 \quad \sum ( x - \bar { x } ) ^ { 2 } = 21.5$$ where \(\bar { x }\) is the sample mean. Times to complete the puzzle are assumed to be normally distributed with the same population variance. Test at the \(5 \%\) significance level whether the population mean time taken to complete the puzzle by children at school \(P\) is greater than the population mean time taken to complete the puzzle by children at school \(Q\).
CAIE Further Paper 4 2023 June Q4
9 marks Standard +0.3
4 An inspector is checking the lengths of metal rods produced by two machines, \(X\) and \(Y\). These rods should be of the same length, but the inspector suspects that those made by machine \(X\) are shorter, on average, than those made by machine \(Y\). The inspector chooses a random sample of 80 rods made by machine \(X\) and a random sample of 60 rods made by machine \(Y\). The lengths of these rods are \(x \mathrm {~cm}\) and \(y \mathrm {~cm}\) respectively. Her results are summarised as follows. $$\sum x = 164.0 \quad \sum x ^ { 2 } = 338.1 \quad \sum y = 124.8 \quad \sum y ^ { 2 } = 261.1$$
  1. Test at the \(10 \%\) significance level whether the data supports the inspector's suspicion.
  2. Give a reason why it is not necessary to make any assumption about the distributions of the lengths of the rods.
CAIE Further Paper 4 2024 June Q6
8 marks Standard +0.3
6 Jade is a swimming instructor at a sports college. She claims that, as a result of an intensive training course, the mean time taken by students to swim 50 metres has reduced by more than 1 second. She chooses a random sample of 10 students. The times taken, in seconds, before and after the training course are recorded in the table.
StudentABCD\(E\)\(F\)G\(H\)IJ
Time before course54.247.452.159.055.351.048.952.258.451.4
Time after course50.146.352.558.851.448.449.548.758.351.4
  1. Test, at the 10\% significance level, whether Jade's claim is justified.
  2. State an assumption that is necessary for this test to be valid.
CAIE Further Paper 4 2024 June Q6
10 marks Standard +0.8
6 Seva is investigating the lengths of the tails of adult wallabies in two regions of Australia, \(X\) and \(Y\). He chooses a random sample of 50 adult wallabies from region \(X\) and records the lengths, \(x \mathrm {~cm}\), of their tails. He also chooses a random sample of 40 adult wallabies from region \(Y\) and records the lengths, \(y \mathrm {~cm}\), of their tails. His results are summarised as follows. $$\sum x = 1080 \quad \sum x ^ { 2 } = 23480 \quad \sum y = 940 \quad \sum y ^ { 2 } = 22220$$ It cannot be assumed that the population variances of the two distributions are the same.
  1. Find a \(90 \%\) confidence interval for the difference between the population mean lengths of the tails of adult wallabies in regions \(X\) and \(Y\). \includegraphics[max width=\textwidth, alt={}, center]{b5ff998a-fcb6-4a1b-ae86-ec66b0dccc3c-10_2718_38_141_2010} The population mean lengths of the tails of adult wallabies in regions \(X\) and \(Y\) are \(\mu _ { X } \mathrm {~cm}\) and \(\mu _ { Y } \mathrm {~cm}\) respectively.
  2. Test, at the \(10 \%\) significance level, the null hypothesis \(\mu _ { Y } - \mu _ { X } = 1.1\) against the alternative hypothesis \(\mu _ { Y } - \mu _ { X } > 1.1\). State your conclusion in the context of the question.
    If you use the following page to complete the answer to any question, the question number must be clearly shown.
CAIE Further Paper 4 2020 November Q4
8 marks Standard +0.3
4 Members of the Sprints athletics club have been taking part in an intense training scheme, aimed at reducing their times taken to run 400 m . For a random sample of 9 athletes from the club, the times taken, in seconds, before and after the training scheme are given in the following table.
Athlete\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)
Time before48.848.250.349.649.448.947.650.348.4
Time after47.947.849.649.149.648.947.749.148.1
The organiser of the training scheme claims that on average an athlete's time will be reduced by at least 0.3 seconds. Test at the 10\% significance level whether the organiser's claim is justified, stating any assumption that you make.
CAIE Further Paper 4 2020 November Q1
5 marks Moderate -0.3
1 The heights of the members of a large sports club are normally distributed. A random sample of 11 members of the club is chosen and their heights, \(x \mathrm {~cm}\), are measured. The results are summarised as follows, where \(\bar { x }\) denotes the sample mean of \(x\). $$\bar { x } = 176.2 \quad \sum ( x - \bar { x } ) ^ { 2 } = 313.1$$ Test, at the \(5 \%\) significance level, the null hypothesis that the population mean height for members of this club is equal to 172.5 cm against the alternative hypothesis that the mean differs from 172.5 cm . [5]
CAIE Further Paper 4 2020 November Q6
12 marks Challenging +1.2
6 Nassa is researching the lengths of a particular type of snake in two countries, \(A\) and \(B\).
  1. He takes a random sample of 10 snakes of this type from country \(A\) and measures the length, \(x \mathrm {~m}\), of each snake. He then calculates a \(90 \%\) confidence interval for the population mean length, \(\mu \mathrm { m }\), for snakes of this type, assuming that snake lengths have a normal distribution. This confidence interval is \(3.36 \leqslant \mu \leqslant 4.22\). Find the sample mean and an unbiased estimate for the population variance.
  2. Nassa also measures the lengths, \(y \mathrm {~m}\), of a random sample of 8 snakes of this type taken from country \(B\). His results are summarised as follows. $$\sum y = 27.86 \quad \sum y ^ { 2 } = 98.02$$ Nassa claims that the mean length of snakes of this type in country \(B\) is less than the mean length of snakes of this type in country \(A\). Nassa assumes that his sample from country \(B\) also comes from a normal distribution, with the same variance as the distribution from country \(A\). Test at the \(10 \%\) significance level whether there is evidence to support Nassa's claim.
    If you use the following lined page to complete the answer(s) to any question(s), the question number(s) must be clearly shown.
CAIE Further Paper 4 2021 November Q1
7 marks Standard +0.3
1 The times taken for students at a college to run 200 m have a normal distribution with mean \(\mu \mathrm { s }\). The times, \(x\) s, are recorded for a random sample of 10 students from the college. The results are summarised as follows, where \(\bar { x }\) is the sample mean. $$\bar { x } = 25.6 \quad \sum ( x - \bar { x } ) ^ { 2 } = 78.5$$
  1. Find a 90\% confidence interval for \(\mu\).
    A test of the null hypothesis \(\mu = k\) is carried out on this sample, using a \(10 \%\) significance level. The test does not support the alternative hypothesis \(\mu < k\).
  2. Find the greatest possible value of \(k\).
CAIE Further Paper 4 2021 November Q4
8 marks Standard +0.8
4 Manet has developed a new training course to help athletes improve their time taken to run 800 m . Manet claims that his course will decrease an athlete's time by more than 2 s on average. For a random sample of 10 athletes the times taken, in seconds, before and after the course are given in the following table.
Athlete\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)
Before150146131135126142130129137134
After145138129135122135132128127137
Use a \(t\)-test, at the \(5 \%\) significance level, to test whether Manet's claim is justified, stating any assumption that you make.
CAIE Further Paper 4 2021 November Q6
10 marks Standard +0.8
6 A scientist is investigating the masses of a particular type of fish found in lakes \(A\) and \(B\). He chooses a random sample of 10 fish of this type from lake \(A\) and records their masses, \(x \mathrm {~kg}\), as follows.
0.9
1.8
1.8
1.9
2.1
2.4
2.6
2.2
2.5
3.0 The scientist also chooses a random sample of 12 fish of this type from lake \(B\), but he only has a summary of their masses, \(y \mathrm {~kg}\), as follows. $$\sum y = 24.48 \quad \sum y ^ { 2 } = 53.75$$ Test at the \(10 \%\) significance level whether the mean mass of fish of this type in lake \(A\) is greater than the mean mass of fish of this type in lake \(B\). You should state any assumptions that you need to make for the test to be valid.
[0pt] [10]
If you use the following lined page to complete the answer(s) to any question(s), the question number(s) must be clearly shown.
CAIE Further Paper 4 2022 November Q6
9 marks Standard +0.8
6 A company manufactures copper pipes. The pipes are produced by two different machines, \(A\) and \(B\). An inspector claims that the mean diameter of the pipes produced by machine \(A\) is greater than the mean diameter of the pipes produced by machine \(B\). He takes a random sample of 12 pipes produced by machine \(A\) and measures their diameters, \(x \mathrm {~cm}\). His results are summarised as follows. $$\sum x = 6.24 \quad \sum x ^ { 2 } = 3.26$$ He also takes a random sample of 10 pipes produced by machine \(B\) and measures their diameters in cm. His results are as follows. $$\begin{array} { l l l l l l l l l l } 0.48 & 0.53 & 0.47 & 0.54 & 0.54 & 0.55 & 0.46 & 0.55 & 0.50 & 0.48 \end{array}$$ The diameters of the pipes produced by each machine are assumed to be normally distributed with equal population variances. Test at the \(2.5 \%\) significance level whether the data supports the inspector's claim.
If you use the following page to complete the answer to any question, the question number must be clearly shown.
CAIE Further Paper 4 2022 November Q3
8 marks Standard +0.3
3 A scientist is investigating the masses of birds of a certain species in country \(X\) and country \(Y\). She takes a random sample of 50 birds of this species from country \(X\) and a random sample of 80 birds of this species from country \(Y\). She records their masses in \(\mathrm { kg } , x\) and \(y\), respectively. Her results are summarised as follows. $$\sum x = 75.5 \quad \sum x ^ { 2 } = 115.2 \quad \sum y = 116.8 \quad \sum y ^ { 2 } = 172.6$$ The population mean masses of these birds in countries \(X\) and \(Y\) are \(\mu _ { x } \mathrm {~kg}\) and \(\mu _ { y } \mathrm {~kg}\) respectively.
Test, at the \(5 \%\) significance level, the null hypothesis \(\mu _ { \mathrm { x } } = \mu _ { \mathrm { y } }\) against the alternative hypothesis \(\mu _ { \mathrm { x } } > \mu _ { \mathrm { y } }\). State your conclusion in the context of the question.
CAIE Further Paper 4 2023 November Q1
6 marks Standard +0.3
1 Maya is an athlete who competes in 1500-metre races. Last summer her practice run times had mean 4.22 minutes. Over the winter she has done some intense training to try to improve her times. A random sample of 10 of her practice run times, \(x\) minutes, this summer are summarised as follows. $$\sum x = 42.05 \quad \sum x ^ { 2 } = 176.83$$ Maya's new practice run times are normally distributed. She believes that on average her times have improved as a result of her training. Test, at the \(5 \%\) significance level, whether Maya's belief is supported by the data.
CAIE Further Paper 4 2023 November Q3
8 marks Standard +0.3
3 Scientists are studying the effects of exercise on LDL blood cholesterol levels. Over a three-month period, a large group of people exercised for 20 minutes each day. For a randomly chosen sample of 10 of these people, the LDL blood cholesterol levels were measured at the beginning and the end of the three-month period. The results, measured in suitable units, are as follows.
\cline { 2 - 12 } \multicolumn{1}{c|}{}Person\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)
\multirow{2}{*}{
Cholesterol
level
}
Beginning72841209010213564758088
\cline { 2 - 12 }End64761059210511567757584
  1. Test, at the \(2.5 \%\) significance level, whether there is evidence that the population mean LDL blood cholesterol level has reduced by more than 2 units after the three-month period.
  2. State any assumption that you have made in part (a). \includegraphics[max width=\textwidth, alt={}, center]{44829994-2ef0-488d-aa3b-99fb0e36d733-06_399_1383_269_324} As shown in the diagram, the continuous random variable \(X\) has probability density function f given by $$f ( x ) = \begin{cases} m x & 0 \leqslant x \leqslant 2 \\ \frac { k } { x ^ { 2 } } + c & 2 \leqslant x \leqslant 6 \\ 0 & \text { otherwise } \end{cases}$$ where \(m , k\) and \(c\) are constants.
CAIE Further Paper 4 2023 November Q5
16 marks Standard +0.8
5 A company is deciding which of two machines, \(X\) and \(Y\), can make a certain type of electrical component more quickly. The times taken, in minutes, to make one component of this type are recorded for a random sample of 8 components made by machine \(X\) and a random sample of 9 components made by machine \(Y\). These times are as follows.
Machine \(X\)4.04.64.74.85.05.25.65.8
Machine \(Y\)4.54.95.15.35.45.75.96.36.4
The manager claims that on average the time taken by machine \(X\) to make one component is less than that taken by machine \(Y\).
  1. Carry out a Wilcoxon rank-sum test at the \(5 \%\) significance level to test whether the manager's claim is supported by the data.
  2. Assuming that the times taken to produce the components by the two machines are normally distributed with equal variances, carry out a \(t\)-test at the \(5 \%\) significance level to test whether the manager's claim is supported by the data.
    \section*{Question 5(c) is printed on the next page.}
  3. In general, would you expect the conclusions from the tests in parts (a) and (b) to be the same? Give a reason for your answer.
    If you use the following page to complete the answer to any question, the question number must be clearly shown.
CAIE Further Paper 4 2023 November Q3
8 marks Standard +0.3
3 Scientists are studying the effects of exercise on LDL blood cholesterol levels. Over a three-month period, a large group of people exercised for 20 minutes each day. For a randomly chosen sample of 10 of these people, the LDL blood cholesterol levels were measured at the beginning and the end of the three-month period. The results, measured in suitable units, are as follows.
\cline { 2 - 12 } \multicolumn{1}{c|}{}Person\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)
\multirow{2}{*}{
Cholesterol
level
}
Beginning72841209010213564758088
\cline { 2 - 12 }End64761059210511567757584
  1. Test, at the \(2.5 \%\) significance level, whether there is evidence that the population mean LDL blood cholesterol level has reduced by more than 2 units after the three-month period.
  2. State any assumption that you have made in part (a). \includegraphics[max width=\textwidth, alt={}, center]{b6635fbc-3c9d-4f93-b51a-b1cbd71ddbb1-06_399_1383_269_324} As shown in the diagram, the continuous random variable \(X\) has probability density function f given by $$f ( x ) = \begin{cases} m x & 0 \leqslant x \leqslant 2 \\ \frac { k } { x ^ { 2 } } + c & 2 \leqslant x \leqslant 6 \\ 0 & \text { otherwise } \end{cases}$$ where \(m , k\) and \(c\) are constants.
CAIE Further Paper 4 2024 November Q1
6 marks Standard +0.3
1 Ellie is investigating the heights of two types of beech tree, \(A\) and \(B\), in a certain region. She has chosen a random sample of 60 beech trees of type \(A\) in the region, recorded their heights, \(x \mathrm {~m}\), and calculated unbiased estimates for the population mean and population variance as 35.6 m and \(4.95 \mathrm {~m} ^ { 2 }\) respectively. Ellie also chooses a random sample of 50 beech trees of type \(B\) in the region and records their heights, \(y \mathrm {~m}\). Her results are summarised as follows. $$\sum y = 1654 \quad \sum y ^ { 2 } = 54850$$ Find a \(95 \%\) confidence interval for the difference between the population mean heights of type \(A\) and type \(B\) beech trees in the region.
CAIE Further Paper 4 2024 November Q6
8 marks Standard +0.8
6 Ansal is investigating the wingspans of Monarch butterflies in two different regions, \(X\) and \(Y\). He takes a random sample of 8 Monarch butterflies from region \(X\) and records their wingspans, \(x \mathrm {~cm}\). His results are as follows. $$\begin{array} { l l l l l l l l } 8.2 & 7.0 & 7.3 & 8.8 & 7.8 & 8.5 & 9.2 & 7.4 \end{array}$$ Ansal also takes a random sample of 9 Monarch butterflies from region \(Y\) and records their wingspans, \(y \mathrm {~cm}\). His results are summarised as follows. $$\sum y = 71.10 \quad \sum y ^ { 2 } = 567.13$$ Ansal suspects that the mean wingspan of Monarch butterflies from region \(X\) is greater than the mean wingspan of Monarch butterflies from region \(Y\). It is known that the wingspans of Monarch butterflies in regions \(X\) and \(Y\) are normally distributed with equal population variances. Test, at the 10\% significance level, whether Ansal's suspicion is supported by the data. \includegraphics[max width=\textwidth, alt={}, center]{e2a45d19-7d48-4aa5-93f9-6ef90f99d7c4-12_2717_35_109_2012} \includegraphics[max width=\textwidth, alt={}, center]{e2a45d19-7d48-4aa5-93f9-6ef90f99d7c4-13_2726_35_97_20}
If you use the following page to complete the answer to any question, the question number must be clearly shown. \includegraphics[max width=\textwidth, alt={}, center]{e2a45d19-7d48-4aa5-93f9-6ef90f99d7c4-14_2715_33_109_2012}