5.05c Hypothesis test: normal distribution for population mean

681 questions

Sort by: Default | Easiest first | Hardest first
CAIE S2 2018 June Q7
12 marks Standard +0.3
7 The number of absences by girls from a certain class on any day is modelled by a random variable with distribution \(\operatorname { Po } ( 0.2 )\). The number of absences by boys from the same class on any day is modelled by an independent random variable with distribution \(\operatorname { Po } ( 0.3 )\).
  1. Find the probability that, during a randomly chosen 2-day period, the total number of absences is less than 3 .
  2. Find the probability that, during a randomly chosen 5-day period, the number of absences by boys is more than 3.
  3. The teacher claims that, during the football season, there are more absences by boys than usual. In order to test this claim at the 5\% significance level, he notes the number of absences by boys during a randomly chosen 5-day period during the football season.
    1. State what is meant by a Type I error in this context.
    2. State appropriate null and alternative hypotheses and find the probability of a Type I error.
    3. In fact there were 4 absences by boys during this period. Test the teacher's claim at the 5\% significance level.
      If you use the following lined page to complete the answer(s) to any question(s), the question number(s) must be clearly shown.
CAIE S2 2018 June Q4
9 marks Standard +0.3
4 The mean mass of packets of sugar is supposed to be 505 g . A random sample of 10 packets filled by a certain machine was taken and the masses, in grams, were found to be as follows. $$\begin{array} { l l l l l l l l l l } 500 & 499 & 496 & 495 & 498 & 490 & 492 & 501 & 494 & 494 \end{array}$$
  1. Find unbiased estimates of the population mean and variance.
    The mean mass of packets produced by this machine was found to be less than 505 g , so the machine was adjusted. Following the adjustment, the masses of a random sample of 150 packets from the machine were measured and the total mass was found to be 75660 g .
  2. Given that the population standard deviation is 3.6 g , test at the \(2 \%\) significance level whether the machine is still producing packets with mean mass less than 505 g .
  3. Explain why the use of the normal distribution is justified in carrying out the test in part (ii). [1]
CAIE S2 2018 June Q7
12 marks Standard +0.3
7 A ten-sided spinner has edges numbered \(1,2,3,4,5,6,7,8,9,10\). Sanjeev claims that the spinner is biased so that it lands on the 10 more often than it would if it were unbiased. In an experiment, the spinner landed on the 10 in 3 out of 9 spins.
  1. Test at the \(1 \%\) significance level whether Sanjeev's claim is justified.
  2. Explain why a Type I error cannot have been made.
    In fact the spinner is biased so that the probability that it will land on the 10 on any spin is 0.5 .
  3. Another test at the \(1 \%\) significance level, also based on 9 spins, is carried out. Calculate the probability of a Type II error.
    If you use the following lined page to complete the answer(s) to any question(s), the question number(s) must be clearly shown.
CAIE S2 2019 June Q2
7 marks Moderate -0.3
2 The time, in minutes, that John takes to travel to work has a normal distribution. Last year the mean and standard deviation were 26.5 and 4.8 respectively. This year John uses a different route and he finds that the mean time for his first 150 journeys is 27.5 minutes.
  1. Stating a necessary assumption, test at the \(1 \%\) significance level whether the mean time for his journey to work has increased.
  2. State, with a reason, whether it was necessary to use the Central Limit theorem in your answer to part (i).
CAIE S2 2019 June Q3
7 marks Moderate -0.3
3 Sumitra has a six-sided die. She suspects that it is biased so that it shows a six less often than it would if it were fair. She decides to test the die by throwing it 30 times and noting the number of throws on which it shows a six.
  1. It shows a six on exactly 2 throws. Use a binomial distribution to carry out the test at the \(5 \%\) significance level.
  2. Later, Sumitra repeats the test at the \(5 \%\) significance level by throwing the die 30 times again. Find the probability of a Type I error in this second test.
CAIE S2 2019 June Q3
8 marks Standard +0.3
3 It is claimed that, on average, a particular train journey takes less than 1.9 hours. The times, \(t\) hours, taken for this journey on a random sample of 50 days were recorded. The results are summarised below. $$n = 50 \quad \Sigma t = 92.5 \quad \Sigma t ^ { 2 } = 175.25$$
  1. Calculate unbiased estimates of the population mean and variance.
  2. Test the claim at the \(5 \%\) significance level.
CAIE S2 2019 June Q5
8 marks Standard +0.3
5 The manufacturer of a certain type of biscuit claims that \(10 \%\) of packets include a free offer printed on the packet. Jyothi suspects that the true proportion is less than \(10 \%\). He plans to test the claim by looking at 40 randomly selected packets and, if the number which include the offer is less than 2 , he will reject the manufacturer's claim.
  1. State suitable hypotheses for the test.
  2. Find the probability of a Type I error.
    On another occasion Jyothi looks at 80 randomly selected packets and finds that exactly 6 include the free offer.
  3. Calculate an approximate \(90 \%\) confidence interval for the proportion of packets that include the offer.
  4. Use your confidence interval to comment on the manufacturer's claim. \(6 X\) is a random variable with probability density function given by $$f ( x ) = \begin{cases} \frac { a } { x ^ { 2 } } & 1 \leqslant x \leqslant b \\ 0 & \text { otherwise } \end{cases}$$ where \(a\) and \(b\) are constants.
CAIE S2 2019 June Q7
11 marks Standard +0.3
7 All the seats on a certain daily flight are always sold. The number of passengers who have bought seats but fail to arrive for this flight on a particular day is modelled by the distribution \(\mathbf { B } ( 320,0.005 )\).
  1. Explain what the number 320 represents in this context.
  2. The total number of passengers who have bought seats but fail to arrive for this flight on 2 randomly chosen days is denoted by \(X\). Use a suitable approximating distribution to find \(\mathrm { P } ( 2 < X < 6 )\).
  3. Justify the use of your approximating distribution.
    After some changes, the airline wishes to test whether the mean number of passengers per day who fail to arrive for this flight has decreased.
  4. During 5 randomly chosen days, a total of 2 passengers failed to arrive. Carry out the test at the 2.5\% significance level.
    If you use the following lined page to complete the answer(s) to any question(s), the question number(s) must be clearly shown.
CAIE S2 2019 June Q5
6 marks Challenging +1.2
5 The amount of money, in dollars, spent by a customer on one visit to a certain shop is modelled by the distribution \(\mathrm { N } ( \mu , 1.94 )\). In the past, the value of \(\mu\) has been found to be 20.00 , but following a rearrangement in the shop, the manager suspects that the value of \(\mu\) has changed. He takes a random sample of 6 customers and notes how much they each spend, in dollars. The results are as follows.
15.50
17.60
17.30
22.00
23.50
31.00 The manager carries out a hypothesis test using a significance level of \(\alpha \%\). The test does not support his suspicion. Find the largest possible value of \(\alpha\).
CAIE S2 2016 March Q3
5 marks Standard +0.3
3 In the past, Arvinder has found that the mean time for his journey to work is 35.2 minutes. He tries a different route to work, hoping that this will reduce his journey time. Arvinder decides to take a random sample of 25 journeys using the new route. If the sample mean is less than 34.7 minutes he will conclude that the new route is quicker. Assume that, for the new route, the journey time has a normal distribution with standard deviation 5.6 minutes.
  1. Find the probability that a Type I error occurs.
  2. Arvinder finds that the sample mean is 34.5 minutes. Explain briefly why it is impossible for him to make a Type II error.
CAIE S2 2017 March Q2
4 marks Easy -1.2
2 Karim has noted the lifespans, in weeks, of a large random sample of certain insects. He carries out a test, at the \(1 \%\) significance level, for the population mean, \(\mu\). Karim's null hypothesis is \(\mu = 6.4\).
  1. Given that Karim's test is two-tail, state the alternative hypothesis.
    Karim finds that the value of the test statistic is \(z = 2.43\).
  2. Explain what conclusion he should draw.
  3. Explain briefly when a one-tail test is appropriate, rather than a two-tail test.
CAIE S2 2024 March Q7
6 marks Standard +0.8
7 The heights, in centimetres, of adult females in Litania have mean \(\mu\) and standard deviation \(\sigma\). It is known that in 2004 the values of \(\mu\) and \(\sigma\) were 163.21 and 6.95 respectively. The government claims that the value of \(\mu\) this year is greater than it was in 2004. In order to test this claim a researcher plans to carry out a hypothesis test at the \(1 \%\) significance level. He records the heights of a random sample of 300 adult females in Litania this year and finds the value of the sample mean.
  1. State the probability of a Type I error. \includegraphics[max width=\textwidth, alt={}]{ff3433b0-baab-45e3-845e-56a794739bba-12_74_1577_557_322} ........................................................................................................................................ You should assume that the value of \(\sigma\) after 2004 remains at 6.95 .
  2. Given that the value of \(\mu\) this year is actually 164.91 , find the probability of a Type II error.
    If you use the following page to complete the answer to any question, the question number must be clearly shown.
CAIE S2 2020 November Q2
7 marks Moderate -0.8
2 In a survey, a random sample of 250 adults in Fromleigh were asked to fill in a questionnaire about their travel.
  1. It was found that 102 adults in the sample travel by bus. Find an approximate \(90 \%\) confidence interval for the proportion of all the adults in Fromleigh who travel by bus.
  2. The survey included a question about the amount, \(x\) dollars, spent on travel per year. The results are summarised as follows. $$n = 250 \quad \Sigma x = 50460 \quad \Sigma x ^ { 2 } = 19854200$$ Find unbiased estimates of the population mean and variance of the amount spent per year on travel.
    A councillor wanted to select a random sample of houses in Fromleigh. He planned to select the first house on each of the 143 streets in Fromleigh.
  3. Explain why this would not provide a random sample.
CAIE S2 2020 November Q6
12 marks Standard +0.3
6 The time, in minutes, for Anjan's journey to work on Mondays has mean 38.4 and standard deviation 6.9.
  1. Find the probability that Anjan's mean journey time for a random sample of 30 Mondays is between 38 and 40 minutes.
    Anjan wishes to test whether his mean journey time is different on Tuesdays. He chooses a random sample of 30 Tuesdays and finds that his mean journey time for these 30 Tuesdays is 40.2 minutes. Assume that the standard deviation for his journey time on Tuesdays is 6.9 minutes.
    1. State, with a reason, whether Anjan should use a one-tail or a two-tail test.
    2. Carry out the test at the \(10 \%\) significance level.
    3. Explain whether it was necessary to use the Central Limit theorem in part (b)(ii).
      If you use the following lined page to complete the answer(s) to any question(s), the question number(s) must be clearly shown.
CAIE S2 2020 November Q2
4 marks Moderate -0.8
2 A six-sided die has faces marked \(1,2,3,4,5,6\). When the die is thrown 300 times it shows a six on 56 throws.
  1. Calculate an approximate \(96 \%\) confidence interval for the probability that the die shows a six on one throw.
  2. Maroulla claims that the die is biased. Use your answer to part (a) to comment on this claim.
CAIE S2 2020 November Q4
8 marks Standard +0.3
4 The areas, \(X \mathrm {~cm} ^ { 2 }\), of petals of a certain kind of flower have mean \(\mu \mathrm { cm } ^ { 2 }\). In the past it has been found that \(\mu = 8.9\). Following a change in the climate, a botanist claims that the mean is no longer 8.9. The areas of a random sample of 200 petals from this kind of flower are measured, and the results are summarized by $$\Sigma x = 1850 , \quad \Sigma x ^ { 2 } = 17850 .$$ Test the botanist's claim at the \(2.5 \%\) significance level.
CAIE S2 2017 November Q3
4 marks Standard +0.3
3 After an election 153 adults, from a random sample of 200 adults, said that they had voted. Using this information, an \(\alpha \%\) confidence interval for the proportion of all adults who voted in the election was found to be 0.695 to 0.835 , both correct to 3 significant figures. Find the value of \(\alpha\), correct to the nearest integer.
CAIE S2 2017 November Q8
12 marks Challenging +1.2
8 In order to test the effect of a drug, a researcher monitors the concentration, \(X\), of a certain protein in the blood stream of patients. For patients who are not taking the drug the mean value of \(X\) is 0.185 . A random sample of 150 patients taking the drug was selected and the values of \(X\) were found. The results are summarised below. $$n = 150 \quad \Sigma x = 27.0 \quad \Sigma x ^ { 2 } = 5.01$$ The researcher wishes to test at the \(1 \%\) significance level whether the mean concentration of the protein in the blood stream of patients taking the drug is less than 0.185 .
  1. Carry out the test.
  2. Given that, in fact, the mean concentration for patients taking the drug is 0.175 , find the probability of a Type II error occurring in the test.
CAIE S2 2017 November Q3
8 marks Standard +0.3
3 The masses, \(m \mathrm {~kg}\), of packets of flour are normally distributed. The mean mass is supposed to be 1.01 kg . A quality control officer measures the masses of a random sample of 100 packets. The results are summarised below. $$n = 100 \quad \Sigma m = 98.2 \quad \Sigma m ^ { 2 } = 104.52$$
  1. Test at the \(5 \%\) significance level whether the population mean mass is less than 1.01 kg .
  2. Explain whether it was necessary to use the Central Limit theorem in your answer to part (i).
CAIE S2 2017 November Q6
10 marks Standard +0.3
6 In a certain factory the number of items per day found to be defective has had the distribution \(\operatorname { Po } ( 1.03 )\). After the introduction of new quality controls, the management wished to test at the \(10 \%\) significance level whether the mean number of defective items had decreased. They noted the total number of defective items produced in 5 randomly chosen days. It is assumed that defective items occur randomly and that a Poisson model is still appropriate.
  1. Given that the total number of defective items produced during the 5 days was 2 , carry out the test.
  2. Using another random sample of 5 days the same test is carried out again, with the same significance level. Find the probability of a Type I error.
  3. Explain what is meant by a Type I error in this context.
CAIE S2 2018 November Q7
12 marks Challenging +1.2
7 A mill owner claims that the mean mass of sacks of flour produced at his mill is 51 kg . A quality control officer suspects that the mean mass is actually less than 51 kg . In order to test the owner's claim she finds the mass, \(x \mathrm {~kg}\), of each of a random sample of 150 sacks and her results are summarised as follows. $$n = 150 \quad \Sigma x = 7480 \quad \Sigma x ^ { 2 } = 380000$$
  1. Carry out the test at the \(2.5 \%\) significance level.
    You may now assume that the population standard deviation of the masses of sacks of flour is 6.856 kg . The quality control officer weighs another random sample of 150 sacks and carries out another test at the 2.5\% significance level.
  2. Given that the population mean mass is 49 kg , find the probability of a Type II error.
    If you use the following lined page to complete the answer(s) to any question(s), the question number(s) must be clearly shown.
CAIE S2 2018 November Q5
10 marks Standard +0.3
5 The numbers of basketball courts in a random sample of 70 schools in South Mowland are summarised in the table.
Number of basketball courts01234\(> 4\)
Number of schools228261040
  1. Calculate unbiased estimates for the population mean and variance of the number of basketball courts per school in South Mowland.
    The mean number of basketball courts per school in North Mowland is 1.9 .
  2. Test at the \(5 \%\) significance level whether the mean number of basketball courts per school in South Mowland is less than the mean for North Mowland.
  3. State, with a reason, which of the errors, Type I or Type II, might have been made in the test in part (ii).
CAIE S2 2019 November Q2
8 marks Standard +0.3
2 The heights of a certain species of animal have been found to have mean 65.2 cm and standard deviation 7.1 cm . A researcher suspects that animals of this species in a certain region are shorter on average than elsewhere. She takes a large random sample of \(n\) animals of this species from this region and finds that their mean height is 63.2 cm . She then carries out an appropriate hypothesis test.
  1. She finds that the value of the test statistic \(z\) is - 2.182 , correct to 3 decimal places.
    1. Stating a necessary assumption, calculate the value of \(n\).
    2. Carry out the hypothesis test at the \(4 \%\) significance level.
    3. Explain why it was necessary to use the Central Limit theorem in carrying out the test.
CAIE S2 2019 November Q3
6 marks Standard +0.3
3 The times, in minutes, taken by competitors to complete a puzzle have mean \(\mu\) and standard deviation 3 . The times taken by a random sample of 10 competitors are noted and the results are given below. \(\begin{array} { l l l } 25.2 & 26.8 & 18.5 \end{array}\) 25.5
30.1 \(28.9 \quad 27.0\) \(26.1 \quad 26.0\) 24.9
  1. Stating a necessary assumption, calculate a \(97 \%\) confidence interval for \(\mu\).
  2. Two more random samples, each of 10 competitors, are taken. Their times are used to calculate two more \(97 \%\) confidence intervals for \(\mu\). Find the probability that neither of these intervals contains the true value of \(\mu\).
CAIE S2 2019 November Q7
10 marks Standard +0.8
7 Bob is a self-employed builder. In the past his weekly income had mean \(\\) 546\( and standard deviation \)\\( 120\). Following a change in Bob's working pattern, his mean weekly income for 40 randomly chosen weeks was \(\\) 581\(. You should assume that the standard deviation remains unchanged at \)\\( 120\).
  1. Test at the \(2.5 \%\) significance level whether Bob's mean weekly income has increased.
    Bob finds his mean weekly income for another random sample of 40 weeks and carries out a similar test at the \(2.5 \%\) significance level.
  2. Given that Bob's mean weekly income is now in fact \(\\) 595$, find the probability of a Type II error.
    If you use the following lined page to complete the answer(s) to any question(s), the question number(s) must be clearly shown.