5.05c Hypothesis test: normal distribution for population mean

681 questions

Sort by: Default | Easiest first | Hardest first
OCR MEI S4 2009 June Q3
24 marks Standard +0.3
3
  1. At a waste disposal station, two methods for incinerating some of the rubbish are being compared. Of interest is the amount of particulates in the exhaust, which can be measured over the working day in a convenient unit of concentration. It is assumed that the underlying distributions of concentrations of particulates are Normal. It is also assumed that the underlying variances are equal. During a period of several months, measurements are made for method A on a random sample of 10 working days and for method B on a separate random sample of 7 working days, with results, in the convenient unit, as follows.
    Method A124.8136.4116.6129.1140.7120.2124.6127.5111.8130.3
    Method B130.4136.2119.8150.6143.5126.1130.7
    Use a \(t\) test at the \(10 \%\) level of significance to examine whether either method is better in resulting, on the whole, in a lower concentration of particulates. State the null and alternative hypotheses under test.
  2. The company's statistician criticises the design of the trial in part (i) on the grounds that it is not paired. Summarise the arguments the statistician will have used. A new trial is set up with a paired design, measuring the concentrations of particulates on a random sample of 9 paired occasions. The results are as follows.
    PairIIIIIIIVVVIVIIVIIIIX
    Method A119.6127.6141.3139.5141.3124.1116.6136.2128.8
    Method B112.2128.8130.2134.0135.1120.4116.9134.4125.2
    Use a \(t\) test at the \(5 \%\) level of significance to examine the same hypotheses as in part (i). State the underlying distributional assumption that is needed in this case.
  3. State the names of procedures that could be used in the situations of parts (i) and (ii) if the underlying distributional assumptions could not be made. What hypotheses would be under test?
OCR MEI S4 2011 June Q3
24 marks Challenging +1.2
3
  1. Explain the meaning of the following terms in the context of hypothesis testing: Type I error, Type II error, operating characteristic, power.
  2. A market research organisation is designing a sample survey to investigate whether expenditure on everyday food items has increased in 2011 compared with 2010. For one of the populations being studied, the random variable \(X\) is used to model weekly expenditure, in \(\pounds\), on these items in 2011, where \(X\) is Normally distributed with mean \(\mu\) and variance \(\sigma ^ { 2 }\). As the corresponding mean value in 2010 was 94 , the hypotheses to be examined are $$\begin{aligned} & \mathrm { H } _ { 0 } : \mu = 94 \\ & \mathrm { H } _ { 1 } : \mu > 94 \end{aligned}$$ By comparison with the corresponding 2010 value, \(\sigma ^ { 2 }\) is assumed to be 25 .
    The following criteria for the survey are laid down.
    A random sample of size \(n\) is to be taken and the usual Normal test based on \(\bar { X }\) is to be used, with a critical value of \(c\) such that \(\mathrm { H } _ { 0 }\) is rejected if the value of \(\bar { X }\) exceeds \(c\). Find \(c\) and the smallest value of \(n\) that is required.
  3. Sketch the power function of an ideal test for examining the hypotheses in part (ii).
OCR MEI S4 2013 June Q3
24 marks Standard +0.3
3
  1. Explain the meaning of the following terms in the context of hypothesis testing: Type I error, Type II error, operating characteristic, power.
  2. A test is to be carried out concerning a parameter \(\theta\). The null hypothesis is that \(\theta\) has the particular value \(\theta _ { 0 }\). The alternative hypothesis is \(\theta \neq \theta _ { 0 }\). Draw a sketch of the operating characteristic for a perfect test that never makes an error.
  3. The random variable \(X\) is distributed as \(\mathrm { N } ( \mu , 9 )\). A random sample of size 25 is available. The null hypothesis \(\mu = 0\) is to be tested against the alternative hypothesis \(\mu \neq 0\). The null hypothesis will be accepted if \(- 1 < \bar { x } < 1\) where \(\bar { x }\) is the value of the sample mean, otherwise it will be rejected. Calculate the probability of a Type I error. Calculate the probability of a Type II error if in fact \(\mu = 0.5\); comment on the value of this probability.
  4. Without carrying out any further calculations, draw a sketch of the operating characteristic for the test in part (iii).
CAIE FP2 2010 June Q10
12 marks Standard +0.3
10 Carpal Tunnel syndrome is a condition which affects a person's ability to grip with their hands. Researchers tested a treatment for this syndrome which was applied to 8 randomly chosen patients. A pre-treatment and a post-treatment test of grip was given to each patient, with the following results, measured in kg.
Patient12345678
Pre-treatment grip24.329.528.028.521.528.725.126.3
Post-treatment grip28.334.630.331.621.529.826.027.5
Stating any required assumption, test, at the \(1 \%\) significance level, whether the mean grip of people with the syndrome increases after undergoing the treatment. It is given that there is evidence at the \(10 \%\) significance level that the mean grip increases by more than \(w \mathrm {~kg}\). Find an inequality for \(w\).
CAIE FP2 2011 June Q7
8 marks Standard +0.3
7 A greengrocer claims that his cabbages have a mean mass of more than 1.2 kg . In order to check his claim, he weighs 10 cabbages, chosen at random from his stock. The masses, in kg, are as follows. $$\begin{array} { l l l l l l l l l l } 1.26 & 1.24 & 1.17 & 1.23 & 1.18 & 1.25 & 1.19 & 1.20 & 1.21 & 1.18 \end{array}$$ Stating any assumption that you make, test at the \(10 \%\) significance level whether the greengrocer's claim is supported by this evidence.
CAIE FP2 2011 June Q9
9 marks Challenging +1.2
9 Mr Lee asserts that boys are slower than girls at completing a particular mathematical puzzle. In order to test his assertion, a random sample of 40 boys and a random sample of 60 girls are selected from a large group of students who attempted the puzzle. The times taken by the boys, \(b\) minutes, and the times taken by the girls, \(g\) minutes, are summarised as follows. $$\Sigma b = 92.0 \quad \Sigma b ^ { 2 } = 216.5 \quad \Sigma g = 129.8 \quad \Sigma g ^ { 2 } = 288.8$$ Test at the \(2.5 \%\) significance level whether this evidence supports Mr Lee's assertion.
CAIE FP2 2011 June Q8
11 marks Standard +0.3
8 A company decides that its employees should follow an exercise programme for 30 minutes each day, with the aim that they lose weight and increase productivity. The weights, in kg , of a random sample of 8 employees at the start of the programme and after following the programme for 6 weeks are shown in the table.
Employee\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
Weight before \(( \mathrm { kg } )\)98.687.390.485.2100.592.489.991.3
Weight after \(( \mathrm { kg } )\)93.585.288.284.695.489.386.087.6
Assuming that loss in weight is normally distributed, find a 95\% confidence interval for the mean loss in weight of the company's employees. Test at the \(5 \%\) significance level whether, after the exercise programme, there is a reduction of more than 2.5 kg in the population mean weight.
CAIE FP2 2012 June Q6
7 marks Standard +0.3
6 A random sample of 10 observations of a normal random variable \(X\) has mean \(\bar { x }\), where $$\bar { x } = 8.254 , \quad \Sigma ( x - \bar { x } ) ^ { 2 } = 0.912 .$$ Using a \(5 \%\) significance level, test whether the mean of \(X\) is greater than 8.05.
CAIE FP2 2012 June Q10
12 marks Standard +0.3
10 Engineers are investigating the speed of the internet connection received by households in two towns \(P\) and \(Q\). The speeds, in suitable units, in \(P\) and \(Q\) are denoted by \(x\) and \(y\) respectively. For a random sample of 50 houses in town \(P\) and a random sample of 40 houses in town \(Q\) the results are summarised as follows. $$\Sigma x = 240 \quad \Sigma x ^ { 2 } = 1224 \quad \Sigma y = 168 \quad \Sigma y ^ { 2 } = 754$$ Calculate a \(95 \%\) confidence interval for \(\mu _ { P } - \mu _ { Q }\), where \(\mu _ { P }\) and \(\mu _ { Q }\) are the population mean speeds for \(P\) and \(Q\). Test, at the \(1 \%\) significance level, whether \(\mu _ { P }\) is greater than \(\mu _ { Q }\).
CAIE FP2 2013 June Q9
14 marks Challenging +1.2
9 A gardener \(P\) claims that a new type of fruit tree produces a higher annual mass of fruit than the type that he has previously grown. The old type of tree produced 5.2 kg of fruit per tree, on average. A random sample of 10 trees of the new type is chosen. The masses, \(x \mathrm {~kg}\), of fruit produced are summarised as follows. $$\Sigma x = 61.0 \quad \Sigma x ^ { 2 } = 384.0$$ Test, at the \(5 \%\) significance level, whether gardener \(P\) 's claim is justified, assuming a normal distribution. Another gardener \(Q\) has his own type of fruit tree. The masses, \(y \mathrm {~kg}\), of fruit produced by a random sample of 10 trees grown by gardener \(Q\) are summarised as follows. $$\Sigma y = 70.0 \quad \Sigma y ^ { 2 } = 500.6$$ Test, at the \(5 \%\) significance level, whether the mean mass of fruit produced by gardener \(Q\) 's trees is greater than the mean mass of fruit produced by gardener \(P\) 's trees. You may assume that both distributions are normal and you should state any additional assumption.
CAIE FP2 2013 June Q7
8 marks Standard +0.3
7 Each of a random sample of 6 cyclists from a cycling club is timed over two different 10 km courses. Their times, in minutes, are recorded in the following table.
Cyclist\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)
Course 118.517.819.222.316.520.0
Course 220.220.418.120.618.520.5
Assuming that differences in time over the two courses are normally distributed, test at the \(10 \%\) significance level whether the mean times over the two courses are different.
CAIE FP2 2014 June Q8
9 marks Standard +0.3
8 Weekly expenses claimed by employees at two different branches, \(A\) and \(B\), of a large company are being compared. Expenses claimed by an employee at branch \(A\) and by an employee at branch \(B\) are denoted by \(\\) x\( and \)\\( y\) respectively. A random sample of 60 employees from branch \(A\) and a random sample of 50 employees from branch \(B\) give the following summarised data. $$\Sigma x = 6060 \quad \Sigma x ^ { 2 } = 626220 \quad \Sigma y = 4750 \quad \Sigma y ^ { 2 } = 464500$$ Using a \(2 \%\) significance level, test whether, on average, employees from branch \(A\) claim the same as employees from branch \(B\).
CAIE FP2 2014 June Q11 OR
Challenging +1.2
The time taken for a randomly chosen student at College \(P\) to complete a particular puzzle has a normal distribution with mean \(\mu\) minutes. The times, \(x\) minutes, are recorded for a random sample of 8 students chosen from the college. The results are summarised as follows. $$\Sigma x = 42.8 \quad \Sigma x ^ { 2 } = 236.0$$ Find a 95\% confidence interval for \(\mu\). A test is carried out on this sample data, at the \(10 \%\) significance level. The test supports the claim that \(\mu > k\). Find the greatest possible value of \(k\). A random sample, of size 12, is taken from the students at College \(Q\). Their times to complete the puzzle give a sample mean of 4.60 minutes and an unbiased variance estimate of 1.962 minutes \({ } ^ { 2 }\). Use a 2 -sample test at the \(10 \%\) significance level to test whether the mean time for students at College \(Q\) to complete the puzzle is less than the mean time for students at College \(P\) to complete the puzzle. You should state any assumptions necessary for the test to be valid.
CAIE FP2 2015 June Q10
13 marks Standard +0.3
10 Young children at a primary school are learning to throw a ball as far as they can. The distance thrown at the beginning of the school year and the distance thrown at the end of the same school year are recorded for each child. The distance thrown, in metres, at the beginning of the year is denoted by \(x\); the distance thrown, in metres, at the end of the year is denoted by \(y\). For a random sample of 10 children, the results are shown in the following table.
Child\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)
\(x\)5.24.13.75.47.66.13.24.03.58.0
\(y\)6.24.85.05.67.77.04.04.53.68.5
$$\left[ \Sigma x = 50.8 , \quad \Sigma x ^ { 2 } = 284.16 , \quad \Sigma y = 56.9 , \quad \Sigma y ^ { 2 } = 347.59 , \quad \Sigma x y = 313.28 . \right]$$ A particular child threw the ball a distance of 7.0 metres at the beginning of the year, but he could not throw at the end of the year because he had broken his arm. By finding the equation of an appropriate regression line, estimate the distance this child would have thrown at the end of the year. The teacher suspects that, on average, the distance thrown by a child increases between the two throws by more than 0.4 metres. Stating suitable hypotheses and assuming a normal distribution, test the teacher's suspicion at the \(5 \%\) significance level.
CAIE FP2 2015 June Q8
12 marks Standard +0.8
8 A large number of long jumpers are competing in a national long jump competition. The distances, in metres, jumped by a random sample of 7 competitors are as follows. $$\begin{array} { l l l l l l l } 6.25 & 7.01 & 5.74 & 6.89 & 7.24 & 5.64 & 6.52 \end{array}$$ Assuming that distances are normally distributed, test, at the \(5 \%\) significance level, whether the mean distance jumped by long jumpers in this competition is greater than 6.2 metres. The distances jumped by another random sample of 8 long jumpers in this competition are recorded. Using the data from this sample of 8 long jumpers, a \(95 \%\) confidence interval for the population mean, \(\mu\) metres, is calculated as \(5.89 < \mu < 6.75\). Find the unbiased estimates for the population mean and population variance used in this calculation.
CAIE FP2 2015 June Q10 OR
Challenging +1.3
The times taken, in hours, by cyclists from two different clubs, \(A\) and \(B\), to complete a 50 km time trial are being compared. The times taken by a cyclist from club \(A\) and by a cyclist from club \(B\) are denoted by \(t _ { A }\) and \(t _ { B }\) respectively. A random sample of 50 cyclists from \(A\) and a random sample of 60 cyclists from \(B\) give the following summarised data. $$\Sigma t _ { A } = 102.0 \quad \Sigma t _ { A } ^ { 2 } = 215.18 \quad \Sigma t _ { B } = 129.0 \quad \Sigma t _ { B } ^ { 2 } = 282.3$$ Using a 5\% significance level, test whether, on average, cyclists from club \(A\) take less time to complete the time trial than cyclists from club \(B\). A test at the \(\alpha \%\) significance level shows that there is evidence that the population mean time for cyclists from club \(B\) exceeds the population mean time for cyclists from club \(A\) by more than 0.05 hours. Find the set of possible values of \(\alpha\).
CAIE FP2 2016 June Q7
8 marks Standard +0.3
7 A random sample of 9 observations of a normal variable \(X\) is taken. The results are summarised as follows. $$\Sigma x = 24.6 \quad \Sigma x ^ { 2 } = 68.5$$ Test, at the \(5 \%\) significance level, whether the population mean is greater than 2.5.
CAIE FP2 2016 June Q11 OR
Challenging +1.8
Petra is studying a particular species of bird. She takes a random sample of 12 birds from nature reserve \(A\) and measures the wing span, \(x \mathrm {~cm}\), for each bird. She then calculates a \(95 \%\) confidence interval for the population mean wing span, \(\mu \mathrm { cm }\), for birds of this species, assuming that wing spans are normally distributed. Later, she is not able to find the summary of the results for the sample, but she knows that the \(95 \%\) confidence interval is \(25.17 \leqslant \mu \leqslant 26.83\). Find the values of \(\sum x\) and \(\sum x ^ { 2 }\) for this sample. Petra also measures the wing spans of a random sample of 7 birds from nature reserve \(B\). Their wing spans, \(y \mathrm {~cm}\), are as follows. $$\begin{array} { l l l l l l l } 23.2 & 22.4 & 27.6 & 25.3 & 28.4 & 26.5 & 23.6 \end{array}$$ She believes that the mean wing span of birds found in nature reserve \(A\) is greater than the mean wing span of birds found in nature reserve \(B\). Assuming that this second sample also comes from a normal distribution, with variance the same as the first distribution, test, at the \(10 \%\) significance level, whether there is evidence to support Petra's belief.
CAIE FP2 2018 June Q7
7 marks Standard +0.3
7 A large number of athletes are taking part in a competition. The masses, in kg , of a random sample of 7 athletes are as follows. $$\begin{array} { l l l l l l l } 98.1 & 105.0 & 92.2 & 89.8 & 99.9 & 95.4 & 101.2 \end{array}$$ Assuming that masses are normally distributed, test, at the \(10 \%\) significance level, whether the mean mass of athletes in this competition is equal to 94 kg .
CAIE FP2 2018 June Q10
12 marks Standard +0.8
10 The times taken to run 400 metres by students at two large colleges \(P\) and \(Q\) are being compared. There is no evidence that the population variances are equal. The time taken by a student at college \(P\) and the time taken by a student at college \(Q\) are denoted by \(x\) seconds and \(y\) seconds respectively. A random sample of 50 students from college \(P\) and a random sample of 60 students from college \(Q\) give the following summarised data. $$\Sigma x = 2620 \quad \Sigma x ^ { 2 } = 138200 \quad \Sigma y = 3060 \quad \Sigma y ^ { 2 } = 157000$$
  1. Using a 10\% significance level, test whether, on average, students from college \(P\) take longer to run 400 metres than students from college \(Q\).
  2. Find a \(90 \%\) confidence interval for the difference in the mean times taken to run 400 metres by students from colleges \(P\) and \(Q\).
CAIE FP2 2018 June Q10
12 marks Standard +0.3
10 During the summer months, all members of a large swimming club take part in intensive training. The times taken to swim 50 metres at the beginning of the summer and at the end of the summer are recorded for each member of the club. The time taken, in seconds, at the beginning of the summer is denoted by \(x\) and the time taken at the end of the summer is denoted by \(y\). For a random sample of 9 members the results are shown in the following table.
Member\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)
\(x\)38.540.232.335.136.241.432.038.238.2
\(y\)37.438.131.634.734.238.631.836.336.8
The swimming coach believes that, on average, the time taken by a swimmer to swim 50 metres will decrease by more than one second as a result of the intensive training.
  1. Stating suitable hypotheses and assuming a normal distribution, test the coach's belief at the \(10 \%\) significance level.
  2. Find a 95\% confidence interval for the population mean time taken to swim 50 metres after the intensive training, assuming a normal distribution.
CAIE FP2 2019 June Q8
8 marks Standard +0.3
8 A large number of runners are attending a summer training camp. A random sample of 6 runners is chosen and their times to run 1500 m at the beginning of the camp and at the end of the camp are recorded. Their times, in minutes, are shown in the following table.
Runner\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)
Time at beginning of camp3.823.623.553.713.753.92
Time at end of camp3.723.553.523.683.543.73
The organiser of the training camp claims that a runner's time will improve by more than 0.05 minutes between the beginning and end of the camp. Assuming that differences in time over the two runs are normally distributed, test at the \(10 \%\) significance level whether the organiser's claim is justified. [8]
CAIE FP2 2019 June Q11 OR
Challenging +1.2
A farmer grows two different types of cherries, Type \(A\) and Type \(B\). He assumes that the masses of each type are normally distributed. He chooses a random sample of 8 cherries of Type \(A\). He finds that the sample mean mass is 15.1 g and that a \(95 \%\) confidence interval for the population mean mass, \(\mu \mathrm { g }\), is \(13.5 \leqslant \mu \leqslant 16.7\).
  1. Find an unbiased estimate for the population variance of the masses of cherries of Type \(A\).
    The farmer now chooses a random sample of 6 cherries of Type \(B\) and records their masses as follows.
    12.2
    13.3
    13.9
    14.0
    15.4
    16.4
  2. Test at the \(5 \%\) significance level whether the mean mass of cherries of Type \(B\) is less than the mean mass of cherries of Type \(A\). You should assume that the population variances for the two types of cherry are equal.
    If you use the following lined page to complete the answer(s) to any question(s), the question number(s) must be clearly shown.
CAIE FP2 2019 June Q9
10 marks Standard +0.3
9 A farmer grows large amounts of a certain crop. On average, the yield per plant has been 0.75 kg . The farmer has improved the soil in which the crop grows, and she claims that the yield per plant has increased. A random sample of 10 plants grown in the improved soil is chosen. The yields, \(x \mathrm {~kg}\), are summarised as follows. $$\Sigma x = 7.85 \quad \Sigma x ^ { 2 } = 6.19$$
  1. Test at the \(5 \%\) significance level whether the farmer's claim is justified, assuming a normal distribution.
  2. Find a 95\% confidence interval for the population mean yield for plants grown in the new soil.
CAIE FP2 2019 June Q11 OR
Challenging +1.2
A company produces packets of sweets. Two different machines, \(A\) and \(B\), are used to fill the packets. The manager decides to assess the performance of the two machines. He selects a random sample of 50 packets filled by machine \(A\) and a random sample of 60 packets filled by machine \(B\). The masses of sweets, \(x \mathrm {~kg}\), in packets filled by machine \(A\) and the masses of sweets, \(y \mathrm {~kg}\), in packets filled by machine \(B\) are summarised as follows. $$\Sigma x = 22.4 \quad \Sigma x ^ { 2 } = 10.1 \quad \Sigma y = 28.8 \quad \Sigma y ^ { 2 } = 16.3$$ A test at the \(\alpha \%\) significance level provides evidence that the mean mass of sweets in packets filled by machine \(A\) is less than the mean mass of sweets in packets filled by machine \(B\). Find the set of possible values of \(\alpha\).
If you use the following lined page to complete the answer(s) to any question(s), the question number(s) must be clearly shown.