7. Two methods of extracting juice from an orange are to be compared. Eight oranges are halved. One half of each orange is chosen at random and allocated to Method \(A\) and the other half is allocated to Method \(B\). The amounts of juice extracted, in ml , are given in the table.
The lengths of components produced by the machines can be assumed to follow normal distributions.
- Use a two tail test to show, at the \(10 \%\) significance level, that the variances of the lengths of components produced by each machine can be assumed to be equal.
- Showing your working clearly, find a \(95 \%\) confidence interval for \(\mu _ { B } - \mu _ { A }\), where \(\mu _ { A }\) and \(\mu _ { B }\) are the mean lengths of the populations of components produced by machine \(A\) and machine \(B\) respectively.
There are serious consequences for the production at the factory if the difference in mean lengths of the components produced by the two machines is more than 0.7 cm .
- State, giving your reason, whether or not the factory manager should be concerned.
5. Rolls of cloth delivered to a factory contain defects at an average rate of \(\lambda\) per metre. A quality assurance manager selects a random sample of 15 metres of cloth from each delivery to test whether or not there is evidence that \(\lambda > 0.3\). The criterion that the manager uses for rejecting the hypothesis that \(\lambda = 0.3\) is that there are 9 or more defects in the sample. - Find the size of the test.
Table 1 gives some values, to 2 decimal places, of the power function of this test.
\begin{table}[h]
- Use a paired \(t\)-test to determine, at the \(10 \%\) level of significance, whether or not there is a difference in the mean blood pressure measured using the two methods. State your hypotheses clearly.
- State an assumption about the underlying distribution of measured blood pressure required for this test.
2. The value of orders, in \(\pounds\), made to a firm over the internet has distribution \(\mathrm { N } \left( \mu , \sigma ^ { 2 } \right)\). A random sample of \(n\) orders is taken and \(\bar { X }\) denotes the sample mean. - Write down the mean and variance of \(\bar { X }\) in terms of \(\mu\) and \(\sigma ^ { 2 }\).
A second sample of \(m\) orders is taken and \(\bar { Y }\) denotes the mean of this sample.
An estimator of the population mean is given by
$$U = \frac { n \bar { X } + m \bar { Y } } { n + m }$$ - Show that \(U\) is an unbiased estimator for \(\mu\).
- Show that the variance of \(U\) is \(\frac { \sigma ^ { 2 } } { n + m }\).
- State which of \(\bar { X }\) or \(U\) is a better estimator for \(\mu\). Give a reason for your answer.
3. The lengths, \(x \mathrm {~mm}\), of the forewings of a random sample of male and female adult butterflies are measured. The following statistics are obtained from the data. - Stating your hypotheses clearly, and using a \(10 \%\) level of significance, test whether or not there is evidence of a difference between the variances of the marks of the two groups.
- State clearly an assumption you have made to enable you to carry out the test in part (a).
- Use a two tailed test, with a \(5 \%\) level of significance, to determine if the playing of music during the test has made any difference in the mean marks of the two groups. State your hypotheses clearly.
- Write down what you can conclude about the effect of music on a student's performance during the test.
3. The weights, in grams, of mice are normally distributed. A biologist takes a random sample of 10 mice. She weighs each mouse and records its weight.
The ten mice are then fed on a special diet. They are weighed again after two weeks.
Their weights in grams are as follows: - State an assumption that needs to be made in order to carry out a \(t\)-test in this case.
- State why a paired \(t\)-test is suitable for use with these data.
- Using a \(5 \%\) level of significance, test whether or not there is evidence that the device reduces \(\mathrm { CO } _ { 2 }\) emissions from cars.
- Explain, in context, what a type II error would be in this case.
3. Define, in terms of \(\mathrm { H } _ { 0 }\) and/or \(\mathrm { H } _ { 1 }\), - the size of a hypothesis test,
- the power of a hypothesis test.
The probability of getting a head when a coin is tossed is denoted by \(p\). This coin is tossed 12 times in order to test the hypotheses \(\mathrm { H } _ { 0 } : p = 0.5\) against \(\mathrm { H } _ { 1 } : p \neq 0.5\), using a \(5 \%\) level of significance.
- Find the largest critical region for this test, such that the probability in each tail is less than \(2.5 \%\).
- Given that \(p = 0.4\)
- find the probability of a type II error when using this test,
- find the power of this test.
- Suggest two ways in which the power of the test can be increased.
4. A farmer set up a trial to assess whether adding water to dry feed increases the milk yield of his cows. He randomly selected 22 cows. Thirteen of the cows were given dry feed and the other 9 cows were given the feed with water added. The milk yields, in litres per day, were recorded with the following results.
You may assume that the times taken to complete the task by the students are two independent random samples from normal distributions. - Stating your hypotheses clearly, test, at the \(10 \%\) level of significance, whether or not the variances of the times taken to complete the task with and without background music are equal.
- Find a \(99 \%\) confidence interval for the difference in the mean times taken to complete the task with and without background music.
Experiments like this are often performed using the same people in each group.
- Explain why this would not be appropriate in this case.
2. As part of an investigation, a random sample of 10 people had their heart rate, in beats per minute, measured whilst standing up and whilst lying down. The results are summarized below.
Stating your hypotheses clearly, test, at the \(10 \%\) level of significance, whether or not the mean amount of juice produced by machine \(B\) is more than the mean amount produced by machine \(A\).
4. A proportion \(p\) of letters sent by a company are incorrectly addressed and if \(p\) is thought to be greater than 0.05 then action is taken.
Using \(\mathrm { H } _ { 0 } : p = 0.05\) and \(\mathrm { H } _ { 1 } : p > 0.05\), a manager from the company takes a random sample of 40 letters and rejects \(\mathrm { H } _ { 0 }\) if the number of incorrectly addressed letters is more than 3 . - Find the size of this test.
- Find the probability of a Type II error in the case where \(p\) is in fact 0.10 .
Table 1 below gives some values, to 2 decimal places, of the power function of this test.
The student decides to carry out a paired \(t\)-test to investigate whether, on average, the blood pressure of a person when sitting down is more than their blood pressure after standing up.
- State clearly the hypotheses that should be used and any necessary assumption that needs to be made.
- Carry out the test at the \(1 \%\) level of significance.
2. A biologist investigating the shell size of turtles takes random samples of adult female and adult make turtles and records the length, \(x \mathrm {~cm}\), of the shell. The results are summarised below.
Assuming that the scores are normally distributed and stating your hypotheses clearly, test at the \(5 \%\) level of significance whether or not there is evidence to support the teacher's belief.
(8)
6. A machine fills bottles with water. The amount of water in each bottle is normally distributed. To check the machine is working properly, a random sample of 12 bottles is selected and the amount of water, in ml , in each bottle is recorded. Unbiased estimates for the mean and variance are
$$\mu = 502 \quad s ^ { 2 } = 5.6$$
Stating your hypotheses clearly, test at the \(1 \%\) level of significance - whether or not the mean amount of water in a bottle is more than 500 ml ,
- whether or not the standard deviation of the amount of water in a bottle is less than 3 ml .
7. A machine produces bricks. The lengths, \(x \mathrm {~mm}\), of the bricks are distributed \(\mathrm { N } \left( \mu , 2 ^ { 2 } \right)\). At the start of each week a random sample of \(n\) bricks is taken to check the machine is working correctly.
A test is then carried out at the \(1 \%\) level of significance with
$$\mathrm { H } _ { 0 } : \mu = 202 \quad \text { and } \quad \mathrm { H } _ { 1 } : \mu < 202$$ - Find, in terms of \(n\), the critical region of the test.
The probability of a type II error, when \(\mu = 200\), is less than 0.05 .
- Find the minimum value of \(n\).