Two-sample z-test large samples - T-tests (unknown variance)

CAIE Further Paper 4 2021 June Q4

8 marks Standard +0.3

4 A scientist is investigating the lengths of the leaves of birch trees in different regions. He takes a random sample of 50 leaves from birch trees in region $A$ and a random sample of 60 leaves from birch trees in region $B$. He records their lengths in $\mathrm { cm } , x$ and $y$, respectively. His results are summarised as follows. $$\sum x = 282 \quad \sum x ^ { 2 } = 1596 \quad \sum y = 328 \quad \sum y ^ { 2 } = 1808$$ The population mean lengths of leaves from birch trees in regions $A$ and $B$ are $\mu _ { A } \mathrm {~cm}$ and $\mu _ { B } \mathrm {~cm}$ respectively. Carry out a test at the $5 \%$ significance level to test the null hypothesis $\mu _ { \mathrm { A } } = \mu _ { \mathrm { B } }$ against the alternative hypothesis $\mu _ { \mathrm { A } } \neq \mu _ { \mathrm { B } }$.

CAIE Further Paper 4 2020 June Q4

9 marks Standard +0.8

4 A company has two different machines, $X$ and $Y$, each of which fills empty cups with coffee. The manager is investigating the volumes of coffee, $x$ and $y$, measured in appropriate units, in the cups filled by machines $X$ and $Y$ respectively. She chooses a random sample of 50 cups filled by machine $X$ and a random sample of 40 cups filled by machine $Y$. The volumes are summarised as follows. $$\sum x = 15.2 \quad \sum x ^ { 2 } = 5.1 \quad \sum y = 13.4 \quad \sum y ^ { 2 } = 4.8$$ The manager claims that there is no difference between the mean volume of coffee in cups filled by machine $X$ and the mean volume of coffee in cups filled by machine $Y$. Test the manager's claim at the $10 \%$ significance level.

CAIE Further Paper 4 2023 June Q4

9 marks Standard +0.3

4 An inspector is checking the lengths of metal rods produced by two machines, $X$ and $Y$. These rods should be of the same length, but the inspector suspects that those made by machine $X$ are shorter, on average, than those made by machine $Y$. The inspector chooses a random sample of 80 rods made by machine $X$ and a random sample of 60 rods made by machine $Y$. The lengths of these rods are $x \mathrm {~cm}$ and $y \mathrm {~cm}$ respectively. Her results are summarised as follows. $$\sum x = 164.0 \quad \sum x ^ { 2 } = 338.1 \quad \sum y = 124.8 \quad \sum y ^ { 2 } = 261.1$$

Test at the $10 \%$ significance level whether the data supports the inspector's suspicion.
Give a reason why it is not necessary to make any assumption about the distributions of the lengths of the rods.

CAIE Further Paper 4 2022 November Q3

8 marks Standard +0.3

3 A scientist is investigating the masses of birds of a certain species in country $X$ and country $Y$. She takes a random sample of 50 birds of this species from country $X$ and a random sample of 80 birds of this species from country $Y$. She records their masses in $\mathrm { kg } , x$ and $y$, respectively. Her results are summarised as follows. $$\sum x = 75.5 \quad \sum x ^ { 2 } = 115.2 \quad \sum y = 116.8 \quad \sum y ^ { 2 } = 172.6$$ The population mean masses of these birds in countries $X$ and $Y$ are $\mu _ { x } \mathrm {~kg}$ and $\mu _ { y } \mathrm {~kg}$ respectively.
Test, at the $5 \%$ significance level, the null hypothesis $\mu _ { \mathrm { x } } = \mu _ { \mathrm { y } }$ against the alternative hypothesis $\mu _ { \mathrm { x } } > \mu _ { \mathrm { y } }$. State your conclusion in the context of the question.

OCR S3 2016 June Q6

11 marks Challenging +1.2

6 The masses at birth, in kg, of random samples of babies were recorded for each of the years 1970 and 2010. The table shows the sample mean and an unbiased estimate of the population variance for each year.

Year

No. of babies

Sample

mean

Unbiased estimate of

population variance

1970

285

3.303

0.2043

2010

260

3.352

0.2323

A researcher tests the null hypothesis that babies born in 2010 are 0.04 kg heavier, on average, than babies born in 1970, against the alternative hypothesis that they are more than 0.04 kg heavier on average. Show that, at the $5 \%$ level of significance, the null hypothesis is not rejected.
Another researcher chooses samples of equal size, $n$, for the two years. Using the same hypothesis as in part (i), she finds that the null hypothesis is rejected at the $5 \%$ level of significance. Assuming that the sample means and unbiased estimates of population variance for the two years are as given in the table, find the smallest possible value of $n$.

CAIE FP2 2011 June Q9

9 marks Challenging +1.2

9 Mr Lee asserts that boys are slower than girls at completing a particular mathematical puzzle. In order to test his assertion, a random sample of 40 boys and a random sample of 60 girls are selected from a large group of students who attempted the puzzle. The times taken by the boys, $b$ minutes, and the times taken by the girls, $g$ minutes, are summarised as follows. $$\Sigma b = 92.0 \quad \Sigma b ^ { 2 } = 216.5 \quad \Sigma g = 129.8 \quad \Sigma g ^ { 2 } = 288.8$$ Test at the $2.5 \%$ significance level whether this evidence supports Mr Lee's assertion.

CAIE FP2 2014 June Q8

9 marks Standard +0.3

8 Weekly expenses claimed by employees at two different branches, $A$ and $B$, of a large company are being compared. Expenses claimed by an employee at branch $A$ and by an employee at branch $B$ are denoted by $\$ x$ and $\$ y$ respectively. A random sample of 60 employees from branch $A$ and a random sample of 50 employees from branch $B$ give the following summarised data. $$\Sigma x = 6060 \quad \Sigma x ^ { 2 } = 626220 \quad \Sigma y = 4750 \quad \Sigma y ^ { 2 } = 464500$$ Using a $2 \%$ significance level, test whether, on average, employees from branch $A$ claim the same as employees from branch $B$.

CAIE FP2 2012 November Q9

10 marks Standard +0.3

9 Experiments are conducted to test the breaking strength of each of two types of rope, $P$ and $Q$. A random sample of 50 ropes of type $P$ and a random sample of 70 ropes of type $Q$ are selected. The breaking strengths, $p$ and $q$, measured in appropriate units, are summarised as follows. $$\Sigma p = 321.2 \quad \Sigma p ^ { 2 } = 2120.0 \quad \Sigma q = 475.3 \quad \Sigma q ^ { 2 } = 3310.0$$ Test, at the $10 \%$ significance level, whether the mean breaking strengths of type $P$ and type $Q$ ropes are the same.

OCR MEI Paper 2 2024 June Q15

17 marks Standard +0.3

15 Bottles of Fizzipop nominally contain 330 ml of drink. A consumer affairs researcher collects a random sample of 55 bottles of Fizzipop and records the volume of drink in each bottle. Summary statistics for the researcher's sample are shown in the table.

$n$	55
$\sum x$	18535
$\sum x ^ { 2 }$	6247066.6

1. Calculate the mean volume of drink in a bottle of Fizzipop.
2. Show that the standard deviation of the volume of drink in a bottle of Fizzipop is 3.78 ml . The researcher uses software to produce a histogram with equal class intervals, which is shown below. \includegraphics[max width=\textwidth, alt={}, center]{8e48bbd3-2166-49e7-8906-833261f331ca-10_533_759_1181_251}
Explain why the researcher decides that the Normal distribution is a suitable model for the volume of drink in a bottle of Fizzipop.
Use your answers to parts (a) and (b) to determine the expected number of bottles which contain less than 330 ml in a random sample of 100 bottles. In order to comply with new regulations, no more than 1\% of bottles of Fizzipop should contain less than 330 ml . The manufacturer decides to meet the new regulations by adjusting the manufacturing process so that the mean volume of drink in a bottle of Fizzipop is increased. The standard deviation is unaltered.
Determine the minimum mean volume of drink in a bottle of Fizzipop which should ensure that the new regulations are met. Give your answer to $\mathbf { 3 }$ significant figures. The mean volume of drink in a bottle of Fizzipop is set to 340 ml . After several weeks the quality control manager suspects the mean volume may have reduced. She collects a random sample of 100 bottles of Fizzipop. The mean volume of drink in a bottle in the sample is found to be 339.37 ml .
Assuming the standard deviation is unaltered, conduct a hypothesis test at the $5 \%$ level to determine whether there is any evidence to suggest that the mean volume of drink in a bottle of Fizzipop is less than 340 ml .

OCR MEI Paper 2 2020 November Q9

9 marks Standard +0.3

9 A company supplies computers to businesses. In the past the company has found that computers are kept by businesses for a mean time of 5 years before being replaced. Claud, the manager of the company, thinks that the mean time before replacing computers is now different.

Describe how Claud could obtain a cluster sample of 120 computers used by businesses the company supplies. Claud decides to conduct a hypothesis test at the $5 \%$ level to test whether there is evidence to suggest that the mean time that businesses keep computers is not 5 years. He takes a random sample of 120 computers. Summary statistics for the length of time computers in this sample are kept are shown in Fig. 9. \begin{table}[h]
Statistics
$n$ 120
Mean 4.8855
$\sigma$ 2.6941
$s$ 2.7054
$\Sigma x$ 586.2566
$\Sigma x ^ { 2 }$ 3735.1475
Min 0.1213
Q1 2.5472
Median 4.8692
Q3 7.0349
Max 9.9856
\captionsetup{labelformat=empty} \caption{Fig. 9}
\end{table} \section*{(b) In this question you must show detailed reasoning.}
- State the hypotheses for this test, explaining why the alternative hypothesis takes the form it does.
- Use a suitable distribution to carry out the test.

Edexcel S3 2023 January Q5

10 marks Standard +0.3

5 Claire grows strawberries on her farm. She wants to compare two brands of fertiliser, brand $A$ and brand $B$. She grows two sets of plants of the same variety of strawberries under the same conditions, fertilising one set with brand $A$ and the other with brand $B$. The yields per plant, in grams, from each set of plants are summarised below.

	Mean	Standard deviation	Number of plants
Fertiliser A	1377	17.8	50
Fertiliser B	1368	18.4	40

Stating your hypotheses clearly, carry out a suitable test to assess whether the mean yield from plants using fertiliser $A$ is greater than the mean yield from plants using fertiliser $B$.
Use a 1\% level of significance and state your test statistic and critical value. The total cost of fertiliser $A$ for Claire's 50 plants was $\pounds 75$ The total cost of fertiliser $B$ for Claire's 40 plants was $\pounds 50$ Claire sells all her strawberries at $\pounds 3$ per kilogram.
Use this information, together with your answer in part (a), to advise Claire on which of the two brands of fertiliser she should use next year in order to maximise her expected profit per plant, giving a reason for your answer.

Edexcel S3 2014 June Q3

7 marks Standard +0.3

3. A grocer believes that the average weight of a grapefruit from farm $A$ is greater than the average weight of a grapefruit from farm $B$. The weights, in grams, of 80 grapefruit selected at random from farm $A$ have a mean value of 532 g and a standard deviation, $s _ { A }$, of 35 g . A random sample of 100 grapefruit from farm $B$ have a mean weight of 520 g and a standard deviation, $s _ { B }$, of 28 g . Stating your hypotheses clearly and using a 1\% level of significance, test whether or not the grocer's belief is supported by the data.

Edexcel S3 2017 June Q5

11 marks Standard +0.3

5. A dance studio has 800 dancers of which \begin{displayquote} 452 are beginners
251 are intermediates
97 are professionals

Explain in detail how a stratified sample of size 50 could be taken.
State an advantage of stratified sampling rather than simple random sampling in this situation. \end{displayquote} Independent random samples of 80 beginners and 60 intermediates are chosen. Each of these dancers is given an assessment score, $x$, based on the quality of their dancing. The results are summarised in the table below.
$\bar { x }$ $s ^ { 2 }$ $n$
Beginners 31.7 57.3 80
Intermediates 36.9 38.1 60
The studio manager believes that the mean score of intermediates is more than 3 points greater than the mean score of beginners.
Stating your hypotheses clearly and using a $5 \%$ level of significance, test whether or not these data support the studio manager's belief.

Edexcel S3 2023 June Q6

13 marks Standard +0.3

Roxane, a scientist, carries out an investigation into the fat content of different brands of crisps.

Roxane took random samples of different brands of crisps and recorded, in grams, the fat content ( $x$ ) of a 30 gram serving. The table below shows some results for just two of these brands.

Brand	$\sum x$	$\sum \boldsymbol { x } ^ { \mathbf { 2 } }$	$\bar { x }$	$s$	Sample size
A	350	1753.9744	5.0	0.24	70
B	331.5	1694.65	$\alpha$	β	65

Calculate the value of $\alpha$ and the value of $\beta$ Roxane claims that these results show that the crisps from brand A have a lower fat content than the crisps from brand B , as the mean fat content for brand A is, statistically, significantly less than the mean fat content for brand B .
Stating your hypotheses clearly, carry out a suitable test, at the $5 \%$ level of significance, to assess Roxane's claim.
You should state your test statistic and critical value.
For the test in part (b), state whether or not it is necessary to assume that the fat content of crisps is normally distributed. Give a reason for your answer.
State an assumption you have made in carrying out the test in part (b).

Edexcel S3 2021 October Q6

12 marks Standard +0.3

6. Amala believes that the resting heart rate is lower in men who exercise regularly compared to men who do not exercise regularly. She measures the resting heart rate, $h$, of a random sample of 50 men who exercise regularly and a random sample of 40 men who do not exercise regularly. Her results are summarised in the table below.

\cline { 2 - 6 } \multicolumn{1}{c|}{}

Sample

size

$\sum \boldsymbol { h }$

$\sum \boldsymbol { h } ^ { 2 }$

Unbiased

estimate of

the mean

Unbiased

estimate of

the variance

Exercise regularly

50

3270

214676

$\alpha$

$\beta$

Do not exercise

regularly

40

2832

201660

70.8

29.6

Calculate the value of $\alpha$ and the value of $\beta$
Test, at the $5 \%$ level of significance, whether there is evidence to support Amala's belief. State your hypotheses clearly.
Explain the significance of the central limit theorem to the test in part (b).
State two assumptions you have made in carrying out the test in part (b).

Edexcel S3 2018 Specimen Q6

13 marks Standard +0.3

6. As part of an investigation, a random sample was taken of 50 footballers who had completed an obstacle course in the early morning. The time taken by each of these footballers to complete the obstacle course, $x$ minutes, was recorded and the results are summarised by $$\sum x = 1570 \quad \text { and } \quad \sum x ^ { 2 } = 49467.58$$

Find unbiased estimates for the mean and variance of the time taken by footballers to complete the obstacle course in the early morning. An independent random sample was taken of 50 footballers who had completed the same obstacle course in the late afternoon. The time taken by each of these footballers to complete the obstacle course, $y$ minutes, was recorded and the results are summarised as $$\bar { y } = 30.9 \quad \text { and } \quad s _ { y } ^ { 2 } = 3.03$$
Test, at the $5 \%$ level of significance, whether or not the mean time taken by footballers to complete the obstacle course in the early morning, is greater than the mean time taken by footballers to complete the obstacle course in the late afternoon. State your hypotheses clearly.
Explain the relevance of the Central Limit Theorem to the test in part (b).
State an assumption you have made in carrying out the test in part (b).

Edexcel S3 2003 June Q5

11 marks Standard +0.3

5. A scientist monitored the levels of river pollution near a factory. Before the factory was closed down she took 100 random samples of water from different parts of the river and found an average weight of pollutants of $10 \mathrm { mg } \mathrm { l } ^ { - 1 }$ with a standard deviation of $2.64 \mathrm { mg } \mathrm { l } ^ { - 1 }$. After the factory was closed down the scientist collected a further 120 random samples and found that they contained $8 \mathrm { mg } \mathrm { l } ^ { - 1 }$ of pollutants on average with a standard deviation of $1.94 \mathrm { mg } \mathrm { l } ^ { - 1 }$. Test, at the $5 \%$ level of significance, whether or not the mean river pollution fell after the factory closed down.

Edexcel S3 2007 June Q5

14 marks Standard +0.3

In a trial of $\operatorname { diet } A$ a random sample of 80 participants were asked to record their weight loss, $x \mathrm {~kg}$, after their first week of using the diet. The results are summarised by

$$\sum x = 361.6 \text { and } \sum x ^ { 2 } = 1753.95$$

Find unbiased estimates for the mean and variance of weight lost after the first week of using diet $A$. The designers of diet $A$ believe it can achieve a greater mean weight loss after the first week than a standard diet $B$. A random sample of 60 people used diet $B$. After the first week they had achieved a mean weight loss of 4.06 kg , with an unbiased estimate of variance of weight loss of $2.50 \mathrm {~kg} ^ { 2 }$.
Test, at the $5 \%$ level of significance, whether or not the mean weight loss after the first week using $\operatorname { diet } A$ is greater than that using diet $B$. State your hypotheses clearly.
Explain the significance of the central limit theorem to the test in part (b).
State an assumption you have made in carrying out the test in part (b).

Edexcel S3 2012 June Q5

9 marks Standard +0.3

5. Mr Alan and Ms Burns are two Mathematics teachers teaching mixed ability groups of students in a large college. At the end of the college year all students took the same examination. A random sample of 29 of Mr Alan's students and a random sample of 26 of Ms Burns' students are chosen. The results are summarised in the table below.

	Sample Size, $n$	Mean, $\bar { x }$	Standard Deviation, $s$
Mr Alan	29	80	10
Ms Burns	26	74	15

Stating your hypotheses clearly, test, at the $10 \%$ level of significance whether there is evidence that there is a difference in the mean scores of their students. Ms Burns thinks the comparison was unfair as the examination was set by Mr Alan. She looks up a different set of examination results for these students and, although Mr Alan's sample has a higher mean, she calculates the test statistic for this new set of results to be 1.6 However, Mr Alan now claims that the mean marks of his students are higher than the mean marks of Ms Burns' students.
Test Mr Alan's claim, stating the hypotheses and critical values you would use. Use a $10 \%$ level of significance.

Edexcel S3 2014 June Q5

13 marks Standard +0.3

5. A student believes that there is a difference in the mean lengths of English and French films. He goes to the university video library and randomly selects a sample of 120 English films and a sample of 70 French films. He notes the length, $x$ minutes, of each of the films in his samples. His data are summarised in the table below.

	$\Sigma x$	$\Sigma x ^ { 2 }$	$s ^ { 2 }$	$n$
English films	10650	956909	98.5	120
French films	6510	615849	151	70

Verify that the unbiased estimate of the variance, $s ^ { 2 }$, of the lengths of English films is 98.5 minutes ${ } ^ { 2 }$
Stating your hypotheses clearly, test, at the 1\% level of significance, whether or not the mean lengths of English and French films are different.
Explain the significance of the Central Limit Theorem to the test in part (b).
The university video library contained 724 English films and 473 French films. Explain how the student could have taken a stratified sample of 190 of these films.

AQA S3 2009 June Q4

8 marks Standard +0.8

4 Holly, a horticultural researcher, believes that the mean height of stems on Tahiti daffodils exceeds that on Jetfire daffodils by more than 15 cm . She measures the heights, $x$ centimetres, of stems on a random sample of 65 Tahiti daffodils and finds that their mean, $\bar { x }$, is 40.7 and that their standard deviation, $s _ { x }$, is 3.4 . She also measures the heights, $y$ centimetres, of stems on a random sample of 75 Jetfire daffodils and finds that their mean, $\bar { y }$, is 24.4 and that their standard deviation, $s _ { y }$, is 2.8 . Investigate, at the $1 \%$ level of significance, Holly's belief.

AQA S3 2012 June Q2

7 marks Moderate -0.3

2 As part of a comparison of two varieties of cucumber, Fanfare and Marketmore, random samples of harvested cucumbers of each variety were selected and their lengths measured, in centimetres. The results are summarised in the table.

\multirow{2}{*}{}		\multirow[b]{2}{*}{Sample size}	Length (cm)
			Sample mean	Sample standard deviation
\multirow{2}{*}{Cucumber variety}	Fanfare	50	22.0	1.31
	Marketmore	75	21.6	0.702

Test, at the $1 \%$ level of significance, the hypothesis that there is no difference between the mean length of harvested Fanfare cucumbers and that of harvested Marketmore cucumbers.
In addition to length, name one other characteristic of cucumbers that could be used for comparative purposes.

CAIE FP2 2009 November Q8

9 marks Challenging +1.2

150 sheep, chosen from a large flock of sheep, were divided into two groups of 75. Over a fixed period, one group had their grazing controlled and the other group grazed freely. The gains in weight, in kg, were recorded for each animal and the table below shows the sample means and the unbiased estimates of the population variances for the two samples.

	Sample mean	Unbiased estimate of population variance
Controlled grazing	19.14	20.54
Free grazing	15.36	9.84

It is required to test whether the population mean for sheep having their grazing controlled exceeds the population mean for sheep grazing freely by less than 5 kg. State, giving a reason, if it is necessary for the validity of the test to assume that the two population variances are equal. [1] Stating any other assumption, carry out the test at the 5\% significance level. [8]

Edexcel S3 2006 June Q3

9 marks Standard +0.3

A biologist investigated whether or not the diet of chickens influenced the amount of cholesterol in their eggs. The cholesterol content of 70 eggs selected at random from chickens fed diet A had a mean value of 198 mg and a standard deviation of 47 mg. A random sample of 90 eggs from chickens fed diet B had a mean cholesterol content of 201 mg and a standard deviation of 23 mg.

Stating your hypotheses clearly and using a 5\% level of significance, test whether or not there is a difference between the mean cholesterol content of eggs laid by chickens fed on these two diets. [7]
State, in the context of this question, an assumption you have made in carrying out the test in part (a). [2]

Edexcel S3 2009 June Q6

10 marks Standard +0.3

The lengths of a random sample of 120 limpets taken from the upper shore of a beach had a mean of 4.97 cm and a standard deviation of 0.42 cm. The lengths of a second random sample of 150 limpets taken from the lower shore of the same beach had a mean of 5.05 cm and a standard deviation of 0.67 cm.

Test, using a 5\% level of significance, whether or not the mean length of limpets from the upper shore is less than the mean length of limpets from the lower shore. State your hypotheses clearly. [8]
State two assumptions you made in carrying out the test in part (a). [2]

Statistics
\(n\)	120
Mean	4.8855
\(\sigma\)	2.6941
\(s\)	2.7054
\(\Sigma x\)	586.2566
\(\Sigma x ^ { 2 }\)	3735.1475
Min	0.1213
Q1	2.5472
Median	4.8692
Q3	7.0349
Max	9.9856

	\(\bar { x }\)	\(s ^ { 2 }\)	\(n\)
Beginners	31.7	57.3	80
Intermediates	36.9	38.1	60

Brand	\(\sum x\)	\(\sum \boldsymbol { x } ^ { \mathbf { 2 } }\)	\(\bar { x }\)	\(s\)	Sample size
A	350	1753.9744	5.0	0.24	70
B	331.5	1694.65	\(\alpha\)	β	65

	Sample Size, \(n\)	Mean, \(\bar { x }\)	Standard Deviation, \(s\)
Mr Alan	29	80	10
Ms Burns	26	74	15