Paired comparison or matched samples

Questions involving the same subjects measured twice or matched pairs (e.g., same person testing two bikes, same plots with/without treatment) requiring a paired t-test approach rather than independent samples.

9 questions · Standard +0.5

Sort by: Default | Easiest first | Hardest first
CAIE Further Paper 4 2021 November Q6
10 marks Standard +0.8
6 A scientist is investigating the masses of a particular type of fish found in lakes \(A\) and \(B\). He chooses a random sample of 10 fish of this type from lake \(A\) and records their masses, \(x \mathrm {~kg}\), as follows.
0.9
1.8
1.8
1.9
2.1
2.4
2.6
2.2
2.5
3.0 The scientist also chooses a random sample of 12 fish of this type from lake \(B\), but he only has a summary of their masses, \(y \mathrm {~kg}\), as follows. $$\sum y = 24.48 \quad \sum y ^ { 2 } = 53.75$$ Test at the \(10 \%\) significance level whether the mean mass of fish of this type in lake \(A\) is greater than the mean mass of fish of this type in lake \(B\). You should state any assumptions that you need to make for the test to be valid.
[0pt] [10]
If you use the following lined page to complete the answer(s) to any question(s), the question number(s) must be clearly shown.
CAIE Further Paper 4 2022 November Q6
9 marks Standard +0.8
6 A company manufactures copper pipes. The pipes are produced by two different machines, \(A\) and \(B\). An inspector claims that the mean diameter of the pipes produced by machine \(A\) is greater than the mean diameter of the pipes produced by machine \(B\). He takes a random sample of 12 pipes produced by machine \(A\) and measures their diameters, \(x \mathrm {~cm}\). His results are summarised as follows. $$\sum x = 6.24 \quad \sum x ^ { 2 } = 3.26$$ He also takes a random sample of 10 pipes produced by machine \(B\) and measures their diameters in cm. His results are as follows. $$\begin{array} { l l l l l l l l l l } 0.48 & 0.53 & 0.47 & 0.54 & 0.54 & 0.55 & 0.46 & 0.55 & 0.50 & 0.48 \end{array}$$ The diameters of the pipes produced by each machine are assumed to be normally distributed with equal population variances. Test at the \(2.5 \%\) significance level whether the data supports the inspector's claim.
If you use the following page to complete the answer to any question, the question number must be clearly shown.
CAIE Further Paper 4 2024 November Q6
8 marks Standard +0.8
6 Ansal is investigating the wingspans of Monarch butterflies in two different regions, \(X\) and \(Y\). He takes a random sample of 8 Monarch butterflies from region \(X\) and records their wingspans, \(x \mathrm {~cm}\). His results are as follows. $$\begin{array} { l l l l l l l l } 8.2 & 7.0 & 7.3 & 8.8 & 7.8 & 8.5 & 9.2 & 7.4 \end{array}$$ Ansal also takes a random sample of 9 Monarch butterflies from region \(Y\) and records their wingspans, \(y \mathrm {~cm}\). His results are summarised as follows. $$\sum y = 71.10 \quad \sum y ^ { 2 } = 567.13$$ Ansal suspects that the mean wingspan of Monarch butterflies from region \(X\) is greater than the mean wingspan of Monarch butterflies from region \(Y\). It is known that the wingspans of Monarch butterflies in regions \(X\) and \(Y\) are normally distributed with equal population variances. Test, at the 10\% significance level, whether Ansal's suspicion is supported by the data. \includegraphics[max width=\textwidth, alt={}, center]{e2a45d19-7d48-4aa5-93f9-6ef90f99d7c4-12_2717_35_109_2012} \includegraphics[max width=\textwidth, alt={}, center]{e2a45d19-7d48-4aa5-93f9-6ef90f99d7c4-13_2726_35_97_20}
If you use the following page to complete the answer to any question, the question number must be clearly shown. \includegraphics[max width=\textwidth, alt={}, center]{e2a45d19-7d48-4aa5-93f9-6ef90f99d7c4-14_2715_33_109_2012}
CAIE Further Paper 4 2024 November Q6
8 marks Standard +0.3
6 Ansal is investigating the wingspans of Monarch butterflies in two different regions, \(X\) and \(Y\). He takes a random sample of 8 Monarch butterflies from region \(X\) and records their wingspans, \(x \mathrm {~cm}\). His results are as follows. $$\begin{array} { l l l l l l l l } 8.2 & 7.0 & 7.3 & 8.8 & 7.8 & 8.5 & 9.2 & 7.4 \end{array}$$ Ansal also takes a random sample of 9 Monarch butterflies from region \(Y\) and records their wingspans, \(y \mathrm {~cm}\). His results are summarised as follows. $$\sum y = 71.10 \quad \sum y ^ { 2 } = 567.13$$ Ansal suspects that the mean wingspan of Monarch butterflies from region \(X\) is greater than the mean wingspan of Monarch butterflies from region \(Y\). It is known that the wingspans of Monarch butterflies in regions \(X\) and \(Y\) are normally distributed with equal population variances. Test, at the 10\% significance level, whether Ansal's suspicion is supported by the data. \includegraphics[max width=\textwidth, alt={}, center]{8b2a13d7-62f4-45a7-84c5-7d5bc870b8ce-12_2715_44_110_2006} \includegraphics[max width=\textwidth, alt={}, center]{8b2a13d7-62f4-45a7-84c5-7d5bc870b8ce-13_2726_35_97_20}
If you use the following page to complete the answer to any question, the question number must be clearly shown. \includegraphics[max width=\textwidth, alt={}, center]{8b2a13d7-62f4-45a7-84c5-7d5bc870b8ce-14_2714_38_109_2010}
OCR S3 2006 January Q6
13 marks Standard +0.3
6 A company with a large fleet of cars compared two types of tyres, \(A\) and \(B\). They measured the stopping distances of cars when travelling at a fixed speed on a dry road. They selected 20 cars at random from the fleet and divided them randomly into two groups of 10 , one group being fitted with tyres of type \(A\) and the other group with tyres of type \(B\). One of the cars fitted with tyres of type \(A\) broke down so these tyres were tested on only 9 cars. The stopping distances, \(x\) metres, for the two samples are summarised by $$n _ { A } = 9 , \quad \bar { x } _ { A } = 17.30 , \quad s _ { A } ^ { 2 } = 0.7400 , \quad n _ { B } = 10 , \quad \bar { x } _ { B } = 14.74 , \quad s _ { B } ^ { 2 } = 0.8160 ,$$ where \(s _ { A } ^ { 2 }\) and \(s _ { B } ^ { 2 }\) are unbiased estimates of the two population variances.
It is given that the two populations have the same variance.
  1. Show that an unbiased estimate of this variance is 0.780 , correct to 3 decimal places. The population mean stopping distances for cars with tyres of types \(A\) and \(B\) are denoted by \(\mu _ { A }\) metres and \(\mu _ { B }\) metres respectively.
  2. Stating any further assumption you need to make, calculate a \(98 \%\) confidence interval for \(\mu _ { A } - \mu _ { B }\). The manufacturers of Type \(B\) tyres assert that \(\mu _ { B } < \mu _ { A } - 2\).
  3. Carry out a significance test of this assertion at the \(5 \%\) significance level. \section*{[Question 7 is printed overleaf.]}
OCR S3 2007 January Q3
10 marks Standard +0.3
3 A new treatment of cotton thread, designed to increase the breaking strength, was tested on a random sample of 6 pieces of a standard length. The breaking strengths, in grams, were as follows. $$\begin{array} { l l l l l l } 17.3 & 18.4 & 18.6 & 17.2 & 17.5 & 19.3 \end{array}$$ The breaking strengths of a random sample of 5 similar pieces of the thread which had not been treated were as follows. \section*{\(\begin{array} { l l l l l } 18.6 & 17.2 & 16.3 & 17.4 & 16.8 \end{array}\)} A test of whether the treatment has been successful is to be carried out.
  1. State what distributional assumptions are needed.
  2. Carry out the test at the \(10 \%\) significance level.
OCR S3 2012 January Q7
18 marks Standard +0.3
7 The manufacturer's specification for batteries used in a certain electronic game is that the mean lifetime should be 32 hours. The manufacturer tests a random sample of 10 batteries made in Factory \(A\), and the lifetimes ( \(x\) hours) are summarised by $$n = 10 , \sum x = 289.0 \text { and } \sum x ^ { 2 } = 8586.19 .$$ It may be assumed that the population of lifetimes has a normal distribution.
  1. Carry out a one-tail test at the \(5 \%\) significance level of whether the specification is being met.
  2. Justify the use of a one-tail test in this context. Batteries made with the same specification are also made in Factory \(B\). The lifetimes of these batteries are also normally distributed. A random sample of 12 batteries from this factory was tested. The lifetimes are summarised by $$n = 12 , \sum x = 363.0 \text { and } \sum x ^ { 2 } = 11290.95 \text {. }$$
  3. (a) State what further assumption must be made in order to test whether there is any difference in the mean lifetimes of batteries made at the two factories.
    Use the data to comment on whether this assumption is reasonable.
    (b) Carry out the test at the \(10 \%\) significance level.
OCR MEI S4 2006 June Q3
24 marks Standard +0.3
3 The human resources department of a large company is investigating two methods, A and B, for training employees to carry out a certain complicated and intricate task.
  1. Two separate random samples of employees who have not previously performed the task are taken. The first sample is of size 10 ; each of the employees in it is trained by method A. The second sample is of size 12; each of the employees in it is trained by method B. After completing the training, the time for each employee to carry out the task is measured, in controlled conditions. The times are as follows, in minutes.
    Employees trained by method A:35.247.825.838.053.631.033.9
    35.421.642.5
    Employees trained by method B:43.057.568.620.931.444.962.8
    27.641.846.139.861.6
    Stating appropriate assumptions concerning the underlying populations, use a \(t\) test at the \(5 \%\) significance level to examine whether either training method is better in respect of leading, on the whole, to a lower time to carry out the task.
  2. A further trial of method B is carried out to see if the performance of experienced and skilled workers can be improved by re-training them. A random sample of 8 such workers is taken. The times in minutes, under controlled conditions, for each worker to carry out the task before and after re-training are as follows.
    Worker\(W _ { 1 }\)\(W _ { 2 }\)\(W _ { 3 }\)\(W _ { 4 }\)\(W _ { 5 }\)\(W _ { 6 }\)\(W _ { 7 }\)\(W _ { 8 }\)
    Time before32.628.522.927.634.928.834.231.3
    Time after26.224.119.028.629.320.036.019.2
    Stating an appropriate assumption, use a \(t\) test at the \(5 \%\) significance level to examine whether the re-training appears, on the whole, to lead to a lower time to carry out the task.
  3. Explain how the test procedure in part (ii) is enhanced by designing it as a paired comparison.
OCR S3 2014 June Q8
10 marks Standard +0.3
8 A random sample of 20 plots of land, each of equal area, was used to test whether the addition of phosphorus would increase the yield of corn. 10 plots were treated with phosphorus and 10 plots were untreated. The yields of corn, in litres, on a treated plot and on an untreated plot are denoted by \(X\) and \(Y\) respectively. You are given that $$\sum x = 2112 , \quad \sum y = 2008$$ You are also given that an unbiased estimate for the variance of treated plots is 87.96 and an unbiased estimate for the variance of untreated plots is 31.96 , both correct to 4 significant figures.
  1. You may assume that the population variance estimates are sufficiently similar for the assumption of common variance to be made. What other assumption needs to be made for a \(t\)-test to be valid?
  2. Carry out a suitable \(t\)-test at the \(1 \%\) significance level, to test whether the use of phosphorus increases the yield of corn.