Paired t-test

A question is this type if and only if it asks to perform a paired t-test on matched or repeated measures data, assuming normality of differences.

33 questions · Standard +0.4

Sort by: Default | Easiest first | Hardest first
OCR S3 2011 June Q7
15 marks Standard +0.3
7 In order to improve their mathematics results 10 students attended an intensive Summer School course. Each student took a test at the start of the course and a similar test at the end of the course. The table shows the scores achieved in each test.
Student12345678910
First test score37273847542752396223
Second test score47295044723763457632
It is desired to test whether there has been an increase in the population mean score.
  1. Explain why a two-sample \(t\)-test would not be appropriate.
  2. Stating any necessary assumptions, carry out a suitable \(t\)-test at the \(\frac { 1 } { 2 } \%\) significance level.
  3. The Summer School director claims that after taking the course the population mean score increases by more than 5 . Is there sufficient evidence for this claim?
OCR S3 Specimen Q7
15 marks Standard +0.3
7 A factory manager wished to compare two methods of assembling a new component, to determine which method could be carried out more quickly, on average, by the workforce. A random sample of 12 workers was taken, and each worker tried out each of the methods of assembly. The times taken, in seconds, are shown in the table.
Worker\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)\(K\)\(L\)
Time in seconds for Method 1483847596241505258544960
Time in seconds for Method 2474038555742424062474751
  1. (a) Carry out an appropriate \(t\)-test, using a \(2 \%\) significance level, to test whether there is any difference in the times for the two methods of assembly.
    (b) State an assumption needed in carrying out this test.
    (c) Calculate a \(95 \%\) confidence interval for the population mean time difference for the two methods of assembly.
  2. Instead of using the same 12 workers to try both methods, the factory manager could have used two independent random samples of workers, allocating Method 1 to the members of one sample and Method 2 to the members of the other sample.
    (a) State one disadvantage of a procedure based on two independent random samples.
    (b) State any assumptions that would need to be made to carry out a \(t\)-test based on two independent random samples.
OCR MEI S3 2006 January Q3
18 marks Standard +0.3
3 A production line has two machines, A and B , for delivering liquid soap into bottles. Each machine is set to deliver a nominal amount of 250 ml , but it is not expected that they will work to a high level of accuracy. In particular, it is known that the ambient temperature affects the rate of flow of the liquid and leads to variation in the amounts delivered. The operators think that machine B tends to deliver a somewhat greater amount than machine A , no matter what the ambient temperature. This is being investigated by an experiment. A random sample of 10 results from the experiment is shown below. Each column of data is for a different ambient temperature.
Ambient temperature\(T _ { 1 }\)\(T _ { 2 }\)\(T _ { 3 }\)\(T _ { 4 }\)\(T _ { 5 }\)\(T _ { 6 }\)\(T _ { 7 }\)\(T _ { 8 }\)\(T _ { 9 }\)\(T _ { 10 }\)
Amount delivered by machine A246.2251.6252.0246.6258.4251.0247.5247.1248.1253.4
Amount delivered by machine B248.3252.6252.8247.2258.8250.0247.2247.9249.0254.5
  1. Use an appropriate \(t\) test to examine, at the \(5 \%\) level of significance, whether the mean amount delivered by machine B may be taken as being greater than that delivered by machine A , stating carefully your null and alternative hypotheses and the required distributional assumption.
  2. Using the data for machine A in the table above, provide a two-sided \(95 \%\) confidence interval for the mean amount delivered by this machine, stating the required distributional assumption. Explain whether you would conclude that the machine appears to be working correctly in terms of the nominal amount as set.
OCR MEI S3 2007 June Q3
18 marks Standard +0.3
3 The management of a large chain of shops aims to reduce the level of absenteeism among its workforce by means of an incentive bonus scheme. In order to evaluate the effectiveness of the scheme, the management measures the percentage of working days lost before and after its introduction for each of a random sample of 11 shops. The results are shown below.
ShopABCDEFGHIJK
\% days lost before3.55.03.53.24.54.94.16.06.88.16.0
\% days lost after1.84.32.94.54.45.83.56.76.45.45.1
  1. The management decides to carry out a \(t\) test to investigate whether there has been a reduction in absenteeism.
    1. State clearly the hypotheses that should be used together with any necessary assumptions.
    2. Carry out the test using a \(5 \%\) significance level.
  2. Find a 95\% confidence interval for the true mean percentage of days lost after the introduction of the incentive scheme and state any assumption needed. The management has set a target that the mean percentage should be 3.5. Do you think this has been achieved? Explain your answer.
OCR MEI S3 2008 June Q3
18 marks Standard +0.3
3
  1. A tea grower is testing two types of plant for the weight of tea they produce. A trial is set up in which each type of plant is grown at each of 8 sites. The total weight, in grams, of tea leaves harvested from each plant is measured and shown below.
    SiteABCDEFGH
    Type I225.2268.9303.6244.1230.6202.7242.1247.5
    Type II215.2242.1260.9241.7245.5204.7225.8236.0
    1. The grower intends to perform a \(t\) test to examine whether there is any difference in the mean yield of the two types of plant. State the hypotheses he should use and also any necessary assumption.
    2. Carry out the test using a \(5 \%\) significance level.
  2. The tea grower deals with many types of tea and employs tasters to rate them. The tasters do this by giving each tea a score out of 100. The tea grower wishes to compare the scores given by two of the tasters. Their scores for a random selection of 10 teas are as follows.
    TeaQRSTUVWXYZ
    Taster 169798563816585868977
    Taster 274759966756496949686
    Use a Wilcoxon test to examine, at the \(5 \%\) level of significance, whether it appears that, on the whole, the scores given to teas by these two tasters differ.
OCR S3 2015 June Q3
12 marks Standard +0.3
3 A tutor gave an assessment to 6 randomly chosen new eleven-year-old students. After each student had received 30 hours of tuition, they were given a second assessment. The scores are shown in the table.
Student\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)
1st assessment124121111113118119
2nd assessment127119114110120122
  1. Show that, at the \(5 \%\) significance level, there is insufficient evidence that students' scores are higher, on average, after tuition than before tuition. State a necessary assumption.
  2. Disappointed by this result, the tutor looked again at the first assessment. She discovered that the first assessment was too easy, in fact being a test for ten-year-olds, not eleven-year-olds. She decided to reduce each score for the first assessment by a constant integer \(k\). Find the least value of \(k\) for which there is evidence at the \(5 \%\) significance level that the students' scores have, on average, improved.
CAIE Further Paper 4 2020 Specimen Q3
8 marks Standard +0.3
3 Empø es at ap rtich ar comp y \(\mathbf { h }\) \textbackslash & b en wo kg seqnb s each \(\mathbf { d y }\) ,frm 9 am to 4 p .To try to red e ab en e,th cm \(\mathbf { p } \mathrm { y }\) d cid s to in rd e 'flex-time'ad all emp 甲 es to wo k th ir see nb s each d y at ay time b tween 7 am ad 9 p .Fo a rach sampe \(\mathbf { 6 } \mathbf { 0 }\) emp \(\boldsymbol { \varphi }\) es,th m b rs 6 b s of ab en e in th \(y\) arb fo e ad th \(y\) ar after th in rd tin 6 flex-time are g t rit t fb low ig ab e.
Emp \(\varphi\) e\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)
Befo e2364058360
After3302326300
Test,at th \textbackslash %sig ficance le l,wh th r th \(\mathbf { p }\) atin mean m br \(\mathbf { 6 }\) b s \(\mathbf { 6 }\) ab en e \(\mathbf { s }\) d creased b lw ig b in rd tim flex-time,s tatig pr ssm p it b tm ak.[ア
OCR S3 2010 January Q6
12 marks Standard +0.3
6 The deterioration of a certain drug over time was investigated as follows. The drug strength was measured in each of a random sample of 8 bottles containing the drug. These were stored for two years and the strengths were then re-measured. The original and final strengths, in suitable units, are shown in the following table.
Bottle12345678
Original strength8.79.49.28.99.68.29.98.8
Final strength8.19.09.08.89.38.09.58.5
  1. Stating any required assumption, test at the \(5 \%\) significance level whether the mean strength has decreased by more than 0.2 over the two years.
  2. Calculate a 95\% confidence interval for the mean reduction in strength over the two years.
OCR S3 2013 January Q2
8 marks Standard +0.3
2 A new running track has been developed and part of the testing procedure involves 7 randomly chosen athletes. They each run 100 m on both the old and new tracks.
The results are as follows.
Athlete1234567
Time on old track \(( s )\)12.210.311.513.011.811.711.9
Time on new track \(( s )\)11.110.511.012.611.010.912.0
The population mean times on the old and new tracks are denoted by \(\mu _ { \mathrm { O } }\) seconds and \(\mu _ { \mathrm { N } }\) seconds respectively. Stating any necessary assumption, carry out a suitable \(t\)-test of the null hypothesis \(\mu _ { \mathrm { O } } - \mu _ { \mathrm { N } } = 0\) against the alternative hypothesis \(\mu _ { \mathrm { O } } - \mu _ { \mathrm { N } } > 0\). Use a \(2 \frac { 1 } { 2 } \%\) significance level .
OCR S3 2012 June Q2
9 marks Standard +0.3
2 Four pairs of randomly chosen twins were each given identical puzzles to solve. The times taken (in seconds) are shown in the following table.
Twin pair1234
Time for first-born46384449
Time for second-born40413746
Stating any necessary assumption, test at the \(10 \%\) significance level whether there is a difference between the population mean times of first-born and second-born twins.
OCR MEI S3 2010 January Q3
18 marks Standard +0.3
3 Cholesterol is a lipid (fat) which is manufactured by the liver from the fatty foods that we eat. It plays a vital part in allowing the body to function normally. However, when high levels of cholesterol are present in the blood there is a risk of arterial disease. Among the factors believed to assist with achieving and maintaining low cholesterol levels are weight loss and exercise. A doctor wishes to test the effectiveness of exercise in lowering cholesterol levels. For a random sample of 12 of her patients, she measures their cholesterol levels before and after they have followed a programme of exercise. The measurements obtained are as follows.
PatientABCDEFGHIJKL
Before5.75.74.06.87.45.56.76.47.27.27.14.4
After5.84.05.25.76.05.05.84.27.35.26.44.1
  1. A \(t\) test is to be used in order to see if, on average, the exercise programme seems to be effective in lowering cholesterol levels. State the distributional assumption necessary for the test, and carry out the test using a \(1 \%\) significance level.
  2. A second random sample of 12 patients gives a \(95 \%\) confidence interval of \(( - 0.5380,1.4046 )\) for the true mean reduction (before - after) in cholesterol level. Find the mean and standard deviation for this sample. How might the doctor interpret this interval in relation to the exercise programme?
OCR MEI S3 2010 June Q3
18 marks Standard +0.3
3
  1. In order to prevent and/or control the spread of infectious diseases, the Government has various vaccination programmes. One such programme requires people to receive a booster injection at the age of 18. It is felt that the proportion of people receiving this booster could be increased and a publicity campaign is undertaken for this purpose. In order to assess the effectiveness of this campaign, health authorities across the country are asked to report the percentage of 18-year-olds receiving the booster before and after the campaign. The results for a randomly chosen sample of 9 authorities are as follows.
    AuthorityABCDEFGHI
    Before769888818684839380
    After829793778395919589
    This sample is to be tested to see whether the campaign appears to have been successful in raising the percentage receiving the booster.
    1. Explain why the use of paired data is appropriate in this context.
    2. Carry out an appropriate Wilcoxon signed rank test using these data, at the \(5 \%\) significance level.
  2. Benford's Law predicts the following probability distribution for the first significant digit in some large data sets.
    Digit123456789
    Probability0.3010.1760.1250.0970.0790.0670.0580.0510.046
    On one particular day, the first significant digits of the stock market prices of the shares of a random sample of 200 companies gave the following results.
    Digit123456789
    Frequency55342716151712159
    Test at the \(10 \%\) level of significance whether Benford's Law provides a reasonable model in the context of share prices.
OCR MEI S3 2012 June Q1
18 marks Standard +0.3
1 Technologists at a company that manufactures paint are trying to develop a new type of gloss paint with a shorter drying time than the current product. In order to test whether the drying time has been reduced, the technologists paint a square metre of each of the new and old paints on each of 10 different surfaces. The lengths of time, in hours, that each square metre takes to dry are as follows.
SurfaceABCDEFGHIJ
Old paint16.617.016.515.616.316.516.415.916.316.1
New paint15.916.316.315.915.516.616.116.016.215.6
  1. Explain why a paired sample is used in this context.
  2. The mean reduction in drying time is to be investigated. Why might a \(t\) test be appropriate in this context and what assumption needs to be made?
  3. Using a significance level of \(5 \%\), carry out a test to see if there appears to be any reduction in mean drying time.
  4. Find a 95\% confidence interval for the true mean reduction in drying time.
CAIE FP2 2010 June Q10
12 marks Standard +0.3
10 Carpal Tunnel syndrome is a condition which affects a person's ability to grip with their hands. Researchers tested a treatment for this syndrome which was applied to 8 randomly chosen patients. A pre-treatment and a post-treatment test of grip was given to each patient, with the following results, measured in kg.
Patient12345678
Pre-treatment grip24.329.528.028.521.528.725.126.3
Post-treatment grip28.334.630.331.621.529.826.027.5
Stating any required assumption, test, at the \(1 \%\) significance level, whether the mean grip of people with the syndrome increases after undergoing the treatment. It is given that there is evidence at the \(10 \%\) significance level that the mean grip increases by more than \(w \mathrm {~kg}\). Find an inequality for \(w\).
CAIE FP2 2010 November Q9
10 marks Standard +0.3
9 A national athletics coach suspects that, on average, 200-metre runners' indoor times exceed their outdoor times by more than 0.1 seconds. In order to test this, the coach randomly selects eight 200 -metre runners and records their indoor and outdoor times. The results, in seconds, are shown in the table.
Runner\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
Indoor time21.521.820.921.221.421.421.221.0
Outdoor time21.121.720.720.921.321.021.120.8
Stating suitable hypotheses and any necessary assumption that you make, test the coach's suspicion at the 2.5\% level of significance.
Edexcel S4 2006 January Q5
13 marks Standard +0.3
5. Seven pipes of equal length are selected at random. Each pipe is cut in half. One piece of each pipe is coated with protective paint and the other is left uncoated. All of the pieces of pipe are buried to the same depth in various soils for 6 months. The table gives the percentage area of the pieces of pipe in the various soils that are subject to corrosion.
SoilABCDEFG
\% Corrosion
coated pipe
39404332423336
\% Corrosion
uncoated pipe
41366148424845
  1. Stating your hypotheses clearly and using a \(5 \%\) significance level, carry out a paired \(t\)-test to assess whether or not there is a difference between the mean percentage of corrosion on the coated pipes and the mean percentage of corrosion on the uncoated pipes.
    1. State an assumption that has been made in order to carry out this test.
    2. Comment on the validity of this assumption.
  2. State what difference would be made to the conclusion in part (a) if the test had been to determine whether or not the percentage of corrosion on the uncoated pipes was higher than the mean percentage of corrosion on the coated pipes. Justify your answer.
Edexcel S4 2004 June Q4
9 marks Standard +0.3
4. A doctor believes that the span of a person's dominant hand is greater than that of the weaker hand. To test this theory, the doctor measures the spans of the dominant and weaker hands of a random sample of 8 people. He subtracts the span of the weaker hand from that of the dominant hand. The spans, in mm , are summarised in the table below.
\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
Dominant hand202251215235210195191230
Weaker hand195249218234211197181225
Test, at the 5\% significance level, the doctor's belief.
(9)
Edexcel S4 2005 June Q4
13 marks Standard +0.3
4. A farmer set up a trial to assess the effect of two different diets on the increase in the weight of his lambs. He randomly selected 20 lambs. Ten of the lambs were given \(\operatorname { diet } A\) and the other 10 lambs were given diet \(B\). The gain in weight, in kg , of each lamb over the period of the trial was recorded.
  1. State why a paired \(t\)-test is not suitable for use with these data.
  2. Suggest an alternative method for selecting the sample which would make the use of a paired \(t\)-test valid.
  3. Suggest two other factors that the farmer might consider when selecting the sample. The following paired data were collected.
    Diet \(A\)5674.66.15.76.27.453
    Diet \(B\)77.286.45.17.98.26.26.15.8
  4. Using a paired \(t\)-test, at the \(5 \%\) significance level, test whether or not there is evidence of a difference in the weight gained by the lambs using \(\operatorname { diet } A\) compared with those using \(\operatorname { diet } B\).
  5. State, giving a reason, which diet you would recommend the farmer to use for his lambs.
    (Total 13 marks)
Edexcel S4 2006 June Q3
9 marks Standard +0.3
3. As part of an investigation into the effectiveness of solar heating, a pair of houses was identified where the mean weekly fuel consumption was the same. One of the houses was then fitted with solar heating and the other was not. Following the fitting of the solar heating, a random sample of 9 weeks was taken and the table below shows the weekly fuel consumption for each house.
Week123456789
Without solar heating191918146753143
With solar heating1322111614102038
Units of fuel used per week
  1. Stating your hypotheses clearly, test, at the \(5 \%\) level of significance, whether or not there is evidence that the solar heating reduces the mean weekly fuel consumption.
    (8)
  2. State an assumption about weekly fuel consumption that is required to carry out this test.
Edexcel S4 2007 June Q1
9 marks Standard +0.3
  1. A medical student is investigating two methods of taking a person's blood pressure. He takes a random sample of 10 people and measures their blood pressure using an arm cuff and a finger monitor. The table below shows the blood pressure for each person, measured by each method.
Person\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)
Arm cuff140110138127142112122128132160
Finger monitor154112156152142104126132144180
  1. Use a paired \(t\)-test to determine, at the \(10 \%\) level of significance, whether or not there is a difference in the mean blood pressure measured using the two methods. State your hypotheses clearly.
    (8)
  2. State an assumption about the underlying distribution of measured blood pressure required for this test.
    (1)
Edexcel S4 2008 June Q3
8 marks Standard +0.3
  1. The weights, in grams, of mice are normally distributed. A biologist takes a random sample of 10 mice. She weighs each mouse and records its weight.
The ten mice are then fed on a special diet. They are weighed again after two weeks.
Their weights in grams are as follows:
MouseABCD\(E\)\(F\)G\(H\)I\(J\)
Weight before diet50.048.347.554.038.942.750.146.840.341.2
Weight after diet52.147.650.152.342.244.351.848.041.943.6
Stating your hypotheses clearly, and using a \(1 \%\) level of significance, test whether or not the diet causes an increase in the mean weight of the mice.
Edexcel S4 2009 June Q2
12 marks Standard +0.3
2. An emission-control device is tested to see if it reduces \(\mathrm { CO } _ { 2 }\) emissions from cars. The emissions from 6 randomly selected cars are measured with and without the device. The results are as follows.
CarA\(B\)CDE\(F\)
Emissions without device151.4164.3168.5148.2139.4151.2
Emissions with device148.9162.7166.9150.1140.0146.7
  1. State an assumption that needs to be made in order to carry out a \(t\)-test in this case.
  2. State why a paired \(t\)-test is suitable for use with these data.
  3. Using a \(5 \%\) level of significance, test whether or not there is evidence that the device reduces \(\mathrm { CO } _ { 2 }\) emissions from cars.
  4. Explain, in context, what a type II error would be in this case.
Edexcel S4 2011 June Q3
8 marks Standard +0.3
3. Manuel is planning to buy a new machine to squeeze oranges in his cafe and he has two models, at the same price, on trial. The manufacturers of machine \(B\) claim that their machine produces more juice from an orange than machine \(A\). To test this claim Manuel takes a random sample of 8 oranges, cuts them in half and puts one half in machine \(A\) and the other half in machine \(B\). The amount of juice, in ml , produced by each machine is given in the table below.
Orange12345678
Machine \(A\)6058555352515456
Machine \(B\)6160585255505258
Stating your hypotheses clearly, test, at the \(10 \%\) level of significance, whether or not the mean amount of juice produced by machine \(B\) is more than the mean amount produced by machine \(A\).
Edexcel S4 2012 June Q1
9 marks Standard +0.3
  1. A medical student is investigating whether there is a difference in a person's blood pressure when sitting down and after standing up. She takes a random sample of 12 people and measures their blood pressure, in mmHg , when sitting down and after standing up.
The results are shown below.
PersonA\(B\)CD\(E\)F\(G\)\(H\)I\(J\)\(K\)\(L\)
Sitting down135146138146141158136135146161119151
Standing up131147132140138160127136142154130144
The student decides to carry out a paired \(t\)-test to investigate whether, on average, the blood pressure of a person when sitting down is more than their blood pressure after standing up.
  1. State clearly the hypotheses that should be used and any necessary assumption that needs to be made.
  2. Carry out the test at the \(1 \%\) level of significance.
    \section*{L}
Edexcel S4 2014 June Q1
9 marks Standard +0.3
  1. In a trial for a new cough medicine, a random sample of 8 healthy patients were given steadily increasing doses of a pepper extract until they started coughing. The level of pepper that triggered the coughing was recorded. Each patient completed the trial after taking a standard cough medicine and, at a later time, after taking the new medicine. The results are given in the table below.
Level of pepper extract that triggers coughing
Patient\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
Standard medicine461218312316279
New medicine5316134911343822
  1. Using a suitable test, at the \(5 \%\) level of significance, state whether or not, on the basis of this trial, you would recommend using the new medicine. State your hypotheses clearly.
  2. State an assumption needed to carry out this test.