Wilcoxon tests

123 questions · 15 question types identified

Sort by: Question count | Difficulty
Paired t-test

A question is this type if and only if it asks to perform a paired t-test on matched or repeated measures data, assuming normality of differences.

33 Standard +0.4
26.8% of questions
Show example »
2 Four pairs of randomly chosen twins were each given identical puzzles to solve. The times taken (in seconds) are shown in the following table.
Twin pair1234
Time for first-born46384449
Time for second-born40413746
Stating any necessary assumption, test at the \(10 \%\) significance level whether there is a difference between the population mean times of first-born and second-born twins.
View full question →
Easiest question Standard +0.3 »
7 In order to improve their mathematics results 10 students attended an intensive Summer School course. Each student took a test at the start of the course and a similar test at the end of the course. The table shows the scores achieved in each test.
Student12345678910
First test score37273847542752396223
Second test score47295044723763457632
It is desired to test whether there has been an increase in the population mean score.
  1. Explain why a two-sample \(t\)-test would not be appropriate.
  2. Stating any necessary assumptions, carry out a suitable \(t\)-test at the \(\frac { 1 } { 2 } \%\) significance level.
  3. The Summer School director claims that after taking the course the population mean score increases by more than 5 . Is there sufficient evidence for this claim?
View full question →
Hardest question Challenging +1.3 »
  1. A random sample of 8 people were given a new drug designed to help people sleep.
In a two-week period the drug was given for one week and a placebo (a tablet that contained no drug) was given for one week. In the first week 4 people, selected at random, were given the drug and the other 4 people were given the placebo. Those who were given the drug in the first week were given the placebo in the second week. Those who were given the placebo in the first week were given the drug in the second week. The mean numbers of hours of sleep per night for each of the people are shown in the table.
Person\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
Hours of sleep with drug10.87.28.76.89.410.911.17.6
Hours of sleep with placebo10.06.59.05.68.78.09.86.8
  1. State one assumption that needs to be made in order to carry out a paired \(t\)-test.
  2. Stating your hypotheses clearly, test, at the \(1 \%\) level of significance, whether or not the drug increases the mean number of hours of sleep per night by more than 10 minutes. State the critical value for this test.
View full question →
Wilcoxon rank-sum test (Mann-Whitney U test)

A question is this type if and only if it asks to test for differences between two independent samples using the Wilcoxon rank-sum or Mann-Whitney U test.

26 Standard +0.3
21.1% of questions
Show example »
6 In a two-tail Wilcoxon rank-sum test, the sample sizes are 13 and 15. The sum of the ranks for the sample of size 13 is 135 . Carry out the test at the \(5 \%\) level of significance.
View full question →
Easiest question Moderate -0.8 »
6 In a two-tail Wilcoxon rank-sum test, the sample sizes are 13 and 15. The sum of the ranks for the sample of size 13 is 135 . Carry out the test at the \(5 \%\) level of significance.
View full question →
Hardest question Challenging +1.2 »
5 An historian has reason to believe that the average age at which men got married in the seventeenth century was higher in urban areas compared to rural areas. The historian collected data from a random sample of 8 men in an urban area and a random sample of 6 men in a rural area, all of whom were married in the seventeenth century. The results were as follows, given in the form years/months.
Urban:\(18 / 3\)\(18 / 5\)\(19 / 9\)\(20 / 7\)\(25 / 6\)\(34 / 6\)\(41 / 8\)\(46 / 3\)
Rural:\(18 / 0\)\(18 / 1\)\(18 / 4\)\(19 / 11\)\(22 / 2\)\(28 / 11\)
  1. Use an appropriate non-parametric method to test at the \(5 \%\) significance level whether the average age at marriage of men is higher in urban areas than in rural areas.
  2. When checking the data, the historian found that the age of one of the men, Mr X, which had been recorded as 28/11, had been wrongly recorded. When corrected, the result of the test in part (a) was unchanged. Determine the youngest age that Mr X could have been, given that it was not the same, in years and months, as that of any of the other men in the sample.
View full question →
Wilcoxon matched-pairs signed-rank test

A question is this type if and only if it asks to test for differences between two related/paired samples (same subjects measured twice, or matched pairs) using the Wilcoxon signed-rank test.

20 Standard +0.3
16.3% of questions
Show example »
4 A psychologist investigated the scores of pairs of twins on an aptitude test. Seven pairs of twins were chosen randomly, and the scores are given in the following table.
Elder twin65376079394088
Younger twin58396162502684
  1. Carry out an appropriate Wilcoxon test at the \(10 \%\) significance level to investigate whether there is evidence of a difference in test scores between the elder and the younger of a pair of twins.
  2. Explain the advantage in this case of a Wilcoxon test over a sign test.
View full question →
Easiest question Moderate -0.3 »
4 A company has many factories. It is concerned about incidents of trespassing and, in the hope of reducing if not eliminating these, has embarked on a programme of installing new fencing.
  1. Records for a random sample of 9 factories of the numbers of trespass incidents in typical weeks before and after installation of the new fencing are as follows.
    FactoryABCDEFGHI
    Number before installation81264142241314
    Number after installation6110118101154
    Use a Wilcoxon test to examine at the \(5 \%\) level of significance whether it appears that, on the whole, the number of trespass incidents per week is lower after the installation of the new fencing than before.
  2. Records are also available of the costs of damage from typical trespass incidents before and after the introduction of the new fencing for a random sample of 7 factories, as follows (in £).
    FactoryTUVWXYZ
    Cost before installation1215955464672356236550
    Cost after installation12681105784802417318620
    Stating carefully the required distributional assumption, provide a two-sided \(99 \%\) confidence interval based on a \(t\) distribution for the population mean difference between costs of damage before and after installation of the new fencing. Explain why this confidence interval should not be based on the Normal distribution.
View full question →
Hardest question Standard +0.8 »
2 A company wishes to buy a new lathe for making chair legs. Two models of lathe, 'Allegro' and 'Vivace', were trialled. The company asked 12 randomly selected employees to make a particular type of chair leg on each machine. The times, in seconds, for each employee are shown in the table.
Employee123456789101112
Time on Allegro162111194159202210183168165150185160
Time on Vivace182130193181192205186184192180178189
The company wishes to test whether there is any difference in average times for the two machines.
  1. State the circumstances under which a non-parametric test should be used.
  2. Use two different non-parametric tests and show that they lead to different conclusions at the 5\% significance level.
  3. State, with a reason, which conclusion is to be preferred.
View full question →
Wilcoxon signed-rank test (single sample)

A question is this type if and only if it asks to test whether a population median equals a specific value using a single sample of observations.

19 Standard +0.1
15.4% of questions
Show example »
1 A Wilcoxon signed-rank test is carried out at the \(5 \%\) level of significance on a random sample of size 32 . The hypotheses are \(\mathrm { H } _ { 0 } : m = m _ { 0 } , \mathrm { H } _ { 1 } : m < m _ { 0 }\) where \(m\) is the population median and \(m _ { 0 }\) is a specific numerical value. The value obtained for the test statistic \(T\) is 162 . Find the outcome of the test.
View full question →
Easiest question Easy -1.8 »
2
    1. Give two reasons why an investigator might need to take a sample in order to obtain information about a population.
    2. State two requirements of a sample.
    3. Discuss briefly the advantage of the sampling being random.
    1. Under what circumstances might one use a Wilcoxon single sample test in order to test a hypothesis about the median of a population? What distributional assumption is needed for the test?
    2. On a stretch of road leading out of the centre of a town, highways officials have been monitoring the speed of the traffic in case it has increased. Previously it was known that the median speed on this stretch was 28.7 miles per hour. For a random sample of 12 vehicles on the stretch, the following speeds were recorded. $$\begin{array} { l l l l l l l l l l l l } 32.0 & 29.1 & 26.1 & 35.2 & 34.4 & 28.6 & 32.3 & 28.5 & 27.0 & 33.3 & 28.2 & 31.9 \end{array}$$ Carry out a test, with a \(5 \%\) significance level, to see whether the speed of the traffic on this stretch of road seems to have increased on the whole.
      [0pt] [10]
View full question →
Hardest question Standard +0.8 »
6. A zoologist knows that the median body length of adults in a species of fire-bellied toads is 4.2 cm . The zoologist thinks he has discovered a new subspecies of fire-bellied toads. If there is sufficient evidence to suggest the median body length differs from 4.2 cm , he will continue his studies to confirm whether he has discovered a new subspecies. Otherwise, he will abandon his studies on fire-bellied toads. The lengths of 10 randomly selected adult toads from the group being investigated are given below. $$\begin{array} { l l l l l l l l l l } 5 \cdot 0 & 3 \cdot 2 & 4 \cdot 9 & 4 \cdot 0 & 3 \cdot 3 & 4 \cdot 2 & 6 \cdot 1 & 4 \cdot 3 & 4 \cdot 8 & 5 \cdot 9 \end{array}$$ Carry out a suitable Wilcoxon signed rank test at a significance level as close to \(1 \%\) as possible and give your conclusion in context.
View full question →
Critical region or test statistic properties

A question is this type if and only if it asks to find critical regions, critical values, possible values of test statistics, or theoretical properties of test statistics.

6 Standard +0.4
4.9% of questions
Show example »
8 The critical region for an \(r\) \% two-tailed Wilcoxon signed-rank test, based on a large sample of size \(n\), is \(\left\{ W _ { + } \leqslant 113 \right\} \cup \left\{ W _ { + } \geqslant 415 \right\}\).
  1. Show that \(n = 32\).
  2. Using a suitable approximation, determine the value of \(r\).
View full question →
Two-sample t-test

A question is this type if and only if it asks to perform a two-sample t-test comparing means of two independent groups, assuming normality and possibly equal variances.

5 Standard +0.3
4.1% of questions
Show example »
  1. The times, \(x\) seconds, taken by the competitors in the 100 m freestyle events at a school swimming gala are recorded. The following statistics are obtained from the data.
\cline { 2 - 4 } \multicolumn{1}{c|}{}No. of competitorsSample mean \(\overline { \boldsymbol { x } }\)\(\sum \boldsymbol { x } ^ { \mathbf { 2 } }\)
Girls883.155746
Boys788.956130
Following the gala, a mother claims that girls are faster swimmers than boys. Assuming that the times taken by the competitors are two independent random samples from normal distributions,
  1. test, at the \(10 \%\) level of significance, whether or not the variances of the two distributions are the same. State your hypotheses clearly.
  2. Stating your hypotheses clearly, test the mother's claim. Use a \(5 \%\) level of significance.
View full question →
Sign test

A question is this type if and only if it asks to perform or discuss the sign test as an alternative to other non-parametric tests.

3 Moderate -0.1
2.4% of questions
Show example »
1 Ten archers shot at targets with two types of bow. Their scores out of 100 are shown in the table.
Archer\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)
Bow type \(P\)95979285879290899877
Bow type \(Q\)91918890808893859484
  1. Use the sign test, at the \(5 \%\) level of significance, to test the hypothesis that bow type \(P\) is better than bow type \(Q\).
  2. Why would a Wilcoxon signed rank test, if valid, be a better test than the sign test?
View full question →
Spearman's rank correlation test

A question is this type if and only if it asks to test for rank correlation between two variables using Spearman's method.

1 Standard +0.3
0.8% of questions
Show example »
  1. A random sample of size \(n = 8\) of paired data is taken from a population. The data are plotted below. \includegraphics[max width=\textwidth, alt={}, center]{ba41c616-0805-4466-81b8-b985b0bdd94b-06_572_983_335_541}
Test, at the \(1 \%\) level of significance, whether or not there is evidence of a negative rank correlation between the two variables. You should state your hypotheses and critical value and show your working clearly.
View full question →
Confidence intervals (paired data)

A question is this type if and only if it asks to calculate a confidence interval for the mean difference in paired data.

1 Standard +0.3
0.8% of questions
Show example »
2. Every 6 months some engineers are tested to see if their times, in minutes, to assemble a particular component have changed. The times taken to assemble the component are normally distributed. A random sample of 8 engineers was chosen and their times to assemble the component were recorded in January and in July. The data are given in the table below.
Engineer\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
January1719222615281821
July1918252417251619
  1. Calculate a \(95 \%\) confidence interval for the mean difference in times.
  2. Use your confidence interval to state, giving a reason, whether or not there is evidence of a change in the mean time to assemble a component. State your hypotheses clearly.
View full question →
Justifying choice of test

A question is this type if and only if it asks to explain why a particular test (parametric vs non-parametric, paired vs unpaired) is or is not appropriate for given data.

1 Moderate -0.3
0.8% of questions
Show example »
6 A biologist is studying the effect of nutrients on the heights to which plants grow. A random sample of 24 similar young plants is divided into two equal groups \(A\) and \(B\). The plants in group \(A\) are fed with nutrients and water and the plants in group \(B\) are given only water. After four weeks, the height, in cm, of each plant is measured and the results are as follows.
Group \(A\)12.311.812.113.211.110.613.812.012.212.413.513.9
Group \(B\)11.710.810.911.311.212.611.010.511.912.510.711.6
The biologist decides to carry out a test at the \(5 \%\) significance level to test whether the nutrients have resulted in an increase in growth.
  1. She carries out a Wilcoxon rank-sum test. Give a reason why this is an appropriate choice of test.
  2. Carry out the Wilcoxon rank-sum test for these results.
    If you use the following lined page to complete the answer(s) to any question(s), the question number(s) must be clearly shown.
View full question →
Type I and Type II errors

A question is this type if and only if it asks to explain or identify Type I or Type II errors in the context of a hypothesis test.

0
0.0% of questions
Sampling methods and design

A question is this type if and only if it asks about how to select samples (random, stratified, systematic) or design experiments (pairing, blocking, randomization).

0
0.0% of questions
Comparing parametric and non-parametric tests

A question is this type if and only if it asks to compare advantages/disadvantages of parametric tests versus non-parametric tests or to perform both and compare results.

0
0.0% of questions
Effect of data corrections on test outcome

A question is this type if and only if it asks whether correcting or changing one or more data values would alter the conclusion of a hypothesis test.

0
0.0% of questions
Stating assumptions for tests

A question is this type if and only if it asks to state the assumptions required for a particular statistical test to be valid.

0
0.0% of questions
Unclassified

Questions not yet assigned to a type.

8
6.5% of questions
Show 8 unclassified »
2 The times, in milliseconds, taken by a computer to perform a certain task were recorded on 10 randomly chosen occasions. The times were as follows. $$\begin{array} { l l l l l l l l l l } 6.44 & 6.16 & 5.62 & 5.82 & 6.51 & 6.62 & 6.19 & 6.42 & 6.34 & 6.28 \end{array}$$ It is claimed that the median time to complete the task is 6.4 milliseconds.
  1. Carry out a Wilcoxon signed-rank test at the \(5 \%\) significance level to test this claim.
  2. State an underlying assumption that is made when using a Wilcoxon signed-rank test.
5 Georgio has designed two new uniforms \(X\) and \(Y\) for the employees of an airline company. A random sample of 11 employees are each asked to assess each of the two uniforms for practicality and appearance, and to give a total score out of 100. The scores are given in the table.
Employee\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)\(K\)
Uniform \(X\)8274425960739498623650
Uniform \(Y\)7875635667829990724861
  1. Give a reason why a Wilcoxon signed-rank test may be more appropriate than a \(t\)-test for investigating whether there is any evidence of a preference for one of the uniforms.
  2. Carry out a Wilcoxon matched-pairs signed-rank test at the \(10 \%\) significance level.
6 A teacher at a large college gave a mathematical puzzle to all the students. The median time taken by a random sample of 24 students to complete the puzzle was 18.0 minutes. The students were then given practice in solving puzzles. Two weeks later, the students were given another mathematical puzzle of the same type as the first. The times, in minutes, taken by the random sample of 24 students to complete this puzzle are as follows.
18.217.516.415.120.526.519.223.2
17.918.825.819.917.716.217.316.6
17.120.120.312.616.021.422.718.4
The teacher claims that the practice has not made any difference to the average time taken to complete a puzzle of this type. Carry out a Wilcoxon signed-rank test, at the 10\% significance level, to test whether there is sufficient evidence to reject the teacher's claim.
If you use the following page to complete the answer to any question, the question number must be clearly shown.
4 A random sample of 13 technology companies is chosen and the numbers of employees in 2018 and in 2022 are recorded.
CompanyABCD\(E\)\(F\)G\(H\)IJ\(K\)\(L\)M
Number in 2018104191262349705143514942912863041104
Number in 20221062412722810125253215644924782941154
A researcher claims that there has been an increase in the median number of employees at technology companies between 2018 and 2022.
  1. Carry out a Wilcoxon matched-pairs signed-rank test, at the \(5 \%\) significance level, to test whether the data supports this claim.
    The researcher notices that the figures for company \(G\) have been recorded incorrectly. In fact, the number of employees in 2018 was 32 and the number of employees in 2022 was 35.
  2. Explain, with numerical justification, whether or not the conclusion of the test in part (a) remains the same.
3 A factory produces metal discs. The manager claims that the diameters of these discs have a median of 22.0 mm . The diameters, in mm , of a random sample of 12 discs produced by this factory are as follows. $$\begin{array} { l l l l l l l l l l l l } 22.4 & 20.9 & 22.8 & 21.5 & 23.2 & 22.9 & 23.9 & 21.7 & 19.8 & 23.6 & 22.6 & 23.0 \end{array}$$
  1. Carry out a Wilcoxon signed-rank test, at the \(10 \%\) significance level, to test whether there is any evidence against the manager's claim.
  2. State an assumption that is necessary for this test to be valid.
2 Metal rods produced by a certain factory are claimed to have a median breaking strength of 200 tonnes. For a random sample of 9 rods, the breaking strengths, measured in tonnes, were as follows. $$\begin{array} { l l l l l l l l l } 210 & 186 & 188 & 208 & 184 & 191 & 215 & 198 & 196 \end{array}$$ A scientist believes that the median breaking strength of metal rods produced by this factory is less than 200 tonnes.
  1. Use a Wilcoxon signed-rank test, at the \(5 \%\) significance level, to test whether there is evidence to support the scientist’s belief.
  2. Give a reason why a Wilcoxon signed-rank test is preferable to a sign test, when both are valid.
6 The blood cholesterol levels, measured in suitable units, of a random sample of 11 women and a random sample of 12 men are shown below.
Women51552421671522567513798238235
Men3112621703021753202202607235186333
Carry out a Wilcoxon rank-sum test, at the \(5 \%\) significance level, to test whether, on average, there is a difference in cholesterol levels between women and men.
If you use the following lined page to complete the answer(s) to any question(s), the question number(s) must be clearly shown.
3 A large college is holding a piano competition. Each student has played a particular piece of music and two judges have each awarded a mark out of 80 . The marks awarded to a random sample of 14 students are shown in the following table.
Student\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)\(K\)\(L\)\(M\)\(N\)
Judge 17954637469525057554263555648
Judge 27562607376413151455549506536
  1. One of the students claims that on average Judge 1 is awarding higher marks than Judge 2. Carry out a Wilcoxon matched-pairs signed-rank test at the 5\% significance level to test whether the data supports the student's claim.
  2. Give a reason why it is preferable to use a Wilcoxon matched-pairs signed-rank test in this situation rather than a paired sample \(t\)-test.