5.07b Sign test: and Wilcoxon signed-rank

84 questions

Sort by: Default | Easiest first | Hardest first
OCR Further Statistics 2022 June Q8
7 marks Challenging +1.8
8 The critical region for an \(r\) \% two-tailed Wilcoxon signed-rank test, based on a large sample of size \(n\), is \(\left\{ W _ { + } \leqslant 113 \right\} \cup \left\{ W _ { + } \geqslant 415 \right\}\).
  1. Show that \(n = 32\).
  2. Using a suitable approximation, determine the value of \(r\).
OCR Further Statistics 2023 June Q5
10 marks Challenging +1.2
5 An historian has reason to believe that the average age at which men got married in the seventeenth century was higher in urban areas compared to rural areas. The historian collected data from a random sample of 8 men in an urban area and a random sample of 6 men in a rural area, all of whom were married in the seventeenth century. The results were as follows, given in the form years/months.
Urban:\(18 / 3\)\(18 / 5\)\(19 / 9\)\(20 / 7\)\(25 / 6\)\(34 / 6\)\(41 / 8\)\(46 / 3\)
Rural:\(18 / 0\)\(18 / 1\)\(18 / 4\)\(19 / 11\)\(22 / 2\)\(28 / 11\)
  1. Use an appropriate non-parametric method to test at the \(5 \%\) significance level whether the average age at marriage of men is higher in urban areas than in rural areas.
  2. When checking the data, the historian found that the age of one of the men, Mr X, which had been recorded as 28/11, had been wrongly recorded. When corrected, the result of the test in part (a) was unchanged. Determine the youngest age that Mr X could have been, given that it was not the same, in years and months, as that of any of the other men in the sample.
OCR Further Statistics 2024 June Q4
6 marks Challenging +1.8
4
  1. Write down the number of ways of choosing 5 objects from 12 distinct objects.
  2. Each possible set of 5 different integers selected from the integers \(1,2 , \ldots , 12\) is obtained, and for each set, the sum of the 5 integers is found. The sum \(S\) can take values between 15 and 50 inclusive. Part of the frequency distribution of \(S\) is shown in the following table, together with the cumulative frequencies.
    S151617181920212223
    Frequency112357101317
    Cumulative Frequency12471219294259
    Use these numbers to determine the critical region for a 1-tail Wilcoxon rank-sum test at the \(2 \%\) significance level when \(m = 5\) and \(n = 7\).
  3. A student says that, for a Wilcoxon rank-sum test on samples of size \(m\) and \(n\), where \(m\) and \(n\) are large, the mean and variance of the test statistic \(R _ { m }\) are 200 and \(616 \frac { 2 } { 3 }\) respectively. Show that at least one of these values must be incorrect.
OCR Further Statistics 2021 November Q7
12 marks Standard +0.3
7 In a school opinion poll a random sample of 8 pupils were asked to rate school lunches on a scale of 0 to 20 . The results were as follows. \(\begin{array} { l l l l l l l l } 0 & 1 & 2 & 3 & 4 & 10 & 11 & 13 \end{array}\) After a new menu was introduced, the test was repeated with a different random sample of 8 pupils. The results were as follows. \(\begin{array} { l l l l l l l l } 7 & 8 & 9 & 14 & 15 & 17 & 19 & 20 \end{array}\)
  1. Carry out an appropriate Wilcoxon test at the \(5 \%\) significance level to test whether pupils' opinions of school lunches have changed. A statistics student tells the organisers of the opinion poll that it would have been better to have asked the same 8 pupils both times.
  2. Explain why the statistics student's suggestion would produce a better test.
  3. State which test should be used if the student's suggestion is followed.
  4. You are given that there are 12870 ways in which 8 different integers can be chosen from the integers 1 to 16 inclusive. Estimate the number of ways of selecting 8 different digits between 1 and 16 inclusive that have a sum less than or equal to the critical value used in the test in part (a).
OCR Further Statistics Specimen Q4
7 marks Standard +0.3
4 A psychologist investigated the scores of pairs of twins on an aptitude test. Seven pairs of twins were chosen randomly, and the scores are given in the following table.
Elder twin65376079394088
Younger twin58396162502684
  1. Carry out an appropriate Wilcoxon test at the \(10 \%\) significance level to investigate whether there is evidence of a difference in test scores between the elder and the younger of a pair of twins.
  2. Explain the advantage in this case of a Wilcoxon test over a sign test.
Edexcel S4 2006 January Q7
16 marks Standard +0.3
7. A psychologist gives a test to students from two different schools, \(A\) and \(B\). A group of 9 students is randomly selected from school \(A\) and given instructions on how to do the test.
A group of 7 students is randomly selected from school \(B\) and given the test without the instructions. The table shows the time taken, to the nearest second, to complete the test by the two groups.
\(A\)111212131415161717
\(B\)8101113131414
Stating your hypotheses clearly,
  1. test at the \(10 \%\) significance level, whether or not the variance of the times taken to complete the test by students from school \(A\) is the same as the variance of the times taken to complete the test by students from school \(B\). (You may assume that times taken for each school are normally distributed.)
  2. test at the \(5 \%\) significance level, whether or not the mean time taken to complete the test by students from school \(A\) is greater than the mean time taken to complete the test by students from school \(B\).
  3. Why does the result to part (a) enable you to carry out the test in part (b)?
  4. Give one factor that has not been taken into account in your analysis.
Edexcel S4 2011 June Q3
8 marks Standard +0.3
3. Manuel is planning to buy a new machine to squeeze oranges in his cafe and he has two models, at the same price, on trial. The manufacturers of machine \(B\) claim that their machine produces more juice from an orange than machine \(A\). To test this claim Manuel takes a random sample of 8 oranges, cuts them in half and puts one half in machine \(A\) and the other half in machine \(B\). The amount of juice, in ml , produced by each machine is given in the table below.
Orange12345678
Machine \(A\)6058555352515456
Machine \(B\)6160585255505258
Stating your hypotheses clearly, test, at the \(10 \%\) level of significance, whether or not the mean amount of juice produced by machine \(B\) is more than the mean amount produced by machine \(A\).
Edexcel S4 2013 June Q5
8 marks Standard +0.3
  1. Students studying for their Mathematics GCSE are assessed by two examination papers. A teacher believes that on average the score on paper I is more than 1 mark higher than the score on paper II. To test this belief the scores of 8 randomly selected students are recorded. The results are given in the table below.
Student\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
Score on paper I5763688143655231
Score on paper II5362617844644329
Assuming that the scores are normally distributed and stating your hypotheses clearly, test at the \(5 \%\) level of significance whether or not there is evidence to support the teacher's belief.
Edexcel S4 2013 June Q2
10 marks Standard +0.3
2. Every 6 months some engineers are tested to see if their times, in minutes, to assemble a particular component have changed. The times taken to assemble the component are normally distributed. A random sample of 8 engineers was chosen and their times to assemble the component were recorded in January and in July. The data are given in the table below.
Engineer\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
January1719222615281821
July1918252417251619
  1. Calculate a \(95 \%\) confidence interval for the mean difference in times.
  2. Use your confidence interval to state, giving a reason, whether or not there is evidence of a change in the mean time to assemble a component. State your hypotheses clearly.
Edexcel S4 2014 June Q1
9 marks Standard +0.3
  1. In a trial for a new cough medicine, a random sample of 8 healthy patients were given steadily increasing doses of a pepper extract until they started coughing. The level of pepper that triggered the coughing was recorded. Each patient completed the trial after taking a standard cough medicine and, at a later time, after taking the new medicine. The results are given in the table below.
Level of pepper extract that triggers coughing
Patient\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
Standard medicine461218312316279
New medicine5316134911343822
  1. Using a suitable test, at the \(5 \%\) level of significance, state whether or not, on the basis of this trial, you would recommend using the new medicine. State your hypotheses clearly.
  2. State an assumption needed to carry out this test.
Edexcel S4 2014 June Q4
9 marks Challenging +1.3
  1. A random sample of 8 people were given a new drug designed to help people sleep.
In a two-week period the drug was given for one week and a placebo (a tablet that contained no drug) was given for one week. In the first week 4 people, selected at random, were given the drug and the other 4 people were given the placebo. Those who were given the drug in the first week were given the placebo in the second week. Those who were given the placebo in the first week were given the drug in the second week. The mean numbers of hours of sleep per night for each of the people are shown in the table.
Person\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
Hours of sleep with drug10.87.28.76.89.410.911.17.6
Hours of sleep with placebo10.06.59.05.68.78.09.86.8
  1. State one assumption that needs to be made in order to carry out a paired \(t\)-test.
  2. Stating your hypotheses clearly, test, at the \(1 \%\) level of significance, whether or not the drug increases the mean number of hours of sleep per night by more than 10 minutes. State the critical value for this test.
Edexcel S4 2015 June Q1
9 marks Standard +0.3
  1. The Sales Manager of a large chain of convenience stores is studying the sale of lottery tickets in her stores. She randomly selects 8 of her stores. From these stores she collects data for the total sales of lottery tickets in the previous January and July. The data are shown below
StoreABCDEFGH
January ticket sales \(( \boldsymbol { \pounds } )\)10801639710110891510661322819
July ticket sales \(( \boldsymbol { \pounds } )\)11131702831104886110901303852
  1. Use a paired \(t\)-test to determine whether or not there is evidence, at the \(5 \%\) level of significance, that the mean sales of lottery tickets in this chain's stores are higher in July than in January. You should state your hypotheses and show your working clearly.
  2. State what assumption the Sales Manager needs to make about the sales of lottery tickets in her stores for the test in part (a) to be valid.
Edexcel S4 2016 June Q1
9 marks Standard +0.3
  1. A new diet has been designed. Its designers claim that following the diet for a month will result in a mean weight loss of more than 2 kg . In a trial, a random sample of 10 people followed the new diet for a month. Their weights, in kg, before starting the diet and their weights after following the diet for a month were recorded. The results are given in the table below.
Person\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)
Weight before diet (kg)96110116981219198106110116
Weight after diet (kg)91101111961219190101104110
  1. Using a suitable \(t\)-test, at the \(5 \%\) level of significance, state whether or not the trial supports the designers' claim. State your hypotheses and show your working clearly.
  2. State an assumption necessary for the test in part (a).
OCR MEI Further Statistics B AS 2022 June Q4
10 marks Standard +0.3
4 A wood contains a large number of mature beech trees. The diameters in centimetres of a random sample of 10 of these trees are as follows. \(\begin{array} { l l l l l l l l l l } 82.6 & 79.2 & 77.8 & 38.4 & 88.1 & 32.2 & 26.5 & 23.4 & 94.3 & 104.2 \end{array}\) A tree surgeon wants to know if the average diameter of mature beech trees in this wood is 50 cm . The tree surgeon produces a Normal probability plot for these data. \includegraphics[max width=\textwidth, alt={}, center]{4caa7409-cb32-41da-ad64-012a45753296-4_796_1230_589_230}
  1. Explain why the tree surgeon should not carry out a test based on the \(t\) distribution.
  2. Carry out a suitable test at the \(5 \%\) significance level to investigate whether the average diameter of mature beech trees in this wood is 50 cm .
OCR MEI Further Statistics B AS 2021 November Q4
9 marks Standard +0.3
4 John regularly downloads podcasts onto his mobile phone. From past experience he knows that the average time to download one 30 -minute podcast is 12.7 s . He believes that this time has recently increased. At each of 12 randomly chosen times, he downloads a 30-minute podcast. The times in seconds to download the 12 podcasts are as follows. \(\begin{array} { l l l l l l l l l l l } 12.63 & 13.24 & 11.73 & 14.91 & 13.17 & 13.53 & 12.33 & 14.27 & 11.48 & 13.51 & 13.05 \end{array} 13.83\)
  1. Given that it is not known whether the times are Normally distributed, carry out a suitable test at the \(5 \%\) significance level to investigate whether the average download time has increased.
  2. What assumption is required to carry out the test in part (a)?
OCR MEI Further Statistics B AS Specimen Q4
8 marks Standard +0.3
4 An online encyclopedia claims that the average mass of an adult European hedgehog is 720 g . In an investigation to check this average figure, the masses in grams of twelve randomly chosen adult European hedgehogs are measured and shown below.
705730720691718680
731723745708724736
  1. What assumption is required to carry out a Wilcoxon test in this situation?
  2. Given that this assumption is met, carry out a 2 -tail Wilcoxon test at the \(5 \%\) level to test whether the median mass is 720 g . You should state your hypotheses and complete the table of calculations in the Printed Answer Booklet.
OCR MEI Further Statistics Major 2019 June Q8
13 marks Standard +0.3
8 A student doing a school project wants to test a claim which she read in a newspaper that drinking a cup of tea will improve a person's arithmetic skills.
She chooses 13 students from her school and gets each of them to drink a cup of tea. She then gives each of them an arithmetic test. She knows that the average score for this test in students of the same age group as those she has chosen is 33.5.
The scores of the students she tests, arranged in ascending order, are as follows. \(\begin{array} { l l l l l l l l l l l l l } 26 & 28 & 29 & 30 & 31 & 32 & 34 & 42 & 49 & 54 & 55 & 56 & 61 \end{array}\) The student decides to use software to draw a Normal probability plot for these data, and to carry out a Normality test as shown in Fig. 8. \begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{3a89edc4-ac93-4691-ade8-4d4665b55202-09_536_1234_792_244} \captionsetup{labelformat=empty} \caption{Fig. 8}
\end{figure}
  1. The student uses the output from the software to help in deciding on a suitable hypothesis test to use for investigating the claim about drinking tea.
    Explain what the student should conclude.
  2. The student's teacher agrees with the student's choice of hypothesis test, but says that even this test may not be valid as there may be some unsatisfactory features in the student's project. Give three features that the teacher might identify as unsatisfactory.
  3. Assuming that the student's procedures can be justified, carry out an appropriate test at the \(5 \%\) significance level to investigate the claim about drinking tea.
OCR MEI Further Statistics Major 2022 June Q11
13 marks Standard +0.3
11 A particular dietary supplement, when taken for a period of 1 month, is claimed to increase lean body mass of adults by an average of 1 kg . A researcher believes that this claim overestimates the increase. She selects a random sample of 10 adults who then each take the supplement for a month. The increases in lean body masses in kg are as follows. $$\begin{array} { l l l l l l l l l l } - 0.84 & - 0.76 & - 0.16 & 0.43 & 1.31 & 1.32 & 1.47 & 1.64 & 1.93 & 2.14 \end{array}$$ A Normal probability plot and the \(p\)-value of the Kolmogorov-Smirnov test for these data are shown below. \includegraphics[max width=\textwidth, alt={}, center]{77eabbd6-a058-457f-9601-d66f3c2db005-09_575_1485_689_242}
  1. The researcher decides to carry out a hypothesis test in order to investigate the claim. Comment on the type of hypothesis test that should be used. You should refer to
WJEC Further Unit 5 2023 June Q4
12 marks Standard +0.3
4. Llŷr believes that he will have more social media followers by appearing on a certain Welsh television show. To investigate his belief, he collects data on 9 randomly selected contestants who have appeared on the show. Llŷr records the number of social media followers one week before and one week after the contestants appeared on the show. The data he collects are shown in the table below.
ContestantABCDEFGH1
Before48010080344351781876741457
After8419987513449545428201011644
    1. Carry out a Wilcoxon signed-rank test on this data set, at a significance level as close to 10\% as possible.
    2. Suggest a possible course of action that Llŷr might take.
  1. Give two reasons why the Wilcoxon signed-rank test is appropriate in this case.
WJEC Further Unit 5 2023 June Q6
7 marks Standard +0.3
6. A triathlon race organiser wishes to know whether competitors who are members of a triathlon club race more frequently than competitors who are not members of a triathlon club. Six competitors from a triathlon club and six competitors who are not members of a triathlon club are selected at random. The table below shows the number of triathlon races they each entered last year.
Club
members
11412537
Not club
members
294086
  1. Use a Mann-Whitney U test at a significance level as close to \(5 \%\) as possible to carry out the race organiser's investigation.
  2. Briefly explain why a Wilcoxon signed-rank test is not appropriate in this case.
Edexcel FS2 2019 June Q5
7 marks Standard +0.3
5 Alexa believes that students are equally likely to achieve the same percentage score on each of two tests, paper I and paper II. She randomly selects 8 students and gives them each paper I and paper II. The percentage scores for each paper are recorded. The following paired data are collected.
Student\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
Paper I (\%)7070848064656590
Paper II (\%)6476727468645876
Test, at the \(1 \%\) significance level, whether or not there is evidence to support Alexa's belief. State your hypotheses clearly and show your working.
OCR S4 2010 June Q3
7 marks Challenging +1.8
  1. Assuming that all rankings are equally likely, show that \(\mathrm { P } ( R \leqslant 17 ) = \frac { 2 } { 231 }\). The marks of 5 randomly chosen students from School \(A\) and 6 randomly chosen students from School \(B\), who took the same examination, achieving different marks, were ranked. The rankings are shown in the table.
    Rank1234567891011
    School\(A\)\(A\)\(A\)\(B\)\(A\)\(A\)\(B\)\(B\)\(B\)\(B\)\(B\)
  2. For a Wilcoxon rank-sum test, obtain the exact smallest significance level for which there is evidence of a difference in performance at the two schools.
OCR Further Statistics 2018 March Q4
9 marks Moderate -0.8
4 Sheena travels to school by bus. She records the number of minutes, \(T\), that her bus is late on each of 32 days. She believes that on average \(T\) is greater than 5, and she carries out a significance test at the \(5 \%\) level.
  1. State a condition needed for a Wilcoxon test to be valid in this case. Assume now that this condition is satisfied.
  2. State an advantage of using a Wilcoxon test rather than a sign test.
  3. Calculate the critical region for the test, in terms of a variable which should be defined.
OCR Further Statistics 2018 September Q8
8 marks Standard +0.3
8 In an experiment to investigate the effect of background music in carrying out work, ten students were each given a task. Five of the students did the task in silence and the other five did the task with background music. The scores on the tasks were as follows.
Silence4346555861
Background music1931385270
  1. Use a Wilcoxon rank-sum test to test at the 10\% level whether the presence of background music affects scores.
  2. A statistician suggests that the experiment is redesigned so that each student takes one task in silence and another task with background music. The differences in the test scores would then be analysed using a paired-sample method. State an advantage in redesigning the experiment in this way.
OCR Further Statistics 2018 December Q6
15 marks Standard +0.3
6 The reaction times, in milliseconds, of all adult males in a standard experiment have a symmetrical distribution with mean and median both equal to 700 and standard deviation 125. The reaction times of a random sample of 6 international athletes are measured and the results are as follows: \(\begin{array} { l l l l l l } 702 & 631 & 540 & 714 & 575 & 480 \end{array}\) It is required to test whether international athletes have a mean reaction time which is less than 700.
  1. Assume first that the reaction times of international athletes have the distribution \(\mathrm { N } \left( \mu , 125 ^ { 2 } \right)\). Test at the \(5 \%\) significance level whether \(\mu < 700\).
  2. Now assume only that the distribution of the data is symmetrical, but not necessarily normal.
    1. State with a reason why a Wilcoxon test is preferable to a sign test.
    2. Use an appropriate Wilcoxon test at the \(5 \%\) significance level to test whether the median reaction time of international athletes is less than 700 .
  3. Explain why the significance tests in part (a) and part (b)(ii) could produce different results.