2.05a Hypothesis testing language: null, alternative, p-value, significance

282 questions

Sort by: Default | Easiest first | Hardest first
Edexcel S4 2004 June Q3
9 marks Challenging +1.2
3. It is suggested that a Poisson distribution with parameter \(\lambda\) can model the number of currants in a currant bun. A random bun is selected in order to test the hypotheses \(\mathrm { H } _ { 0 } : \lambda = 8\) against \(\mathrm { H } _ { 1 } : \lambda \neq 8\), using a \(10 \%\) level of significance.
  1. Find the critical region for this test, such that the probability in each tail is as close as possible to \(5 \%\).
  2. Given that \(\lambda = 10\), find
    1. the probability of a type II error,
    2. the power of the test.
      (4)
Edexcel S4 2005 June Q5
12 marks Standard +0.3
5. Define
  1. a Type I error,
  2. the size of a test. Jane claims that she can read Alan's mind. To test this claim Alan randomly chooses a card with one of 4 symbols on it. He then concentrates on the symbol. Jane then attempts to read Alan's mind by stating what symbol she thinks is on the card. The experiment is carried out 8 times and the number of times \(X\) that Jane is correct is recorded. The probability of Jane stating the correct symbol is denoted by \(p\).
    To test the hypothesis \(\mathrm { H } _ { 0 } : p = 0.25\) against \(\mathrm { H } _ { 1 } : p > 0.25\), a critical region of \(X > 6\) is used.
  3. Find the size of this test.
  4. Show that the power function of this test is \(8 p ^ { 7 } - 7 p ^ { 8 }\). Given that \(p = 0.3\), calculate
  5. the power of this test,
  6. the probability of a Type II error.
  7. Suggest two ways in which you might reduce the probability of a Type II error.
Edexcel S4 2006 June Q5
17 marks Challenging +1.8
5. Rolls of cloth delivered to a factory contain defects at an average rate of \(\lambda\) per metre. A quality assurance manager selects a random sample of 15 metres of cloth from each delivery to test whether or not there is evidence that \(\lambda > 0.3\). The criterion that the manager uses for rejecting the hypothesis that \(\lambda = 0.3\) is that there are 9 or more defects in the sample.
  1. Find the size of the test. Table 1 gives some values, to 2 decimal places, of the power function of this test. \begin{table}[h]
    \(\lambda\)0.40.50.60.70.80.91.0
    Power0.150.34\(r\)0.720.850.920.96
    \captionsetup{labelformat=empty} \caption{Table 1}
    \end{table}
  2. Find the value of \(r\). The manager would like to design a test, of whether or not \(\lambda > 0.3\), that uses a smaller length of cloth. He chooses a length of 10 m and requires the probability of a type I error to be less than \(10 \%\).
  3. Find the criterion to reject the hypothesis that \(\lambda = 0.3\) which makes the test as powerful as possible.
  4. Hence state the size of this second test. Table 2 gives some values, to 2 decimal places, of the power function for the test in part (c). \begin{table}[h]
    \(\lambda\)0.40.50.60.70.80.91.0
    Power0.210.380.550.70\(s\)0.880.93
    \captionsetup{labelformat=empty} \caption{Table 2}
    \end{table}
  5. Find the value of \(s\).
  6. Using the same axes, on graph paper draw the graphs of the power functions of these two tests.
    1. State the value of \(\lambda\) where the graphs cross.
    2. Explain the significance of \(\lambda\) being greater than this value. The cost of wrongly rejecting a delivery of cloth with \(\lambda = 0.3\) is low. Deliveries of cloth with \(\lambda > 0.7\) are unusual.
  7. Suggest, giving your reasons, which the test manager should adopt.
    (2)
Edexcel S4 2014 June Q2
7 marks Challenging +1.2
2. (a) Define
  1. a Type I error,
  2. a Type II error. Rolls of material, manufactured by a machine, contain defects at a mean rate of 6 per roll. The machine is modified. A single roll is selected at random and a test is carried out to see whether or not the mean number of defects per roll has decreased. The significance level is chosen to be as close as possible to \(5 \%\).
    (b) Calculate the probability of a Type I error for this test.
    (c) Given that the true mean number of defects per roll of material made by the machine is now 4, calculate the probability of a Type II error.
Edexcel S4 2017 June Q2
8 marks Standard +0.8
  1. The number of accidents per year in Daftstown follows a Poisson distribution with mean \(\lambda\). The value of \(\lambda\) has previously been 6 but Jonty claims that since the Council increased the speed limit, the value of \(\lambda\) has increased.
Jonty records the number of accidents in Daftstown in the first year after the speed limit was increased. He plans to test, at the \(5 \%\) significance level, whether or not there is evidence of an increase in the mean number of accidents in Daftstown per year.
  1. Stating your hypotheses clearly, calculate the probability of a Type I error for this test. Given that there were 9 accidents in the first year after the speed limit was increased,
  2. state, giving a reason, whether or not there is evidence to support Jonty's claim.
  3. Given that the value of \(\lambda\) has actually increased to 8, calculate the probability of drawing the conclusion, using this test, that the number of accidents per year in Daftstown has not increased.
Edexcel S4 2018 June Q5
11 marks Challenging +1.2
  1. A machine makes posts. The length of a post is normally distributed with unknown mean \(\mu\) and standard deviation 4 cm .
A random sample of size \(n\) is taken to test, at the \(5 \%\) significance level, the hypotheses $$\mathrm { H } _ { 0 } : \mu = 150 \quad \mathrm { H } _ { 1 } : \mu > 150$$
  1. State the probability of a Type I error for this test. The manufacturer requires the probability of a Type II error to be less than 0.1 when the actual value of \(\mu\) is 152
  2. Calculate the minimum value of \(n\).
Edexcel S4 Q3
13 marks Standard +0.8
3. A certain vaccine is known to be only \(70 \%\) effective against a particular virus; thus \(30 \%\) of those vaccinated will actually catch the virus. In order to test whether or not a new and more expensive vaccine provides better protection against the same virus, a random sample of 30 people were chosen and given the new vaccine. If fewer than 6 people contracted the virus the new vaccine would be considered more effective than the current one.
  1. Write down suitable hypotheses for this test.
  2. Find the probability of making a Type I error.
  3. Find the power of this test if the new vaccine is
    1. \(80 \%\) effective,
    2. \(90 \%\) effective. An independent research organisation decided to test the new vaccine on a random sample of 50 people to see if it could be considered more than \(70 \%\) effective. They required the probability of a Type I error to be as close as possible to 0.05 .
  4. Find the critical region for this test.
  5. State the size of this critical region.
  6. Find the power of this test if the new vaccine is
    1. \(80 \%\) effective,
    2. \(90 \%\) effective.
  7. Give one advantage and one disadvantage of the second test.
OCR MEI Further Statistics Minor 2021 November Q4
14 marks Standard +0.3
4 A scientist is investigating sea salinity (the level of salt in the sea) in a particular area. She wishes to check whether satellite measurements, \(y\), of salinity are similar to those directly measured, \(x\). Both variables are measured in parts per thousand in suitable units. The scientist obtains a random sample of 10 values of \(x\) and the related values of \(y\). Below is a screenshot of a scatter diagram to illustrate the data. She decides to carry out a hypothesis test to check if there is any correlation between direct measurement, \(x\), and satellite measurement, \(y\). \includegraphics[max width=\textwidth, alt={}, center]{691e8b55-e9a1-4fff-b9ee-a71ff1f73ead-5_830_837_589_246}
  1. Explain why the scientist might decide to carry out a test based on the product moment correlation coefficient. Summary statistics for \(x\) and \(y\) are as follows. \(n = 10 \quad \sum x = 351.9 \quad \sum y = 350.0 \quad \sum x ^ { 2 } = 12384.5 \quad \sum y ^ { 2 } = 12251.2 \quad \sum \mathrm { xy } = 12317.2\)
  2. In this question you must show detailed reasoning. Calculate the product moment correlation coefficient.
  3. Carry out a hypothesis test at the \(5 \%\) significance level to investigate whether there is positive correlation between directly measured and satellite measured salinity levels.
  4. Explain why it would be preferable to use a larger sample. The scientist is also interested in whether there is any correlation between salinity and numbers of a particular species of shrimp in the water. She takes a large sample and finds that the product moment correlation coefficient for this sample is 0.165 . The result of a test based on this sample is to reject the null hypothesis and conclude that there is correlation between salinity and numbers of shrimp.
  5. Comment on the outcome of the hypothesis test with reference to the effect size of 0.165 .
OCR MEI Further Statistics Major 2022 June Q8
14 marks Standard +0.3
8 A swimming coach is investigating whether there is correlation between the times taken by teenage swimmers to swim 50 m Butterfly and 50 m Freestyle. The coach selects a random sample of 11 teenage swimmers and records the times that each of them take for each event. The spreadsheet shows the data, together with a scatter diagram to illustrate the data. \includegraphics[max width=\textwidth, alt={}, center]{77eabbd6-a058-457f-9601-d66f3c2db005-06_712_1465_456_274}
  1. In the scatter diagram, Butterfly times have been plotted on the horizontal axis and Freestyle times on the vertical axis. A student states that the variables should have been plotted the other way around. Explain whether the student is correct. The student decides to carry out a hypothesis test to investigate whether there is any correlation between the times taken for the two events.
  2. Explain why the student decides to carry out a test based on Spearman's rank correlation coefficient.
  3. In this question you must show detailed reasoning. Carry out the test at the 5\% significance level.
  4. The student concludes that there is definitely no correlation between the times. Comment on the student's conclusion.
Edexcel FS1 2019 June Q5
12 marks Challenging +1.2
  1. Information was collected about accidents on the Seapron bypass. It was found that the number of accidents per month could be modelled by a Poisson distribution with mean 2.5 Following some work on the bypass, the numbers of accidents during a series of 3-month periods were recorded. The data were used to test whether or not there was a change in the mean number of accidents per month.
    1. Stating your hypotheses clearly and using a \(5 \%\) level of significance, find the critical region for this test. You should state the probability in each tail.
    2. State P(Type I error) using this test.
    Data from the series of 3-month periods are recorded for 2 years.
  2. Find the probability that at least 2 of these 3-month periods give a significant result. Given that the number of accidents per month on the bypass, after the work is completed, is actually 2.1 per month,
  3. find P (Type II error) for the test in part (a)
Edexcel FS1 2021 June Q7
8 marks Challenging +1.8
  1. A manufacturer has a machine that produces lollipop sticks.
The length of a lollipop stick produced by the machine is normally distributed with unknown mean \(\mu\) and standard deviation 0.2 Farhan believes that the machine is not working properly and the mean length of the lollipop sticks has decreased.
He takes a random sample of size \(n\) to test, at the 1\% level of significance, the hypotheses $$\mathrm { H } _ { 0 } : \mu = 15 \quad \mathrm { H } _ { 1 } : \mu < 15$$
  1. Write down the size of this test. Given that the actual value of \(\mu\) is 14.9
    1. calculate the minimum value of \(n\) such that the probability of a Type II error is less than 0.05
      Show your working clearly.
    2. Farhan uses the same sample size, \(n\), but now carries out the test at a \(5 \%\) level of significance. Without doing any further calculations, state how this would affect the probability of a Type II error.
OCR MEI Paper 2 2020 November Q12
15 marks Standard +0.3
  1. Given that \(q < 2 p\), determine the values of \(p\) and \(q\).
  2. The spinner is spun 10 times. Calculate the probability that exactly one 5 is obtained. Elaine's teacher believes that the probability that the spinner shows a 1 is greater than 0.2 . The spinner is spun 100 times and gives a score of 1 on 28 occasions.
  3. Conduct a hypothesis test at the \(5 \%\) level to determine whether there is any evidence to suggest that the probability of obtaining a score of 1 is greater than 0.2 .
OCR H240/02 2018 March Q9
10 marks Standard +0.3
9 A bag contains 100 black discs and 200 white discs. Paula takes five discs at random, without replacement. She notes the number \(X\) of these discs that are black.
  1. Find \(\mathrm { P } ( X = 3 )\). Paula decides to use the binomial distribution as a model for the distribution of \(X\).
  2. Explain why this model will give probabilities that are approximately, but not exactly, correct.
  3. Paula uses the binomial model to find an approximate value for \(\mathrm { P } ( X = 3 )\). Calculate the percentage by which her answer will differ from the answer in part (ii). Paula now assumes that the binomial distribution is a good model for \(X\). She uses a computer simulation to generate 1000 values of \(X\). The number of times that \(X = 3\) occurs is denoted by \(Y\).
  4. Calculate estimates of the limits between which two thirds of the values of \(Y\) will lie.
OCR H240/02 2018 March Q10
12 marks Moderate -0.8
10 A researcher is investigating the actual lengths of time that patients spend at their appointments with the doctors at a certain clinic. There are 12 doctors at the clinic, and each doctor has 24 appointments per day. The researcher plans to choose a sample of 24 appointments on a particular day.
  1. The researcher considers the following two methods for choosing the sample. Method A: Choose a random sample of 24 appointments from the 288 on that day.
    Method B: Choose one doctor's 1st and 2nd appointments. Choose another doctor's 3rd and 4th appointments and so on until the last doctor's 23rd and 24th appointments. For each of A and B state a disadvantage of using this method. Appointments are scheduled to last 10 minutes. The researcher suspects that the actual times that patients spend are more than 10 minutes on average. To test this suspicion, he uses method A , and takes a random sample of 24 appointments. He notes the actual time spent for each appointment and carries out a hypothesis test at the \(1 \%\) significance level.
  2. Explain why a 1-tail test is appropriate. The population mean of the actual times that patients spend at their appointments is denoted by \(\mu\) minutes.
  3. Assuming that \(\mu = 10\), state the probability that the conclusion of the test will be that \(\mu\) is not greater than 10 . The actual lengths of time, in minutes, that patients spend for their appointments may be assumed to have a normal distribution with standard deviation 3.4.
    [0pt]
  4. Given that the total length of time spent for the 24 appointments is 285 minutes, carry out the test. [7]
  5. In part (iv) it was necessary to use the fact that the sample mean is normally distributed. Give a reason why you know that this is true in this case.
OCR H240/02 2018 September Q11
8 marks Moderate -0.3
11 In an experiment involving a bivariate distribution ( \(X , Y\) ) a random sample of 7 pairs of values was obtained and Pearson's product-moment correlation coefficient \(r\) was calculated for these values.
  1. The value of \(r\) was found to be 0.894 . Use the table below to test, at the \(5 \%\) significance level, whether there is positive linear correlation in the population, stating your hypotheses and conclusion clearly.
    1-tail test 2-tail test5\%2.5\%1\%0.5\%
    10\%5\%2\%1\%
    \(n\)
    1----
    2----
    30.98770.99690.99950.9999
    40.90000.95000.98000.9900
    50.80540.87830.93430.9587
    60.72930.81140.88220.9587
    70.66940.75450.83290.9745
    80.62150.70670.78870.8343
    90.58820.66640.74980.7977
    100.54940.63190.71550.7646
    Scatter diagrams for four sets of bivariate data, are shown. \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{85de9a39-f8be-40ee-b0c8-e2e632be93d8-8_380_371_301_191} \captionsetup{labelformat=empty} \caption{Diagram A}
    \end{figure} \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{85de9a39-f8be-40ee-b0c8-e2e632be93d8-8_373_373_301_628} \captionsetup{labelformat=empty} \caption{Diagram B}
    \end{figure} \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{85de9a39-f8be-40ee-b0c8-e2e632be93d8-8_378_373_301_1064} \captionsetup{labelformat=empty} \caption{Diagram C}
    \end{figure} \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{85de9a39-f8be-40ee-b0c8-e2e632be93d8-8_378_373_301_1503} \captionsetup{labelformat=empty} \caption{Diagram D}
    \end{figure} It is given that \(r = 0.894\) for one of these diagrams.
  2. For each of the other diagrams, state how you can tell that \(r \neq 0.894\).
OCR H240/02 2018 September Q12
8 marks Moderate -0.3
12 In the past, the time spent by customers in a certain shop had mean 10.5 minutes and standard deviation 4.2 minutes. Following a change of layout in the shop, the mean time spent in the shop by a random sample of 50 customers is found to be 12.0 minutes.
  1. Assuming that the standard deviation is unchanged, test at the \(1 \%\) significance level whether the mean time spent by customers in the shop has changed.
  2. Another random sample of 50 customers is chosen and a similar test at the \(1 \%\) significance level is carried out. Given that the population mean time has not changed, state the probability that the conclusion of the test will be that the population mean time has changed.
OCR S2 Q7
10 marks Standard +0.3
7 The random variable \(X\) has the distribution \(\mathrm { N } \left( \mu , 8 ^ { 2 } \right)\). The mean of a random sample of 12 observations of \(X\) is denoted by \(\bar { X }\). A test is carried out at the \(1 \%\) significance level of the null hypothesis \(\mathrm { H } _ { 0 } : \mu = 80\) against the alternative hypothesis \(\mathrm { H } _ { 1 } : \mu < 80\). The test is summarised as follows: 'Reject \(\mathrm { H } _ { 0 }\) if \(\bar { X } < c\); otherwise do not reject \(\mathrm { H } _ { 0 } { } ^ { \prime }\).
  1. Calculate the value of \(c\).
  2. Assuming that \(\mu = 80\), state whether the conclusion of the test is correct, results in a Type I error, or results in a Type II error if:
    1. \(\bar { X } = 74.0\),
    2. \(\bar { X } = 75.0\).
    3. Independent repetitions of the above test, using the value of \(c\) found in part (i), suggest that in fact the probability of rejecting the null hypothesis is 0.06 . Use this information to calculate the value of \(\mu\).
Edexcel AS Paper 2 2018 June Q3
7 marks Moderate -0.3
  1. Naasir is playing a game with two friends. The game is designed to be a game of chance so that the probability of Naasir winning each game is \(\frac { 1 } { 3 }\) Naasir and his friends play the game 15 times.
    1. Find the probability that Naasir wins
      1. exactly 2 games,
      2. more than 5 games.
    Naasir claims he has a method to help him win more than \(\frac { 1 } { 3 }\) of the games. To test this claim, the three of them played the game again 32 times and Naasir won 16 of these games.
  2. Stating your hypotheses clearly, test Naasir's claim at the \(5 \%\) level of significance.
Edexcel AS Paper 2 Specimen Q5
9 marks Easy -1.2
5. (a) The discrete random variable \(X \sim \mathrm {~B} ( 40,0.27 )\) $$\text { Find } \quad \mathrm { P } ( X \geqslant 16 )$$ Past records suggest that \(30 \%\) of customers who buy baked beans from a large supermarket buy them in single tins. A new manager suspects that there has been a change in the proportion of customers who buy baked beans in single tins. A random sample of 20 customers who had bought baked beans was taken.
(b) Write down the hypotheses that should be used to test the manager's suspicion.
(c) Using a \(10 \%\) level of significance, find the critical region for a two-tailed test to answer the manager's suspicion. You should state the probability of rejection in each tail, which should be less than 0.05
(d) Find the actual significance level of a test based on your critical region from part (c). One afternoon the manager observes that 12 of the 20 customers who bought baked beans, bought their beans in single tins.
(e) Comment on the manager's suspicion in the light of this observation. Later it was discovered that the local scout group visited the supermarket that afternoon to buy food for their camping trip.
(f) Comment on the validity of the model used to obtain the answer to part (e), giving a reason for your answer.
Edexcel AS Paper 2 Specimen Q2
8 marks Standard +0.3
2. The discrete random variable \(X \sim \mathrm {~B} ( 30,0.28 )\)
  1. Find \(\mathrm { P } ( 5 \leq X < 12 )\). Past records from a large supermarket show that \(25 \%\) of people who buy eggs, buy organic eggs. On one particular day a random sample of 40 people is taken from those that had bought eggs and 16 people are found to have bought organic eggs.
  2. Test, at the \(1 \%\) significance level, whether or not the proportion \(p\) of people who bought organic eggs that day had increased. State your hypotheses clearly.
  3. State the conclusion you would have reached if a \(5 \%\) significance level had been used for this test. \section*{(Total for Question 2 is 8 marks)}
Edexcel Paper 3 2018 June Q5
14 marks Challenging +1.2
  1. The lifetime, \(L\) hours, of a battery has a normal distribution with mean 18 hours and standard deviation 4 hours.
Alice's calculator requires 4 batteries and will stop working when any one battery reaches the end of its lifetime.
  1. Find the probability that a randomly selected battery will last for longer than 16 hours. At the start of her exams Alice put 4 new batteries in her calculator. She has used her calculator for 16 hours, but has another 4 hours of exams to sit.
  2. Find the probability that her calculator will not stop working for Alice's remaining exams. Alice only has 2 new batteries so, after the first 16 hours of her exams, although her calculator is still working, she randomly selects 2 of the batteries from her calculator and replaces these with the 2 new batteries.
  3. Show that the probability that her calculator will not stop working for the remainder of her exams is 0.199 to 3 significant figures. After her exams, Alice believed that the lifetime of the batteries was more than 18 hours. She took a random sample of 20 of these batteries and found that their mean lifetime was 19.2 hours.
  4. Stating your hypotheses clearly and using a \(5 \%\) level of significance, test Alice's belief.
AQA Further AS Paper 2 Statistics 2019 June Q7
9 marks Standard +0.3
7 Mohammed is conducting a medical trial to study the effect of two drugs, \(A\) and \(B\), on the amount of time it takes to recover from a particular illness. Drug \(A\) is used by one group of 60 patients and drug \(B\) is used by a second group of 60 patients. The results are summarised in the table:
AQA Further AS Paper 2 Statistics 2022 June Q6
8 marks Standard +0.3
6 The number of computers sold per day by a shop can be modelled by the random variable \(Y\) where \(Y \sim \operatorname { Po } ( 42 )\) 6
  1. State the variance of \(Y\) 6
  2. One month ago, the shop started selling a new model of computer.
    On a randomly chosen day in the last month, the shop sold 53 computers.
    Carry out a hypothesis test, at the \(5 \%\) level of significance, to investigate whether the mean number of computers sold per day has increased in the last month.
    [0pt] [6 marks]
    6
  3. Describe, in the context of the hypothesis test in part (b), what is meant by a Type II error.
AQA Further AS Paper 2 Statistics 2023 June Q4
4 marks Standard +0.3
4 The proportion, \(p\), of people in a particular town who use the local supermarket is unknown. A random sample of 30 people in the town is taken and each person is asked if they use the local supermarket. The manager of the supermarket claims that 35\% of the people in the town use the local supermarket. The random sample is used to conduct a hypothesis test at the \(5 \%\) level of significance with the hypotheses $$\begin{aligned} & \mathrm { H } _ { 0 } : p = 0.35 \\ & \mathrm { H } _ { 1 } : p \neq 0.35 \end{aligned}$$ Show that the probability that a Type I error is made is 0.0356 , correct to four decimal places.
AQA Further AS Paper 2 Statistics 2024 June Q7
11 marks Standard +0.3
7 Over a period of time, it has been shown that the mean number of customers entering a small store is 6 per hour. The store runs a promotion, selling many products at lower prices. 7
  1. Luke randomly selects an hour during the promotion and counts 11 customers entering the store. He claims that the promotion has changed the mean number of customers per hour entering the store. Investigate Luke's claim, using the \(5 \%\) level of significance.
    7
  2. Luke randomly selects another hour and carries out the same investigation as in part (a). Find the probability of a Type I error, giving your answer to four decimal places.
    Fully justify your answer.
    7
  3. When observing the store, Luke notices that some customers enter the store together as a group. Explain why the model used in parts (a) and (b) might not be valid.
    DO NOT WRITE/ON THIS PAGE ANSWER IN THE/SPACES PROVIDED number Additional page, if required. Write the question numbers in the left-hand margin.
    Additional page, if required. number Additional page, if required.
    Write the question numbers in the left-hand margin. Additional page, if required. number Additional page, if required.
    Write the question numbers in the left-hand margin.
    Write the question n