2.05b Hypothesis test for binomial proportion

279 questions

Sort by: Default | Easiest first | Hardest first
OCR S2 2007 January Q4
7 marks Moderate -0.8
4 A set of observations of a random variable \(W\) can be summarised as follows: $$n = 14 , \quad \Sigma w = 100.8 , \quad \Sigma w ^ { 2 } = 938.70 .$$
  1. Calculate an unbiased estimate of the variance of \(W\).
  2. The mean of 70 observations of \(W\) is denoted by \(\bar { W }\). State the approximate distribution of \(\bar { W }\), including unbiased estimate(s) of any parameter(s).
OCR S2 2007 January Q7
11 marks Standard +0.3
7 A television company believes that the proportion of households that can receive Channel C is 0.35 .
  1. In a random sample of 14 households it is found that 2 can receive Channel C. Test, at the \(2.5 \%\) significance level, whether there is evidence that the proportion of households that can receive Channel C is less than 0.35.
  2. On another occasion the test is carried out again, with the same hypotheses and significance level as in part (i), but using a new sample, of size \(n\). It is found that no members of the sample can receive Channel C. Find the largest value of \(n\) for which the null hypothesis is not rejected. Show all relevant working.
OCR S2 2008 January Q3
8 marks Standard +0.8
3 The random variable \(G\) has the distribution \(\operatorname { Po } ( \lambda )\). A test is carried out of the null hypothesis \(\mathrm { H } _ { 0 } : \lambda = 4.5\) against the alternative hypothesis \(\mathrm { H } _ { 1 } : \lambda \neq 4.5\), based on a single observation of \(G\). The critical region for the test is \(G \leqslant 1\) and \(G \geqslant 9\).
  1. Find the significance level of the test.
  2. Given that \(\lambda = 5.5\), calculate the probability that the test results in a Type II error.
OCR S2 2008 January Q8
13 marks Standard +0.3
8 Consultations are taking place as to whether a site currently in use as a car park should be developed as a shopping mall. An agency acting on behalf of a firm of developers claims that at least \(65 \%\) of the local population are in favour of the development. In a survey of a random sample of 12 members of the local population, 6 are in favour of the development.
  1. Carry out a test, at the \(10 \%\) significance level, to determine whether the result of the survey is consistent with the claim of the agency.
  2. A local residents' group claims that no more than \(35 \%\) of the local population are in favour of the development. Without further calculations, state with a reason what can be said about the claim of the local residents' group.
  3. A test is carried out, at the \(15 \%\) significance level, of the agency's claim. The test is based on a random sample of size \(2 n\), and exactly \(n\) of the sample are in favour of the development. Find the smallest possible value of \(n\) for which the outcome of the test is to reject the agency's claim.
    [0pt] [4] 4
OCR S2 2005 June Q6
11 marks Standard +0.3
6 A factory makes chocolates of different types. The proportion of milk chocolates made on any day is denoted by \(p\). It is desired to test the null hypothesis \(\mathrm { H } _ { 0 } : p = 0.8\) against the alternative hypothesis \(\mathrm { H } _ { 1 } : p < 0.8\). The test consists of choosing a random sample of 25 chocolates. \(\mathrm { H } _ { 0 }\) is rejected if the number of milk chocolates is \(k\) or fewer. The test is carried out at a significance level as close to \(5 \%\) as possible.
  1. Use tables to find the value of \(k\), giving the values of any relevant probabilities.
  2. The test is carried out 20 times, and each time the value of \(p\) is 0.8 . Each of the tests is independent of all the others. State the expected number of times that the test will result in rejection of the null hypothesis.
  3. The test is carried out once. If in fact the value of \(p\) is 0.6 , find the probability of rejecting \(\mathrm { H } _ { 0 }\).
  4. The test is carried out twice. Each time the value of \(p\) is equally likely to be 0.8 or 0.6 . Find the probability that exactly one of the two tests results in rejection of the null hypothesis.
OCR S2 2006 June Q2
7 marks Standard +0.3
2
  1. The random variable \(R\) has the distribution \(\mathrm { B } ( 6 , p )\). A random observation of \(R\) is found to be 6. Carry out a \(5 \%\) significance test of the null hypothesis \(\mathrm { H } _ { 0 } : p = 0.45\) against the alternative hypothesis \(\mathrm { H } _ { 1 } : p \neq 0.45\), showing all necessary details of your calculation.
  2. The random variable \(S\) has the distribution \(\mathrm { B } ( n , p ) . \mathrm { H } _ { 0 }\) and \(\mathrm { H } _ { 1 }\) are as in part (i). A random observation of \(S\) is found to be 1 . Use tables to find the largest value of \(n\) for which \(\mathrm { H } _ { 0 }\) is not rejected. Show the values of any relevant probabilities.
OCR S2 2007 June Q3
3 marks Easy -1.8
3 The proportion of adults in a large village who support a proposal to build a bypass is denoted by \(p\). A random sample of size 20 is selected from the adults in the village, and the members of the sample are asked whether or not they support the proposal.
  1. Name the probability distribution that would be used in a hypothesis test for the value of \(p\).
  2. State the properties of a random sample that explain why the distribution in part (i) is likely to be a good model. \(4 X\) is a continuous random variable.
OCR S2 2007 June Q6
9 marks Standard +0.3
6 In a rearrangement code, the letters of a message are rearranged so that the frequency with which any particular letter appears is the same as in the original message. In ordinary German the letter \(e\) appears \(19 \%\) of the time. A certain encoded message of 20 letters contains one letter \(e\).
  1. Using an exact binomial distribution, test at the \(10 \%\) significance level whether there is evidence that the proportion of the letter \(e\) in the language from which this message is a sample is less than in German, i.e., less than \(19 \%\).
  2. Give a reason why a binomial distribution might not be an appropriate model in this context.
OCR S2 2014 June Q6
12 marks Standard +0.3
6 In a city the proportion of inhabitants from ethnic group \(\mathbf { Z }\) is known to be \(\mathbf { 0 . 4 }\). A sample of \(\mathbf { 1 2 }\) employees of a large company in this city is obtained and it is found that 2 of them are from ethnic group \(Z\). A test is carried out, at the \(5 \%\) significance level, of whether the proportion of employees in this company from ethnic group \(Z\) is less than in the city as a whole.
[0pt]
  1. State an assumption that must be made about the sample for a significance test to be valid. [1]
    [0pt]
  2. Describe briefly an appropriate way of obtaining the sample. [2]
    [0pt]
  3. Carry out the test. [7]
  4. A manager believes that the company discriminates against ethnic group \(Z\). Explain whether carrying out the test at the 10\% significance level would be more supportive or less supportive of the manager's belief. [2]
OCR S2 Specimen Q8
14 marks Challenging +1.2
8 The proportion of left-handed adults in a country is known to be \(15 \%\). It is suggested that for mathematicians the proportion is greater than \(15 \%\). A random sample of 12 members of a university mathematics department is taken, and it is found to include five who are left-handed.
  1. Stating your hypotheses, test whether the suggestion is justified, using a significance level as close to \(5 \%\) as possible.
  2. In fact the significance test cannot be carried out at a significance level of exactly \(5 \%\). State the probability of making a Type I error in the test.
  3. Find the probability of making a Type II error in the test for the case when the proportion of mathematicians who are left-handed is actually \(20 \%\).
  4. Determine, as accurately as the tables of cumulative binomial probabilities allow, the actual proportion of mathematicians who are left-handed for which the probability of making a Type II error in the test is 0.01 .
OCR S2 2013 January Q1
4 marks Standard +0.8
1 A random variable has the distribution \(\mathrm { B } ( n , p )\). It is required to test \(\mathrm { H } _ { 0 } : p = \frac { 2 } { 3 }\) against \(\mathrm { H } _ { 1 } : p < \frac { 2 } { 3 }\) at a significance level as close to \(1 \%\) as possible, using a sample of size \(n = 8,9\) or 10 . Use tables to find which value of \(n\) gives such a test, stating the critical region for the test and the corresponding significance level.
[0pt] [4]
OCR S2 2013 January Q9
8 marks Standard +0.8
9 The random variable \(A\) has the distribution \(\mathrm { B } ( 30 , p )\). A test is carried out of the hypotheses \(\mathrm { H } _ { 0 } : p = 0.6\) against \(\mathrm { H } _ { 1 } : p < 0.6\). The critical region is \(A \leqslant 13\).
  1. State the probability that \(\mathrm { H } _ { 0 }\) is rejected when \(p = 0.6\).
  2. Find the probability that a Type II error occurs when \(p = 0.5\).
  3. It is known that on average \(p = 0.5\) on one day in five, and on other days the value of \(p\) is 0.6 . On each day two tests are carried out. If the result of the first test is that \(\mathrm { H } _ { 0 }\) is rejected, the value of \(p\) is adjusted if necessary, to ensure that \(p = 0.6\) for the rest of the day. Otherwise the value of \(p\) remains the same as for the first test. Calculate the probability that the result of the second test is to reject \(\mathrm { H } _ { 0 }\). \section*{THERE ARE NO QUESTIONS WRITTEN ON THIS PAGE.}
OCR S2 2015 June Q5
8 marks Standard +0.3
5
  1. State an advantage of using random numbers in selecting samples.
  2. It is known that in analysing the digits in large sets of financial records, the probability that the leading digit is 1 is 0.25 . A random sample of 18 leading digits from a certain large set of financial records is obtained and it is found that 8 of the leading digits are 1 s . Test, at the \(5 \%\) significance level, whether the probability that the leading digit is 1 in this set of records is greater than 0.25 .
OCR S2 2015 June Q8
7 marks Standard +0.8
8 The random variable \(S\) has the distribution \(\mathrm { B } ( 14 , p )\). A significance test is carried out of the null hypothesis \(\mathrm { H } _ { 0 } : p = 0.3\) against the alternative hypothesis \(\mathrm { H } _ { 1 } : p > 0.3\). The critical region for the test is \(S \geqslant 8\).
  1. Find the significance level of the test, correct to 3 significant figures.
  2. It is given that, on each occasion that the test is carried out, the true value of \(p\) is equally likely to be \(0.3,0.5\) or 0.7 , independently of any other test. Four independent tests are carried out. Find the probability that at least one of the tests results in a Type II error.
OCR MEI S1 2012 January Q6
17 marks Moderate -0.3
6 It is known that \(25 \%\) of students in a particular city are smokers. A random sample of 20 of the students is selected.
  1. (A) Find the probability that there are exactly 4 smokers in the sample.
    (B) Find the probability that there are at least 3 but no more than 6 smokers in the sample.
    (C) Write down the expected number of smokers in the sample. A new health education programme is introduced. This programme aims to reduce the percentage of students in this city who are smokers. After the programme has been running for a year, it is decided to carry out a hypothesis test to assess the effectiveness of the programme. A random sample of 20 students is selected.
  2. (A) Write down suitable null and alternative hypotheses for the test.
    (B) Explain why the alternative hypothesis has the form that it does.
  3. Find the critical region for the test at the \(5 \%\) level, showing all of your calculations.
  4. In fact there are 3 smokers in the sample. Complete the test, stating your conclusion clearly.
OCR MEI S1 2013 January Q7
18 marks Standard +0.3
7 A coffee shop provides free internet access for its customers. It is known that the probability that a randomly selected customer is accessing the internet is 0.35 , independently of all other customers.
  1. 10 customers are selected at random.
    (A) Find the probability that exactly 5 of them are accessing the internet.
    (B) Find the probability that at least 5 of them are accessing the internet.
    (C) Find the expected number of these customers who are accessing the internet. Another coffee shop also provides free internet access. It is suspected that the probability that a randomly selected customer at this coffee shop is accessing the internet may be different from 0.35 . A random sample of 20 customers at this coffee shop is selected. Of these, 10 are accessing the internet.
  2. Carry out a hypothesis test at the \(5 \%\) significance level to investigate whether the probability for this coffee shop is different from 0.35 . Give a reason for your choice of alternative hypothesis.
  3. To get a more reliable result, a much larger random sample of 200 customers is selected over a period of time, and another hypothesis test is carried out. You are given that 90 of the 200 customers were accessing the internet. You are also given that, if \(X\) has the binomial distribution with parameters \(n = 200\) and \(p = 0.35\), then \(\mathrm { P } ( X \geqslant 90 ) = 0.0022\). Using the same hypotheses and significance level which you used in part (ii), complete this test.
OCR MEI S1 2009 June Q8
18 marks Standard +0.3
8 The Department of Health 'eat five a day' advice recommends that people should eat at least five portions of fruit and vegetables per day. In a particular school, \(20 \%\) of pupils eat at least five a day.
  1. 15 children are selected at random.
    (A) Find the probability that exactly 3 of them eat at least five a day.
    (B) Find the probability that at least 3 of them eat at least five a day.
    (C) Find the expected number who eat at least five a day. A programme is introduced to encourage children to eat more portions of fruit and vegetables per day. At the end of this programme, the diets of a random sample of 15 children are analysed. A hypothesis test is carried out to examine whether the proportion of children in the school who eat at least five a day has increased.
  2. (A) Write down suitable null and alternative hypotheses for the test.
    (B) Give a reason for your choice of the alternative hypothesis.
  3. Find the critical region for the test at the \(10 \%\) significance level, showing all of your calculations. Hence complete the test, given that 7 of the 15 children eat at least five a day.
OCR MEI S1 2010 June Q6
18 marks Standard +0.3
6 A manufacturer produces tiles. On average 10\% of the tiles produced are faulty. Faulty tiles occur randomly and independently. A random sample of 18 tiles is selected.
  1. (A) Find the probability that there are exactly 2 faulty tiles in the sample.
    (B) Find the probability that there are more than 2 faulty tiles in the sample.
    (C) Find the expected number of faulty tiles in the sample. A cheaper way of producing the tiles is introduced. The manufacturer believes that this may increase the proportion of faulty tiles. In order to check this, a random sample of 18 tiles produced using the cheaper process is selected and a hypothesis test is carried out.
  2. (A) Write down suitable null and alternative hypotheses for the test.
    (B) Explain why the alternative hypothesis has the form that it does.
  3. Find the critical region for the test at the \(5 \%\) level, showing all of your calculations.
  4. In fact there are 4 faulty tiles in the sample. Complete the test, stating your conclusion clearly.
OCR MEI S1 2012 June Q5
8 marks Moderate -0.3
5 A manufacturer produces titanium bicycle frames. The bicycle frames are tested before use and on average \(5 \%\) of them are found to be faulty. A cheaper manufacturing process is introduced and the manufacturer wishes to check whether the proportion of faulty bicycle frames has increased. A random sample of 18 bicycle frames is selected and it is found that 4 of them are faulty. Carry out a hypothesis test at the \(5 \%\) significance level to investigate whether the proportion of faulty bicycle frames has increased.
OCR MEI S1 2013 June Q5
8 marks Moderate -0.3
5 A researcher is investigating whether people can identify whether a glass of water they are given is bottled water or tap water. She suspects that people do no better than they would by guessing. Twenty people are selected at random; thirteen make a correct identification. She carries out a hypothesis test.
  1. Explain why the null hypothesis should be \(p = 0.5\), where \(p\) represents the probability that a randomly selected person makes a correct identification.
  2. Briefly explain why she uses an alternative hypothesis of \(p > 0.5\).
  3. Complete the test at the \(5 \%\) significance level.
OCR MEI S1 2015 June Q7
17 marks Standard +0.3
7 A drug for treating a particular minor illness cures, on average, \(78 \%\) of patients. Twenty people with this minor illness are selected at random and treated with the drug.
  1. \(( A )\) Find the probability that exactly 19 patients are cured.
    (B) Find the probability that at most 18 patients are cured. \(( C )\) Find the expected number of patients who are cured.
  2. A pharmaceutical company is trialling a new drug to treat this illness. Researchers at the company hope that a higher percentage of patients will be cured when given this new drug. Twenty patients are selected at random, and given the new drug. Of these, 19 are cured. Carry out a hypothesis test at the \(1 \%\) significance level to investigate whether there is any evidence to suggest that the new drug is more effective than the old one.
  3. If the researchers had chosen to carry out the hypothesis test at the \(5 \%\) significance level, what would the result have been? Justify your answer.
OCR S2 2009 January Q4
10 marks Moderate -0.3
4 A television company believes that the proportion of adults who watched a certain programme is 0.14 . Out of a random sample of 22 adults, it is found that 2 watched the programme.
  1. Carry out a significance test, at the \(10 \%\) level, to determine, on the basis of this sample, whether the television company is overestimating the proportion of adults who watched the programme.
  2. The sample was selected randomly. State what properties of this method of sampling are needed to justify the use of the distribution used in your test.
OCR S2 2011 January Q9
11 marks Standard +0.3
9 A pharmaceutical company is developing a new drug to treat a certain disease. The company will continue to develop the drug if the proportion \(p\) of those who have the disease and show a substantial improvement after treatment is greater than 0.7 . The company carries out a test, at the \(5 \%\) significance level, on a random sample of 14 patients who suffer from the disease.
  1. Find the critical region for the test.
  2. Given that 12 of the 14 patients in the sample show a substantial improvement, carry out the test.
  3. Find the probability that the test results in a Type II error if in fact \(p = 0.8\). RECOGNISING ACHIEVEMENT
OCR S2 2009 June Q3
7 marks Moderate -0.3
3 An electronics company is developing a new sound system. The company claims that \(60 \%\) of potential buyers think that the system would be good value for money. In a random sample of 12 potential buyers, 4 thought that it would be good value for money. Test, at the 5\% significance level, whether the proportion claimed by the company is too high.
OCR S2 2010 June Q4
7 marks Moderate -0.3
4 The proportion of commuters in a town who travel to work by train is 0.4 . Following the opening of a new station car park, a random sample of 16 commuters is obtained, and 11 of these travel to work by train. Test at the \(1 \%\) significance level whether there is evidence of an increase in the proportion of commuters in this town who travel to work by train.