5.02d Binomial: mean np and variance np(1-p)

88 questions

Sort by: Default | Easiest first | Hardest first
Edexcel S4 2017 June Q6
19 marks Challenging +1.2
6. The independent random variables \(X _ { 1 }\) and \(X _ { 2 }\) are each distributed \(\mathrm { B } ( n , p )\), where \(n > 1\) An unbiased estimator for \(p\) is given by $$\hat { p } = \frac { a X _ { 1 } + b X _ { 2 } } { n }$$ where \(a\) and \(b\) are constants.
[0pt] [You may assume that if \(X _ { 1 }\) and \(X _ { 2 }\) are independent then \(\mathrm { E } \left( X _ { 1 } X _ { 2 } \right) = \mathrm { E } \left( X _ { 1 } \right) \mathrm { E } \left( X _ { 2 } \right)\) ]
  1. Show that \(a + b = 1\)
  2. Show that \(\operatorname { Var } ( \hat { p } ) = \frac { \left( 2 a ^ { 2 } - 2 a + 1 \right) p ( 1 - p ) } { n }\)
  3. Hence, justifying your answer, determine the value of \(a\) and the value of \(b\) for which \(\hat { p }\) has minimum variance.
    1. Show that \(\hat { p } ^ { 2 }\) is a biased estimator for \(p ^ { 2 }\)
    2. Show that the bias \(\rightarrow 0\) as \(n \rightarrow \infty\)
  4. By considering \(\mathrm { E } \left[ X _ { 1 } \left( X _ { 1 } - 1 \right) \right]\) find an unbiased estimator for \(p ^ { 2 }\)
OCR MEI Further Statistics A AS 2018 June Q4
9 marks Moderate -0.3
4 The probability that an expert darts player hits the bullseye on any throw is 0.12 , independently of any other throw. The player throws darts at the bullseye until she hits it.
  1. Find the probability that the player has to throw exactly six darts.
  2. Find the probability that the player has to throw more than six darts.
  3. (A) Find the mean number of darts that the player has to throw.
    (B) Find the variance of the number of darts that the player has to throw. The player continues to throw more darts at the bullseye after she has hit it for the first time.
  4. Find the probability that the player hits the bullseye at least twice in the first ten throws.
  5. Find the probability that the player hits the bullseye for the second time on the tenth throw.
OCR MEI Further Statistics A AS 2019 June Q2
9 marks Moderate -0.3
2 Almost all plants of a particular species have red flowers. However on average 1 in every 1500 plants of this species have white flowers. A random sample of 2000 plants of this species is selected. The random variable \(X\) represents the number of plants in the sample that have white flowers.
  1. Name two distributions which could be used to model the distribution of \(X\), stating the parameters of each of these distributions. You may use either of the distributions you have named in the rest of this question.
  2. Calculate each of the following.
    Calculate the probability that there are at least 10 plants in the sample that have white flowers.
OCR MEI Further Statistics A AS 2023 June Q3
7 marks Moderate -0.8
3 At a pottery which manufactures mugs, it is known that \(5 \%\) of mugs are faulty. The mugs are produced in batches of 20 . Faults are modelled as occurring randomly and independently. The number of faulty mugs in a batch is denoted by the random variable \(X\).
  1. Determine \(\mathrm { P } ( X \geqslant 2 )\).
  2. Find \(\operatorname { Var } ( X )\). Independently of the mugs, the pottery also manufactures cups, and it is known that \(7 \%\) of cups are faulty. The cups are produced in batches of 30 . Faults are modelled as occurring randomly and independently. The number of faulty cups in a batch is denoted by the random variable \(Y\).
  3. Determine the standard deviation of \(X + Y\). When 10 batches of cups have been produced, a sample of 15 cups is tested to ensure that the handles of the cups are properly attached.
  4. Explain why it might not be sensible to select a sample of 15 cups from the same batch.
OCR MEI Further Statistics A AS 2023 June Q4
10 marks Standard +0.3
4 At a parcel delivery company it is known that the probability that a parcel is delivered to the wrong address is 0.0005 . On a particular day, 15000 parcels are delivered. The number of parcels delivered to the wrong address is denoted by the random variable \(X\).
  1. Explain why the binomial distribution and the Poisson distribution could both be suitable models for the distribution of \(X\).
  2. Use a Poisson distribution to find each of the following.
    • \(\mathrm { P } ( X = 5 )\)
    • \(\mathrm { P } ( X \geqslant 8 )\)
    You are given that 15000 parcels are delivered each day in a 5-day working week.
    1. Determine the probability that at least 40 parcels are delivered to the wrong address during the week.
    2. Determine the probability that at least 8 parcels are delivered to the wrong address on each of the 5 days in the week.
OCR MEI Further Statistics A AS 2024 June Q6
10 marks Moderate -0.8
6 A bank monitors the amounts of cash withdrawn from a cash machine. It categorises any withdrawal of an amount of \(\pounds 50\) or less as 'small' and any withdrawal of an amount greater than \(\pounds 50\) as 'large'. Over a long period of time the bank finds that the proportion of withdrawals that are small is 0.43 .
The bank wishes to model a sample of 10 withdrawals to examine the number of small withdrawals.
    1. State a suitable probability distribution for such a model, justifying your answer.
    2. State one assumption needed for the model to be valid.
    1. Find the probability that exactly 4 of the 10 withdrawals are small.
    2. Find the probability that exactly 4 of the 10 withdrawals are large.
    3. Find the probability that no more than 4 of the 10 withdrawals are large.
  1. Find the probability that, in the 10 withdrawals, the 7th withdrawal is large and there are exactly 3 that are small.
OCR MEI Further Statistics Minor 2019 June Q1
7 marks Easy -1.3
1 In a game at a charity fair, a spinner is spun 4 times.
On each spin the chance that the spinner lands on a score of 5 is 0.2 .
The random variable \(X\) represents the number of spins on which the spinner lands on a score of 5 .
  1. Find \(\mathrm { P } ( X = 3 )\).
  2. Find each of the following.
    • \(\mathrm { E } ( X )\)
    • \(\operatorname { Var } ( X )\)
    One game costs \(\pounds 1\) to play and, for each spin that lands on a score of 5 , the player receives 50 pence.
    1. Find the expected total amount of money gained by a player in one game.
    2. Find the standard deviation of the total amount of money gained by a player in one game.
OCR MEI Further Statistics Minor 2020 November Q2
11 marks Standard +0.3
2 On computer monitor screens there are often one or more tiny dots which are permanently dark and do not display any of the image. Such dots are known as 'dead pixels'. Dead pixels occur on screens randomly and independently of each other. A company manufactures three types of monitor, Types A, B and C. For a monitor of Type A, the screen has a total of 2304000 pixels. For this type of monitor, the probability of a randomly chosen pixel being dead is 1 in 500000 . Let \(X\) represent the number of dead pixels on a monitor screen of this type.
  1. Explain why you could use either a binomial distribution or a Poisson distribution to model the distribution of \(X\).
  2. Use a Poisson distribution to calculate estimates of each of the following probabilities.
    For a monitor of Type B, the probability of a randomly chosen pixel being dead is also 1 in 500 000. The screen of a monitor of Type B has a total of \(n\) pixels. Use a binomial distribution to find the least value of \(n\) for which the probability of finding at least 1 dead pixel is greater than 0.99 . Give your answer in millions correct to 3 significant figures. For a monitor of Type C, the number of dead pixels on the screen is modelled by a Poisson distribution with mean \(\lambda\).
  3. Given that the probability of finding at least one dead pixel is 0.8 , find \(\lambda\).
OCR MEI Further Statistics Major 2023 June Q11
9 marks Moderate -0.5
11 The random variable \(X\) takes the value 1 with probability \(p\) and the value 0 with probability \(1 - p\).
  1. Find each of the following.
    Use the results of part (a) to prove that
Edexcel FS1 2020 June Q2
4 marks Moderate -0.5
  1. The discrete random variables \(W , X\) and \(Y\) are distributed as follows
$$W \sim \mathrm {~B} ( 10,0.4 ) \quad X \sim \operatorname { Po } ( 4 ) \quad Y \sim \operatorname { Po } ( 3 )$$
  1. Explain whether or not \(\mathrm { Po } ( 4 )\) would be a good approximation to \(\mathrm { B } ( 10,0.4 )\)
  2. State the assumption required for \(X + Y\) to be distributed as \(\operatorname { Po } ( 7 )\) Given the assumption in part (b) holds,
  3. find \(\mathrm { P } ( X + Y < \operatorname { Var } ( W ) )\)
Edexcel FS1 2020 June Q4
8 marks Standard +0.8
  1. The discrete random variable \(X\) has the following probability distribution.
\(x\)- 5- 234
\(\mathrm { P } ( X = x )\)\(\frac { 1 } { 12 }\)\(\frac { 1 } { 6 }\)\(\frac { 1 } { 4 }\)\(\frac { 1 } { 2 }\)
  1. Find \(\operatorname { Var } ( X )\) The discrete random variable \(Y\) is defined in terms of the discrete random variable \(X\) When \(X\) is negative, \(Y = X ^ { 2 }\) When \(X\) is positive, \(Y = 3 X - 2\)
  2. Find \(\mathrm { P } ( Y < 9 )\)
  3. Find \(\mathrm { E } ( X Y )\)
Edexcel FS1 2021 June Q4
10 marks Standard +0.3
  1. Members of a photographic group may enter a maximum of 5 photographs into a members only competition.
    Past experience has shown that the number of photographs, \(N\), entered by a member follows the probability distribution shown below.
\(n\)012345
\(\mathrm { P } ( N = n )\)\(a\)0.20.050.25\(b\)\(c\)
Given that \(\mathrm { E } ( 4 N + 2 ) = 14.8\) and \(\mathrm { P } ( N = 5 \mid N > 2 ) = \frac { 1 } { 2 }\)
  1. show that \(\operatorname { Var } ( N ) = 2.76\) The group decided to charge a 50p entry fee for the first photograph entered and then 20p for each extra photograph entered into the competition up to a maximum of \(\pounds 1\) per person. Thus a member who enters 3 photographs pays 90 p and a member who enters 4 or 5 photographs just pays £1 Assuming that the probability distribution for the number of photographs entered by a member is unchanged,
  2. calculate the expected entry fee per member. Bai suggests that, as the mean and variance are close, a Poisson distribution could be used to model the number of photographs entered by a member next year.
  3. State a limitation of the Poisson distribution in this case.
OCR Further Statistics AS 2018 June Q6
5 marks Standard +0.8
6 In this question you must show detailed reasoning. The random variable \(T\) has a binomial distribution. It is known that \(\mathrm { E } ( T ) = 5.625\) and the standard deviation of \(T\) is 1.875 . Find the values of the parameters of the distribution.
OCR Further Statistics 2018 March Q5
8 marks Standard +0.3
5 A spinner has 5 edges. Each edge is numbered with a different integer from 1 to 5 . When the spinner is spun, it is equally likely to come to rest on any one of the edges. The spinner is spun 100 times. The number of times on which the spinner comes to rest on the edge numbered 5 is denoted by \(X\).
  1. \(\mathrm { E } ( X )\),
  2. \(\operatorname { Var } ( X )\).
    1. Write down
    2. Use a normal distribution with the same mean and variance as in your answers to part (i) to estimate the smallest value of \(n\) such that \(\mathrm { P } ( X \geqslant n ) < 0.02\).
    3. Use the binomial distribution to find exactly the smallest value of \(n\) such that \(\mathrm { P } ( X \geqslant n ) < 0.02\). Show the values of all relevant calculations.
OCR Further Statistics 2018 September Q6
10 marks Standard +0.8
6 A bag contains 7 red counters and 5 blue counters.
  1. Fred chooses 4 counters at random, without replacement. Show that the probability that Fred chooses exactly 2 red counters is \(\frac { 14 } { 33 }\).
  2. Lina chooses 4 counters at random from the bag, records whether or not exactly 2 red counters are chosen, and returns the counters to the bag. She carries out this experiment 99 times.
    1. Find the mean of the number of experiments that result in choosing exactly 2 red counters.
    2. Find the variance of the number of experiments that result in choosing exactly 2 red counters.
    3. Alex arranges all 12 counters in a random order in a straight line. A is the event: no two blue counters are next to one another. B is the event: all the blue counters are next to one another. Find \(\mathrm { P } ( A \cup B )\).
AQA S1 2005 January Q5
15 marks Standard +0.3
5 Each evening Aaron sets his alarm for 7 am. He believes that the probability that he wakes before his alarm rings each morning is 0.4 , and is independent from morning to morning.
  1. Assuming that Aaron's belief is correct, determine the probability that, during a week (7 mornings), he wakes before his alarm rings:
    1. on 2 or fewer mornings;
    2. on more than 1 but fewer than 5 mornings.
  2. Assuming that Aaron's belief is correct, calculate the probability that, during a 4 -week period, he wakes before his alarm rings on exactly 7 mornings.
  3. Assuming that Aaron's belief is correct, calculate values for the mean and standard deviation of the number of mornings in a week when Aaron wakes before his alarm rings.
    (2 marks)
  4. During a 50-week period, Aaron records, each week, the number of mornings on which he wakes before his alarm rings. The results are as follows.
    Number of mornings01234567
    Frequency108775544
    1. Calculate the mean and standard deviation of these data.
    2. State, giving reasons, whether your answers to part (d)(i) support Aaron's belief that the probability that he wakes before his alarm rings each morning is 0.4 , and is independent from morning to morning.
      (3 marks)
AQA S1 2010 January Q6
14 marks Moderate -0.3
6 During the winter, the probability that Barry's cat, Sylvester, chooses to stay outside all night is 0.35 , and the cat's choice is independent from night to night.
  1. Determine the probability that, during a period of 2 weeks ( 14 nights) in winter, Sylvester chooses to stay outside:
    1. on at most 7 nights;
    2. on at least 11 nights;
    3. on more than 5 nights but fewer than 10 nights.
  2. Calculate the probability that, during a period of \(\mathbf { 3 }\) weeks in winter, Sylvester chooses to stay outside on exactly 4 nights.
  3. Barry claims that, during the summer, the number of nights per week, \(S\), on which Sylvester chooses to stay outside can be modelled by a binomial distribution with \(n = 7\) and \(p = \frac { 5 } { 7 }\).
    1. Assuming that Barry's claim is correct, find the mean and the variance of \(S\).
    2. For a period of 13 weeks during the summer, the number of nights per week on which Sylvester chose to stay outside had a mean of 5 and a variance of 1.5 . Comment on Barry's claim.
      (2 marks)
AQA S1 2006 June Q5
17 marks Standard +0.3
5 Kirk and Les regularly play each other at darts.
  1. The probability that Kirk wins any game is 0.3 , and the outcome of each game is independent of the outcome of every other game. Find the probability that, in a match of 15 games, Kirk wins:
    1. exactly 5 games;
    2. fewer than half of the games;
    3. more than 2 but fewer than 7 games.
  2. Kirk attends darts coaching sessions for three months. He then claims that he has a probability of 0.4 of winning any game, and that the outcome of each game is independent of the outcome of every other game.
    1. Assuming this claim to be true, calculate the mean and standard deviation for the number of games won by Kirk in a match of 15 games.
    2. To assess Kirk's claim, Les keeps a record of the number of games won by Kirk in a series of 10 matches, each of 15 games, with the following results: $$\begin{array} { l l l l l l l l l l } 8 & 5 & 6 & 3 & 9 & 12 & 4 & 2 & 6 & 5 \end{array}$$ Calculate the mean and standard deviation of these values.
    3. Hence comment on the validity of Kirk's claim.
OCR MEI Further Statistics Major Specimen Q11
24 marks Standard +0.3
11 Two girls, Lili and Hui, play a game with a fair six-sided dice. The dice is thrown 10 times. \(X _ { 1 } , X _ { 2 } , \ldots , X _ { 10 }\) represent the scores on the \(1 ^ { \text {st } } , 2 ^ { \text {nd } } , \ldots , 10 ^ { \text {th } }\) throws of the dice. \(L\) denotes Lili's score and \(L = 10 X _ { 1 }\). \(H\) denotes Hui's score and \(H = X _ { 1 } + X _ { 2 } + X _ { 3 } + \ldots + X _ { 10 }\).
  1. Calculate
    The spreadsheet below shows a simulation of 25 plays of the game. The cell E3, highlighted, shows the score when the dice is thrown the fourth time in the first game. \begin{table}[h]
    ABCDEFGHIJKLMN
    1Throw of diceLili'sHui's
    212345678910scorescore
    3Game 135211311143022
    4Game 263244353356038
    5Game 364265215236036
    6Game 415166314621035
    7Game 544316441624035
    8Game 621512515232027
    9Game 711344563421033
    10Game 811363445231032
    11Game 922243215562032
    12Game 1035335343113031
    13Game 1153655421155037
    14Game 1264324133536034
    15Game 1323212222212019
    16Game 1441331266134030
    17Game 1551263463645040
    18Game 1636115313333029
    19Game 1752524522345034
    20Game 1836355231123031
    21Game 1966315634166041
    22Game 2026456524332040
    23Game 2153545336615041
    24Game 2263556356116041
    25Game 2354556421365041
    26Game 2435232432333030
    27Game 2552424522525033
    28
    29mean37.6033.68
    30sd17.395.77
    \captionsetup{labelformat=empty} \caption{Fig. 11}
    \end{table}
  2. Use the simulation to estimate \(\mathrm { P } ( L > 40 )\) and \(\mathrm { P } ( H > 40 )\).
  3. (A) Calculate the exact value of \(\mathrm { P } ( L > 40 )\).
    (B) Comment on how the exact value compares with your estimate of \(\mathrm { P } ( L > 40 )\) in part (v). Hui wonders whether it is appropriate to use the Central Limit Theorem to approximate the distribution of \(X _ { 1 } + X _ { 2 } + X _ { 3 } + \ldots + X _ { 10 }\).
  4. (A) State what type of diagram Hui could draw, based on the output from the spreadsheet, to investigate this.
    (B) Explain how she should interpret the diagram.
  5. (A) Calculate an approximate value of \(\mathrm { P } \left( X _ { 1 } + X _ { 2 } + X _ { 3 } + \ldots + X _ { 10 } > 40 \right)\) using the Central Limit Theorem.
    (B) Comment on how this value compares with your estimate of \(\mathrm { P } ( H > 40 )\) in part (v). \section*{Copyright Information:} }{www.ocr.org.uk}) after the live examination series. If OCR has unwittingly failed to correctly acknowledge or clear any third-party content in this assessment material, OCR will be happy to correct its mistake at the earliest possible opportunity.
    For queries or further information please contact the Copyright Team, First Floor, 9 Hills Road, Cambridge CB2 1GE.
    OCR is part of the
Edexcel S2 Q1
Easy -1.8
  1. It is estimated that \(4 \%\) of people have green eyes. In a random sample of size \(n\), the expected number of people with green eyes is 5 .
    1. Calculate the value of \(n\).
Edexcel S2 2024 October Q2
Standard +0.3
  1. A multiple-choice test consists of 25 questions, each having 5 responses, only one of which is correct.
Each correct answer gains 4 marks but each incorrect answer loses 1 mark.
Sam answers all 25 questions by choosing at random one response for each question.
Let \(X\) be the number of correct answers that Sam achieves.
  1. State the distribution of \(X\) Let \(M\) be the number of marks that Sam achieves.
    1. State the distribution of \(M\) in terms of \(X\)
    2. Hence, show clearly that the number of marks that Sam is expected to achieve is zero. In order to pass the test at least 30 marks are required.
  2. Find the probability that Sam will pass the test. Past records show that when the test is done properly, the probability that a student answers the first question correctly is 0.5 A random sample of 50 students that did the test properly was taken.
    Given that the probability that more than \(n\) but at most 30 students answered the first question correctly was 0.9328 to 4 decimal places,
  3. find the value of \(n\)
Pre-U Pre-U 9795/2 2015 June Q3
14 marks Challenging +1.2
3 The probability generating function of the random variable \(X\) is \(\frac { 1 } { 81 } \left( t + \frac { 2 } { t } \right) ^ { 4 }\).
  1. Use the probability generating function to find \(\mathrm { E } ( X )\) and \(\operatorname { Var } ( X )\).
  2. The random variable \(Y\) is defined by \(Y = \frac { 1 } { 2 } ( X + 4 )\). By finding the probability distribution of \(X\), or otherwise, show that \(Y \sim \mathrm {~B} ( n , p )\), stating the values of \(n\) and \(p\).
Pre-U Pre-U 9795/2 2016 June Q3
5 marks Standard +0.8
3
  1. Show that the probability generating function of a random variable with the distribution \(\mathrm { B } ( n , p )\) is \(( 1 - p + p t ) ^ { n }\).
  2. \(R\) and \(S\) are independent random variables with the distributions \(\mathrm { B } \left( 8 , \frac { 1 } { 4 } \right)\) and \(\mathrm { B } \left( 8 , \frac { 3 } { 4 } \right)\) respectively. Show that the probability generating function of \(R + S\) can be expressed as $$\left( \frac { 3 } { 16 } + \frac { 1 } { 16 } t ( 10 + 3 t ) \right) ^ { 8 }$$ and use this result to find \(\mathrm { P } ( R + S = 1 )\).
Pre-U Pre-U 9795/2 2016 June Q5
10 marks Challenging +1.2
5 The random variable \(R\) has the distribution \(\mathrm { B } ( n , p )\).
  1. State two conditions that \(n\) and \(p\) must satisfy if the distribution of \(R\) can be well approximated by a normal distribution. Assume now that these conditions hold. Using the normal approximation, it is given that \(\mathrm { P } ( R < 25 ) = 0.8282\) and \(\mathrm { P } ( R \geqslant 28 ) = 0.0393\), correct to 4 decimal places.
  2. Find the mean and standard deviation of the approximating normal distribution.
  3. Hence find the value of \(p\) and the value of \(n\).
Pre-U Pre-U 9795/2 2016 Specimen Q2
Standard +0.3
2
  1. The probability that a shopper obtains a parking space on the river embankment on any given Saturday morning is 0.2 . Using a suitable normal approximation, find the probability that, over a period of 100 Saturday mornings, the shopper finds a parking space at least 15 times. Justify the use of the normal approximation in this case.
  2. The number of parking tickets that a traffic warden issues on the river embankment during the course of a week has a Poisson distribution with mean 36 . The probability that the traffic warden issues more than \(N\) parking tickets is less than 0.05 . Using a suitable normal approximation, find the least possible value of \(N\).