5.02i Poisson distribution: random events model

479 questions

Sort by: Default | Easiest first | Hardest first
AQA Further Paper 3 Statistics 2024 June Q4
6 marks Moderate -0.3
4
8
16
256 2 The random variable \(T\) has an exponential distribution with mean 2 Find \(\mathrm { P } ( T \leq 1.4 )\) Circle your answer. \(\mathrm { e } ^ { - 2.8 }\) \(\mathrm { e } ^ { - 0.7 }\) \(1 - e ^ { - 0.7 }\) \(1 - \mathrm { e } ^ { - 2.8 }\) The continuous random variable \(Y\) has cumulative distribution function $$\mathrm { F } ( y ) = \left\{ \begin{array} { l r } 0 & y < 2 \\ - \frac { 1 } { 9 } y ^ { 2 } + \frac { 10 } { 9 } y - \frac { 16 } { 9 } & 2 \leq y < 5 \\ 1 & y \geq 5 \end{array} \right.$$ Find the median of \(Y\) Circle your answer. 2 \(\frac { 10 - 3 \sqrt { 2 } } { 2 }\) \(\frac { 7 } { 2 }\) \(\frac { 10 + 3 \sqrt { 2 } } { 2 }\) Turn over for the next question 4 Research has shown that the mean number of volcanic eruptions on Earth each day is 20 Sandra records 162 volcanic eruptions during a period of one week. Sandra claims that there has been an increase in the mean number of volcanic eruptions per week. Test Sandra's claim at the \(5 \%\) level of significance.
OCR MEI Further Statistics Major Specimen Q8
12 marks Standard +0.3
8 Natural background radiation consists of various particles, including neutrons. A detector is used to count the number of neutrons per second at a particular location.
  1. State the conditions required for a Poisson distribution to be a suitable model for the number of neutrons detected per second. The number of neutrons detected per second due to background radiation only is modelled by a Poisson distribution with mean 1.1.
  2. Find the probability that the detector detects
    (A) no neutrons in a randomly chosen second,
    (B) at least 60 neutrons in a randomly chosen period of 1 minute. A neutron source is switched on. It emits neutrons which should all be contained in a protective casing. The detector is used to check whether any neutrons have not been contained; these are known as stray neutrons. If the detector detects more than 8 neutrons in a period of 1 second, an alarm will be triggered in case this high reading is due to stray neutrons.
  3. Suppose that there are no stray neutrons and so the neutrons detected are all due to the background radiation. Find the expected number of times the alarm is triggered in 1000 randomly chosen periods of 1 second.
  4. Suppose instead that stray neutrons are being produced at a rate of 3.4 per second in addition to the natural background radiation. Find the probability that at least one alarm will be triggered in 10 randomly chosen periods of 1 second. You should assume that all stray neutrons produced are detected.
OCR FS1 AS 2021 June Q2
6 marks Moderate -0.8
2 In the manufacture of fibre optical cable (FOC), flaws occur randomly. Whether any point on a cable is flawed is independent of whether any other point is flawed. The number of flaws in 100 m of FOC of standard diameter is denoted by \(X\).
  1. State a further assumption needed for \(X\) to be well modelled by a Poisson distribution. Assume now that \(X\) can be well modelled by the distribution \(\operatorname { Po } ( 0.7 )\).
  2. Find the probability that in 300 m of FOC of standard diameter there are exactly 3 flaws. The number of flaws in 100 m of FOC of a larger diameter has the distribution \(\mathrm { Po } ( 1.6 )\).
  3. Find the probability that in 200 m of FOC of standard diameter and 100 m of FOC of the larger diameter the total number of flaws is at least 4 . Judith believes that mathematical ability and chess-playing ability are related. She asks 20 randomly chosen chess players, with known British Chess Federation (BCF) ratings \(X\), to take a mathematics aptitude test, with scores \(Y\). The results are summarised as follows. $$n = 20 , \Sigma x = 3600 , \Sigma x ^ { 2 } = 660500 , \Sigma y = 1440 , \Sigma y ^ { 2 } = 105280 , \Sigma x y = 260990$$
    1. Calculate the value of Pearson's product-moment correlation coefficient \(r\).
    2. State an assumption needed to be able to carry out a significance test on the value of \(r\).
    3. Assume now that the assumption in part (b) is valid. Test at the \(5 \%\) significance level whether there is evidence that chess players with higher BCF ratings are better at mathematics.
    4. There are two different grading systems for chess players, the BCF system and the international ELO system. The two sets of ratings are related by $$\text { ELO rating } = 8 \times \text { BCF rating } + 650$$ Magnus says that the experiment should have used ELO ratings instead of BCF ratings. Comment on Magnus's suggestion.
    5. Calculate the value of Pearson's product-moment correlation coefficient \(r\).
    6. State an assumption needed to be able to carry out a significance test on the value of \(r\).
    7. Assume now that the assumption in part (b) is valid. Test at the \(5 \%\) significance level whether there is evidence that chess players with higher BCF ratings are better at mathematics.
    8. There are two different grading systems for chess players, the BCF system and the international ELO system. The two sets of ratings are related by $$\mathrm { ELO } \text { rating } = 8 \times \mathrm { BCF } \text { rating } + 650 .$$ Magnus says that the experiment should have used ELO ratings instead of BCF ratings. Comment on Magnus's suggestion. An environmentalist measures the mean concentration, \(c\) milligrams per litre, of a particular chemical in a group of rivers, and the mean mass, \(m\) pounds, of fish of a certain species found in those rivers. The results are given in the table.
      QuestionAnswerMarksAOGuidance
      1(a)\(\begin{aligned}0.25 + 0.36 + x + x ^ { 2 } = 1
      x ^ { 2 } + x - 0.39 = 0
      x = 0.3 \text { (or } - 1.3 \text { ) }
      x \text { cannot be negative }
      \mathrm { E } ( W ) = 2.23
      \mathrm { E } \left( W ^ { 2 } \right) = \Sigma w ^ { 2 } \mathrm { p } ( w ) \quad [ = 5.83 ]
      \text { Subtract } [ \mathrm { E } ( W ) ] ^ { 2 } \text { to get } \mathbf { 0 . 8 5 7 1 } \end{aligned}\)\(\begin{gathered} \text { M1 }
      \text { A1 }
      \text { A1 }
      \text { B1ft }
      \text { B1 }
      \text { M1 }
      \text { A1 }
      { [ 7 ] } \end{gathered}\)
      3.1a
      1.1b
      1.1b
      2.3
      1.1b
      1.1
      2.1
      Equation using \(\Sigma p = 1\)
      Correct simplified quadratic Correctly obtain \(x = 0.3\)
      Explicitly reject other solution
      2.23 or exact equivalent only Use \(\Sigma w ^ { 2 } \mathrm { p } ( w )\)
      Correctly obtain given answer, www
      Can be implied
      Method needed ft on their quadratic Allow for \(\mathrm { E } ( W ) ^ { 2 } = 4.9729\)
      Need 2.23 or 4.9729 and 5.83 or full numerical \(\Sigma w ^ { 2 } \mathrm { p } ( w )\)
      1(b)\(9 \times 0.8571 = 7.7139\)
      B1
      [1]
      1.1bAllow 7.71 or 7.714
      2(a)Flaws must occur at constant average rate (uniform rate)
      B1
      [1]
      1.2
      Context (e.g. "flaws") needed
      Extra answers, e.g. "singly": B0
      Not "constant rate" or "average constant rate".
      2(b)\(\operatorname { Po(2.1)~or~ } e ^ { - \lambda } \frac { \lambda ^ { 3 } } { 3 ! }\)
      M1
      A1
      [2]
      1.1
      1.1b
      Po(2.1) stated or implied, or formula with \(\lambda = 2.1\) stated Awrt 0.189
      2(c)
      Po(3)
      \(1 - \mathrm { P } ( \leq 3 )\)
      M1
      M1
      A1
      [3]
      1.1
      1.1
      1.1b
      \(\operatorname { Po } ( 2 \times 0.7 + 1.6 )\) stated or implied
      Allow \(1 - \mathrm { P } ( \leq 4 ) = 0.1847\), or from wrong \(\lambda\)
      Awrt 0.353
      Or all combinations \(\leq 3\)
      \(1 -\) above, not just \(= 3\)
      QuestionAnswerMarksAOGuidance
      3(a)0.4(00)
      B2
      [2]
      1.1
      1.1b
      SC: if B0, give SC B1 for two of \(S _ { x x } = 12500 , S _ { y y } = 1600 , S _ { x y } = 1790\) and \(S _ { x y } / \sqrt { } \left( S _ { x x } S _ { y y } \right)\)Also allow SC B1 for equivalent methods using Covariance \SDs
      3(b)Data needs to have a bivariate normal distribution
      B1
      [1]
      1.2Needs "bivariate normal" or clear equivalent. Not just "both normally distributed"Allow "scatter diagram forms ellipse"
      3(c)
      \(\mathrm { H } _ { 0 }\) : higher maths scores are not associated with higher BCF grading; \(\mathrm { H } _ { 1 }\) : positively associated
      CV 0.3783
      \(0.400 > 0.3783\) so reject \(\mathrm { H } _ { 0 }\)
      Significant evidence that higher maths scores are associated with higher BCF grading
      B1
      B1
      M1ft
      A1ft
      [4]
      2.5
      1.1b
      2.2b
      3.5a
      Needs context and clearly onetailed \(O R \rho\) used and defined Not "evidence that ..."
      Allow 0.378
      Reject/do not reject \(\mathrm { H } _ { 0 }\)
      Contextualised, not too definite Needn't say "positive" if \(\mathrm { H } _ { 1 } \mathrm { OK }\)
      SC 2-tail: B0; 0.4438, or 0.3783 B1; then M1A0
      \(\mathrm { H } _ { 0 } : \rho = 0 , \mathrm { H } _ { 1 } : \rho > 0\) where \(\rho\) is population pmcc (not \(r\) )
      FT on their \(r\), but not CV
      Not "scores are associated
      ...". FT on their \(r\) only
      3(d)It makes no difference as this is a linear transformation
      B1
      [1]
      2.2aNeed both "unchanged" oe and reason, need "linear" or exact equivalent"oe" includes "their 0.4"
      4(a)Neither
      B1
      [1]
      2.5OENot "neither is independent of the other"
      4(b)\(c = 2.848 - 0.1567 m\)
      B1
      B1
      B1
      [3]
      1.1
      1.1
      1.1
      Correct \(a\), awrt 2.85
      Correct \(b\), awrt 0.157
      Letters correct from correct method
      (If both wrongly rounded, e.g. \(c = 2.84 - 0.156 m\), give B2)
      \(\mathrm { SC } : m\) on \(c\) :
      \(m = 15.65 - 4.832 c\) : B2
      \(y = 15.65 - 4.832 x\) : B1
      \(c = 15.65 - 4.832 m : \mathrm { B } 1\)
      If B0B0, give B1 for correct letters from valid working
      QuestionAnswerMarksAOGuidance
      4(c)\(a\) unchanged, \(b\) multiplied by 2.2 (allow " \(a\) unchanged, \(b\) increases", etc)B1 [1]2.2aoe, e.g. \(c = 2.848 - 0.345 m\); \(m = 7.114 - 2.196 c\)SC: \(m\) on \(c\) in (b): Both divided by 2.2 B1
      4(d)
      Draw approximate line of best fit
      Draw at least one vertical from line to point
      Say that "Best fit" line minimises the sum of squares of these distances
      M1
      M1
      A1
      [3]
      1.1
      2.4
      2.4
      Needs M2 and "minimises" and "sums of squares" oe
      SC: Horizontal(s):
      full marks (indept of (b))
OCR Further Statistics 2021 June Q3
11 marks Standard +0.3
3 The numbers of CD players sold in a shop on three consecutive weekends were 7,6 and 2 . It may be assumed that sales of CD players occur randomly and that nobody buys more than one CD player at a time. The number of CD players sold on a randomly chosen weekend is denoted by \(X\).
  1. How appropriate is the Poisson distribution as a model for \(X\) ? Now assume that a Poisson distribution with mean 5 is an appropriate model for \(X\).
  2. Find
    1. \(\mathrm { P } ( X = 6 )\),
    2. \(\mathrm { P } ( X \geqslant 8 )\). The number of integrated sound systems sold in a weekend at the same shop can be assumed to have the distribution \(\operatorname { Po } ( 7.2 )\).
  3. Find the probability that on a randomly chosen weekend the total number of CD players and integrated sound systems sold is between 10 and 15 inclusive.
  4. State an assumption needed for your answer to part (c) to be valid.
  5. Give a reason why the assumption in part (d) may not be valid in practice.
Edexcel S2 2024 October Q1
Standard +0.3
During an annual beach-clean, the people doing the clean are asked to conduct a litter survey.
At a particular beach-clean, litter was found at a rate of 4 items per square metre.
  1. Find the probability that, in a randomly selected area of 2 square metres on this beach, exactly 5 items of litter were found. Of the litter found on the beach, 30\% of the items were face masks.
  2. Find the probability that, in a randomly selected area of 5 square metres on this beach, more than 4 face masks were found.
  3. Using a suitable approximation, find the probability that, in a randomly selected area of 20 square metres on this beach, less than 60 items of litter were found that were not face masks.
Edexcel S2 2024 October Q3
Standard +0.3
During Monday afternoons, customers are known to enter a certain shop at a mean rate of 7 customers every 10 minutes.
  1. Suggest a suitable distribution to model the number of customers that enter this shop in a 10-minute interval on Monday afternoons.
  2. State two assumptions necessary for this distribution to be a suitable model of this situation. A new shop manager wants to find out if the rate of customers has changed since they took over.
  3. Write down suitable null and alternative hypotheses that the shop manager should use. The shop manager decides to monitor the number of customers entering the shop in a random 10-minute interval next Monday afternoon.
  4. Using a \(3 \%\) level of significance, find the critical region to test whether the rate of customers has changed.
  5. Find the actual significance level of this test based on your critical region from part (d) During the random 10-minute interval that Monday afternoon, 12 customers entered the shop.
  6. Comment on this finding, using the critical region in part (d)
Pre-U Pre-U 9795/2 2010 June Q7
8 marks Standard +0.3
7 The number of goals scored by a hockey team in an interval of time of length \(t\) minutes follows a Poisson distribution with mean \(\frac { 1 } { 24 } t\). The random variable \(T\) is defined as the length of time, in minutes, between successive goals.
  1. (a) Show that \(\mathrm { P } ( T < t ) = 1 - \mathrm { e } ^ { - \frac { 1 } { 24 } t }\) for \(t \geqslant 0\).
    (b) Hence find the probability density function of \(T\).
  2. Find the exact value of the interquartile range of \(T\).
Pre-U Pre-U 9795/2 2010 June Q9
10 marks Challenging +1.2
9
  1. Two independent discrete random variables \(X\) and \(Y\) follow Poisson distributions with means \(\lambda\) and \(\mu\) respectively. Prove that the discrete random variable \(Z = X + Y\) follows a Poisson distribution with mean \(\lambda + \mu\). A garage has a white limousine and a green limousine for hire. Demands to hire the white limousine occur at a constant mean rate of 3 per week and demands to hire the green limousine occur at a constant mean rate of 2 per week. Demands for hire are received independently and randomly.
  2. Calculate the probability that in a period of two weeks
    1. no demands for hire are received, giving your answer to 3 significant figures,
    2. seven demands for hire are received.
    3. Find the least value of \(n\) such that the probability of at least \(n\) demands for hire in a period of three weeks is less than 0.005 .
Pre-U Pre-U 9795/2 2011 June Q2
8 marks Moderate -0.3
2 The discrete random variable \(X\) has a Poisson distribution with mean 12.25.
  1. Calculate \(\mathrm { P } ( X \leqslant 5 )\).
  2. Calculate an approximate value for \(\mathrm { P } ( X \leqslant 5 )\) using a normal approximation to the Poisson distribution.
  3. Comment, giving a reason, on the accuracy of using a normal approximation to the Poisson distribution in this case.
Pre-U Pre-U 9795/2 2013 June Q1
3 marks Moderate -0.3
1 A company hires out narrowboats on a canal. It may be assumed that demands to hire a narrowboat occur independently and randomly at a constant mean rate of 25 per week. Using a suitable normal approximation, find
  1. the probability that 15 or fewer narrowboats are hired out during a certain week,
  2. the number of narrowboats that the company needs to have available for a week in order that the probability of running out of boats is 0.05 or less.
Pre-U Pre-U 9795/2 2013 June Q3
9 marks Standard +0.3
3
  1. Given that \(X \sim \operatorname { Po } ( 5 )\), find \(\mathrm { P } ( X > 6 \mid X > 3 )\).
  2. Given that \(Y \sim \operatorname { Po } ( \lambda )\) and \(\mathrm { P } ( Y \leqslant 1 ) = \frac { 1 } { 2 }\), show that \(\lambda\) satisfies the equation \(\lambda = \ln \{ 2 ( 1 + \lambda ) \}\).
  3. Starting with a suitable approximation from the table of cumulative Poisson probabilities, use iteration to find \(\lambda\) correct to 3 decimal places.
Pre-U Pre-U 9795/2 2013 November Q3
Standard +0.3
3 The number of signal failures in a certain region of the railway network averages 10 every 3 weeks. Assume that signal failures occur independently, randomly and at constant mean rate.
  1. Find the probability that
    1. there are between 7 and 12 (inclusive) signal failures in a three-week period,
    2. there are more than 4 signal failures in a one-week period.
    3. It has been calculated, using a suitable distributional approximation, that the probability of more than 62 signal failures in a period of \(n\) weeks is 0.0385 . Find the value of \(n\).
Pre-U Pre-U 9795/2 2015 June Q4
11 marks Challenging +1.2
4
  1. (a) Derive the moment generating function for a Poisson distribution with mean \(\lambda\).
    (b) The independent random variables \(X\) and \(Y\) are such that \(X \sim \operatorname { Po } ( \mu )\) and \(Y \sim \operatorname { Po } ( v )\). Use moment generating functions to show that \(( X + Y ) \sim \operatorname { Po } ( \mu + v )\).
  2. The number of goals scored per match by Camford Academicals FC may be modelled by a Poisson distribution with mean 2. The number of goals scored against Camford during a match may be modelled by an independent Poisson distribution with mean \(k\). The probability that no goals are scored, by either side, in a match involving Camford is 0.045 . Find
    (a) the value of \(k\),
    (b) the probability that exactly 3 goals are scored against Camford in a match,
    (c) the probability that the total number of goals scored, in a match involving Camford, is between 2 and 5 inclusive.
Pre-U Pre-U 9795/2 2016 June Q4
7 marks Standard +0.3
4 In a Football League match, the number of goals scored by the home team can be modelled by the distribution \(\mathrm { Po } ( 2.4 )\). The number of goals scored by the away team can be modelled by the distribution Po(1.8).
  1. State a necessary assumption for the total number of goals scored in one match to be modelled by the distribution \(\operatorname { Po } ( 4.2 )\).
  2. Assume now that this assumption holds.
    1. Write down an expression for the probability that the total number of goals scored in \(n\) randomly chosen games is less than 4 .
    2. Find the probability that the result of a randomly chosen game is either 0-0 or 1-1.
Pre-U Pre-U 9795/2 2016 Specimen Q4
10 marks Challenging +1.2
4
  1. The random variable \(X\) has the distribution \(\operatorname { Po } ( \lambda )\). Prove that the probability generating function, \(\mathrm { G } _ { X } ( t )\), is given by $$\mathrm { G } _ { X } ( t ) = \mathrm { e } ^ { \lambda ( t - 1 ) }$$
  2. The independent random variables \(X\) and \(Y\) have distributions \(\operatorname { Po } ( \lambda )\) and \(\operatorname { Po } ( \mu )\) respectively. Use probability generating functions to show that the distribution of \(X + Y\) is \(\operatorname { Po } ( \lambda + \mu )\).
  3. Given that \(X \sim \operatorname { Po } ( 1.5 )\) and \(Y \sim \operatorname { Po } ( 2.5 )\), find \(\mathrm { P } ( X \leqslant 2 \mid X + Y = 4 )\).
Pre-U Pre-U 9795/2 2016 Specimen Q2
Standard +0.3
2
  1. The probability that a shopper obtains a parking space on the river embankment on any given Saturday morning is 0.2 . Using a suitable normal approximation, find the probability that, over a period of 100 Saturday mornings, the shopper finds a parking space at least 15 times. Justify the use of the normal approximation in this case.
  2. The number of parking tickets that a traffic warden issues on the river embankment during the course of a week has a Poisson distribution with mean 36 . The probability that the traffic warden issues more than \(N\) parking tickets is less than 0.05 . Using a suitable normal approximation, find the least possible value of \(N\).
Pre-U Pre-U 9795/2 2017 June Q5
8 marks Standard +0.3
5 The number of calls to a car breakdown service during any one hour of the day is modelled by the distribution \(\operatorname { Po } ( 20 )\).
  1. Find the probability that in a randomly chosen 12 -minute period there are at least 7 calls to the service.
  2. Find the period of time, correct to the nearest second, for which the probability that no calls are made to the service is 0.6 .
  3. Use a suitable approximation to find the probability that, in a randomly chosen 3-hour period, there are no more than 65 calls to the service.
Pre-U Pre-U 9795/2 2018 June Q2
Moderate -0.8
2 Secret radio messages received under difficult conditions are subject to errors caused by random instantaneous breaks in transmission. The number of errors caused by breaks in transmission in a 10-minute period is denoted by \(B\).
  1. State two conditions, other than randomness, needed for a Poisson distribution to be a suitable model for \(B\). Assume now that \(B \sim \mathrm { Po } ( 5 )\).
  2. Calculate the probability that in a 15-minute period there are between 6 and 10 errors, inclusive, caused by random breaks in transmission. Secret radio messages are also subject to errors caused by mistakes made by the sender. The number of errors caused by mistakes made by the sender in a 10 -minute period, \(M\), has the independent distribution \(\operatorname { Po } ( 8 )\).
  3. Calculate the period of time, in seconds, for which the probability that a message contains no errors of either sort is 0.6 .
Pre-U Pre-U 9795/2 Specimen Q10
4 marks Standard +0.3
10
  1. \(X , Y\) and \(Z\) are independent random variables having Poisson distributions with means \(\lambda , \mu\) and \(\lambda + \mu\) respectively. Find \(\mathrm { P } ( X = 0\) and \(Y = 2 ) , \mathrm { P } ( X = 1\) and \(Y = 1 )\) and \(\mathrm { P } ( X = 2\) and \(Y = 0 )\). Hence verify that \(\mathrm { P } ( X + Y = 2 ) = \mathrm { P } ( Z = 2 )\).
  2. In an office the male absence rate, i.e. the number of working days lost each month due to the absence of male employees, has a Poisson distribution with mean 4.5. In the same office the female absence rate has an independent Poisson distribution with mean 4.1. Calculate the probability that
    1. during a particular month both the male absence rate and the female absence rate are equal to 3,
    2. during a particular month the total of the male and female absence rates is equal to 6,
    3. during a particular month the male and female absence rates were each equal to 3 , given that the total of the male and female absence rates was equal to 6 .
WJEC Unit 2 2022 June Q3
Moderate -0.3
In a study, samples of soil were collected during the summer. Soil samples of dimensions \(25 \mathrm {~cm} \times 25 \mathrm {~cm} \times 40 \mathrm {~cm}\) were collected for analysis. The study found that there were, on average, 11 earthworms per sample. a) Explain briefly the conditions under which a Poisson distribution could be used to model the number of earthworms per sample.
b) In July, pupils at a primary school are asked to dig a smaller hole, \(25 \mathrm {~cm} \times 25 \mathrm {~cm} \times 10 \mathrm {~cm}\), and to count the number of earthworms they find. Calculate the probability that the pupils find exactly 5 earthworms.
c) In the autumn, the average number of earthworms per sample is greater than in the summer. The probability that, in the autumn, there are fewer than 13 earthworms in a soil sample of dimensions \(25 \mathrm {~cm} \times 25 \mathrm {~cm} \times 40 \mathrm {~cm}\) is close to \(36 \%\). Find the mean number of earthworms, to the nearest whole number, per \(25 \mathrm {~cm} \times 25 \mathrm {~cm} \times 40 \mathrm {~cm}\) soil sample in the autumn.
04
Jessica is studying the relationship between hip girth, \(h \mathrm {~cm}\), and thigh girth, \(t \mathrm {~cm}\), for American adults who are physically active. She takes a random sample of 11 people from a very large dataset which she has downloaded into a spreadsheet software package. The results are shown below.
\(h ( \mathrm {~cm} )\)\(98 \cdot 6\)\(112 \cdot 1\)\(97 \cdot 9\)\(110 \cdot 2\)\(89 \cdot 2\)\(111 \cdot 7\)\(87 \cdot 0\)\(94 \cdot 7\)\(100 \cdot 4\)\(104 \cdot 0\)\(88 \cdot 4\)
\(t ( \mathrm {~cm} )\)\(48 \cdot 3\)\(87 \cdot 2\)\(55 \cdot 2\)\(68 \cdot 0\)\(48 \cdot 5\)\(63 \cdot 2\)\(49 \cdot 5\)\(55 \cdot 7\)\(59 \cdot 1\)\(64 \cdot 0\)\(52 \cdot 4\)
a) Jessica notes that, for the thigh girth data, the lower quartile is 49.5 and the upper quartile is \(64 \cdot 0\).
i) Show that 87.2 should be classified as an outlier for \(t\).
ii) Give a reason why Jessica might exclude the outlier.
iii) Give a reason why Jessica might include the outlier. Jessica decides to exclude the outlier and produces the following scatter diagram. \section*{Thigh girth versus Hip girth} \includegraphics[max width=\textwidth, alt={}, center]{77c62e6d-58e4-42d3-9982-5a8325e8e826-04_647_1250_1439_404}
b) Interpret, in context, the correlation in the data shown in the diagram. The equation of the regression line of \(t\) on \(h\) for this sample is $$t = 0.69 h - 11.26$$ c) Interpret the gradient of the regression line in this context.
d) Use your knowledge of large data sets and spreadsheet software packages to suggest a way in which Jessica could improve her investigation. A company, Run4Lyfe, sponsors an athletic event. The organisers of the event claim that \(70 \%\) of the participants know the name of the sponsoring company. Run4Lyfe is concerned that the proportion, \(p\), of participants knowing the name of the sponsoring company is less than \(70 \%\). They decide to survey 60 randomly selected participants to carry out a significance test.
a) State suitable hypotheses for carrying out the test.
b) i) Explain what is meant by the critical region for this test.
ii) Determine the critical region if the test is to be carried out at a significance level as close as possible to, but not exceeding, \(5 \%\).
iii) Given that 40 participants out of the 60 in the sample know the name of the company, complete the significance test.
c) State, with a reason, how you would advise Run4Lyfe with regards to sponsoring the event next year.
06
The fertility rate for a country is the average number of children that are born to a woman over her lifetime. The graphs and table below show some data on the fertility rates for 197 countries in the years 1914 and 2014. \begin{figure}[h]
\captionsetup{labelformat=empty} \caption{Fertility rates in 1914} \includegraphics[alt={},max width=\textwidth]{77c62e6d-58e4-42d3-9982-5a8325e8e826-06_671_1483_593_283}
\end{figure} \begin{figure}[h]
\captionsetup{labelformat=empty} \caption{Fertility rates in 2014} \includegraphics[alt={},max width=\textwidth]{77c62e6d-58e4-42d3-9982-5a8325e8e826-06_616_1219_1434_287}
\end{figure} \begin{figure}[h]
\captionsetup{labelformat=empty} \caption{Decreases in fertility rates from 1914 to 2014} \includegraphics[alt={},max width=\textwidth]{77c62e6d-58e4-42d3-9982-5a8325e8e826-06_476_613_2270_388}
\end{figure}
Minimum value- 0.71
Lower quartile2.08
Median3.19
Upper quartile3.94
Maximum value6.49
a) Comment on the shapes of the distributions of fertility rates for 1914 and 2014.
b) Interpret the minimum value, \(- 0 \cdot 71\), in the boxplot. You are also given the following information:
Country
Fertility rate
for 1914
Fertility rate
for 2014
France
Between 2
and 3
1.98
Ethiopia
Between 6
and 7
4.4
c) i) Find the best possible estimate for the decrease in the fertility rate from 1914 to 2014 for France.
ii) Find the best possible estimate for the decrease in the fertility rate from 1914 to 2014 for Ethiopia.
iii) Give one possible reason why the answers to i) and ii) are so different.
iv) Explain why these estimates may not be very accurate. \section*{Section B: Mechanics}
\(\mathbf { 0 }\)7
The diagram below shows a vehicle of mass 1300 kg towing a trailer of mass 500 kg by means of a light horizontal tow bar. The vehicle is moving forward along a straight horizontal road such that a constant resistance of magnitude 650 N acts on the vehicle and a constant resistance of magnitude 320 N acts on the trailer. The vehicle's engine produces a constant driving force of \(F \mathrm {~N}\).
\includegraphics[max width=\textwidth, alt={}]{77c62e6d-58e4-42d3-9982-5a8325e8e826-08_158_851_781_609}
Given that the acceleration of the vehicle and trailer is \(0.85 \mathrm {~ms} ^ { - 2 }\), show that \(F = 2500\) and determine the tension in the tow bar.
CAIE FP2 2012 June Q8
9 marks Standard +0.3
The number of flaws in a randomly chosen 100 metre length of ribbon is modelled by a Poisson distribution with mean 1.6. The random variable \(X\) metres is the distance between two successive flaws. Show that the distribution function of \(X\) is given by $$\text{F}(x) = \begin{cases} 1 - e^{-0.016x} & x \geq 0, \\ 0 & x < 0, \end{cases}$$ and deduce that \(X\) has a negative exponential distribution, stating its mean. [4] Find
  1. the median distance between two successive flaws, [3]
  2. the probability that there is a distance of at least 50 metres between two successive flaws. [2]
CAIE FP2 2012 June Q8
9 marks Standard +0.3
The number of flaws in a randomly chosen 100 metre length of ribbon is modelled by a Poisson distribution with mean 1.6. The random variable \(X\) metres is the distance between two successive flaws. Show that the distribution function of \(X\) is given by $$\text{F}(x) = \begin{cases} 1 - e^{-0.016x} & x \geqslant 0, \\ 0 & x < 0, \end{cases}$$ and deduce that \(X\) has a negative exponential distribution, stating its mean. [4] Find
  1. the median distance between two successive flaws, [3]
  2. the probability that there is a distance of at least 50 metres between two successive flaws. [2]
CAIE FP2 2018 November Q10
12 marks Standard +0.8
The number of accidents, \(x\), that occur each day on a motorway are recorded over a period of 40 days. The results are shown in the following table.
Number of accidents0123456\(\geqslant 7\)
Observed frequency358105720
\begin{enumerate}[label=(\roman*)] \item Show that the mean number of accidents each day is 2.95 and calculate the variance for this sample. Explain why these values suggest that a Poisson distribution might fit the data. [3] \item A Poisson distribution with mean 2.95, as found from the data, is used to calculate the expected frequencies, correct to 2 decimal places. The results are shown in the following table.
Number of accidents0123456\(\geqslant 7\)
Observed frequency358105720
Expected frequency2.096.189.118.966.613.901.921.23
Show how the expected frequency of 6.61 for \(x = 4\) is obtained. [2] \item Test at the 5% significance level the goodness of fit of this Poisson distribution to the data. [7] \end{enumerate]
CAIE S2 2021 June Q1
4 marks Standard +0.3
Accidents at two factories occur randomly and independently. On average, the numbers of accidents per month are 3.1 at factory \(A\) and 1.7 at factory \(B\). Find the probability that the total number of accidents in the two factories during a 2-month period is more than 3. [4]
CAIE S2 2021 June Q5
7 marks Standard +0.3
On average, 1 in 75000 adults has a certain genetic disorder.
  1. Use a suitable approximating distribution to find the probability that, in a random sample of 10000 people, at least 1 has the genetic disorder. [3]
  2. In a random sample of \(n\) people, where \(n\) is large, the probability that no-one has the genetic disorder is more than 0.9. Find the largest possible value of \(n\). [4]