CAIE S2 (Statistics 2) 2024 March

Question 1
View details
1 The lengths, \(X \mathrm {~cm}\), of a sample of 100 insects of a certain type were summarised as follows. $$n = 100 \quad \sum x = 36.8 \quad \sum x ^ { 2 } = 17.34$$
  1. Calculate unbiased estimates for the population mean and variance of \(X\).
  2. State a necessary condition for the estimates found in part (a) to be reliable.
Question 2
View details
2 A random sample of 250 people living in Barapet was chosen. It was found that 78 of these people owned a BETEC phone.
  1. Calculate an approximate \(98 \%\) confidence interval for the proportion of people living in Barapet who own a BETEC phone.
  2. Manjit claims that more than \(40 \%\) of the people living in Barapet own a BETEC phone. Use your answer to part (a) to comment on this claim.
Question 3
View details
3 In a certain lottery, on average 1 in every 10000 tickets is a prize-winning ticket. An agent sells 6000 tickets.
  1. Use a suitable approximating distribution to find the probability that at least 3 of the tickets sold by the agent are prize-winning tickets.
  2. Justify the use of your approximating distribution in this context.
Question 4
View details
4 Each year a transport firm uses \(X\) litres of gasoline and \(Y\) litres of diesel fuel, where \(X\) and \(Y\) have the independent distributions \(X \sim \mathrm {~N} ( 10700,950 ) ^ { 2 }\) and \(Y \sim \mathrm {~N} \left( 13400,1210 ^ { 2 } \right)\).
  1. Find the probability that in a randomly chosen year the firm uses more gasoline than diesel fuel.
    The costs per litre of gasoline and diesel fuel are \\(0.80 and
    )0.85 respectively.
  2. Find the probability that the total cost of gasoline and diesel fuel in a randomly chosen year is between \(
    ) 20000\( and \)\\( 22000\).
Question 5
View details
5 A teacher models the numbers of girls and boys who arrive late for her class on any day by the independent random variables \(G \sim \operatorname { Po } ( 0.10 )\) and \(B \sim \operatorname { Po } ( 0.15 )\) respectively.
  1. Find the probability that during a randomly chosen 2-day period no girls arrive late.
  2. Find the probability that during a randomly chosen 5-day period the total number of students who arrive late is less than 3 .
  3. It is given that the values of \(\mathrm { P } ( G = r )\) and \(\mathrm { P } ( B = r )\) for \(r \geqslant 3\) are very small and can be ignored. Find the probability that on a randomly chosen day more girls arrive late than boys.
    Following a timetable change the teacher claims that on average more students arrive late than before the change. During a randomly chosen 5-day period a total of 4 students are late.
  4. Test the teacher's claim at the \(5 \%\) significance level.
Question 6
View details
6 The graph of the probability density function f of a random variable \(X\) is symmetrical about the line \(x = 2\). It is given that \(\mathrm { P } ( 2 < X < 5 ) = \frac { 117 } { 256 }\).
  1. Using only this information show that \(\mathrm { P } ( X > - 1 ) = \frac { 245 } { 256 }\).
    It is now given that, for \(x\) in a suitable domain, $$f ( x ) = k \left( 12 + 4 x - x ^ { 2 } \right) , \text { where } k \text { is a constant. }$$
  2. Find the value of \(k\).
  3. A different random variable \(X\) has probability density function \(\mathbf { g } ( x ) = \frac { 2 } { 9 } \left( 2 + x - x ^ { 2 } \right)\). The domain of \(X\) is all values of \(x\) for which \(\mathrm { g } ( x ) \geqslant 0\). Find \(\operatorname { Var } ( X )\).
    \includegraphics[max width=\textwidth, alt={}, center]{ff3433b0-baab-45e3-845e-56a794739bba-11_63_1547_447_347}
Question 7
View details
7 The heights, in centimetres, of adult females in Litania have mean \(\mu\) and standard deviation \(\sigma\). It is known that in 2004 the values of \(\mu\) and \(\sigma\) were 163.21 and 6.95 respectively. The government claims that the value of \(\mu\) this year is greater than it was in 2004. In order to test this claim a researcher plans to carry out a hypothesis test at the \(1 \%\) significance level. He records the heights of a random sample of 300 adult females in Litania this year and finds the value of the sample mean.
  1. State the probability of a Type I error.
    \includegraphics[max width=\textwidth, alt={}]{ff3433b0-baab-45e3-845e-56a794739bba-12_74_1577_557_322} ........................................................................................................................................ You should assume that the value of \(\sigma\) after 2004 remains at 6.95 .
  2. Given that the value of \(\mu\) this year is actually 164.91 , find the probability of a Type II error.
    If you use the following page to complete the answer to any question, the question number must be clearly shown.