5.05a Sample mean distribution: central limit theorem

222 questions

Sort by: Default | Easiest first | Hardest first
AQA S1 2007 January Q6
17 marks Moderate -0.3
6 When Monica walks to work from home, she uses either route A or route B.
  1. Her journey time, \(X\) minutes, by route A may be assumed to be normally distributed with a mean of 37 and a standard deviation of 8 . Determine:
    1. \(\mathrm { P } ( X < 45 )\);
    2. \(\mathrm { P } ( 30 < X < 45 )\).
  2. Her journey time, \(Y\) minutes, by route B may be assumed to be normally distributed with a mean of 40 and a standard deviation of \(\sigma\). Given that \(\mathrm { P } ( Y > 45 ) = 0.12\), calculate the value of \(\sigma\).
  3. If Monica leaves home at 8.15 am to walk to work hoping to arrive by 9.00 am , state, with a reason, which route she should take.
  4. When Monica travels to work from home by car, her journey time, \(W\) minutes, has a mean of 18 and a standard deviation of 12 . Estimate the probability that, for a random sample of 36 journeys to work from home by car, Monica's mean time is more than 20 minutes.
  5. Indicate where, if anywhere, in this question you needed to make use of the Central Limit Theorem.
AQA S1 2005 June Q6
12 marks Standard +0.3
6 On arrival at a business centre, all visitors are required to register at the reception desk. An analysis of the register, for a random sample of 100 days, results in the following information on the number, \(X\), of visitors per day.
Number of visitors per dayNumber of days
1-1013
11-2033
21-2517
26-3012
31-358
36-405
41-505
51-1007
Total100
  1. Calculate an estimate of:
    1. \(\mu\), the mean number of visitors per day;
    2. \(\sigma\), the standard deviation of the number of visitors per day.
  2. Give a reason, based upon the data provided, why \(X\) is unlikely to be normally distributed.
    1. Give a reason why \(\bar { X }\), the mean of a random sample of 100 observations on \(X\), may be assumed to be normally distributed.
    2. State, in terms of \(\mu\) and \(\sigma\), the mean and variance of \(\bar { X }\).
  3. Hence construct a \(99 \%\) confidence interval for \(\mu\).
  4. The receptionist claims that she registers on average more than 30 visitors per day, and frequently registers more than 50 visitors on any one day. Comment on each of these two claims.
AQA S1 2015 June Q7
12 marks Moderate -0.3
7
  1. The weight of a sack of mixed dog biscuits can be modelled by a normal distribution with a mean of 10.15 kg and a standard deviation of 0.3 kg . A pet shop purchases 12 such sacks that can be considered to be a random sample.
    Calculate the probability that the mean weight of the 12 sacks is less than 10 kg .
  2. The weight of dry cat food in a pouch can also be modelled by a normal distribution. The contents, \(x\) grams, of each of a random sample of 40 pouches were weighed. Subsequent analysis of these weights gave $$\bar { x } = 304.6 \quad \text { and } \quad s = 5.37$$
    1. Construct a \(99 \%\) confidence interval for the mean weight of dry cat food in a pouch. Give the limits to one decimal place.
    2. Comment, with justification, on each of the following two claims. Claim 1: The mean weight of dry cat food in a pouch is more than 300 grams.
      Claim 2: All pouches contain more than 300 grams of dry cat food.
      [0pt] [4 marks]
      \includegraphics[max width=\textwidth, alt={}]{6fbb8891-e6de-42fe-a195-ea643552fdcf-24_2288_1705_221_155}
AQA Further Paper 3 Statistics 2020 June Q2
1 marks Easy -1.2
2 Jamie is conducting a hypothesis test on a random variable which has a normal distribution with standard deviation 1 The hypotheses are $$\begin{aligned} & \mathrm { H } _ { 0 } : \mu = 5 \\ & \mathrm { H } _ { 1 } : \mu > 5 \end{aligned}$$ He takes a random sample of size 4
The mean of his sample is 6
He uses a 5\% level of significance.
Before Jamie conducted the test, what was the probability that he would make a Type I error? Circle your answer.
[0pt] [1 mark] \(0.0228 \quad 0.0456 \quad 0.0500 \quad 0.1587\)
AQA Further Paper 3 Statistics 2020 June Q5
7 marks Challenging +1.2
5 Emily claims that the average number of runners per minute passing a shop during a long distance run is 8 Emily conducts a hypothesis test to investigate her claim.
5
  1. State the hypotheses for Emily's test. 5
  2. Emily counts the number of runners, \(X\), passing the shop in a randomly chosen minute. The critical region for Emily's test is \(X \leq 2\) or \(X \geq 14\) During a randomly chosen minute, Emily counts 3 runners passing the shop.
    Determine the outcome of Emily's hypothesis test.
    5
  3. The actual average number of runners per minute passing the shop is 7 Find the power of Emily's hypothesis test, giving your answer to three significant figures.
WJEC Further Unit 5 2022 June Q5
13 marks Standard +0.3
5. A laboratory carrying out screening for a certain blood disorder claims that the average time taken for test results to be returned is 38 hours. A reporter for a national newspaper suspects that the results take longer, on average, to be returned than claimed by the laboratory. The reporter finds the time, \(x\) hours, for 50 randomly selected results, in order to conduct a hypothesis test. The following summary statistics were obtained. $$\sum x = 2163 \quad \sum x ^ { 2 } = 98508$$
  1. Calculate the \(p\)-value for the reporter's hypothesis test, and complete the test using a \(5 \%\) level of significance. Hence write a headline for the reporter to use.
  2. Explain the relevance or otherwise of the Central Limit Theorem to your answer in part (a).
  3. Briefly explain why a random sample is preferable to taking a batch of 50 consecutive results.
  4. On another occasion, the reporter took a different random sample of 10 results.
    1. State, with a reason, what type of hypothesis test the reporter should use on this occasion.
    2. State one assumption required to carry out this test.
Pre-U Pre-U 9795/2 2010 June Q10
11 marks Challenging +1.2
10 A box contains a large number, \(n\), of identical dice, which are thought to be biased. The probability that one of these dice will show a six in a single roll is \(p\). The \(n\) dice are rolled many times and the number of sixes obtained in each trial is recorded. In \(4.01 \%\) of these trials 56 or more dice showed a six. In \(10.56 \%\) of these trials 37 or fewer dice showed a six. Using a suitable normal approximation, find the values of \(n\) and \(p\).
Pre-U Pre-U 9795/2 2012 June Q2
9 marks Standard +0.3
2 The independent random variables \(X\) and \(Y\) have normal distributions where \(X \sim \mathrm {~N} \left( \mu , \sigma ^ { 2 } \right)\) and \(Y \sim \mathrm {~N} \left( 3 \mu , 4 \sigma ^ { 2 } \right)\). Two random samples each of size \(n\) are taken, one from each of these normal populations.
  1. Show that \(a \bar { X } + b \bar { Y }\) is an unbiased estimator of \(\mu\) provided that \(a + 3 b = 1\), where \(a\) and \(b\) are constants and \(\bar { X }\) and \(\bar { Y }\) are the respective sample means. In the remainder of the question assume that \(a \bar { X } + b \bar { Y }\) is an unbiased estimator of \(\mu\).
  2. Show that \(\operatorname { Var } ( a \bar { X } + b \bar { Y } )\) can be written as \(\frac { \sigma ^ { 2 } } { n } \left( 1 - 6 b + 13 b ^ { 2 } \right)\).
  3. The value of the constant \(b\) can be varied. Find the value of \(b\) that gives the minimum of \(\operatorname { Var } ( a \bar { X } + b \bar { Y } )\), and hence find the minimum of \(\operatorname { Var } ( a \bar { X } + b \bar { Y } )\) in terms of \(\sigma\) and \(n\).
Pre-U Pre-U 9795/2 2013 June Q1
3 marks Moderate -0.3
1 A company hires out narrowboats on a canal. It may be assumed that demands to hire a narrowboat occur independently and randomly at a constant mean rate of 25 per week. Using a suitable normal approximation, find
  1. the probability that 15 or fewer narrowboats are hired out during a certain week,
  2. the number of narrowboats that the company needs to have available for a week in order that the probability of running out of boats is 0.05 or less.
Pre-U Pre-U 9795/2 2013 June Q2
9 marks Moderate -0.3
2
  1. The heights of boys in Year 9 are normally distributed with mean 156 cm and standard deviation 8 cm . The heights of girls in Year 10 are, independently, normally distributed with mean 160 cm and standard deviation 7 cm . Find the probability that the mean height of a random sample of 9 boys in Year 9 exceeds the mean height of a random sample of 16 girls in Year 10.
  2. State why the distributions of the sample means are normally distributed.
Pre-U Pre-U 9795/2 2013 November Q5
Standard +0.3
5 The random variable \(X\) has a binomial distribution with parameters \(n\) and \(p\), where \(p > 0.5\). A random sample of \(4 n\) observations of \(X\) is taken and \(\bar { X }\) denotes the sample mean. It is given that \(\mathrm { E } ( \bar { X } ) = 180\) and \(\operatorname { Var } ( \bar { X } ) = 0.0225\).
  1. Find
    1. the values of \(p\) and \(n\),
    2. \(\mathrm { P } ( \bar { X } < 179.8 )\),
    3. the value of \(a\) for which \(\mathrm { P } ( 180 - a < \bar { X } < 180 + a ) = 0.99\), giving your answer correct to 2 decimal places.
    4. State how you have used the Central Limit Theorem in part (i).
Pre-U Pre-U 9795/2 2015 June Q5
11 marks Challenging +1.8
5 Each year a college has a large fixed number, \(n\), of places to fill. The probability, \(p\), that a randomly chosen student comes from abroad is constant. Using a suitable normal approximation and applying a continuity correction, it is calculated that the probability of more than 60 students coming from abroad is 0.0187 and the probability of fewer than 40 students coming from abroad is 0.0783 . Find the values of \(n\) and \(p\).
Pre-U Pre-U 9795/2 2016 June Q5
10 marks Challenging +1.2
5 The random variable \(R\) has the distribution \(\mathrm { B } ( n , p )\).
  1. State two conditions that \(n\) and \(p\) must satisfy if the distribution of \(R\) can be well approximated by a normal distribution. Assume now that these conditions hold. Using the normal approximation, it is given that \(\mathrm { P } ( R < 25 ) = 0.8282\) and \(\mathrm { P } ( R \geqslant 28 ) = 0.0393\), correct to 4 decimal places.
  2. Find the mean and standard deviation of the approximating normal distribution.
  3. Hence find the value of \(p\) and the value of \(n\).
Pre-U Pre-U 9795/2 2018 June Q6
Challenging +1.3
6 In a certain city there are \(N\) taxis. Each taxi displays a different licensing number which is an integer in the range 1 to \(N\). A visitor to the city attempts to estimate the value of \(N\), assuming that the licensing number of each taxi observed is equally likely to be any integer from 1 to \(N\) inclusive.
  1. The visitor observes one randomly chosen licensing number, \(X\). Using standard results for \(\sum _ { r = 1 } ^ { n } r\) and \(\sum _ { r = 1 } ^ { n } r ^ { 2 }\), show that \(\mathrm { E } ( X ) = \frac { 1 } { 2 } ( N + 1 )\) and \(\operatorname { Var } ( X ) = \frac { 1 } { 12 } \left( N ^ { 2 } - 1 \right)\). The mean of 40 independent observations of \(X\) is denoted by \(A\).
  2. Find an unbiased estimator \(E _ { 1 }\) of \(N\) based on \(A\), and state the approximate distribution of \(E _ { 1 }\), giving the value(s) of any parameter(s). \(B\) is another random variable based on a random sample of 40 independent observations of \(X\). It is given that \(\mathrm { E } ( B ) = \frac { 40 } { 27 } N\) and that \(\operatorname { Var } ( B ) = \alpha N ^ { 2 }\) where \(\alpha\) is a constant.
  3. Find an unbiased estimator \(E _ { 2 }\) of \(N\) based on \(B\), and determine the set of values of \(\alpha\) for which \(\operatorname { Var } \left( E _ { 2 } \right) > \operatorname { Var } \left( E _ { 1 } \right)\) for all values of \(N\).
CAIE S2 2021 June Q2
4 marks Moderate -0.8
The time, in minutes, taken by students to complete a test has the distribution \(\text{N}(125, 36)\).
  1. Find the probability that the mean time taken to complete the test by a random sample of 40 students is less than 123 minutes. [3]
  2. Explain whether it was necessary to use the Central Limit theorem in the solution to part (a). [1]
CAIE S2 2022 November Q5
6 marks Moderate -0.8
\(X\) is a random variable with distribution B(10, 0.2). A random sample of 160 values of \(X\) is taken.
  1. Find the approximate distribution of the sample mean, including the values of the parameters. [3]
  2. Hence find the probability that the sample mean is less than 1.8. [3]
CAIE S2 2023 November Q1
3 marks Moderate -0.5
A random variable \(X\) has the distribution N(410, 400). Find the probability that the mean of a random sample of 36 values of \(X\) is less than 405. [3]
CAIE S2 2023 November Q7
12 marks Standard +0.3
A biologist wishes to test whether the mean concentration \(\mu\), in suitable units, of a certain pollutant in a river is below the permitted level of 0.5. She measures the concentration, \(x\), of the pollutant at 50 randomly chosen locations in the river. The results are summarised below. \(n = 50 \quad \Sigma x = 23.0 \quad \Sigma x^2 = 13.02\)
  1. Carry out a test at the 5% significance level of the null hypothesis \(\mu = 0.5\) against the alternative hypothesis \(\mu < 0.5\). [7]
Later, a similar test is carried out at the 5% significance level using another sample of size 50 and the same hypotheses as before. You should assume that the standard deviation is unchanged.
  1. Given that, in fact, the value of \(\mu\) is 0.4, find the probability of a Type II error. [5]
CAIE S2 2011 June Q4
8 marks Moderate -0.3
A doctor wishes to investigate the mean fat content in low-fat burgers. He takes a random sample of 15 burgers and sends them to a laboratory where the mass, in grams, of fat in each burger is determined. The results are as follows. \(9 \quad 7 \quad 8 \quad 9 \quad 6 \quad 11 \quad 7 \quad 9 \quad 8 \quad 9 \quad 8 \quad 10 \quad 7 \quad 9 \quad 9\) Assume that the mass, in grams, of fat in low-fat burgers is normally distributed with mean \(\mu\) and that the population standard deviation is 1.3.
  1. Calculate a 99\% confidence interval for \(\mu\). [4]
  2. Explain whether it was necessary to use the Central Limit theorem in the calculation in part (i). [2]
  3. The manufacturer claims that the mean mass of fat in burgers of this type is 8 g. Use your answer to part (i) to comment on this claim. [2]
CAIE S2 2016 June Q6
9 marks Standard +0.3
\(X\) and \(Y\) are independent random variables with distributions \(\mathrm{Po}(1.6)\) and \(\mathrm{Po}(2.3)\) respectively.
  1. Find \(\mathrm{P}(X + Y = 4)\). [3]
A random sample of 75 values of \(X\) is taken.
  1. State the approximate distribution of the sample mean, \(\overline{X}\), including the values of the parameters. [2]
  2. Hence find the probability that the sample mean is more than 1.7. [3]
  3. Explain whether the Central Limit theorem was needed to answer part (ii). [1]
CAIE S2 2002 November Q5
8 marks Standard +0.3
\(X\) and \(Y\) are independent random variables each having a Poisson distribution. \(X\) has mean 2.5 and \(Y\) has mean 3.1.
  1. Find P\((X + Y > 3)\). [4]
  2. A random sample of 80 values of \(X\) is taken. Find the probability that the sample mean is less than 2.4. [4]
Edexcel S2 2010 June Q1
5 marks Easy -1.8
Explain what you understand by
  1. a population, [1]
  2. a statistic. [1]
A researcher took a sample of 100 voters from a certain town and asked them who they would vote for in an election. The proportion who said they would vote for Dr Smith was 35\%.
  1. State the population and the statistic in this case. [2]
  2. Explain what you understand by the sampling distribution of this statistic. [1]
Edexcel S3 2015 June Q6
13 marks Standard +0.3
As part of an investigation, a random sample was taken of 50 footballers who had completed an obstacle course in the early morning. The time taken by each of these footballers to complete the obstacle course, \(x\) minutes, was recorded and the results are summarised by $$\sum x = 1570 \quad \text{and} \quad \sum x^2 = 49467.58$$
  1. Find unbiased estimates for the mean and variance of the time taken by footballers to complete the obstacle course in the early morning. [4]
An independent random sample was taken of 50 footballers who had completed the same obstacle course in the late afternoon. The time taken by each of these footballers to complete the obstacle course, \(y\) minutes, was recorded and the results are summarised as $$\bar{y} = 30.9 \quad \text{and} \quad s_y^2 = 3.03$$
  1. Test, at the 5\% level of significance, whether or not the mean time taken by footballers to complete the obstacle course in the early morning, is greater than the mean time taken by footballers to complete the obstacle course in the late afternoon. State your hypotheses clearly. [7]
  2. Explain the relevance of the Central Limit Theorem to the test in part (b). [1]
  3. State an assumption you have made in carrying out the test in part (b). [1]
Edexcel S3 2015 June Q7
5 marks Moderate -0.3
A fair six-sided die is labelled with the numbers 1, 2, 3, 4, 5 and 6. The die is rolled 40 times and the score, \(S\), for each roll is recorded.
  1. Find the mean and the variance of \(S\). [2]
  2. Find an approximation for the probability that the mean of the 40 scores is less than 3 [3]
Edexcel S3 Q2
9 marks Standard +0.3
A random sample of 100 classical CDs produced by a record company had a mean playing time of 70.6 minutes and a standard deviation of 9.1 minutes. An independent random sample of 80 CDs produced by a different company had a mean playing time of 67.2 minutes with a standard deviation of 8.4 minutes.
  1. Using a 1\% level of significance, test whether or not there is a difference in the mean playing times of the CDs produced by these two companies. State your hypotheses clearly. [8]
  2. State an assumption you made in carrying out the test in part (a). [1]