5.05b Unbiased estimates: of population mean and variance

259 questions

Sort by: Default | Easiest first | Hardest first
Edexcel S3 2022 June Q5
9 marks Standard +0.3
A random sample of two observations \(X _ { 1 }\) and \(X _ { 2 }\) is taken from a population with unknown mean \(\mu\) and unknown variance \(\sigma ^ { 2 }\)
  1. Explain why \(\frac { X _ { 1 } - \mu } { \sigma }\) is not a statistic.
  2. Explain what you understand by an unbiased estimator for \(\mu\) Two estimators for \(\mu\) are \(U _ { 1 }\) and \(U _ { 2 }\) where $$U _ { 1 } = 3 X _ { 1 } - 2 X _ { 2 } \quad \text { and } \quad U _ { 2 } = \frac { X _ { 1 } + 3 X _ { 2 } } { 4 }$$
  3. Show that both \(U _ { 1 }\) and \(U _ { 2 }\) are unbiased estimators for \(\mu\) The most efficient estimator among a group of unbiased estimators is the one with the smallest variance.
  4. By finding the variance of \(U _ { 1 }\) and the variance of \(U _ { 2 }\) state, giving a reason, the most efficient estimator for \(\mu\) from these two estimators.
Edexcel S3 2023 June Q3
9 marks Moderate -0.8
A random sample of 2 observations, \(X _ { 1 }\) and \(X _ { 2 }\), is taken from a population with unknown mean \(\mu\) and unknown variance \(\sigma ^ { 2 }\)
  1. Explain why \(\frac { X _ { 1 } - X _ { 2 } } { \sigma }\) is not a statistic. $$S = \frac { 3 } { 5 } X _ { 1 } + \frac { 5 } { 7 } X _ { 2 }$$
  2. Show that \(S\) is a biased estimator of \(\mu\)
  3. Hence find the bias, in terms of \(\mu\), when \(S\) is used as an estimator of \(\mu\) Given that \(Y = a X _ { 1 } + b X _ { 2 }\) is an unbiased estimator of \(\mu\), where \(a\) and \(b\) are constants,
  4. find an equation, in terms of \(a\) and \(b\), that must be satisfied.
  5. Using your answer to part (d), show that \(\operatorname { Var } ( Y ) = \left( 2 a ^ { 2 } - 2 a + 1 \right) \sigma ^ { 2 }\)
Edexcel S3 2020 October Q1
4 marks Standard +0.3
  1. The random variable \(X\) has the discrete uniform distribution
$$\mathrm { P } ( X = x ) = \frac { 1 } { \alpha } \quad \text { for } x = 1,2 , \ldots , \alpha$$ The mean of a random sample of size \(n\), taken from this distribution, is denoted by \(\bar { X }\)
  1. Show that \(2 \bar { X }\) is a biased estimator of \(\alpha\) A random sample of 6 observations of \(X\) is taken and the results are given below. $$\begin{array} { l l l l l l } 8 & 7 & 3 & 7 & 2 & 9 \end{array}$$
  2. Use the sample mean to estimate the value of \(\alpha\)
Edexcel S3 2020 October Q5
12 marks Standard +0.3
5. A greengrocer is investigating the weights of two types of orange, type \(A\) and type \(B\). She believes that on average type \(A\) oranges weigh greater than 5 grams more than type \(B\) oranges. She collects a random sample of 40 type \(A\) oranges and 32 type \(B\) oranges and records the weight, \(x\) grams, of each orange. The table shows a summary of her data.
\(n\)\(\bar { x }\)\(\sum x ^ { 2 }\)
Type \(A\) oranges40140.4790258
Type \(B\) oranges32134.7581430
  1. Calculate unbiased estimates for the variance of the weights of the population of type \(A\) oranges and the variance of the weights of the population of type \(B\) oranges.
  2. Test, at the \(5 \%\) level of significance, the greengrocer's belief. You should state the hypotheses and the critical value used for this test.
  3. Explain how you have used the fact that the sample sizes are large in your answer to part (b).
Edexcel S3 2021 October Q6
12 marks Standard +0.3
6. Amala believes that the resting heart rate is lower in men who exercise regularly compared to men who do not exercise regularly. She measures the resting heart rate, \(h\), of a random sample of 50 men who exercise regularly and a random sample of 40 men who do not exercise regularly. Her results are summarised in the table below.
\cline { 2 - 6 } \multicolumn{1}{c|}{}
Sample
size
\(\sum \boldsymbol { h }\)\(\sum \boldsymbol { h } ^ { 2 }\)
Unbiased
estimate of
the mean
Unbiased
estimate of
the variance
Exercise regularly503270214676\(\alpha\)\(\beta\)
Do not exercise
regularly
40283220166070.829.6
  1. Calculate the value of \(\alpha\) and the value of \(\beta\)
  2. Test, at the \(5 \%\) level of significance, whether there is evidence to support Amala's belief. State your hypotheses clearly.
  3. Explain the significance of the central limit theorem to the test in part (b).
  4. State two assumptions you have made in carrying out the test in part (b).
Edexcel S3 2018 Specimen Q6
13 marks Standard +0.3
6. As part of an investigation, a random sample was taken of 50 footballers who had completed an obstacle course in the early morning. The time taken by each of these footballers to complete the obstacle course, \(x\) minutes, was recorded and the results are summarised by $$\sum x = 1570 \quad \text { and } \quad \sum x ^ { 2 } = 49467.58$$
  1. Find unbiased estimates for the mean and variance of the time taken by footballers to complete the obstacle course in the early morning. An independent random sample was taken of 50 footballers who had completed the same obstacle course in the late afternoon. The time taken by each of these footballers to complete the obstacle course, \(y\) minutes, was recorded and the results are summarised as $$\bar { y } = 30.9 \quad \text { and } \quad s _ { y } ^ { 2 } = 3.03$$
  2. Test, at the \(5 \%\) level of significance, whether or not the mean time taken by footballers to complete the obstacle course in the early morning, is greater than the mean time taken by footballers to complete the obstacle course in the late afternoon. State your hypotheses clearly.
  3. Explain the relevance of the Central Limit Theorem to the test in part (b).
  4. State an assumption you have made in carrying out the test in part (b).
Edexcel S3 2006 January Q3
12 marks Moderate -0.3
3. The drying times of paint can be assumed to be normally distributed. A paint manufacturer paints 10 test areas with a new paint. The following drying times, to the nearest minute, were recorded. $$82 , \quad 98 , \quad 140 , \quad 110 , \quad 90 , \quad 125 , \quad 150 , \quad 130 , \quad 70 , \quad 110 .$$
  1. Calculate unbiased estimates for the mean and the variance of the population of drying times of this paint. Given that the population standard deviation is 25 ,
  2. find a 95\% confidence interval for the mean drying time of this paint. Fifteen similar sets of tests are done and the \(95 \%\) confidence interval is determined for each set.
  3. Estimate the expected number of these 15 intervals that will enclose the true value of the population mean \(\mu\).
Edexcel S3 2004 June Q4
10 marks Moderate -0.3
4. Kylie regularly travels from home to visit a friend. On 10 randomly selected occasions the journey time \(x\) minutes was recorded. The results are summarised as follows. $$\Sigma x = 753 , \quad \Sigma x ^ { 2 } = 57455 .$$
  1. Calculate unbiased estimates of the mean and the variance of the population of journey times. After many journeys, a random sample of 100 journeys gave a mean of 74.8 minutes and a variance of 84.6 minutes \({ } ^ { 2 }\).
  2. Calculate a 95\% confidence interval for the mean of the population of journey times.
  3. Write down two assumptions you made in part (b).
Edexcel S3 2008 June Q1
8 marks Moderate -0.8
  1. Some biologists were studying a large group of wading birds. A random sample of 36 were measured and the wing length, \(x \mathrm {~mm}\) of each wading bird was recorded. The results are summarised as follows
$$\sum x = 6046 \quad \sum x ^ { 2 } = 1016338$$
  1. Calculate unbiased estimates of the mean and the variance of the wing lengths of these birds. Given that the standard deviation of the wing lengths of this particular type of bird is actually 5.1 mm ,
  2. find a \(99 \%\) confidence interval for the mean wing length of the birds from this group.
Edexcel S3 2013 June Q7
13 marks Moderate -0.5
  1. Lambs are born in a shed on Mill Farm. The birth weights, \(x \mathrm {~kg}\), of a random sample of 8 newborn lambs are given below.
$$\begin{array} { l l l l l l l l } 4.12 & 5.12 & 4.84 & 4.65 & 3.55 & 3.65 & 3.96 & 3.40 \end{array}$$
  1. Calculate unbiased estimates of the mean and variance of the birth weight of lambs born on Mill Farm. A further random sample of 32 lambs is chosen and the unbiased estimates of the mean and variance of the birth weight of lambs from this sample are 4.55 and 0.25 respectively.
  2. Treating the combined sample of 40 lambs as a single sample, estimate the standard error of the mean. The owner of Mill Farm researches the breed of lamb and discovers that the population of birth weights is normally distributed with standard deviation 0.67 kg .
  3. Calculate a \(95 \%\) confidence interval for the mean birth weight of this breed of lamb using your combined sample mean.
Edexcel S3 2014 June Q5
13 marks Standard +0.3
5. A student believes that there is a difference in the mean lengths of English and French films. He goes to the university video library and randomly selects a sample of 120 English films and a sample of 70 French films. He notes the length, \(x\) minutes, of each of the films in his samples. His data are summarised in the table below.
\(\Sigma x\)\(\Sigma x ^ { 2 }\)\(s ^ { 2 }\)\(n\)
English films1065095690998.5120
French films651061584915170
  1. Verify that the unbiased estimate of the variance, \(s ^ { 2 }\), of the lengths of English films is 98.5 minutes \({ } ^ { 2 }\)
  2. Stating your hypotheses clearly, test, at the 1\% level of significance, whether or not the mean lengths of English and French films are different.
  3. Explain the significance of the Central Limit Theorem to the test in part (b).
  4. The university video library contained 724 English films and 473 French films. Explain how the student could have taken a stratified sample of 190 of these films.
Edexcel S3 2014 June Q6
8 marks Standard +0.3
6. A random sample \(X _ { 1 } , X _ { 2 } , \ldots , X _ { n }\) is taken from a population with mean \(\mu\).
  1. Show that \(\bar { X } = \frac { 1 } { n } \left( X _ { 1 } + X _ { 2 } + \ldots + X _ { n } \right)\) is an unbiased estimator of the population mean \(\mu\). A company produces small jars of coffee. Five jars of coffee were taken at random and weighed. The weights, in grams, were as follows $$\begin{array} { l l l l l } 197 & 203 & 205 & 201 & 195 \end{array}$$
  2. Calculate unbiased estimates of the population mean and variance of the weights of the jars produced by the company. It is known from previous results that the weights are normally distributed with standard deviation 4.8 g . The manager is going to take a second random sample. He wishes to ensure that there is at least a \(95 \%\) probability that the estimate of the population mean is within 1.25 g of its true value.
  3. Find the minimum sample size required.
Edexcel S3 2015 June Q3
11 marks Moderate -0.8
3. A nursery has 16 staff and 40 children on its records. In preparation for an outing the manager needs an estimate of the mean weight of the people on its records and decides to take a stratified sample of size 14 .
  1. Describe how this stratified sample should be taken. The weights, \(x \mathrm {~kg}\), of each of the 14 people selected are summarised as $$\sum x = 437 \text { and } \sum x ^ { 2 } = 26983$$
  2. Find unbiased estimates of the mean and the variance of the weights of all the people on the nursery's records.
  3. Estimate the standard error of the mean. The estimates of the standard error of the mean for the staff and for the children are 5.11 and 1.10 respectively.
  4. Comment on these values with reference to your answer to part (c) and give a reason for any differences.
Edexcel S3 2017 June Q5
10 marks Moderate -0.3
5. Paul takes the company bus to work. According to the bus timetable he should arrive at work at 0831. Paul believes the bus is not reliable and often arrives late. Paul decides to test the arrival time of the bus and carries out a survey. He records the values of the random variable $$X = \text { number of minutes after } 0831 \text { when the bus arrives. }$$ His results are summarised below. $$n = 15 \quad \sum x = 60 \quad \sum x ^ { 2 } = 1946$$
  1. Calculate unbiased estimates of the mean, \(\mu\), and the variance of \(X\). Using the mean of Paul's sample and given \(X \sim \mathrm {~N} \left( \mu , 10 ^ { 2 } \right)\)
    1. calculate a 95\% confidence interval for the mean arrival time at work for this company bus.
    2. State an assumption you made about the values in the sample obtained by Paul.
  2. Comment on Paul's belief. Justify your answer.
Edexcel S3 2018 June Q3
10 marks Standard +0.3
A random sample of repair times, in hours, was taken for an electronic component. The 4 observed times are shown below.
1.3
1.7
1.4
1.8
  1. Calculate unbiased estimates of the mean and the variance of the population of repair times for this electronic component. The population standard deviation of the repair times for this electronic component is known to be 0.5 hours. An estimate of the population mean is required to be within 0.1 hours of its true value with a probability of at least 0.99
  2. Find the minimum sample size required.
AQA S1 2012 January Q7
14 marks Standard +0.3
7 A random sample of 50 full-time university employees was selected as part of a higher education salary survey. The annual salary in thousands of pounds, \(x\), of each employee was recorded, with the following summarised results. $$\sum x = 2290.0 \quad \text { and } \quad \sum ( x - \bar { x } ) ^ { 2 } = 28225.50$$ Also recorded was the fact that 6 of the 50 salaries exceeded \(\pounds 60000\).
    1. Calculate values for \(\bar { x }\) and \(s\), where \(s ^ { 2 }\) denotes the unbiased estimate of \(\sigma ^ { 2 }\).
    2. Hence show why the annual salary, \(X\), of a full-time university employee is unlikely to be normally distributed. Give numerical support for your answer.
    1. Indicate why the mean annual salary, \(\bar { X }\), of a random sample of 50 full-time university employees may be assumed to be normally distributed.
    2. Hence construct a \(99 \%\) confidence interval for the mean annual salary of full-time university employees.
  1. It is claimed that the annual salaries of full-time university employees have an average which exceeds \(\pounds 55000\) and that more than \(25 \%\) of such salaries exceed £60000. Comment on each of these two claims.
AQA S2 2005 June Q3
8 marks Moderate -0.3
3 The heights, in metres, of a random sample of 10 students attending Higrade School are recorded below. \(\begin{array} { l l l l l l l l l } 1.76 & 1.59 & 1.54 & 1.62 & 1.49 & 1.52 & 1.56 & 1.47 & 1.75 \end{array} 1.50\) Assume that the heights of students attending Higrade School are normally distributed.
  1. Calculate unbiased estimates for the mean and variance of the heights of students attending Higrade School.
    (3 marks)
  2. Construct a 90\% confidence interval for the mean height of students attending Higrade School.
    (5 marks)
AQA S3 2009 June Q7
17 marks Standard +0.8
7 The daily number of customers visiting a small arts and crafts shop may be modelled by a Poisson distribution with a mean of 24 .
  1. Using a distributional approximation, estimate the probability that there was a total of at most 150 customers visiting the shop during a given 6-day period.
  2. The shop offers a picture framing service. The daily number of requests, \(Y\), for this service may be assumed to have a Poisson distribution. Prior to the shop advertising this service in the local free newspaper, the mean value of \(Y\) was 2. Following the advertisement, the shop received a total of 17 requests for the service during a period of 5 days.
    1. Using a Poisson distribution, carry out a test, at the \(10 \%\) level of significance, to investigate the claim that the advertisement increased the mean daily number of requests for the shop's picture framing service.
    2. Determine the critical value of \(Y\) for your test in part (b)(i).
    3. Hence, assuming that the advertisement increased the mean value of \(Y\) to 3, determine the power of your test in part (b)(i).
AQA S3 2010 June Q3
7 marks Standard +0.8
3
The weekly number of hits, \(S\), on Sam's website may be modelled by a Poisson distribution with parameter \(\lambda _ { S }\). The weekly number of hits, \(T\), on Tina's website may be modelled by a Poisson distribution with parameter \(\lambda _ { T }\).
During a period of 40 weeks, the number of hits on Sam's website was 940.
During a period of 60 weeks, the number of hits on Tina's website was 1560.
Assuming that \(S\) and \(T\) are independent random variables, investigate, at the \(2 \%\) level of significance, Tina's claim that the mean weekly number of hits on her website is greater than that on Sam's website.
(7 marks)

\hline QUESTION PART REFERENCE &
\hline & \(\_\_\_\_\) \hline \multicolumn{2}{|c|}{}
\hline & \(\_\_\_\_\) \hline \multicolumn{2}{|c|}{}
\hline \multicolumn{2}{|c|}{}
\hline & \includegraphics[max width=\textwidth, alt={}]{b855b5b3-097e-4894-aaec-d77f515949b0-06_72_1707_1510_154}
\hline \multicolumn{2}{|c|}{}
\hline \multicolumn{2}{|c|}{}
\hline &
\hline & \(\_\_\_\_\) \hline \multicolumn{2}{|c|}{}
\hline & \(\_\_\_\_\) \hline \multicolumn{2}{|c|}{}
\hline & \(\_\_\_\_\) \hline \multicolumn{2}{|c|}{}
\hline & \includegraphics[max width=\textwidth, alt={}]{b855b5b3-097e-4894-aaec-d77f515949b0-06_107_1709_2445_152}
\hline
\includegraphics[max width=\textwidth, alt={}]{b855b5b3-097e-4894-aaec-d77f515949b0-07_2484_1709_223_153}
AQA S3 2010 June Q6
18 marks Standard +0.8
6
  1. A district council claimed that more than 80 per cent of the complaints that it received about the delivery of its services were answered to the satisfaction of complainants before reaching formal status. An analysis of a random sample of 175 complaints revealed that 28 reached formal status.
    1. Construct an approximate \(95 \%\) confidence interval for the proportion of complaints that reach formal status.
    2. Hence comment on the council's claim.
  2. The district council also claimed that less than 40 per cent of all formal complaints were due to a failing in the delivery of its services. An analysis of the 50 formal complaints received during 2007/08 showed that 16 were due to a failing in the delivery of its services.
    1. Using an exact test, investigate the council's claim at the \(10 \%\) level of significance. The 50 formal complaints received during 2007/08 may be assumed to be a random sample.
    2. Determine the critical value for your test in part (b)(i).
    3. In fact, only 25 per cent of all formal complaints were due to a failing in the delivery of the council's services. Determine the probability of a Type II error for a test of the council's claim at the \(10 \%\) level of significance and based on the analysis of a random sample of 50 formal complaints.
      (4 marks)
      \includegraphics[max width=\textwidth, alt={}]{b855b5b3-097e-4894-aaec-d77f515949b0-15_2484_1709_223_153}
      \includegraphics[max width=\textwidth, alt={}]{b855b5b3-097e-4894-aaec-d77f515949b0-16_2484_1712_223_153}
      \includegraphics[max width=\textwidth, alt={}]{b855b5b3-097e-4894-aaec-d77f515949b0-17_2484_1709_223_153}
AQA S3 2011 June Q1
7 marks Moderate -0.3
1 A consumer report claimed that more than 25 per cent of visitors to a theme park were dissatisfied with the catering facilities provided. In a survey, 375 visitors who had used the catering facilities were interviewed independently, and 108 of them stated that they were dissatisfied with the catering facilities provided.
  1. Test, at the \(2 \%\) level of significance, the consumer report's claim.
  2. State an assumption about the 375 visitors that was necessary in order for the hypothesis test in part (a) to be valid.
AQA S3 2012 June Q4
6 marks Standard +0.8
4 The manager of a medical centre suspects that patients using repeat prescriptions were requesting, on average, more items during 2011 than during 2010. The mean number of items on a repeat prescription during 2010 was 2.6.
An analysis of a random sample of 250 repeat prescriptions during 2011 showed a total of 688 items requested. The number of items requested on a repeat prescription may be modelled by a Poisson distribution. Use a distributional approximation to investigate, at the \(5 \%\) level of significance, the manager's suspicion.
AQA S3 2012 June Q5
10 marks Standard +0.3
5 A random sample of 125 people was selected from a council's electoral roll. Of these, 68 were in favour of a proposed local building plan.
  1. Construct an approximate 98\% confidence interval for the percentage of people on the council's electoral roll who were in favour of the proposal.
  2. Calculate, to the nearest 5, an estimate of the minimum sample size necessary in order that an approximate \(98 \%\) confidence interval for the percentage of people on the council's electoral roll who were in favour of the proposal has a width of at most 10 per cent.
AQA S3 2013 June Q7
15 marks Standard +0.3
7 It is claimed that the proportion, \(P\), of people who prefer cooked fresh garden peas to cooked frozen garden peas is greater than 0.50 .
  1. In an attempt to investigate this claim, a sample of 50 people were each given an unlabelled portion of cooked fresh garden peas and an unlabelled portion of cooked frozen garden peas to taste. After tasting each portion, the people were each asked to state which of the two portions they preferred. Of the 50 people sampled, 29 preferred the cooked fresh garden peas. Assuming that the 50 people may be considered to constitute a random sample, use a binomial distribution and the \(10 \%\) level of significance to investigate the claim.
    (6 marks)
  2. It was then decided to repeat the tasting in part (a) but to involve a sample of 500 , rather than 50, people. Of the 500 people sampled, 271 preferred the cooked fresh garden peas.
    1. Assuming that the 500 people may be considered to constitute a random sample, use an approximation to the distribution of the sample proportion, \(\widehat { P }\), and the \(10 \%\) level of significance to again investigate the claim.
    2. The critical value of \(\widehat { P }\) for the test in part (b)(i) is 0.529 , correct to three significant figures. It is also given that, in fact, 55 per cent of people prefer cooked fresh garden peas. Estimate the power for a test of the claim that \(P > 0.50\) based on a random sample of 500 people and using the \(10 \%\) level of significance.
      (5 marks)
AQA S3 2014 June Q7
4 marks Challenging +1.2
7
  1. The random variable \(X\) has a Poisson distribution with parameter \(\lambda\).
    1. Prove, from first principles, that \(\mathrm { E } ( X ) = \lambda\).
    2. Given that \(\mathrm { E } \left( X ^ { 2 } - X \right) = \lambda ^ { 2 }\), deduce that \(\operatorname { Var } ( X ) = \lambda\).
  2. The number of faults in a 100-metre ball of nylon string may be modelled by a Poisson distribution with parameter \(\lambda\).
    1. An analysis of one ball of string, selected at random, showed 15 faults. Using an exact test, investigate the claim that \(\lambda > 10\). Use the \(5 \%\) level of significance.
    2. A subsequent analysis of a random sample of 20 balls of string showed a total of 241 faults.
      (A) Using an approximate test, re-investigate the claim that \(\lambda > 10\). Use the \(5 \%\) level of significance.
      (B) Determine the critical value of the total number of faults for the test in part (b)(ii)(A).
      (C) Given that, in fact, \(\lambda = 12\), estimate the probability of a Type II error for a test of the claim that \(\lambda > 10\) based upon a random sample of 20 balls of string and using the \(5 \%\) level of significance.
      [0pt] [4 marks] \includegraphics[max width=\textwidth, alt={}, center]{d5852425-9340-4aae-82da-e3bf6772a0de-22_2490_1728_219_141} \includegraphics[max width=\textwidth, alt={}, center]{d5852425-9340-4aae-82da-e3bf6772a0de-23_2490_1719_217_150} \includegraphics[max width=\textwidth, alt={}, center]{d5852425-9340-4aae-82da-e3bf6772a0de-24_2489_1728_221_141}