F-test for equality of variances

A question is this type if and only if it involves testing whether two population variances are equal using the F-distribution, typically comparing two independent samples from normal populations.

15 questions · Standard +0.7

Sort by: Default | Easiest first | Hardest first
Edexcel S4 2006 January Q7
16 marks Standard +0.3
7. A psychologist gives a test to students from two different schools, \(A\) and \(B\). A group of 9 students is randomly selected from school \(A\) and given instructions on how to do the test.
A group of 7 students is randomly selected from school \(B\) and given the test without the instructions. The table shows the time taken, to the nearest second, to complete the test by the two groups.
\(A\)111212131415161717
\(B\)8101113131414
Stating your hypotheses clearly,
  1. test at the \(10 \%\) significance level, whether or not the variance of the times taken to complete the test by students from school \(A\) is the same as the variance of the times taken to complete the test by students from school \(B\). (You may assume that times taken for each school are normally distributed.)
  2. test at the \(5 \%\) significance level, whether or not the mean time taken to complete the test by students from school \(A\) is greater than the mean time taken to complete the test by students from school \(B\).
  3. Why does the result to part (a) enable you to carry out the test in part (b)?
  4. Give one factor that has not been taken into account in your analysis.
Edexcel S4 2005 June Q2
6 marks Standard +0.3
2. The standard deviation of the length of a random sample of 8 fence posts produced by a timber yard was 8 mm . A second timber yard produced a random sample of 13 fence posts with a standard deviation of 14 mm .
  1. Test, at the \(10 \%\) significance level, whether or not there is evidence that the lengths of fence posts produced by these timber yards differ in variability. State your hypotheses clearly.
  2. State an assumption you have made in order to carry out the test in part (a).
Edexcel S4 2006 June Q2
12 marks Challenging +1.2
2. The weights, in grams, of apples are assumed to follow a normal distribution. The weights of apples sold by a supermarket have variance \(\sigma _ { s } { } ^ { 2 }\). A random sample of 4 apples from the supermarket had weights $$\text { 114, 100, 119, } 123 .$$
  1. Find a 95\% confidence interval for \(\sigma _ { s } ^ { 2 }\). The weights of apples sold on a market stall have variance \(\sigma _ { M } ^ { 2 }\). A second random sample of 7 apples was taken from the market stall. The sample variance \(s _ { M } ^ { 2 }\) of the apples was 318.8.
  2. Stating your hypotheses clearly test, at the \(1 \%\) levcel of significnace, whether or not there is evidence that \(\sigma _ { M } ^ { 2 } > \sigma _ { s } ^ { 2 }\).
Edexcel S4 2008 June Q2
17 marks Standard +0.3
  1. A large number of students are split into two groups \(A\) and \(B\). The students sit the same test but under different conditions. Group A has music playing in the room during the test, and group B has no music playing during the test. Small samples are then taken from each group and their marks recorded. The marks are normally distributed.
The marks are as follows:
Sample from Group \(A\)424035373443424449
Sample from Group \(B\)40443847383733
  1. Stating your hypotheses clearly, and using a \(10 \%\) level of significance, test whether or not there is evidence of a difference between the variances of the marks of the two groups.
  2. State clearly an assumption you have made to enable you to carry out the test in part (a).
  3. Use a two tailed test, with a \(5 \%\) level of significance, to determine if the playing of music during the test has made any difference in the mean marks of the two groups. State your hypotheses clearly.
  4. Write down what you can conclude about the effect of music on a student's performance during the test.
Edexcel S4 2012 June Q3
5 marks Standard +0.8
  1. The sample variance of the lengths of a random sample of 9 paving slabs sold by a builders' merchant is \(36 \mathrm {~mm} ^ { 2 }\). The sample variance of the lengths of a random sample of 11 paving slabs sold by a second builders' merchant is \(225 \mathrm {~mm} ^ { 2 }\). Test at the \(10 \%\) significance level whether or not there is evidence that the lengths of paving slabs sold by these builders' merchants differ in variability. State your hypotheses clearly.
    (You may assume the lengths of paving slabs are normally distributed.)
  2. A newspaper runs a daily Sudoku. A random sample of 10 people took the following times, in minutes, to complete the Sudoku.
\section*{\(\begin{array} { l l l l l l l l l l } 5.0 & 4.5 & 4.7 & 5.3 & 5.2 & 4.1 & 5.3 & 4.8 & 5.5 & 4.6 \end{array}\)} Given that the times to complete the Sudoku follow a normal distribution,
  1. calculate a 95\% confidence interval for
    1. the mean,
    2. the variance,
      of the times taken by people to complete the Sudoku. The newspaper requires the average time needed to complete the Sudoku to be 5 minutes with a standard deviation of 0.7 minutes.
  2. Comment on whether or not the Sudoku meets this requirement. Give a reason for your answer.
Edexcel S4 2013 June Q4
15 marks Challenging +1.2
4. A company carries out an investigation into the strengths of rods from two different suppliers, Ardo and Bards. Independent random samples of rods were taken from each supplier and the force, \(x \mathrm { kN }\), needed to break each rod was recorded. The company wrote the results on a piece of paper but unfortunately spilt ink on it so some of the results can not be seen.
The paper with the results on is shown below. \includegraphics[max width=\textwidth, alt={}, center]{4f096806-33da-453f-a4c1-12be20d1a96d-08_435_1522_541_244}
    1. Use the data from Ardo to calculate an unbiased estimate, \(s _ { A } ^ { 2 }\), of the variance.
    2. Hence find an unbiased estimate, \(s _ { B } ^ { 2 }\), of the variance for the sample of 9 values from Bards.
  1. Stating your hypotheses clearly, test at the \(10 \%\) level of significance whether or not there is a difference in variability of strength between the rods from Ardo and the rods from Bards.
    (You may assume the two samples come from independent normal distributions.)
  2. Use a \(5 \%\) level of significance to test whether the mean strength of rods from Bards is more than 0.9 kN greater than the mean strength of rods from Ardo.
    (6)
Edexcel S4 2018 June Q4
17 marks Challenging +1.2
  1. A glue supplier claims that Goglue is stronger than Tackfast. A company is presently using Tackfast but agrees to change to Goglue if, at the 5\% significance level,
  • the standard deviation of the force required for Goglue to fail is not greater than the standard deviation of the force required for Tackfast to fail and
  • the mean force required for Goglue to fail is more than 4 newtons greater than the mean force for Tackfast to fail.
A series of trials is carried out, using Goglue and Tackfast, and the glues are tested to destruction. The force, \(x\) newtons, at which each glue fails is recorded.
Sample size \(( n )\)Sample mean \(( \bar { x } )\)Standard deviation \(( s )\)
Tackfast \(( T )\)65.270.31
Goglue \(( G )\)510.120.66
It can be assumed that the force at which each glue fails is normally distributed.
  1. Test, at the \(5 \%\) level of significance, whether or not there is evidence that the standard deviation of the force required for Goglue to fail is greater than the standard deviation of the force required for the Tackfast to fail. State your hypotheses clearly. The supplier claims that the mean force required for its Goglue to fail is more than 4 newtons greater than the mean force required for Tackfast to fail.
  2. Stating your hypotheses clearly and using a \(5 \%\) level of significance, test the supplier's claim.
  3. Show that, at the \(5 \%\) level of significance, the supplier's claim will be accepted if \(\bar { X } _ { G } - \bar { X } _ { T } > 4.55\), where \(\bar { X } _ { G }\) and \(\bar { X } _ { T }\) are the mean forces required for Goglue to fail and Tackfast to fail respectively. Later, it was found that an error had been made when recording the results for Goglue. This resulted in all the forces recorded for Goglue being 0.5 newtons more than they should have been. The results for Tackfast were correct.
  4. Explain whether or not this information affects the decision about which glue the supplier decides to use.
Edexcel S4 Q5
16 marks Standard +0.8
5. An educational researcher is testing the effectiveness of a new method of teaching a topic in mathematics. A random sample of 10 children were taught by the new method and a second random sample of 9 children, of similar age and ability, were taught by the conventional method. At the end of the teaching, the same test was given to both groups of children. The marks obtained by the two groups are summarised in the table below.
New methodConventional method
Mean \(( \bar { x } )\)82.378.2
Standard deviation \(( s )\)3.55.7
Number of students \(( n )\)109
  1. Stating your hypotheses clearly and using a \(5 \%\) level of significance, investigate whether or not
    1. the variance of the marks of children taught by the conventional method is greater than that of children taught by the new method,
    2. the mean score of children taught by the conventional method is lower than the mean score of those taught by the new method.
      [0pt] [In each case you should give full details of the calculation of the test statistics.]
  2. State any assumptions you made in order to carry out these tests.
  3. Find a 95\% confidence interval for the common variance of the marks of the two groups.
Edexcel FS2 2019 June Q3
8 marks Standard +0.8
3 Yin grows two varieties of potato, plant \(A\) and plant \(B\). A random sample of each variety of potato is taken and the yield, \(x \mathrm {~kg}\), produced by each plant is measured. The following statistics are obtained from the data.
Number of plants\(\sum x\)\(\sum x ^ { 2 }\)
\(A\)25194.71637.37
\(B\)26227.52031.19
  1. Stating your hypotheses clearly, test, at the \(10 \%\) significance level, whether or not the variances of the yields of the two varieties of potato are the same.
  2. State an assumption you have made in order to carry out the test in part (a).
Edexcel FS2 2020 June Q1
6 marks Standard +0.3
1 Gina receives a large number of packages from two companies, \(A\) and \(B\). She believes that the variance of the weights of packages from company \(A\) is greater than the variance of the weights of packages from company \(B\). Gina takes a random sample of 7 packages from company \(A\) and an independent random sample of 10 packages from company \(B\). Her results are summarised below $$\bar { a } = 300 \quad \mathrm {~S} _ { a a } = 145496 \quad \bar { b } = 233.4 \quad \mathrm {~S} _ { b b } = 56364.4$$ [You may assume that the weights of packages from the two companies are normally distributed.]
Test Gina's belief. Use a \(5 \%\) level of significance and state your hypotheses clearly.
Edexcel FS2 2021 June Q6
15 marks Challenging +1.2
  1. Elsa is collecting information on the wingspan of two different species of butterfly, Ringlet and Meadow Brown. She takes a random sample of each type of butterfly. The wingspans, \(w \mathrm {~cm}\), are summarised in the table below. The wingspans of Ringlet and Meadow Brown butterflies each follow normal distributions.
Number of
butterflies
\(\sum w\)\(\sum w ^ { 2 }\)
Ringlet841021032
Meadow Brown629414426
  1. Test, at the \(2 \%\) level of significance, whether or not there is evidence that the variance of the wingspans of Ringlet butterflies is different from the variance of the wingspans of Meadow Brown butterflies. You should state your hypotheses clearly. The \(k \%\) confidence interval for the variance of the wingspans of Meadow Brown butterflies is (1.194, 48.54)
  2. Find the value of \(k\)
  3. Calculate a \(95 \%\) confidence interval for the difference between the mean wingspan of the Ringlet butterfly and the mean wingspan of the Meadow Brown butterfly.
Edexcel FS2 2022 June Q5
8 marks Standard +0.8
  1. The concentration of an air pollutant is measured in micrograms \(/ \mathrm { m } ^ { 3 }\)
Samples of air were taken at two different sites and the concentration of this particular air pollutant was recorded. For Site \(A\) the summary statistics are shown below.
\cline { 2 - 3 } \multicolumn{1}{c|}{}number of samples\(S _ { A } ^ { 2 }\)
Site \(A\)136.39
For Site \(B\) there were 9 samples of air taken.
A test of the hypothesis \(\mathrm { H } _ { 0 } : \sigma _ { A } ^ { 2 } = \sigma _ { B } ^ { 2 }\) against the hypothesis \(\mathrm { H } _ { 1 } : \sigma _ { A } ^ { 2 } \neq \sigma _ { B } ^ { 2 }\) is carried out using a \(2 \%\) level of significance.
  1. State a necessary assumption required to carry out the test. Given that the assumption in part (a) holds,
  2. find the set of values of \(s _ { B } ^ { 2 }\) that would lead to the null hypothesis being rejected,
  3. find a 99\% confidence interval for the variance of the concentration of the air pollutant at Site A.
Edexcel FS2 2023 June Q3
8 marks Challenging +1.2
  1. Two machines, \(A\) and \(B\), are used to fill bottles of water. The amount of water dispensed by each machine is normally distributed.
Samples are taken from each machine and the amount of water, \(x \mathrm { ml }\), dispensed in each bottle is recorded. The table shows the summary statistics for Machine \(A\).
\cline { 2 - 4 } \multicolumn{1}{c|}{}Sample size\(\sum x\)\(\sum x ^ { 2 }\)
Machine \(A\)92268571700
  1. Find a 95\% confidence interval for the variance of the amount of water dispensed in each bottle by Machine \(A\). For Machine \(B\), a random sample of 11 bottles is taken. The sample variance of the amount of water dispensed in bottles is \(12.7 \mathrm { ml } ^ { 2 }\)
  2. Test, at the \(10 \%\) level of significance, whether there is evidence that the variances of the amounts of water dispensed in bottles by the two machines are different. You should state the hypotheses and the critical value used.
Edexcel S4 Q1
6 marks Standard +0.3
  1. A beach is divided into two areas \(A\) and \(B\). A random sample of pebbles is taken from each of the two areas and the length of each pebble is measured. A sample of size 26 is taken from area \(A\) and the unbiased estimate for the population variance is \(s _ { A } ^ { 2 } = 0.495 \mathrm {~mm} ^ { 2 }\). A sample of size 25 is taken from area \(B\) and the unbiased estimate for the population variance is \(s _ { B } ^ { 2 } = 1.04 \mathrm {~mm} ^ { 2 }\).
    1. Stating your hypotheses clearly test, at the \(10 \%\) significance level, whether or not there is a difference in variability of pebble length between area \(A\) and area \(B\).
    2. State the assumption you have made about the populations of pebble lengths in order to carry out the test.
      (1)
    3. A random sample of 10 mustard plants had the following heights, in mm , after 4 days growth.
    $$5.0,4.5,4.8,5.2,4.3,5.1,5.2,4.9,5.1,5.0$$ Those grown previously had a mean height of 5.1 mm after 4 days. Using a \(2.5 \%\) significance level, test whether or not the mean height of these plants is less than that of those grown previously.
    (You may assume that the height of mustard plants after 4 days follows a normal distribution.)
    (9)
    3. A train company claims that the probability \(p\) of one of its trains arriving late is \(10 \%\). A regular traveller on the company's trains believes that the probability is greater than \(10 \%\) and decides to test this by randomly selecting 12 trains and recording the number \(X\) of trains that were late. The traveller sets up the hypotheses \(\mathrm { H } _ { 0 } : p = 0.1\) and \(\mathrm { H } _ { 1 } : p > 0.1\) and accepts the null hypothesis if \(x \leq 2\).
  2. Find the size of the test.
  3. Show that the power function of the test is $$1 - ( 1 - p ) ^ { 10 } \left( 1 + 10 p + 55 p ^ { 2 } \right)$$
  4. Calculate the power of the test when
    1. \(p = 0.2\),
    2. \(p = 0.6\).
  5. Comment on your results from part (c).
    4. A random sample of 15 tomatoes is taken and the weight \(x\) grams of each tomato is found. The results are summarised by \(\sum x = 208\) and \(\sum x ^ { 2 } = 2962\).
  6. Assuming that the weights of the tomatoes are normally distributed, calculate the \(90 \%\) confidence interval for the variance \(\sigma ^ { 2 }\) of the weights of the tomatoes.
  7. State with a reason whether or not the confidence interval supports the assertion \(\sigma ^ { 2 } = 3\).
    5. (a) Define
    1. a Type I error,
    2. a Type II error. A small aviary, that leaves the eggs with the parent birds, rears chicks at an average rate of 5 per year. In order to increase the number of chicks reared per year it is decided to remove the eggs from the aviary as soon as they are laid and put them in an incubator. At the end of the first year of using an incubator 7 chicks had been successfully reared.
  8. Assuming that the number of chicks reared per year follows a Poisson distribution test, at the \(5 \%\) significance level, whether or not there is evidence of an increase in the number of chicks reared per year. State your hypotheses clearly.
  9. Calculate the probability of the Type I error for this test.
  10. Given that the true average number of chicks reared per year when the eggs are hatched in an incubator is 8, calculate the probability of a Type II error.
    6. A random sample of three independent variables \(X _ { 1 } , X _ { 2 }\) and \(X _ { 3 }\) is taken from a distribution with mean \(\mu\) and variance \(\sigma ^ { 2 }\).
  11. Show that \(\frac { 2 } { 3 } X _ { 1 } - \frac { 1 } { 2 } X _ { 2 } + \frac { 5 } { 6 } X _ { 3 }\) is an unbiased estimator for \(\mu\). An unbiased estimator for \(\mu\) is given by \(\hat { \mu } = a X _ { 1 } + b X _ { 2 }\) where \(a\) and \(b\) are constants.
  12. Show that \(\operatorname { Var } ( \hat { \mu } ) = \left( 2 a ^ { 2 } - 2 a + 1 \right) \sigma ^ { 2 }\).
  13. Hence determine the value of \(a\) and the value of \(b\) for which \(\hat { \mu }\) has minimum variance.
    7. Two methods of extracting juice from an orange are to be compared. Eight oranges are halved. One half of each orange is chosen at random and allocated to Method \(A\) and the other half is allocated to Method \(B\). The amounts of juice extracted, in ml , are given in the table.
    \cline { 2 - 9 } \multicolumn{1}{c|}{}Orange
    \cline { 2 - 9 } \multicolumn{1}{c|}{}12345678
    Method \(A\)2930262526222328
    Method \(B\)2725282423262225
    One statistician suggests performing a two-sample \(t\)-test to investigate whether or not there is a difference between the mean amounts of juice extracted by the two methods.
  14. Stating your hypotheses clearly and using a \(5 \%\) significance level, carry out this test.
    (You may assume \(\bar { x } _ { A } = 26.125 , s _ { A } ^ { 2 } = 7.84 , \bar { x } _ { B } = 25 , s _ { B } ^ { 2 } = 4\) and \(\sigma _ { A } ^ { 2 } = \sigma _ { B } ^ { 2 }\) ) Another statistician suggests analysing these data using a paired \(t\)-test.
  15. Using a \(5 \%\) significance level, carry out this test.
  16. State which of these two tests you consider to be more appropriate. Give a reason for your choice.
    (1) \section*{END} \section*{Advanced/Advanced Subsidiary} Wednesday 16 June 2004 - Afternoon Time: \(\mathbf { 1 }\) hour \(\mathbf { 3 0 }\) minutes Answer Book (AB16)
    Nil
    Graph Paper (ASG2)
    Mathematical Formulae (Lilac) Candidates may use any calculator EXCEPT those with the facility for symbolic algebra, differentiation and/or integration. Thus candidates may NOT use calculators such as the Texas Instruments TI 89, TI 92, Casio CFX 9970G, Hewlett Packard HP 48G. In the boxes on the answer book, write the name of the examining body (Edexcel), your centre number, candidate number, the unit title (Statistics S4), the paper reference (6686), your surname, other name and signature.
    Values from the statistical tables should be quoted in full. When a calculator is used, the answer should be given to an appropriate degree of accuracy. A booklet 'Mathematical Formulae and Statistical Tables' is provided.
    Full marks may be obtained for answers to ALL questions.
    This paper has seven questions. You must ensure that your answers to parts of questions are clearly labelled.
    You must show sufficient working to make your methods clear to the Examiner. Answers without working may gain no credit.
    1. The random variable \(X\) has an \(F\)-distribution with 8 and 12 degrees of freedom.
    Find \(\mathrm { P } \left( \frac { 1 } { 5.67 } < X < 2.85 \right)\).
Edexcel S4 2002 June Q5
13 marks Standard +0.3
5. The times, \(x\) seconds, taken by the competitors in the 100 m freestyle events at a school swimming gala are recorded. The following statistics are obtained from the data.
No. of competitorsSample Mean \(\bar { x }\)\(\sum x ^ { 2 }\)
Girls883.1055746
Boys788.9056130
Following the gala a proud parent claims that girls are faster swimmers than boys. Assuming that the times taken by the competitors are two independent random samples from normal distributions,
  1. test, at the \(10 \%\) level of significance, whether or not the variances of the two distributions are the same. State your hypotheses clearly.
  2. Stating your hypotheses clearly, test the parent's claim. Use a \(5 \%\) level of significance.