2.01c Sampling techniques: simple random, opportunity, etc

167 questions

Sort by: Default | Easiest first | Hardest first
OCR MEI S3 2006 June Q3
18 marks Moderate -0.3
3 An employer has commissioned an opinion polling organisation to undertake a survey of the attitudes of staff to proposed changes in the pension scheme. The staff are categorised as management, professional and administrative, and it is thought that there might be considerable differences of opinion between the categories. There are 60,140 and 300 staff respectively in the categories. The budget for the survey allows for a sample of 40 members of staff to be selected for in-depth interviews.
  1. Explain why it would be unwise to select a simple random sample from all the staff.
  2. Discuss whether it would be sensible to consider systematic sampling.
  3. What are the advantages of stratified sampling in this situation?
  4. State the sample sizes in each category if stratified sampling with as nearly as possible proportional allocation is used. The opinion polling organisation needs to estimate the average wealth of staff in the categories, in terms of property, savings, investments and so on. In a random sample of 11 professional staff, the sample mean is \(\pounds 345818\) and the sample standard deviation is \(\pounds 69241\).
  5. Assuming the underlying population is Normally distributed, test at the \(5 \%\) level of significance the null hypothesis that the population mean is \(\pounds 300000\) against the alternative hypothesis that it is greater than \(\pounds 300000\). Provide also a two-sided \(95 \%\) confidence interval for the population mean.
    [0pt] [10]
OCR MEI S4 2006 June Q4
24 marks Standard +0.3
4 An experiment is carried out to compare five industrial paints, A, B, C, D, E, that are intended to be used to protect exterior surfaces in polluted urban environments. Five different types of surface (I, II, III, IV, V) are to be used in the experiment, and five specimens of each type of surface are available. Five different external locations ( \(1,2,3,4,5\) ) are used in the experiment. The paints are applied to the specimens of the surfaces which are then left in the locations for a period of six months. At the end of this period, a "score" is given to indicate how effective the paint has been in protecting the surface.
  1. Name a suitable experimental design for this trial and give an example of an experimental layout. Initial analysis of the data indicates that any differences between the types of surface are negligible, as also are any differences between the locations. It is therefore decided to analyse the data by one-way analysis of variance.
  2. State the usual model, including the accompanying distributional assumptions, for the one-way analysis of variance. Interpret the terms in the model.
  3. The data for analysis are as follows. Higher scores indicate better performance.
    Paint APaint BPaint CPaint DPaint E
    6466596564
    5868567852
    7376696956
    6070607261
    6771637158
    [The sum of these data items is 1626 and the sum of their squares is 106838 .]
    Construct the usual one-way analysis of variance table. Carry out the appropriate test, using a 5\% significance level. Report briefly on your conclusions.
    [0pt] [12]
OCR MEI S4 2007 June Q4
24 marks Standard +0.8
4 An agricultural company conducts a trial of five fertilisers (A, B, C, D, E) in an experimental field at its research station. The fertilisers are applied to plots of the field according to a completely randomised design. The yields of the crop from the plots, measured in a standard unit, are analysed by the one-way analysis of variance, from which it appears that there are no real differences among the effects of the fertilisers. A statistician notes that the residual mean square in the analysis of variance is considerably larger than had been anticipated from knowledge of the general behaviour of the crop, and therefore suspects that there is some inadequacy in the design of the trial.
  1. Explain briefly why the statistician should be suspicious of the design.
  2. Explain briefly why an inflated residual leads to difficulty in interpreting the results of the analysis of variance, in particular that the null hypothesis is more likely to be accepted erroneously. Further investigation indicates that the soil at the west side of the experimental field is naturally more fertile than that at the east side, with a consistent 'fertility gradient' from west to east.
  3. What experimental design can accommodate this feature? Provide a simple diagram of the experimental field indicating a suitable layout. The company decides to conduct a new trial in its glasshouse, where experimental conditions can be controlled so that a completely randomised design is appropriate. The yields are as follows.
    Fertiliser AFertiliser BFertiliser CFertiliser DFertiliser E
    23.626.018.829.017.7
    18.235.316.737.216.5
    32.430.523.032.612.8
    20.831.428.331.420.4
    [The sum of these data items is 502.6 and the sum of their squares is 13610.22 .]
  4. Construct the usual one-way analysis of variance table. Carry out the appropriate test, using a \(5 \%\) significance level. Report briefly on your conclusions.
  5. State the assumptions about the distribution of the experimental error that underlie your analysis in part (iv).
OCR S2 2013 January Q3
8 marks Moderate -0.8
3 A factory produces 9000 music DVDs each day. A random sample of 100 such DVDs is obtained.
  1. Explain how to obtain this sample using random numbers.
  2. Given that \(24 \%\) of the DVDs produced by the factory are classical, use a suitable approximation to find the probability that, in the sample of 100 DVDs, fewer than 20 are classical.
OCR S2 2015 June Q5
8 marks Standard +0.3
5
  1. State an advantage of using random numbers in selecting samples.
  2. It is known that in analysing the digits in large sets of financial records, the probability that the leading digit is 1 is 0.25 . A random sample of 18 leading digits from a certain large set of financial records is obtained and it is found that 8 of the leading digits are 1 s . Test, at the \(5 \%\) significance level, whether the probability that the leading digit is 1 in this set of records is greater than 0.25 .
OCR S2 2011 January Q8
11 marks Moderate -0.3
8 A company has 3600 employees, of whom \(22.5 \%\) live more than 30 miles from their workplace. A random sample of 40 employees is obtained.
  1. Use a suitable approximation, which should be justified, to find the probability that more than 5 of the employees in the sample live more than 30 miles from their workplace.
  2. Describe how to use random numbers to select a sample of 40 from a population of 3600 employees.
OCR S2 2009 June Q4
7 marks Moderate -0.8
4 A survey is to be carried out to draw conclusions about the proportion \(p\) of residents of a town who support the building of a new supermarket. It is proposed to carry out the survey by interviewing a large number of people in the high street of the town, which attracts a large number of tourists.
  1. Give two different reasons why this proposed method is inappropriate.
  2. Suggest a good method of carrying out the survey.
  3. State two statistical properties of your survey method that would enable reliable conclusions about \(p\) to be drawn.
OCR S2 2011 June Q1
3 marks Easy -1.8
1 In Fisher Avenue there are 263 houses, numbered 1 to 263. Explain how to obtain a random sample of 20 of these houses.
OCR S2 2012 June Q1
2 marks Easy -1.8
1 In one day's production, a machine produces 1000 CDs . Explain how to take a random sample of 15 CDs chosen from one day's production.
OCR S2 2013 June Q1
4 marks Moderate -0.8
1 It is required to select a random sample of 30 pupils from a school with 853 pupils. A student suggests the following method.
"Give each pupil sequentially a three-digit number from 001 to 853 . Use a calculator to generate random three-digit numbers from 0.000 to 0.999 inclusive, multiply the answer by 853 , add 1 and round off to the nearest whole number. Select the corresponding pupil, and repeat as necessary".
  1. Determine which pupil would be picked for each of the following calculator outputs: $$0.103 , \quad 0.104 , \quad 0.105 , \quad 0.106 , \quad 0.107$$
  2. Use your answers to part (i) to show that this method is biased, and suggest an improvement.
OCR MEI S3 2009 January Q4
18 marks Standard +0.3
4
  1. Explain the meaning of 'opportunity sampling'. Give one reason why it might be used and state one disadvantage of using it. A market researcher is conducting an 'on-street' survey in a busy city centre, for which he needs to stop and interview 100 people. For each interview the researcher counts the number of people he has to ask until one agrees to be interviewed. The data collected are as follows.
    No. of people asked1234567 or more
    Frequency261917131186
    A model for these data is proposed as follows, where \(p\) (assumed constant throughout) is the probability that a person asked agrees to be interviewed, and \(q = 1 - p\).
    No. of people asked1234567 or more
    Probability\(p\)\(p q\)\(p q ^ { 2 }\)\(p q ^ { 3 }\)\(p q ^ { 4 }\)\(p q ^ { 5 }\)\(q ^ { 6 }\)
  2. Verify that these probabilities add to 1 whatever the value of \(p\).
  3. Initially it is thought that on average 1 in 4 people asked agree to be interviewed. Test at the \(10 \%\) level of significance whether it is reasonable to suppose that the model applies with \(p = 0.25\).
  4. Later an estimate of \(p\) obtained from the data is used in the analysis. The value of the test statistic (with no combining of cells) is found to be 9.124 . What is the outcome of this new test? Comment on your answer in relation to the outcome of the test in part (iii).
OCR MEI S3 2011 January Q2
18 marks Standard +0.3
2
    1. What is stratified sampling? Why would it be used?
    2. A local authority official wishes to conduct a survey of households in the borough. He decides to select a stratified sample of 2000 households using Council Tax property bands as the strata. At the time of the survey there are 79368 households in the borough. The table shows the numbers of households in the different tax bands.
      Tax bandA - BC - DE - FG - H
      Number of households322983321197394120
      Calculate the number of households that the official should choose from each stratum in order to obtain his sample of 2000 households so that each stratum is represented proportionally.
    1. What assumption needs to be made when using a Wilcoxon single sample test?
    2. As part of an investigation into trends in local authority spending, one of the categories of expenditure considered was 'Highways and the Environment'. For a random sample of 10 local authorities, the percentages of their total expenditure spent on Highways and the Environment in 1999 and then in 2009 are shown in the table.
      Local authorityABCDEFGHIJ
      19999.608.408.679.329.899.357.918.089.618.55
      20098.948.427.878.4110.1710.118.319.769.549.67
      Use a Wilcoxon test, with a significance level of \(10 \%\), to determine whether there appears to be any change to the average percentage of total expenditure spent on Highways and the Environment between 1999 and 2009.
OCR MEI S3 2013 January Q2
18 marks Moderate -0.3
2 A particular species of reed that grows up to 2 metres in length is used for thatching. The lengths in metres of the reeds when harvested are modelled by the random variable \(X\) which has the following probability density function, \(\mathrm { f } ( x )\). $$f ( x ) = \begin{cases} \frac { 3 } { 16 } \left( 4 x - x ^ { 2 } \right) & \text { for } 0 \leqslant x \leqslant 2 \\ 0 & \text { elsewhere } \end{cases}$$
  1. Sketch \(\mathrm { f } ( x )\).
  2. Show that \(\mathrm { E } ( X ) = \frac { 5 } { 4 }\) and find the standard deviation of the lengths of the harvested reeds.
  3. Find the standard error of the mean length for a random sample of 100 reeds. Once the harvested reeds have been collected, any that are shorter than 1 metre are discarded.
  4. Find the proportion of reeds that should be discarded according to the model.
  5. Reeds are harvested from a large area which is divided into several reed beds. A sample of the harvested reeds is required for quality control. How might the method of cluster sampling be used to obtain it?
OCR MEI S3 2009 June Q3
17 marks Standard +0.3
3 A company which employs 600 staff wishes to improve its image by introducing new uniforms for the staff to wear. The human resources manager would like to obtain the views of the staff. She decides to do this by means of a systematic sample of \(10 \%\) of the staff.
  1. How should she go about obtaining such a sample, ensuring that all members of staff are equally likely to be selected? Explain whether this constitutes a simple random sample. At a later stage in the process, the choice of uniform has been reduced to two possibilities. Twelve members of staff are selected to take part in deciding which of the two uniforms to adopt. Each of the twelve assesses each uniform for comfort, appearance and practicality, giving it a total score out of 10. The scores are as follows.
    Staff member123456789101112
    Uniform A4.22.610.09.08.22.85.07.42.86.810.09.8
    Uniform B5.05.21.42.82.26.47.47.86.81.23.47.6
    A Wilcoxon signed rank test is to be used to decide whether there is any evidence of a preference for one of the uniforms.
  2. Explain why this test is appropriate in these circumstances and state the hypotheses that should be used.
  3. Carry out the test at the \(5 \%\) significance level.
OCR MEI S4 Q4
12 marks Standard +0.8
4 An experiment is carried out to compare five industrial paints, A, B, C, D, E, that are intended to be used to protect exterior surfaces in polluted urban environments. Five different types of surface (I, II, III, IV, V) are to be used in the experiment, and five specimens of each type of surface are available. Five different external locations ( \(1,2,3,4,5\) ) are used in the experiment. The paints are applied to the specimens of the surfaces which are then left in the locations for a period of six months. At the end of this period, a "score" is given to indicate how effective the paint has been in protecting the surface.
  1. Name a suitable experimental design for this trial and give an example of an experimental layout. Initial analysis of the data indicates that any differences between the types of surface are negligible, as also are any differences between the locations. It is therefore decided to analyse the data by one-way analysis of variance.
  2. State the usual model, including the accompanying distributional assumptions, for the one-way analysis of variance. Interpret the terms in the model.
  3. The data for analysis are as follows. Higher scores indicate better performance. The underlying distributions of strengths are assumed to be Normal for both suppliers, with variances 2.45 for supplier A and 1.40 for supplier B.
  4. Test at the \(5 \%\) level of significance whether it is reasonable to assume that the mean strengths from the two suppliers are equal.
  5. Provide a two-sided 90\% confidence interval for the true mean difference.
  6. Show that the test procedure used in part (i), with samples of sizes 7 and 5 and a \(5 \%\) significance level, leads to acceptance of the null hypothesis of equal means if \(- 1.556 < \bar { x } - \bar { y } < 1.556\), where \(\bar { x }\) and \(\bar { y }\) are the observed sample means from suppliers A and B . Hence find the probability of a Type II error for this test procedure if in fact the true mean strength from supplier A is 2.0 units more than that from supplier B.
  7. A manager suggests that the Wilcoxon rank sum test should be used instead, comparing the median strengths for the samples of sizes 7 and 5 . Give one reason why this suggestion might be sensible and two why it might not.
OCR MEI S4 2009 June Q4
24 marks Standard +0.3
4
  1. Describe, with the aid of a specific example, an experimental situation for which a Latin square design is appropriate, indicating carefully the features which show that a completely randomised or randomised blocks design would be inappropriate.
  2. The model for the one-way analysis of variance may be written, in a customary notation, as $$x _ { i j } = \mu + \alpha _ { i } + e _ { i j }$$ State the distributional assumptions underlying \(e _ { i j }\) in this model. What is the interpretation of the term \(\alpha _ { i }\) ?
  3. An experiment for comparing 5 treatments is carried out, with a total of 20 observations. A partial one-way analysis of variance table for the analysis of the results is as follows.
    Source of variationSums of squaresDegrees of freedomMean squaresMean square ratio
    Between treatments
    Residual68.76
    Total161.06
    Copy and complete the table, and carry out the appropriate test using a \(1 \%\) significance level.
OCR MEI D1 2012 June Q5
16 marks Moderate -0.3
5 Each morning I reach into my box of tea bags and, without looking, randomly choose a bag. The bags are manufactured in pairs, which can be separated along a perforated line. So when I choose a bag it might be attached to another, in which case I have to separate them and return the other bag to the box. Alternatively, it might be a single bag, having been separated on an earlier day. I only use one tea bag per day, and the box always gets thoroughly shaken during the day as things are moved around in the kitchen. You are to simulate this process, starting with 5 double bags and 0 single bags in the box. You are to use single-digit random numbers in your simulation.
  1. On day 2 there will be 4 double bags and 1 single bag in the box, 9 bags in total. Give a rule for simulating whether I choose a single bag or a double bag, assuming that I am equally likely to choose any of the 9 bags. Use single-digit random numbers in your simulation rule.
  2. On day 3 there will either be 4 double bags or 3 double bags and 2 single bags in the box. Give a rule for simulating what sort of bag I choose in the second of these cases. Use single-digit random numbers in your simulation rule.
  3. Using the random digits in your answer book, simulate what happens on days 2,3 and 4 , briefly explaining your simulations. Give an estimate of the probability that I choose a single bag on day 5 .
  4. Using the random digits in your answer book, carry out 4 more simulations and record the results.
  5. Using your 5 simulations, estimate the probability that I choose a single bag on day 5 .
    [0pt] [Question 6 is printed overleaf.]
OCR MEI D1 2014 June Q2
8 marks Easy -1.2
2 Honor either has coffee or tea at breakfast. On one third of days she chooses coffee, otherwise she has tea. She can never remember what she had the day before.
  1. Construct a simulation rule, using one-digit random numbers, to model Honor's choices of breakfast drink.
  2. Using the one-digit random numbers in your answer book, simulate Honor's choice of breakfast drink for 10 days. Honor also has either coffee or tea at the end of her evening meal, but she does remember what she had for breakfast, and her choice depends on it. If she had coffee at breakfast then the probability of her having coffee again is 0.55 . If she had tea for breakfast, then the probability of her having tea again is 0.15 .
  3. Construct a simulation rule, using two-digit random numbers, to model Honor's choice of evening drink given that she had coffee at breakfast. Construct a simulation rule, using two-digit random numbers, to model Honor's choice of evening drink given that she had tea at breakfast.
  4. Using your breakfast simulation from part (ii), and the two-digit random numbers in your answer book, simulate Honor's choice of evening drink for 10 days.
  5. Use your results from parts (ii) and (iv) to estimate the proportion of Honor's drinks, breakfast and evening meal combined, which are coffee. \section*{Question 3 begins on page 4}
OCR MEI D1 2015 June Q6
16 marks Moderate -0.8
6 Adrian and Kleo like to go out for meals, sometimes to a French restaurant, and sometimes to a Greek restaurant. If their last meal out was at the French restaurant, then the probability of their next meal out being at the Greek restaurant is 0.7 , whilst the probability of it being at the French restaurant is 0.3 . If their last meal out was at the Greek restaurant, then the probability of their next meal out being at the French restaurant is 0.6 , whilst the probability of it being at the Greek restaurant is 0.4 .
  1. Construct two simulation rules, each using single-digit random numbers, to model their choices of where to eat.
  2. Their last meal out was at the Greek restaurant. Use the random digits printed in your answer book to simulate their choices for the next 10 of their meals out. Hence estimate the proportion of their meals out which are at the French restaurant, and the proportion which are at the Greek restaurant. Adrian and Kleo find a Hungarian restaurant which they like. The probabilities of where they eat next are now given in the following table.
    \backslashbox{last meal out}{next meal out}FrenchGreekHungarian
    French\(\frac { 1 } { 5 }\)\(\frac { 3 } { 5 }\)\(\frac { 1 } { 5 }\)
    Greek\(\frac { 1 } { 2 }\)\(\frac { 3 } { 10 }\)\(\frac { 1 } { 5 }\)
    Hungarian\(\frac { 1 } { 3 }\)\(\frac { 1 } { 3 }\)\(\frac { 1 } { 3 }\)
  3. Construct simulation rules, each using single-digit random numbers, to model this new situation.
  4. Their last meal out was at the Greek restaurant. Use the random digits printed in your answer book to simulate their choices for the next 10 of their meals out. Hence estimate the proportion of their meals out which are at each restaurant. \section*{END OF QUESTION PAPER}
OCR MEI D1 2016 June Q1
8 marks Moderate -0.8
1 Pierre knows that, if he gambles, he will lose money in the long run. Nicolas tries to convince him that this is not the case. Pierre stakes a sum of money in a casino game. If he wins then he gets back his stake plus the same amount again. If he loses then he loses his stake. Nicolas says that Pierre can guarantee to win by repeatedly playing the game, even though the probability of winning an individual game is less than 0.5 . His idea is that Pierre should bet in the first game with a stake of \(\pounds 100\). If he wins then he stops, as he will have won \(\pounds 100\). If he loses then he plays again with a stake of \(\pounds 200\). If he wins then he has lost \(\pounds 100\) and won \(\pounds 200\). This gives a total gain of \(\pounds 100\), and he stops. If he loses then he plays again with a stake of \(\pounds 400\). If he wins this time he has lost \(\pounds 100\) and \(\pounds 200\) and won \(\pounds 400\). This gives a total gain of \(\pounds 100\), and he stops. Nicolas's advice is that Pierre simply has to continue in this way, doubling his stake every time that he loses, until he eventually wins. Nicolas says that this guarantees that Pierre will win \(\pounds 100\). You are to simulate what might happen if Pierre tries this strategy in a casino game in which the probability of him winning an individual game is 0.4 , and in which he has \(\pounds 1000\) available.
  1. Give an efficient rule for using 1-digit random numbers to simulate the outcomes of individual games, given that the probability of Pierre winning an individual game is 0.4 .
  2. Explain why at most three random digits are needed for one simulation of Nicolas's strategy, given that Pierre is starting with \(\pounds 1000\).
  3. Simulate five applications of Nicolas's strategy, using the five sets of three 1-digit random numbers in your answer book.
  4. Summarise the results of your simulations, giving your mean result.
OCR H240/02 2018 June Q10
9 marks Standard +0.3
10 A certain forest contains only trees of a particular species. Dipak wished to take a random sample of 5 trees from the forest. He numbered the trees from 1 to 784. Then, using his calculator, he generated the random digits 14781049 . Using these digits, Dipak formed 5 three-digit numbers. He took the first, second and third digits, followed by the second, third and fourth digits and so on. In this way he obtained the following list of numbers for his sample. $$\begin{array} { l l l l l } 147 & 478 & 781 & 104 & 49 \end{array}$$
  1. Explain why Dipak omitted the number 810 from his list.
  2. Explain why Dipak's sample is not random. The mean height of all trees of this species is known to be 4.2 m . Dipak wishes to test whether the mean height of trees in the forest is less than 4.2 m . He now uses a correct method to choose a random sample of 50 trees and finds that their mean height is 4.0 m . It is given that the standard deviation of trees in the forest is 0.8 m .
  3. Carry out the test at the \(2 \%\) significance level.
OCR H240/02 2021 November Q11
2 marks Moderate -0.8
11 Zac is planning to write a report on the music preferences of the students at his college. There is a large number of students at the college.
  1. State one reason why Zac might wish to obtain information from a sample of students, rather than from all the students.
  2. Amaya suggests that Zac should use a sample that is stratified by school year. Give one advantage of this method as compared with random sampling, in this context. Zac decides to take a random sample of 60 students from his college. He asks each student how many hours per week, on average, they spend listening to music during term. From his results he calculates the following statistics.
    Mean
    Standard
    deviation
    Median
    Lower
    quartile
    Upper
    quartile
    21.04.2020.518.022.9
  3. Sundip tells Zac that, during term, she spends on average 30 hours per week listening to music. Discuss briefly whether this value should be considered an outlier.
  4. Layla claims that, during term, each student spends on average 20 hours per week listening to music. Zac believes that the true figure is higher than 20 hours. He uses his results to carry out a hypothesis test at the 5\% significance level. Assume that the time spent listening to music is normally distributed with standard deviation 4.20 hours. Carry out the test.
OCR H240/02 Q9
4 marks Moderate -0.8
9 The diagram below shows some "Cycle to work" data taken from the 2001 and 2011 UK censuses. The diagram shows the percentages, by age group, of male and female workers in England and Wales, excluding London, who cycled to work in 2001 and 2011. \includegraphics[max width=\textwidth, alt={}, center]{f2f45d6c-cfdc-455b-ab08-597b06a69f36-10_951_1635_559_207} The following questions refer to the workers represented by the graphs in the diagram.
  1. A researcher is going to take a sample of men and a sample of women and ask them whether or not they cycle to work. Why would it be more important to stratify the sample of men? A research project followed a randomly chosen large sample of the group of male workers who were aged 30-34 in 2001.
  2. Does the diagram suggest that the proportion of this group who cycled to work has increased or decreased from 2001 to 2011?
    Justify your answer.
  3. Write down one assumption that you have to make about these workers in order to draw this conclusion.
Edexcel AS Paper 2 2019 June Q1
5 marks Easy -1.2
  1. A sixth form college has 84 students in Year 12 and 56 students in Year 13
The head teacher selects a stratified sample of 40 students, stratified by year group.
  1. Describe how this sample could be taken. The head teacher is investigating the relationship between the amount of sleep, s hours, that each student had the night before they took an aptitude test and their performance in the test, \(p\) marks.
    For the sample of 40 students, he finds the equation of the regression line of \(p\) on \(s\) to be $$p = 26.1 + 5.60 s$$
  2. With reference to this equation, describe the effect that an extra 0.5 hours of sleep may have, on average, on a student's performance in the aptitude test.
  3. Describe one limitation of this regression model.
Edexcel AS Paper 2 2021 November Q3
4 marks Easy -1.8
  1. Helen is studying one of the qualitative variables from the large data set for Heathrow from 2015.
She started with the data from 3rd May and then took every 10th reading.
There were only 3 different outcomes with the following frequencies
Outcome\(A\)\(B\)\(C\)
Frequency1621
  1. State the sampling technique Helen used.
  2. From your knowledge of the large data set
    1. suggest which variable was being studied,
    2. state the name of outcome \(A\). George is also studying the same variable from the large data set for Heathrow from 2015. He started with the data from 5th May and then took every 10th reading and obtained the following
      Outcome\(A\)\(B\)\(C\)
      Frequency1611
      Helen and George decided they should examine all of the data for this variable for Heathrow from 2015 and obtained the following
      Outcome\(A\)\(B\)\(C\)
      Frequency155263
  3. State what inference Helen and George could reliably make from their original samples about the outcomes of this variable at Heathrow, for the period covered by the large data set in 2015.