5.06b Fit prescribed distribution: chi-squared test

136 questions

Sort by: Default | Easiest first | Hardest first
CAIE Further Paper 4 2022 June Q4
8 marks Standard +0.3
4 A scientist is investigating the numbers of a particular type of butterfly in a certain region. He claims that the numbers of these butterflies found per square metre can be modelled by a Poisson distribution with mean 2.5. He takes a random sample of 120 areas, each of one square metre, and counts the number of these butterflies in each of these areas. The following table shows the observed frequencies together with some of the expected frequencies using the scientist's Poisson distribution.
Number per square metre0123456\(\geqslant 7\)
Observed frequency1220363213610
Expected frequency9.8524.6330.7825.65\(p\)8.023.34\(q\)
  1. Find the values of \(p\) and \(q\), correct to 2 decimal places.
  2. Carry out a goodness of fit test, at the \(10 \%\) significance level, to test the scientist's claim.
CAIE Further Paper 4 2023 June Q3
9 marks Standard +0.8
3 A random sample of 50 values of the continuous random variable \(X\) was taken. These values are summarised in the following table.
Interval\(1 \leqslant x < 1.5\)\(1.5 \leqslant x < 2\)\(2 \leqslant x < 2.5\)\(2.5 \leqslant x < 3\)\(3 \leqslant x < 3.5\)\(3.5 \leqslant x \leqslant 4\)
Observed frequency338111312
It is required to test the goodness of fit of the distribution with probability density function \(f\) given by $$f ( x ) = \begin{cases} \frac { 1 } { 24 } \left( \frac { 4 } { x ^ { 2 } } + x ^ { 2 } \right) & 1 \leqslant x \leqslant 4 \\ 0 & \text { otherwise } \end{cases}$$ The expected frequencies, correct to 4 decimal places, are given in the following table.
Interval\(1 \leqslant x < 1.5\)\(1.5 \leqslant x < 2\)\(2 \leqslant x < 2.5\)\(2.5 \leqslant x < 3\)\(3 \leqslant x < 3.5\)\(3.5 \leqslant x \leqslant 4\)
Expected frequency4.4271\(a\)6.12858.4549\(b\)14.9678
  1. Show that \(a = 4.6007\) and find the value of \(b\).
  2. Carry out a goodness of fit test, at the \(10 \%\) significance level, to test whether f is a satisfactory model for the data.
CAIE Further Paper 4 2020 November Q3
7 marks Standard +0.3
3 Apples are sold in bags of 5. Based on her previous experience, Freya claims that the probability of any apple weighing more than 100 grams is 0.35 , independently of other apples in the bag. The apples in a random sample of 150 bags are checked and the number, \(x\), in each bag weighing more than 100 grams is recorded. The results are shown in the following table.
\(x\)012345
Frequency12394637124
Carry out a goodness of fit test at the \(5 \%\) significance level and hence comment on Freya's claim.
CAIE Further Paper 4 2021 November Q3
8 marks Standard +0.3
3 A supermarket sells pears in packs of 8 . Some of the pears in a pack may not be ripe, and the supermarket manager claims that the number of unripe pears in a pack can be modelled by the distribution \(\mathrm { B } ( 8,0.15 )\). A random sample of 150 packs was selected and the number of unripe pears in each pack was recorded. The following table shows the observed frequencies together with some of the expected frequencies using the manager's binomial distribution.
Number of unripe pears per pack012345\(\geqslant 6\)
Observed frequency35484315630
Expected frequency40.874\(p\)35.64112.5792.7750.392\(q\)
  1. Find the values of \(p\) and \(q\).
  2. Carry out a goodness of fit test, at the \(5 \%\) significance level, to test whether the manager's claim is justified.
CAIE Further Paper 4 2022 November Q2
8 marks Standard +0.8
2 An organisation runs courses to train students to become engineers. These students are taught in groups of 8 . The director of the organisation claims that on average \(60 \%\) of the students in a group achieve a pass. A random sample of 150 groups of 8 students is chosen. The following table shows the observed frequencies together with some of the expected frequencies using the appropriate binomial distribution.
Number of passes per group012345678
Observed frequency00824453626101
Expected frequency\(p\)1.1806.19318.57934.836\(q\)\(r\)13.4372.519
  1. Find the values of \(p , q\) and \(r\) giving your answers correct to 3 decimal places.
  2. Carry out a goodness of fit test, at the \(10 \%\) significance level, to test whether there is evidence to reject the director's claim.
CAIE Further Paper 4 2024 November Q3
8 marks Standard +0.3
3 A statistician believes that the number of telephone calls received by an advice centre in a 10 -minute interval can be modelled by the Poisson distribution \(\mathrm { Po } ( 1.9 )\). The number of calls received in a randomly chosen 10-minute interval was recorded on each of 100 days. The results are summarised in the table, together with some of the expected frequencies corresponding to the distribution \(\operatorname { Po } ( 1.9 )\).
Number of calls0123456 or more
Observed frequency101835211141
Expected frequency14.95728.41826.9971.322
  1. Complete the table.
  2. Carry out a goodness of fit test, at the \(10 \%\) significance level, to determine whether the statistician's belief is reasonable. \includegraphics[max width=\textwidth, alt={}, center]{e2a45d19-7d48-4aa5-93f9-6ef90f99d7c4-07_2726_35_97_20}
CAIE Further Paper 4 2024 November Q3
10 marks Standard +0.3
3 Rosie sows 5 seeds in each of 150 plant pots. The number of seeds that germinate is recorded for each pot. The results are summarised in the following table.
Number of seeds that germinate012345
Number of pots12404335164
Rosie suggests that the number of seeds that germinate follows the binomial distribution \(\mathrm { B } ( 5 , p )\).
  1. Use Rosie's results to show that \(p = 0.42\).
  2. Carry out a goodness of fit test, at the \(10 \%\) significance level, to test whether the distribution \(\mathrm { B } ( 5,0.42 )\) is a good fit for the data. \includegraphics[max width=\textwidth, alt={}, center]{b9cbf607-4f40-41bb-8374-6b2c39f945ac-06_2720_38_109_2010} \includegraphics[max width=\textwidth, alt={}, center]{b9cbf607-4f40-41bb-8374-6b2c39f945ac-07_2726_35_97_20}
CAIE Further Paper 4 2024 November Q3
8 marks Standard +0.3
3 A statistician believes that the number of telephone calls received by an advice centre in a 10 -minute interval can be modelled by the Poisson distribution \(\mathrm { Po } ( 1.9 )\). The number of calls received in a randomly chosen 10-minute interval was recorded on each of 100 days. The results are summarised in the table, together with some of the expected frequencies corresponding to the distribution \(\operatorname { Po } ( 1.9 )\).
Number of calls0123456 or more
Observed frequency101835211141
Expected frequency14.95728.41826.9971.322
  1. Complete the table.
  2. Carry out a goodness of fit test, at the \(10 \%\) significance level, to determine whether the statistician's belief is reasonable. \includegraphics[max width=\textwidth, alt={}, center]{8b2a13d7-62f4-45a7-84c5-7d5bc870b8ce-07_2726_35_97_20}
OCR S3 2007 June Q5
9 marks Standard +0.3
5 A music store sells both upright and grand pianos. Grand pianos are sold at random times and at a constant average weekly rate \(\lambda\). The probability that in one week no grand pianos are sold is 0.45 .
  1. Show that \(\lambda = 0.80\), correct to 2 decimal places. Upright pianos are sold, independently, at random times and at a constant average weekly rate \(\mu\). During a period of 100 weeks the store sold 180 upright pianos.
  2. Calculate the probability that the total number of pianos sold in a randomly chosen week will exceed 3.
  3. Calculate the probability that over a period of 3 weeks the store sells a total of 6 pianos during the first week and a total of 4 pianos during the next fortnight.
OCR S3 2011 June Q4
9 marks Standard +0.3
4 An experiment by Lord Rutherford at Cambridge in 1909 involved measuring the numbers of \(\alpha\)-particles emitted during radioactive decay. The following table shows emissions during 2608 intervals of 7.5 seconds.
Number of particles emitted, \(x\)012345678910\(\geqslant 11\)
Frequency572033835255324082731394527106
It is given that the mean number of particles emitted per interval, calculated from the data, is 3.87 , correct to 3 significant figures.
  1. Find the contribution to the \(\chi ^ { 2 }\) value of the frequency of 273 corresponding to \(x = 6\) in a goodness of fit test for a Poisson distribution.
  2. Given that no cells need to be combined, state why the number of degrees of freedom is 10 .
  3. Given also that the calculated value of \(\chi ^ { 2 }\) is 13.0 , correct to 3 significant figures, carry out the test at the 10\% significance level.
OCR MEI S3 2007 January Q4
18 marks Standard +0.3
4
  1. An amateur weather forecaster has been keeping records of air pressure, measured in atmospheres. She takes the measurement at the same time every day using a barometer situated in her garden. A random sample of 100 of her observations is summarised in the table below. The corresponding expected frequencies for a Normal distribution, with its two parameters estimated by sample statistics, are also shown in the table.
    Pressure ( \(a\) atmospheres)Observed frequencyFrequency as given by Normal model
    \(a \leqslant 0.98\)41.45
    \(0.98 < a \leqslant 0.99\)65.23
    \(0.99 < a \leqslant 1.00\)913.98
    \(1.00 < a \leqslant 1.01\)1523.91
    \(1.01 < a \leqslant 1.02\)3726.15
    \(1.02 < a \leqslant 1.03\)2118.29
    \(1.03 < a\)810.99
    Carry out a test at the \(5 \%\) level of significance of the goodness of fit of the Normal model. State your conclusion carefully and comment on your findings.
  2. The forecaster buys a new digital barometer that can be linked to her computer for easier recording of observations. She decides that she wishes to compare the readings of the new barometer with those of the old one. For a random sample of 10 days, the readings (in atmospheres) of the two barometers are shown below.
    DayABCDEFGHIJ
    Old0.9921.0051.0011.0111.0260.9801.0201.0251.0421.009
    New0.9851.0031.0021.0141.0220.9881.0301.0161.0471.025
    Use an appropriate Wilcoxon test to examine at the \(10 \%\) level of significance whether there is any reason to suppose that, on the whole, readings on the old and new barometers do not agree.
OCR MEI S3 2006 June Q1
18 marks Standard +0.3
1 Design engineers are simulating the load on a particular part of a complex structure. They intend that the simulated load, measured in a convenient unit, should be given by the random variable \(X\) having probability density function $$f ( x ) = 12 x ^ { 3 } - 24 x ^ { 2 } + 12 x , \quad 0 \leqslant x \leqslant 1 .$$
  1. Find the mean and the mode of \(X\).
  2. Find the cumulative distribution function \(\mathrm { F } ( x )\) of \(X\). $$\text { Verify that } \mathrm { F } \left( \frac { 1 } { 4 } \right) = \frac { 67 } { 256 } , \mathrm {~F} \left( \frac { 1 } { 2 } \right) = \frac { 11 } { 16 } \text { and } \mathrm { F } \left( \frac { 3 } { 4 } \right) = \frac { 243 } { 256 } .$$ The engineers suspect that the process for generating simulated loads might not be working as intended. To investigate this, they generate a random sample of 512 loads. These are recorded in a frequency distribution as follows.
    Load \(x\)\(0 \leqslant x \leqslant \frac { 1 } { 4 }\)\(\frac { 1 } { 4 } < x \leqslant \frac { 1 } { 2 }\)\(\frac { 1 } { 2 } < x \leqslant \frac { 3 } { 4 }\)\(\frac { 3 } { 4 } < x \leqslant 1\)
    Frequency12620913146
  3. Use a suitable statistical procedure to assess the goodness of fit of \(X\) to these data. Discuss your conclusions briefly.
OCR MEI S4 2006 June Q4
24 marks Standard +0.3
4 An experiment is carried out to compare five industrial paints, A, B, C, D, E, that are intended to be used to protect exterior surfaces in polluted urban environments. Five different types of surface (I, II, III, IV, V) are to be used in the experiment, and five specimens of each type of surface are available. Five different external locations ( \(1,2,3,4,5\) ) are used in the experiment. The paints are applied to the specimens of the surfaces which are then left in the locations for a period of six months. At the end of this period, a "score" is given to indicate how effective the paint has been in protecting the surface.
  1. Name a suitable experimental design for this trial and give an example of an experimental layout. Initial analysis of the data indicates that any differences between the types of surface are negligible, as also are any differences between the locations. It is therefore decided to analyse the data by one-way analysis of variance.
  2. State the usual model, including the accompanying distributional assumptions, for the one-way analysis of variance. Interpret the terms in the model.
  3. The data for analysis are as follows. Higher scores indicate better performance.
    Paint APaint BPaint CPaint DPaint E
    6466596564
    5868567852
    7376696956
    6070607261
    6771637158
    [The sum of these data items is 1626 and the sum of their squares is 106838 .]
    Construct the usual one-way analysis of variance table. Carry out the appropriate test, using a 5\% significance level. Report briefly on your conclusions.
    [0pt] [12]
OCR MEI S4 2007 June Q4
24 marks Standard +0.8
4 An agricultural company conducts a trial of five fertilisers (A, B, C, D, E) in an experimental field at its research station. The fertilisers are applied to plots of the field according to a completely randomised design. The yields of the crop from the plots, measured in a standard unit, are analysed by the one-way analysis of variance, from which it appears that there are no real differences among the effects of the fertilisers. A statistician notes that the residual mean square in the analysis of variance is considerably larger than had been anticipated from knowledge of the general behaviour of the crop, and therefore suspects that there is some inadequacy in the design of the trial.
  1. Explain briefly why the statistician should be suspicious of the design.
  2. Explain briefly why an inflated residual leads to difficulty in interpreting the results of the analysis of variance, in particular that the null hypothesis is more likely to be accepted erroneously. Further investigation indicates that the soil at the west side of the experimental field is naturally more fertile than that at the east side, with a consistent 'fertility gradient' from west to east.
  3. What experimental design can accommodate this feature? Provide a simple diagram of the experimental field indicating a suitable layout. The company decides to conduct a new trial in its glasshouse, where experimental conditions can be controlled so that a completely randomised design is appropriate. The yields are as follows.
    Fertiliser AFertiliser BFertiliser CFertiliser DFertiliser E
    23.626.018.829.017.7
    18.235.316.737.216.5
    32.430.523.032.612.8
    20.831.428.331.420.4
    [The sum of these data items is 502.6 and the sum of their squares is 13610.22 .]
  4. Construct the usual one-way analysis of variance table. Carry out the appropriate test, using a \(5 \%\) significance level. Report briefly on your conclusions.
  5. State the assumptions about the distribution of the experimental error that underlie your analysis in part (iv).
OCR S3 2014 June Q2
6 marks Standard +0.3
2 In a study of the inheritance of skin colouration in corn snakes, a researcher found 865 snakes with black and orange bodies, 320 snakes with black bodies, 335 snakes with orange bodies and 112 snakes with bodies of other colours. Theory predicts that snakes of these colours should occur in the ratios \(9 : 3 : 3 : 1\). Test, at the \(5 \%\) significance level, whether these experimental results are compatible with theory.
OCR S3 2015 June Q6
13 marks Standard +0.3
6 In each of 38 randomly selected weeks of the English Premier Football League there were 10 matches. Table 1 summarises the number of home wins in 10 matches, \(X\), and the corresponding number of weeks. \begin{table}[h]
Number of home wins012345678910
Number of weeks01288971200
\captionsetup{labelformat=empty} \caption{Table 1}
\end{table} A researcher investigates whether \(X\) can be modelled by the distribution \(\mathrm { B } ( 10 , p )\). He calculates the expected frequencies using a value of \(p\) obtained from the sample mean.
  1. Show that \(p = 0.45\). Table 2 shows the observed and expected number of weeks. \begin{table}[h]
    Number of home wins012345678910Totals
    Observed number of weeks0128897120038
    Expected number of weeks0.0960.7882.8996.3269.0588.8936.0642.8350.8700.1580.01338
    \captionsetup{labelformat=empty} \caption{Table 2
  2. Show how the value of 2.835 for 7 home wins is obtained.}
\end{table} The researcher carries out a test, at the \(5 \%\) significance level, of whether the distribution \(\mathrm { B } ( 10 , p )\) fits the data.
  • Explain why it is necessary to combine classes.
  • Carry out the test.
  • CAIE Further Paper 4 2020 Specimen Q2
    7 marks Challenging +1.2
    2 Each of 200 identically biased dice is thrown repeatedly until an even number is obtained. The number of throws needed is recorded and the results are summarised in the following table.
    Number of throws123456\(\geqslant 7\)
    Frequency12643223510
    Carry out a goodness of fit test, at the \(5\%\) significance level, to test whether \(\operatorname{Geo}(0.6)\) is a satisfactory model for the data.
    OCR S3 2009 January Q8
    14 marks Standard +0.3
    8 A soft drinks factory produces lemonade which is sold in packs of 6 bottles. As part of the factory's quality control, random samples of 75 packs are examined at regular intervals. The number of underfilled bottles in a pack of 6 bottles is denoted by the random variable \(X\). The results of one quality control check are shown in the following table.
    Number of underfilled bottles0123
    Number of packs442083
    A researcher assumes that \(X \sim \mathrm {~B} ( 3 , p )\).
    1. By finding the sample mean, show that an estimate of \(p\) is 0.2 .
    2. Show that, at the \(5 \%\) significance level, there is evidence that this binomial distribution does not fit the data.
    3. Another researcher suggests that the goodness of fit test should be for \(\mathrm { B } ( 6 , p )\). She finds that the corresponding value of \(\chi ^ { 2 }\) is 2.74 , correct to 3 significant figures. Given that the number of degrees of freedom is the same as in part (ii), state the conclusion of the test at the same significance level.
    OCR S3 2009 June Q7
    14 marks Standard +0.3
    7 In 1761, James Short took measurements of the parallax of the sun based on the transit of Venus. The mean and standard deviation of a random sample of 50 of these measurements are 8.592 and 0.7534 respectively, in suitable units.
    1. Show that if \(X \sim \mathrm {~N} \left( 8.592,0.7534 ^ { 2 } \right)\), then $$\mathrm { P } ( X \leqslant 8.084 ) = \mathrm { P } ( 8.084 < X \leqslant 8.592 ) = \mathrm { P } ( 8.592 < X \leqslant 9.100 ) = \mathrm { P } ( X > 9.100 ) = 0.25 \text {. }$$ The following table summarises the 50 measurements using these intervals.
      Measurement \(( x )\)\(x \leqslant 8.084\)\(8.084 < x \leqslant 8.592\)\(8.592 < x \leqslant 9.100\)\(x > 9.100\)
      Frequency822119
    2. Carry out a test, at the \(\frac { 1 } { 2 } \%\) significance level, of whether a normal distribution fits the data.
    3. Obtain a 99\% confidence interval for the mean of all similar parallax measurements.
    OCR S3 2010 June Q5
    10 marks Standard +0.3
    5 A random variable \(X\) is believed to have (cumulative) distribution function given by $$\mathrm { F } ( x ) = \begin{cases} 0 & x < 0 , \\ 1 - \mathrm { e } ^ { - x ^ { 2 } } & x \geqslant 0 . \end{cases}$$ In order to test this, a random sample of 150 observations of \(X\) were taken, and their values are summarised in the following grouped frequency table.
    Values\(0 \leqslant x < 0.5\)\(0.5 \leqslant x < 1\)\(1 \leqslant x < 1.5\)\(1.5 \leqslant x < 2\)\(x \geqslant 2\)
    Frequency415032234
    The expected frequencies, correct to 1 decimal place, corresponding to the above distribution, are 33.2, 61.6 and 39.4 respectively for the first 3 cells.
    1. Find the expected frequencies for the last 2 cells.
    2. Carry out a goodness of fit test at the \(2 \frac { 1 } { 2 } \%\) significance level.
    OCR S3 2012 June Q7
    16 marks Standard +0.3
    7 A study was carried out into whether patients suffering from a certain respiratory disorder would benefit from particular treatments. Each of 90 patients who agreed to take part was given one of three treatments \(A\), \(B\) or \(C\) as shown in the table.
    Treatment\(A\)\(B\)\(C\)
    Number in group312534
    1. It is claimed that each patient was equally likely to have been given any of the treatments. Test at the \(5 \%\) significance level whether the numbers given each treatment are consistent with this claim.
    2. After 3 months the numbers of patients showing improvement for treatments \(A , B\) and \(C\) were 14, 18 and 25 respectively. By setting up a \(2 \times 3\) contingency table, test whether the outcome is dependent on the treatment. Use a \(5 \%\) significance level.
    3. If one of the treatments is abandoned, explain briefly which it should be. \section*{THERE ARE NO QUESTIONS WRITTEN ON THIS PAGE}
    OCR MEI S3 2010 January Q1
    17 marks Standard +0.3
    1 Coastal wildlife wardens are monitoring populations of herring gulls. Herring gulls usually lay 3 eggs per nest and the wardens wish to model the number of eggs per nest that hatch. They assume that the situation can be modelled by the binomial distribution \(\mathrm { B } ( 3 , p )\) where \(p\) is the probability that an egg hatches. A random sample of 80 nests each containing 3 eggs has been observed with the following results.
    Number of eggs hatched0123
    Number of nests7232921
    1. Initially it is assumed that the value of \(p\) is \(\frac { 1 } { 2 }\). Test at the \(5 \%\) level of significance whether it is reasonable to suppose that the model applies with \(p = \frac { 1 } { 2 }\).
    2. The model is refined by estimating \(p\) from the data. Find the mean of the observed data and hence an estimate of \(p\).
    3. Using the estimated value of \(p\), the value of the test statistic \(X ^ { 2 }\) turns out to be 2.3857 . Is it reasonable to suppose, at the \(5 \%\) level of significance, that this refined model applies?
    4. Discuss the reasons for the different outcomes of the tests in parts (i) and (iii).
    OCR MEI S3 2011 January Q3
    18 marks Standard +0.3
    3 The masses, in kilograms, of a random sample of 100 chickens on sale in a large supermarket were recorded as follows.
    Mass \(( m \mathrm {~kg} )\)\(m < 1.6\)\(1.6 \leqslant m < 1.8\)\(1.8 \leqslant m < 2.0\)\(2.0 \leqslant m < 2.2\)\(2.2 \leqslant m < 2.4\)\(2.4 \leqslant m < 2.6\)\(2.6 \leqslant m\)
    Frequency2830421152
    1. Assuming that the first and last classes are the same width as the other classes, calculate an estimate of the sample mean and show that the corresponding estimate of the sample standard deviation is 0.2227 kg . A Normal distribution using the mean and standard deviation found in part (i) is to be fitted to these data. The expected frequencies for the classes are as follows.
      Mass \(( m \mathrm {~kg} )\)\(m < 1.6\)\(1.6 \leqslant m < 1.8\)\(1.8 \leqslant m < 2.0\)\(2.0 \leqslant m < 2.2\)\(2.2 \leqslant m < 2.4\)\(2.4 \leqslant m < 2.6\)\(2.6 \leqslant m\)
      Expected
      frequency
      2.1710.92\(f\)33.8519.225.130.68
    2. Use the Normal distribution to find \(f\).
    3. Carry out a goodness of fit test of this Normal model using a significance level of 5\%.
    4. Discuss the outcome of the test with reference to the contributions to the test statistic and to the possibility of other significance levels.
    OCR MEI S3 2012 January Q3
    18 marks Standard +0.3
    3
    1. A medical researcher is looking into the delay, in years, between first and second myocardial infarctions (heart attacks). The following table shows the results for a random sample of 225 patients.
      Delay (years)\(0 -\)\(1 -\)\(2 -\)\(3 -\)\(4 - 10\)
      Number of patients160401393
      The mean of this sample is used to construct a model which gives the following expected frequencies.
      Delay (years)\(0 -\)\(1 -\)\(2 -\)\(3 -\)\(4 - 10\)
      Number of patients142.2352.3219.257.084.12
      Carry out a test, using a \(2.5 \%\) level of significance, of the goodness of fit of the model to the data.
    2. A further piece of research compares the incidence of myocardial infarction in men aged 55 to 70 with that in women aged 55 to 70 . Incidence is measured by the number of infarctions per 10000 of the population. For a random sample of 8 health authorities across the UK, the following results for the year 2010 were obtained.
      Health authorityABCDEFGH
      Incidence in men4756155145545032
      Incidence in women3630304754552727
      A Wilcoxon paired sample test, using the hypotheses \(\mathrm { H } _ { 0 } : m = 0\) and \(\mathrm { H } _ { 1 } : m \neq 0\) where \(m\) is the population median difference, is to be carried out to investigate whether there is any difference between men and women on the whole.
      1. Explain why a paired test is being used in this context.
      2. Carry out the test using a \(10 \%\) level of significance.
    OCR MEI S3 2013 January Q4
    18 marks Moderate -0.3
    4
    1. At a college, two examiners are responsible for marking, independently, the students' projects. Each examiner awards a mark out of 100 to each project. There is some concern that the examiners' marks do not agree, on average. Consequently a random sample of 12 projects is selected and the marks awarded to them are compared.
      1. Describe how a random sample of projects should be chosen.
      2. The marks given for the projects in the sample are as follows.
        Project123456789101112
        Examiner A583772786777624180606570
        Examiner B734774717896542797736066
        Carry out a test at the \(10 \%\) level of significance of the hypotheses \(\mathrm { H } _ { 0 } : m = 0 , \mathrm { H } _ { 1 } : m \neq 0\), where \(m\) is the population median difference.
    2. A calculator has a built-in random number function which can be used to generate a list of random digits. If it functions correctly then each digit is equally likely to be generated. When it was used to generate 100 random digits, the frequencies of the digits were as follows.
      Digit0123456789
      Frequency681114129155146
      Use a goodness of fit test, with a significance level of \(10 \%\), to investigate whether the random number function is generating digits with equal probability.