Edexcel S3 (Statistics 3) 2024 January

Question 1
View details
  1. Chen is treating vines to prevent fungus appearing. One month after the treatment, Chen monitors the vines to see if fungus is present.
The contingency table shows information about the type of treatment for a sample of 150 vines and whether or not fungus is present.
\multirow{2}{*}{}Type of treatment
NoneSulphurCopper sulphate
No fungus present205548
Fungus present1089
Test, at the \(5 \%\) level of significance, whether or not there is any association between the type of treatment and the presence of fungus.
Show your working clearly, stating your hypotheses, expected frequencies, test statistic and critical value.
Question 2
View details
  1. A company has 800 employees.
The manager of the company is going to take a sample of 80 employees.
  1. Explain how this sample can be taken using systematic sampling. The company has offices in London, Edinburgh and Cardiff. The table shows the number of employees in each city.
    CityLondonEdinburghCardiff
    Number of employees430250120
    The president of the company is going to take a sample of 100 employees to determine the average time employees spend in front of a computer each week.
  2. Explain how this sample can be taken using stratified sampling.
  3. Explain an advantage of using stratified sampling rather than simple random sampling.
Question 3
View details
  1. The table shows the annual tea consumption, \(t\) (kg/person), and population, \(p\) (millions), for a random sample of 7 European countries.
CountryABCDEFG
Annual tea consumption, \(\boldsymbol { t }\) (kg/person)0.270.150.420.061.940.780.44
Population, \(\boldsymbol { p }\) (millions)5.45.8910.267.917.18.7
$$\text { (You may use } \mathrm { S } _ { t t } = 2.486 \quad \mathrm {~S} _ { p p } = 3026.234 \quad \mathrm {~S} _ { p t } = 83.634 \text { ) }$$ Angela suggests using the product moment correlation coefficient to calculate the correlation between annual tea consumption and population.
  1. Use Angela's suggestion to test, at the \(5 \%\) level of significance, whether or not there is evidence of any correlation between annual tea consumption and population. State your hypotheses clearly and the critical value used. Johan suggests using Spearman's rank correlation coefficient to calculate the correlation between the rank of annual tea consumption and the rank of population.
  2. Calculate Spearman's rank correlation coefficient between the rank of annual tea consumption and the rank of population.
  3. Use Johan's suggestion to test, at the \(5 \%\) level of significance, whether or not there is evidence of a positive correlation between annual tea consumption and population.
    State your hypotheses clearly and the critical value used.
Question 4
View details
  1. The number of jobs sent to a printer per hour in a small office is recorded for 120 hours. The results are summarised in the following table.
Number of jobs012345
Frequency2434282185
  1. Show that the mean number of jobs sent to the printer per hour for these data is 1.75 The office manager believes that the number of jobs sent to the printer per hour can be modelled using a Poisson distribution. The office manager uses the mean given in part (a) to calculate the expected frequencies for this model. Some of the results are given in the following table.
    Number of jobs012345 or more
    Expected frequency20.8536.4931.93\(r\)\(s\)3.95
  2. Show that the value of \(s\) is 8.15 to 2 decimal places.
  3. Find the value of \(r\) to 2 decimal places. The value of \(\sum \frac { \left( O _ { i } - E _ { i } \right) ^ { 2 } } { E _ { i } }\) for the first four frequencies in the table is 1.43
  4. Test, at the \(5 \%\) level of significance, whether or not the number of jobs sent to the printer per hour can be modelled using a Poisson distribution. Show your working clearly, stating your hypotheses, test statistic and critical value.
Question 5
View details
  1. A professor claims that undergraduates studying History have a typing speed of more than 15 words per minute faster than undergraduates studying Maths.
A sample is taken of 38 undergraduates studying History and 45 undergraduates studying Maths. The typing speed, \(x\) words per minute, of each undergraduate is recorded. The results are summarised in the table below.
\(n\)\(\bar { x }\)\(s ^ { 2 }\)
Undergraduates studying History3856.327.2
Undergraduates studying Maths4539.818.5
  1. Use a suitable test, at the \(5 \%\) level of significance, to investigate the professor's claim.
    State clearly your hypotheses, test statistic and critical value.
  2. State two assumptions you have made in carrying out the test in part (a).
Question 6
View details
  1. A random sample of 8 three-month-old golden retriever dogs is taken.
The heights of the golden retrievers are recorded.
Using this sample, a 95\% confidence interval for the mean height, in cm, of three-month-old golden retrievers is found to be \(( 45.72,53.88 )\)
  1. Find a 99\% confidence interval for the mean height. You may assume that the heights are normally distributed with known population standard deviation. Some summary statistics for the weights, \(x \mathrm {~kg}\), of this sample are given below. $$\sum x = 91.2 \quad \sum x ^ { 2 } = 1145.16 \quad n = 8$$
  2. Calculate unbiased estimates of the mean and the variance of the weights of three-month-old golden retrievers. A further random sample of 24 three-month-old golden retrievers is taken. The unbiased estimates of the mean and the variance of the weights, in kg , from this sample are found to be 10.8 and 17.64 respectively.
  3. Estimate the standard error of the mean weight for the combined sample of 32 three-month-old golden retrievers.
Question 7
View details
  1. Small containers and large containers are independently filled with fruit juice.
The amounts of fruit juice in small containers are normally distributed with mean 180 ml and standard deviation 4.5 ml The amounts of fruit juice in large containers are normally distributed with mean 330 ml and standard deviation 6.7 ml The random variable \(W\) represents the total amount of fruit juice in a random sample of 2 small containers minus the amount of fruit juice in 1 randomly selected large container.
\(W \sim \mathrm {~N} ( a , b )\) where \(a\) and \(b\) are positive constants.
  1. Find the value of \(a\) and the value of \(b\)
  2. Find the probability that a randomly chosen large container of fruit juice contains more than 1.8 times the amount of fruit juice in a randomly chosen small container. A random sample of 3 small containers of fruit juice is taken.
  3. Find the probability that the first container of fruit juice in this sample contains at least 5 ml more than the mean amount of fruit juice in all 3 small containers.