Edexcel S3 (Statistics 3) 2024 June

Question 1
View details
  1. The names of the 400 employees of a company are listed alphabetically in a book.
The chairperson of the company wishes to select a sample of 8 employees.
The chairperson numbers the employees from 001 to 400
  1. Describe how the list of numbers can be used to select a systematic sample of 8 employees.
  2. State one disadvantage of systematic sampling in this case.
  3. Write down the probability that the sample includes both the first name (employee 001) and the last name (employee 400) in the list.
Question 2
View details
  1. Aarush is asked to estimate the price of 7 kettles and rank them in order of decreasing price.
Aarush's order of decreasing price is \(D A F C B G E\)
The actual prices of the 7 kettles are shown in the table below.
Kettle\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)
Price (£)99.9914.9934.9749.9919.9729.998.99
  1. Calculate Spearman's rank correlation coefficient between Aarush's order and the actual order. Use a rank of 1 for the highest priced kettle.
    Show your working clearly.
  2. Using a \(5 \%\) level of significance, test whether or not there is evidence to suggest that Aarush is able to rank kettles in order of decreasing price. You should state your hypotheses and critical value.
  3. Explain why Aarush did not use the product moment correlation coefficient in this situation. Aarush discovered that kettle A's price was recorded incorrectly and should have been \(\pounds 49.99\) rather than \(\pounds 99.99\)
  4. Explain what effect this has on the rankings for the price.
Question 3
View details
  1. The volume of water in a bottle has a normal distribution with unknown mean, \(\mu\) millilitres, and known standard deviation, \(\sigma\) millilitres.
A random sample of 150 of the bottles of water gave a 95\% confidence interval for \(\mu\) of
(327.84, 329.76)
  1. Using the confidence interval given, test whether or not \(\mu = 328\) State your hypotheses clearly and write down the significance level you have used. A second random sample, of 200 of these bottles of water, had a mean volume of 328 millilitres.
  2. Calculate a 98\% confidence interval for \(\mu\) based on this second sample. You must show all steps in your working.
    (Solutions relying entirely on calculator technology are not acceptable.) Using five different random samples of 200 of these bottles of water, five \(98 \%\) confidence intervals for \(\mu\) are to be found.
  3. Calculate the probability that more than 3 of these intervals will contain \(\mu\)
Question 4
View details
  1. The manager of a company making ice cream believes that the proportions of people in the population who prefer vanilla, chocolate, strawberry and other are in the ratio \(10 : 5 : 2 : 3\)
The manager takes a random sample of 400 customers and records their age and favourite ice cream flavour. The results are shown in the table below.
\multirow{2}{*}{}Ice cream flavour
VanillaChocolateStrawberryOtherTotal
\multirow{3}{*}{Age}Child95251325158
Teenager57201736130
Adult36501016112
Total188954077400
  1. Use the data in the table to test, at the \(5 \%\) level of significance, the manager's belief. You should state your hypotheses, test statistic, critical value and conclusion clearly. A researcher wants to investigate whether or not there is a relationship between the age of a customer and their favourite ice cream flavour. In order to test whether favourite ice cream flavour and age are related, the researcher plans to carry out a \(\chi ^ { 2 }\) test.
  2. Use the table to calculate expected frequencies for the group
    1. teenagers whose favourite ice cream flavour is vanilla,
    2. adults whose favourite ice cream flavour is chocolate.
  3. Write down the number of degrees of freedom for this \(\chi ^ { 2 }\) test.
Question 5
View details
  1. A manager of a large company is investigating the time it takes the company's employees to complete a task.
The manager believes that the mean time for full-time employees to complete the task is more than a minute quicker than the mean time for part-time employees to complete the task. The manager collects a random sample of 605 full-time employees and 45 part-time employees and records the times, \(t\) minutes, it takes each employee to complete the task. The results are summarised in the table below.
\(n\)\(\bar { t }\)\(s ^ { 2 }\)
Full-time employees6055.69
Part-time employees457.04
  1. Test, at the \(5 \%\) level of significance, the manager's claim. You should state your hypotheses, test statistic, critical value and conclusion clearly.
  2. State two assumptions you have made in carrying out the test in part (a) The company increases the size of the sample of part-time employees to 46 The time taken to complete the task by the extra employee is 8 minutes.
  3. Find an unbiased estimate of the variance for the sample of 46 part-time employees.
Question 6
View details
  1. The weights of bags of carrots, \(C \mathrm {~kg}\), are such that \(C \sim \mathrm {~N} \left( 1.2,0.03 ^ { 2 } \right)\)
Three bags of carrots are selected at random.
  1. Calculate the probability that their total weight is more than 3.5 kg . The weights of bags of potatoes, \(R \mathrm {~kg}\), are such that \(R \sim \mathrm {~N} \left( 2.3,0.03 ^ { 2 } \right)\)
    Two bags of potatoes are selected at random.
  2. Calculate the probability that the difference in their weights is more than 0.05 kg . The weights of trays, \(T \mathrm {~kg}\), are such that \(T \sim \mathrm {~N} \left( 2.5 , \sqrt { 0.1 } ^ { 2 } \right)\)
    The random variable \(G\) represents the total weight, in kg, of a single tray packed with 10 bags of potatoes where \(G\) and \(T\) are independent.
  3. Calculate \(\mathrm { P } ( G < 2 T + 20 )\)
Question 7
View details
  1. The continuous random variable \(D\) is uniformly distributed over the interval \([ x - 1 , x + 5 ]\) where \(x\) is a constant.
A random sample of \(n\) observations of \(D\) is taken, where \(n\) is large.
  1. Use the Central Limit Theorem to find an approximate distribution for \(\bar { D }\) Give your answer in terms of \(n\) and \(x\) where appropriate. The \(n\) observations of \(D\) have a sample mean of 24.6
    Given that the lower bound of the \(99 \%\) confidence interval for \(x\) is 22.101 to 3 decimal places,
  2. find the value of \(n\) Show your working clearly.