Edexcel S1 (Statistics 1) 2022 June

Question 1
View details
  1. The company Seafield requires contractors to record the number of hours they work each week. A random sample of 38 weeks is taken and the number of hours worked per week by contractor Kiana is summarised in the stem and leaf diagram below.
StemLeaf
144455566999(11)
212233444\(w\)9(10)
32344567779(10)
41123(4)
519(2)
64(1)
Key : 3|2 means 32 The quartiles for this distribution are summarised in the table below.
\(Q _ { 1 }\)\(Q _ { 2 }\)\(Q _ { 3 }\)
\(x\)26\(y\)
  1. Find the values of \(w , x\) and \(y\) Kiana is looking for outliers in the data. She decides to classify as outliers any observations greater than $$Q _ { 3 } + 1.0 \times \left( Q _ { 3 } - Q _ { 1 } \right)$$
  2. Showing your working clearly, identify any outliers that Kiana finds.
  3. Draw a box plot for these data in the space provided on the grid opposite.
  4. Use the formula $$\text { skewness } = \frac { \left( Q _ { 3 } - Q _ { 2 } \right) - \left( Q _ { 2 } - Q _ { 1 } \right) } { \left( Q _ { 3 } - Q _ { 1 } \right) }$$ to find the skewness of these data. Give your answer to 2 significant figures. Kiana's new employer, Landacre, wishes to know the average number of hours per week she worked during her employment at Seafield to help calculate the cost of employing her.
  5. Explain why Landacre might prefer to know Kiana's mean, rather than median, number of hours worked per week. Turn over for a spare grid if you need to redraw your box plot.
Question 2
View details
  1. Stuart is investigating the relationship between Gross Domestic Product (GDP) and the size of the population for a particular country.
    He takes a random sample of 9 years and records the size of the population, \(t\) millions, and the GDP, \(g\) billion dollars for each of these years.
The data are summarised as $$n = 9 \quad \sum t = 7.87 \quad \sum g = 144.84 \quad \sum g ^ { 2 } = 3624.41 \quad S _ { t t } = 1.29 \quad S _ { t g } = 40.25$$
  1. Calculate the product moment correlation coefficient between \(t\) and \(g\)
  2. Give an interpretation of your product moment correlation coefficient.
  3. Find the equation of the least squares regression line of \(g\) on \(t\) in the form \(g = a + b t\)
  4. Give an interpretation of the value of \(b\) in your regression line.
    1. Use the regression line from part (c) to estimate the GDP, in billions of dollars, for a population of 7000000
    2. Comment on the reliability of your answer in part (i). Give a reason, in context, for your answer. Using the regression line from part (c), Stuart estimates that for a population increase of \(x\) million there will be an increase of 0.1 billion dollars in GDP.
  5. Find the value of \(x\)
Question 3
View details
  1. Gill buys a bag of logs to use in her stove. The lengths, \(l \mathrm {~cm}\), of the 88 logs in the bag are summarised in the table below.
Length \(( \boldsymbol { l } )\)Frequency \(( \boldsymbol { f } )\)
\(15 < l \leqslant 20\)19
\(20 < l \leqslant 25\)35
\(25 < l \leqslant 27\)16
\(27 < l \leqslant 30\)15
\(30 < l \leqslant 40\)3
A histogram is drawn to represent these data.
The bar representing logs with length \(27 < l \leqslant 30\) has a width of 1.5 cm and a height of 4 cm .
  1. Calculate the width and height of the bar representing log lengths of \(20 < l \leqslant 25\)
  2. Use linear interpolation to estimate the median of \(l\) The maximum length of log Gill can use in her stove is 26 cm .
    Gill estimates, using linear interpolation, that \(x\) logs from the bag will fit into her stove.
  3. Show that \(x = 62\) Gill randomly selects 4 logs from the bag.
  4. Using \(x = 62\), find the probability that all 4 logs will fit into her stove. The weights, \(W\) grams, of the logs in the bag are coded using \(y = 0.5 w - 255\) and summarised by $$n = 88 \quad \sum y = 924 \quad \sum y ^ { 2 } = 12862$$
  5. Calculate
    1. the mean of \(W\)
    2. the variance of \(W\)
Question 4
View details
  1. The events \(H\) and \(W\) are such that
$$\mathrm { P } ( H ) = \frac { 3 } { 8 } \quad \mathrm { P } ( H \cup W ) = \frac { 3 } { 4 }$$ Given that \(H\) and \(W\) are independent,
  1. show that \(\mathrm { P } ( W ) = \frac { 3 } { 5 }\) The event \(N\) is such that $$\mathrm { P } ( N ) = \frac { 1 } { 15 } \quad \mathrm { P } ( H \cap N ) = \mathrm { P } ( N )$$
  2. Find \(\mathrm { P } \left( N ^ { \prime } \mid H \right)\) Given that \(W\) and \(N\) are mutually exclusive,
  3. draw a Venn diagram to represent the events \(H , W\) and \(N\) giving the exact probabilities of each region in the Venn diagram.
Question 5
View details
  1. A red spinner is designed so that the score \(R\) is given by the following probability distribution.
\(r\)23456
\(\mathrm { P } ( R = r )\)0.250.30.150.10.2
  1. Show that \(\mathrm { E } \left( R ^ { 2 } \right) = 15.8\) Given also that \(\mathrm { E } ( R ) = 3.7\)
  2. find the standard deviation of \(R\), giving your answer to 2 decimal places. A yellow spinner is designed so that the score \(Y\) is given by the probability distribution in the table below. The cumulative distribution function \(\mathrm { F } ( y )\) is also given.
    \(y\)23456
    \(\mathrm { P } ( Y = y )\)0.10.20.1\(a\)\(b\)
    \(\mathrm {~F} ( y )\)0.10.30.4\(c\)\(d\)
  3. Write down the value of \(d\) Given that \(\mathrm { E } ( Y ) = 4.55\)
  4. find the value of \(c\) Pabel and Jessie play a game with these two spinners.
    Pabel uses the red spinner.
    Jessie uses the yellow spinner.
    They take turns to spin their spinner.
    The winner is the first person whose spinner lands on the number 2 and the game ends. Jessie spins her spinner first.
  5. Find the probability that Jessie wins on her second spin.
  6. Calculate the probability that, in a game, the score on Pabel's first spin is the same as the score on Jessie’s first spin.
Question 6
View details
  1. A manufacturer fills bottles with oil. The volume of oil in a bottle, \(V \mathrm { ml }\), is normally distributed with \(V \sim \mathrm {~N} \left( 100,2.5 ^ { 2 } \right)\)
    1. Find \(\mathrm { P } ( V > 104.9 )\)
    2. In a pack of 150 bottles, find the expected number of bottles containing more than 104.9 ml
    3. Find the value of \(v\), to 2 decimal places, such that \(\mathrm { P } ( V > v \mid V < 104.9 ) = 0.2801\)