Questions — SPS SPS FM Statistics (95 questions)

Browse by board
AQA AS Paper 1 AS Paper 2 C1 C2 C3 C4 D1 D2 FP1 FP2 FP3 Further AS Paper 1 Further AS Paper 2 Discrete Further AS Paper 2 Mechanics Further AS Paper 2 Statistics Further Paper 1 Further Paper 2 Further Paper 3 Discrete Further Paper 3 Mechanics Further Paper 3 Statistics M1 M2 M3 Paper 1 Paper 2 Paper 3 S1 S2 S3 CAIE FP1 FP2 Further Paper 1 Further Paper 2 Further Paper 3 Further Paper 4 M1 M2 P1 P2 P3 S1 S2 Edexcel AEA AS Paper 1 AS Paper 2 C1 C12 C2 C3 C34 C4 CP AS CP1 CP2 D1 D2 F1 F2 F3 FD1 FD1 AS FD2 FD2 AS FM1 FM1 AS FM2 FM2 AS FP1 FP1 AS FP2 FP2 AS FP3 FS1 FS1 AS FS2 FS2 AS M1 M2 M3 M4 M5 P1 P2 P3 P4 PMT Mocks Paper 1 Paper 2 Paper 3 S1 S2 S3 S4 OCR AS Pure C1 C2 C3 C4 D1 D2 FD1 AS FM1 AS FP1 FP1 AS FP2 FP3 FS1 AS Further Additional Pure Further Additional Pure AS Further Discrete Further Discrete AS Further Mechanics Further Mechanics AS Further Pure Core 1 Further Pure Core 2 Further Pure Core AS Further Statistics Further Statistics AS H240/01 H240/02 H240/03 M1 M2 M3 M4 Mechanics 1 PURE Pure 1 S1 S2 S3 S4 Stats 1 OCR MEI AS Paper 1 AS Paper 2 C1 C2 C3 C4 D1 D2 FP1 FP2 FP3 Further Extra Pure Further Mechanics A AS Further Mechanics B AS Further Mechanics Major Further Mechanics Minor Further Numerical Methods Further Pure Core Further Pure Core AS Further Pure with Technology Further Statistics A AS Further Statistics B AS Further Statistics Major Further Statistics Minor M1 M2 M3 M4 Paper 1 Paper 2 Paper 3 S1 S2 S3 S4 SPS SPS ASFM SPS ASFM Mechanics SPS ASFM Pure SPS ASFM Statistics SPS FM SPS FM Mechanics SPS FM Pure SPS FM Statistics SPS SM SPS SM Mechanics SPS SM Pure SPS SM Statistics WJEC Further Unit 1 Further Unit 2 Further Unit 3 Further Unit 4 Further Unit 5 Further Unit 6 Unit 1 Unit 2 Unit 3 Unit 4
SPS SPS FM Statistics 2021 June Q1
  1. Employees at a company were asked how long their average commute to work was. The table below gives information about their answers.
Time taken ( \(t\) minutes)Number of people
\(0 < t \leq 10\)\(x\)
\(10 < t \leq 20\)30
\(20 < t \leq 30\)35
\(30 < t \leq 50\)28
\(50 < t \leq 90\)12
The company estimates that the mean time for employees commuting to work is 28 minutes. Work out the value of \(x\), showing your working clearly.
[0pt] [BLANK PAGE]
SPS SPS FM Statistics 2021 June Q2
2. Events \(A\) and \(B\) are such that \(P ( A \cup B ) = 0.95 , P ( A \cap B ) = 0.6\) and \(P ( A \mid B ) = 0.75\).
i. Find \(P ( B )\).
ii. Find \(P ( A )\).
iii. Show that the events \(A ^ { \prime }\) and \(B\) are independent.
[0pt] [BLANK PAGE]
SPS SPS FM Statistics 2021 June Q3
3. The letters of the word CHAFFINCH are written on cards.
i. In how many ways can the letters be rearranged with no restrictions.
ii. In how many difference ways can the letters be rearranged if the vowels are to have at least one consonant between them.
[0pt] [BLANK PAGE]
SPS SPS FM Statistics 2021 June Q4
4. The weights of sacks of potatoes are normally distributed. It is known that one in five sacks weigh more than 6 kg and three in five sacks weigh more than 5.5 kg .
i. Find the mean and standard deviation of the weights of potato sacks.
ii. The sacks are put into crates, with twelve sacks going into each crate. What is the probability that a given crate contains two or more sacks that weigh more than 6 kg ? You must explain your reasoning clearly in this question.
[0pt] [BLANK PAGE]
SPS SPS FM Statistics 2021 June Q5
5. Eleven students in a class sit a Mathematics exam and their average score is \(67 \%\) with a standard deviation of \(12 \%\). One student from the class is absent and sits the paper later, achieving a score of \(85 \%\).
i. Find the mean score for the whole class and the standard deviation for the whole class.
ii. Comment, with justification, on whether the score for the paper sat later should be considered as an outlier.
[0pt] [BLANK PAGE] \section*{6. Only two airlines fly daily into an airport.} AMP Air has 70 flights per day and Volt Air has 65 flights per day.
Passengers flying with AMP Air have an \(18 \%\) probability of losing their luggage and passengers flying with Volt Air have a \(23 \%\) probability of losing their luggage. You overhear a passenger in the airport complaining about her luggage being lost.
Find the exact probability that she travelled with Volt Air, giving your answer as a rational number.
[0pt] [BLANK PAGE]
SPS SPS FM Statistics 2021 June Q7
7. A continuous random variable \(X\) has probability density function \(f\) given by $$f ( x ) = \left\{ \begin{array} { c } \frac { x ^ { 2 } } { a } + b , \quad 0 \leq x \leq 4
0 \quad \text { otherwise } \end{array} \right.$$ where \(a\) and \(b\) are positive constants. It is given that \(P ( X \geq 2 ) = 0.75\).
i. Show that \(a = 32\) and \(b = \frac { 1 } { 12 }\).
ii. Find \(E ( X )\).
iii. Find \(P ( X > E ( X ) \mid X > 2 )\)
[0pt] [BLANK PAGE]
[0pt] [BLANK PAGE]
SPS SPS FM Statistics 2021 May Q1
1. The random variable \(X\) denotes the yield, in kilograms per acre, of a certain crop. Under the standard treatment it is known that \(\mathrm { E } ( X ) = 38.4\). Under a new treatment, the yields of 50 randomly chosen regions can be summarised as $$n = 50 , \quad \sum x = 1834.0 , \quad \sum x ^ { 2 } = 70027.37 .$$ Test at the \(1 \%\) level whether there has been a change in the mean crop yield.
[0pt] [BLANK PAGE]
SPS SPS FM Statistics 2021 May Q2
2. The weights of bananas sold by a supermarket are modelled by a Normal distribution with mean 205 g and standard deviation 11 g . When a banana is peeled the change in its weight is modelled as being a reduction of \(35 \%\).
a) Find the probability that the weight of a randomly selected peeled banana is at most 150 g . Andy makes smoothies. Each smoothie is made using 2 peeled bananas and 20 strawberries from the supermarket, all the items being randomly chosen. The weight of a strawberry is modelled by a Normal distribution with mean 22.5 g and standard deviation 2.7 g .
b) Find the probability that the total weight of a smoothie is less than 700 g .
[0pt] [BLANK PAGE]
SPS SPS FM Statistics 2021 May Q3
3. A discrete random variable \(X\) has the distribution \(\mathrm { U } ( 11 )\). The mean of 50 observations of \(X\) is denoted by \(\bar { X }\).
Use an approximate method (continuity correction is not required), which should be justified, to find \(P ( \bar { X } \leq 6.10 )\).
[0pt] [BLANK PAGE]
SPS SPS FM Statistics 2021 May Q4
10 marks
4. A university course was taught by two different professors. Students could choose whether to attend the lectures given by Professor \(Q\) or the lectures given by Professor \(R\). At the end of the course all the students took the same examination. The examination marks of a random sample of 30 students taught by Professor \(Q\) and a random sample of 24 students taught by Professor \(R\) were ranked. The sum of the ranks of the students taught by Professor \(Q\) was 726 . Test at the \(5 \%\) significance level whether there is a difference in the ranks of the students taught by the two professors.
[0pt] [10]
[0pt] [BLANK PAGE]
SPS SPS FM Statistics 2021 May Q5
5. The continuous random variable \(T\) has cumulative distribution function $$\mathrm { F } ( t ) = \begin{cases} 0 & t < 0 ,
1 - \mathrm { e } ^ { - 0.25 t } & t \geqslant 0 . \end{cases}$$
  1. Find the cumulative distribution function of \(2 T\).
  2. Show that, for constant \(k , \quad \mathrm { E } \left( \mathrm { e } ^ { k T } \right) = \frac { 1 } { 1 - 4 k }\). You should state with a reason the range of values of \(k\) for which this result is valid.
  3. \(T\) is the time before a certain event occurs. Show that the probability that no event occurs between time \(T = 0\) and time \(T = \theta\) is the same as the probability that the value of a random variable with the distribution \(\operatorname { Po } ( \lambda )\) is 0 , for a certain value of \(\lambda\). You should state this value of \(\lambda\) in terms of \(\theta\).
    [0pt] [BLANK PAGE]
    [0pt] [BLANK PAGE]
    [0pt] [BLANK PAGE]
    [0pt] [BLANK PAGE]
    [0pt] [BLANK PAGE]
SPS SPS FM Statistics 2021 January Q1
1 marks
1. Alan's journey time to work can be modelled by a normal distribution with standard deviation 6 minutes. Alan measures the journey time to work for a random sample of 5 journeys. The mean of the 5 journey times is 36 minutes.
  1. Construct a \(95 \%\) confidence interval for Alan's mean journey time to work, giving your values to one decimal place.
  2. Alan claims that his mean journey time to work is 30 minutes. State, with a reason, whether or not the confidence interval found in part (a) supports Alan's claim.
    [0pt] [1 mark]
    \includegraphics[max width=\textwidth, alt={}, center]{db093bfd-a08d-4554-ba5c-5204b6045d0e-2_344_1657_1025_246} \section*{2.} Indre works on reception in an office and deals with all the telephone calls that arrive. Calls arrive randomly and, in a 4-hour morning shift, there are on average 80 calls.
  3. Using a suitable model, find the probability of more than 4 calls arriving in a particular 20-minute period one morning. Indre is allowed 20 minutes of break time during each 4-hour morning shift, which she can take in 5-minute periods. When she takes a break, a machine records details of any call in the office that Indre has missed. One morning Indre took her break time in 4 periods of 5 minutes each.
  4. Find the probability that in exactly 3 of these periods there were no calls. On another occasion Indre took 1 break of 5 minutes and 1 break of 15 minutes.
  5. Find the probability that Indre missed exactly 1 call in each of these 2 breaks.
SPS SPS FM Statistics 2021 January Q3
3. A large field of wheat is split into 8 plots of equal area. Each plot is treated with a different amount of fertiliser, \(f\) grams \(/ \mathrm { m } ^ { 2 }\). The yield of wheat, \(w\) tonnes, from each plot is recorded. The results are summarised below. $$\sum f = 28 \quad \sum w = 303 \quad \sum w ^ { 2 } = 13447 \quad \mathrm {~S} _ { f f } = 42 \quad \mathrm {~S} _ { f w } = 269.5$$
  1. Calculate the product moment correlation coefficient between \(f\) and \(w\)
  2. Interpret the value of your product moment correlation coefficient.
  3. Find the equation of the regression line of \(w\) on \(f\) in the form \(w = a + b f\)
  4. Using your equation, estimate the decrease in yield when the amount of fertiliser decreases by 0.5 grams \(/ \mathrm { m } ^ { 2 }\)
SPS SPS FM Statistics 2021 January Q4
4. The continuous random variable \(X\) has cumulative distribution function given by $$\mathrm { F } ( x ) = \left\{ \begin{array} { c c } 0 & x \leqslant 0
k \left( x ^ { 3 } - \frac { 3 } { 8 } x ^ { 4 } \right) & 0 < x \leqslant 2
1 & x > 2 \end{array} \right.$$ where \(k\) is a constant.
  1. Show that \(k = \frac { 1 } { 2 }\)
  2. Showing your working clearly, use calculus to find
    1. \(\mathrm { E } ( X )\)
    2. the mode of \(X\)
SPS SPS FM Statistics 2021 January Q5
9 marks
5. A shopkeeper sells chocolate bars which are described by the manufacturer as having an average mass of 45 grams. The shopkeeper claims that the mass of the chocolate bars, \(X\) grams, is getting smaller on average. A random sample of 6 chocolate bars is taken and their masses in grams are measured. The results are $$\sum x = 246 \quad \text { and } \quad \sum x ^ { 2 } = 10198$$ Investigate the shopkeeper's claim using the \(5 \%\) level of significance.
State any assumptions that you make.
[0pt] [9 marks]
SPS SPS FM Statistics 2021 January Q6
6. A spinner can land on red or blue. When the spinner is spun, there is a probability of \(\frac { 1 } { 3 }\) that it lands on blue. The spinner is spun repeatedly. The random variable \(B\) represents the number of the spin when the spinner first lands on blue.
  1. Find (i) \(\mathrm { P } ( B = 4 )\)
    (ii) \(\mathrm { P } ( B \leqslant 5 )\)
  2. Find \(\mathrm { E } \left( B ^ { 2 } \right)\) Steve invites Tamara to play a game with this spinner.
    Tamara must choose a colour, either red or blue.
    Steve will spin the spinner repeatedly until the spinner first lands on the colour Tamara has chosen. The random variable \(X\) represents the number of the spin when this occurs. If Tamara chooses red, her score is \(\mathrm { e } ^ { X }\)
    If Tamara chooses blue, her score is \(X ^ { 2 }\)
  3. State, giving your reasons and showing any calculations you have made, which colour you would recommend that Tamara chooses.
SPS SPS FM Statistics 2021 January Q7
7. Nine athletes, \(A , B , C , D , E , F , G , H\) and \(I\), competed in both the 100 m sprint and the long jump. After the two events the positions of each athlete were recorded and Spearman's rank correlation coefficient was calculated and found to be 0.85 The piece of paper the positions were recorded on was mislaid. Although some of the athletes agreed their positions, there was some disagreement between athletes \(B , C\) and \(D\) over their long jump results. The table shows the results that are agreed to be correct.
Athlete\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)
Position in 100 m sprint467928315
Position in long jump549312
Given that there were no tied ranks,
  1. find the correct positions of athletes \(B , C\) and \(D\) in the long jump. You must show your working clearly and give reasons for your answers.
  2. Without recalculating the coefficient, explain how Spearman's rank correlation coefficient would change if athlete \(H\) was disqualified from both the 100 m sprint and the long jump.
SPS SPS FM Statistics 2020 October Q1
  1. \(\mathrm { E } ( a X + b Y + c ) = a \mathrm { E } ( X ) + b \mathrm { E } ( Y ) + c\),
  2. if \(X\) and \(Y\) are independent then \(\operatorname { Var } ( a X + b Y + c ) = a ^ { 2 } \operatorname { Var } ( X ) + b ^ { 2 } \operatorname { Var } ( Y )\).
\section*{Discrete distributions} \(X\) is a random variable taking values \(x _ { i }\) in a discrete distribution with \(\mathrm { P } \left( X = x _ { i } \right) = p _ { i }\)
Expectation: \(\mu = \mathrm { E } ( X ) = \sum x _ { i } p _ { i }\)
Variance: \(\sigma ^ { 2 } = \operatorname { Var } ( X ) = \sum \left( x _ { i } - \mu \right) ^ { 2 } p _ { i } = \sum x _ { i } ^ { 2 } p _ { i } - \mu ^ { 2 }\)
\(P ( X = x )\)E \(( X )\)\(\operatorname { Var } ( X )\)
Binomial \(\mathrm { B } ( n , p )\)\(\binom { n } { x } p ^ { x } ( 1 - p ) ^ { n - x }\)\(n p\)\(n p ( 1 - p )\)
Uniform distribution over \(1,2 , \ldots , n , \mathrm { U } ( n )\)\(\frac { 1 } { n }\)\(\frac { n + 1 } { 2 }\)\(\frac { 1 } { 12 } \left( n ^ { 2 } - 1 \right)\)
Geometric distribution Geo(p)\(( 1 - p ) ^ { x - 1 } p\)\(\frac { 1 } { p }\)\(\frac { 1 - p } { p ^ { 2 } }\)
Poisson \(\operatorname { Po } ( \lambda )\)\(e ^ { - \lambda } \frac { \lambda ^ { x } } { x ! }\)\(\lambda\)\(\lambda\)
\section*{Continuous distributions} \(X\) is a continuous random variable with probability density function (p.d.f.) \(\mathrm { f } ( x )\)
Expectation: \(\mu = \mathrm { E } ( X ) = \int x \mathrm { f } ( x ) \mathrm { d } x\)
Variance: \(\sigma ^ { 2 } = \operatorname { Var } ( X ) = \int ( x - \mu ) ^ { 2 } \mathrm { f } ( x ) \mathrm { d } x = \int x ^ { 2 } \mathrm { f } ( x ) \mathrm { d } x - \mu ^ { 2 }\)
Cumulative distribution function \(\mathrm { F } ( x ) = \mathrm { P } ( X \leq x ) = \int _ { - \infty } ^ { x } \mathrm { f } ( t ) \mathrm { d } t\)
p.d.f.E ( \(X\) )\(\operatorname { Var } ( X )\)
Continuous uniform distribution over [ \(a , b\) ]\(\frac { 1 } { b - a }\)\(\frac { 1 } { 2 } ( a + b )\)\(\frac { 1 } { 12 } ( b - a ) ^ { 2 }\)
Exponential\(\lambda \mathrm { e } ^ { - \lambda x }\)\(\frac { 1 } { \lambda }\)\(\frac { 1 } { \lambda ^ { 2 } }\)
Normal \(N \left( \mu , \sigma ^ { 2 } \right)\)\(\frac { 1 } { \sigma \sqrt { 2 \pi } } \mathrm { e } ^ { - \frac { 1 } { 2 } \left( \frac { x - \mu } { \sigma } \right) ^ { 2 } }\)\(\mu\)\(\sigma ^ { 2 }\)
\section*{Percentage points of the normal distribution} If \(Z\) has a normal distribution with mean 0 and variance 1 then, for each value of \(p\), the table gives the value of \(z\) such that \(P ( Z \leq z ) = p\).
\(p\)0.750.900.950.9750.990.9950.99750.9990.9995
\(z\)0.6741.2821.6451.9602.3262.5762.8073.0903.291
  1. The random variable \(X\) is uniformly distributed over the interval \([ - 1,5 ]\).
    a. Sketch the probability density function \(f ( x )\) of \(X\).
    b. State the value of \(\mathrm { P } ( X = 2 )\)
Find
c. \(\mathrm { E } ( X )\)
d. \(\operatorname { Var } ( X )\)
SPS SPS FM Statistics 2020 October Q2
2. The independent random variables \(X\) and \(Y\) are such that \(\mathrm { E } ( X ) = 20 , \mathrm { E } ( Y ) = 10\), \(\operatorname { Var } ( X ) = 5\) and \(\operatorname { Var } ( Y ) = 4\). Find:
a. \(\mathrm { E } ( 2 X - Y )\)
b. \(\operatorname { Var } ( 2 X - Y )\)
SPS SPS FM Statistics 2020 October Q3
3. A survey is carried out into the length of time for which customers wait for a response on a telephone helpline. A statistician who is analysing the results of the survey starts by modelling the waiting time, \(x\) minutes, by an exponential distribution with probability density function $$f ( x ) = \left\{ \begin{array} { c c } \lambda e ^ { - \lambda x } & x \geq 0
0 & x < 0 \end{array} \right.$$ The mean waiting time is found to be 5 minutes.
a. State the value of \(\lambda\).
b. Use the model to calculate the probability that a customer has to wait longer that 20 minutes for a response.
SPS SPS FM Statistics 2020 October Q4
4. The number of A-grades, \(X\), achieved in total by students at Lowkey School in their Mathematical examinations each year can be modelled by a Poisson distribution with a mean of 3 .
a. Determine the probability that, during a 5 -year period, students at Lowkey School achieve a total of more than 18 A-grades in their Mathematics examinations.
b. The number of A-grades, \(Y\), achieved in total by students at Lowkey School in their English examinations each year can be modelled by a Poisson distribution with mean of 7 .
i. Determine the probability that, during a year, students at Lowkey School achieve a total of fewer than 15 A-grades in their Mathematics and English examinations.
ii. What assumption did you make in answering part (b)(i)?
[0pt] [Total 7 marks]
SPS SPS FM Statistics 2020 October Q5
5. The continuous random variable \(X\) has probability density function given by $$f ( x ) = \begin{cases} \frac { 4 } { 3 x ^ { 3 } } & 1 \leq x < 2
\frac { 1 } { 12 } x & 2 \leq x \leq 4
0 & \text { otherwise } \end{cases}$$ a) Find the upper quartile of \(X\).
b) Find \(\mathrm { P } \left( \frac { 1 } { 2 } < X \leq 3 \right)\)
c) Find the value of \(a\) for which \(\mathrm { E } \left( X ^ { 2 } \right) = a \mathrm { E } ( X )\).
SPS SPS FM Statistics 2020 October Q6
6. The continuous random variable \(X\) has (cumulative) distribution function given by $$F ( x ) = \left\{ \begin{array} { c c } 0 & x < 1
1 - \frac { 1 } { x ^ { 4 } } & x \geq 1 \end{array} \right.$$ a. Show that the probability density function of \(Y\), where \(Y = \frac { 1 } { X ^ { 2 } }\), is given by $$g ( y ) = \left\{ \begin{array} { c c } 2 y & 0 < y \leq 1
0 & \text { otherwise } \end{array} \right.$$ b. Find \(\mathrm { E } ( \sqrt [ 3 ] { Y } )\).
SPS SPS FM Statistics 2020 October Q7
7. The partially completed table below summarises the times taken by 120 job applicants to complete a task.
Time, \(t\) (minutes)\(5 < t \leq 7\)\(7 < t \leq 10\)\(10 < t \leq 14\)\(14 < t \leq 18\)\(18 < t \leq 30\)
Frequency102351
A histogram is drawn. The bar representing the \(5 < t \leq 7\) has a width of 1 cm and a height of 5 cm .
  1. Given that the bar representing the group \(14 < t \leq 18\) has a height of 4 cm , find the frequency of this group.
    (2)
  2. Showing your working, estimate the mean time taken by the 120 job applicants.
    (3) The lower quartile of the times is 9.6 minutes and the upper quartile of the times is 15.5 minutes.
    For these data, an outlier is classified as any value greater than \(Q _ { 3 } + 1.5 \times\) IQR .
  3. Showing your working, explain whether or not any of the times taken by these 120 job applicants might be classified as outliers.
    (2) Candidates with the fastest \(5 \%\) of times for the task are given interviews.
  4. Estimate the time taken by a job applicant, below which they might be given an interview.
    (2)
SPS SPS FM Statistics 2020 October Q8
8. A company has a customer services call centre. The company believes that the time taken to complete a call to the call centre may be modelled by a normal distribution with mean 16 minutes and standard deviation \(\sigma\) minutes. Given that \(10 \%\) of the calls take longer than 22 minutes,
  1. show that, to 3 significant figures, the value of \(\sigma\) is 4.68
  2. Calculate the percentage of calls that take less than 13 minutes. A supervisor in the call centre claims that the mean call time is less than 16 minutes. He collects data on his own call times.
    • \(20 \%\) of the supervisor's calls take more than 17 minutes to complete.
    • \(10 \%\) of the supervisor's calls take less than 8 minutes to complete.
    Assuming that the time the supervisor takes to complete a call may be modelled by a normal distribution,
  3. estimate the mean and the standard deviation of the time taken by the supervisor to complete a call.
  4. State, giving a reason, whether or not the calculations in part (c) support the supervisor's claim. \section*{9.} A fast food company has a scratchcard competition. It has ordered scratchcards for the competition and requested that \(45 \%\) of the scratchcards be winning scratchcards. A random sample of 20 of the scratchcards is collected from each of 8 of the fast food company's stores. Assuming that \(45 \%\) of the scratchcards are winning scratchcards, calculate the probability that in at least 2 of the 8 stores, 12 or more of the scratchcards are winning scratchcards.
    [0pt] [Total 5 marks] END OF TEST