Questions — SPS SPS FM Statistics (95 questions)

Browse by board
AQA AS Paper 1 AS Paper 2 C1 C2 C3 C4 D1 D2 FP1 FP2 FP3 Further AS Paper 1 Further AS Paper 2 Discrete Further AS Paper 2 Mechanics Further AS Paper 2 Statistics Further Paper 1 Further Paper 2 Further Paper 3 Discrete Further Paper 3 Mechanics Further Paper 3 Statistics M1 M2 M3 Paper 1 Paper 2 Paper 3 S1 S2 S3 CAIE FP1 FP2 Further Paper 1 Further Paper 2 Further Paper 3 Further Paper 4 M1 M2 P1 P2 P3 S1 S2 Edexcel AEA AS Paper 1 AS Paper 2 C1 C12 C2 C3 C34 C4 CP AS CP1 CP2 D1 D2 F1 F2 F3 FD1 FD1 AS FD2 FD2 AS FM1 FM1 AS FM2 FM2 AS FP1 FP1 AS FP2 FP2 AS FP3 FS1 FS1 AS FS2 FS2 AS M1 M2 M3 M4 M5 P1 P2 P3 P4 PMT Mocks Paper 1 Paper 2 Paper 3 S1 S2 S3 S4 OCR AS Pure C1 C2 C3 C4 D1 D2 FD1 AS FM1 AS FP1 FP1 AS FP2 FP3 FS1 AS Further Additional Pure Further Additional Pure AS Further Discrete Further Discrete AS Further Mechanics Further Mechanics AS Further Pure Core 1 Further Pure Core 2 Further Pure Core AS Further Statistics Further Statistics AS H240/01 H240/02 H240/03 M1 M2 M3 M4 Mechanics 1 PURE Pure 1 S1 S2 S3 S4 Stats 1 OCR MEI AS Paper 1 AS Paper 2 C1 C2 C3 C4 D1 D2 FP1 FP2 FP3 Further Extra Pure Further Mechanics A AS Further Mechanics B AS Further Mechanics Major Further Mechanics Minor Further Numerical Methods Further Pure Core Further Pure Core AS Further Pure with Technology Further Statistics A AS Further Statistics B AS Further Statistics Major Further Statistics Minor M1 M2 M3 M4 Paper 1 Paper 2 Paper 3 S1 S2 S3 S4 SPS SPS ASFM SPS ASFM Mechanics SPS ASFM Pure SPS ASFM Statistics SPS FM SPS FM Mechanics SPS FM Pure SPS FM Statistics SPS SM SPS SM Mechanics SPS SM Pure SPS SM Statistics WJEC Further Unit 1 Further Unit 2 Further Unit 3 Further Unit 4 Further Unit 5 Further Unit 6 Unit 1 Unit 2 Unit 3 Unit 4
SPS SPS FM Statistics 2024 September Q6
6. A television company believes that the proportion of households that can receive Channel C is 0.35 .
  1. In a random sample of 14 households it is found that 2 can receive Channel C. Test, at the \(2.5 \%\) significance level, whether there is evidence that the proportion of households that can receive Channel C is less than 0.35 .
  2. On another occasion the test is carried out again, with the same hypotheses and significance level as in part (i), but using a new sample, of size \(n\). It is found that no members of the sample can receive Channel C. Find the largest value of \(n\) for which the null hypothesis is not rejected. Show all relevant working.
    [0pt] [BLANK PAGE]
SPS SPS FM Statistics 2024 September Q7
7. The continuous random variable \(X\) has the following probability density function: $$f ( x ) = \begin{cases} a + b x & 0 \leqslant x \leqslant 2
0 & \text { otherwise } \end{cases}$$ where \(a\) and \(b\) are constants.
  1. Show that \(2 a + 2 b = 1\).
  2. It is given that \(\mathrm { E } ( X ) = \frac { 11 } { 9 }\). Use this information to find a second equation connecting \(a\) and \(b\), and hence find the values of \(a\) and \(b\).
  3. Determine whether the median of \(X\) is greater than, less than, or equal to \(\mathrm { E } ( X )\).
    [0pt] [BLANK PAGE]
    [0pt] [BLANK PAGE]
    [0pt] [BLANK PAGE]
SPS SPS FM Statistics 2025 January Q1
6 marks
  1. \(\mathrm { E } ( a X + b Y + c ) = a \mathrm { E } ( X ) + b \mathrm { E } ( Y ) + c\),
  2. if \(X\) and \(Y\) are independent then \(\operatorname { Var } ( a X + b Y + c ) = a ^ { 2 } \operatorname { Var } ( X ) + b ^ { 2 } \operatorname { Var } ( Y )\).
\section*{Non-parametric tests} Goodness-of-fit test and contingency tables: \(\sum \frac { \left( O _ { i } - E _ { i } \right) ^ { 2 } } { E _ { i } } \sim \chi _ { v } ^ { 2 }\)
Approximate distributions for large samples
Wilcoxon Signed Rank test: \(T \sim \mathrm {~N} \left( \frac { 1 } { 4 } n ( n + 1 ) , \frac { 1 } { 24 } n ( n + 1 ) ( 2 n + 1 ) \right)\)
Wilcoxon Rank Sum test (samples of sizes \(m\) and \(n\), with \(m \leq n\) ): $$W \sim \mathrm {~N} \left( \frac { 1 } { 2 } m ( m + n + 1 ) , \frac { 1 } { 12 } m n ( m + n + 1 ) \right)$$ \section*{Percentage points of the normal distribution} If \(Z\) has a normal distribution with mean 0 and variance 1 then, for each value of \(p\), the table gives the value of \(z\) such that \(P ( Z \leq z ) = p\). Test whether there is evidence, at the \(1 \%\) significance level, that the judges agree with each another. The marks given by the two judges to the 7 randomly chosen contestants were as follows, where \(x\) is an integer.
Contestant\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)
Judge 164656778798086
Judge 2616378808190\(x\)
(b) Use the value \(r _ { s } = \frac { 27 } { 28 }\) to determine the range of possible values of \(x\).
(c) Give a reason why it might be preferable to use the product moment correlation coefficient rather than Spearman's rank correlation coefficient in this context. Total: \(\_\_\_\_\) / 9 marks \section*{Question 7} A circle, centre \(O\), has radius \(x \mathrm {~cm}\), where \(x\) is an observation from the random variable \(X\) which has a uniform distribution on \([ 0 , \pi ]\)
(a) Find the probability that the area of the circle is greater than \(10 \mathrm {~cm} ^ { 2 }\)
(b) State, giving a reason, whether the median area of the circle is greater or less than \(10 \mathrm {~cm} ^ { 2 }\) The triangle \(O A B\) is drawn inside the circle with \(O A\) and \(O B\) as radii of length \(x \mathrm {~cm}\) and angle \(A O B\) is \(x\) radians.
(c) Use algebraic integration to find the expected value of the area of triangle \(O A B\). Give your answer as an exact value. Detailed working is required and calculator integration is not allowed.
[0pt] [6] Total: \(\_\_\_\_\) / 10 marks End of Paper
SPS SPS FM Statistics 2026 January Q1
1. At a wine-tasting competition, two judges give marks out of 100 to 7 wines as follows.
Wine\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)
Judge I86.387.587.688.889.489.990.5
Judge II85.388.182.787.789.089.491.5
A spectator claims that there is a high level of agreement between the rank orders of the marks given by the two judges. Test the spectator's claim at the \(1 \%\) significance level.
[0pt] [BLANK PAGE]
SPS SPS FM Statistics 2026 January Q2
2. At a toy factory, wooden blocks of approximate heights \(20 \mathrm {~mm} , 30 \mathrm {~mm}\) and 50 mm are made in red, yellow and green respectively. The heights of the blocks in mm are modelled by independent random variables which are Normally distributed with means and standard deviations as shown in the table.
ColourMeanStandard deviation
Red200.8
Yellow300.9
Green501.2
In parts (a), (b) and (c), the blocks are selected randomly and independently of one another.
  1. Find the probability that the height of a red block is less than 19 mm .
  2. A tower is made of 15 blocks stacked on top of each other consisting of 5 red blocks, 5 yellow blocks and 5 green blocks. Determine the probability that the tower is at least 495 mm high.
  3. Determine the probability that a tower made of 3 red blocks will be at least 1 mm higher than a tower made of 2 yellow blocks.
    [0pt] [BLANK PAGE]
SPS SPS FM Statistics 2026 January Q3
4 marks
3. A student is investigating the relationship between different electricity generation methods and cost of electricity in a particular country. The student first checks whether there is any correlation between the cost per unit of electricity, \(x\) euros, and the amount of electricity being generated by wind, \(y \mathrm { GW }\). The data from 30 observations are summarised as follows.
\(n = 30 \quad \sum x = 2.219 \quad \sum y = 357.7 \quad \sum x ^ { 2 } = 0.2368 \quad \sum y ^ { 2 } = 4648 \quad \sum x y = 25.01\)
  1. In this question you must show detailed reasoning. Determine the product moment correlation coefficient.
  2. Carry out a hypothesis test at the \(5 \%\) level to investigate whether there is any correlation between the cost per unit of electricity and the amount of electricity generated by wind.
    [0pt] [4]
    [0pt] [BLANK PAGE]
SPS SPS FM Statistics 2026 January Q4
4. The numbers of customers arriving at a ticket desk between 8 a.m. and 9 a.m. on a Monday morning and on a Tuesday morning are denoted by \(X\) and \(Y\) respectively. It is given that \(X \sim \operatorname { Po } ( 17 )\) and \(Y \sim \operatorname { Po } ( 14 )\).
  1. Find
    (a) \(\mathrm { P } ( X + Y ) > 40\),
    (b) \(\operatorname { Var } ( 2 X - Y )\).
  2. State a necessary assumption for your calculations in part (i) to be valid.
    [0pt] [BLANK PAGE]
SPS SPS FM Statistics 2026 January Q5
5. The random variable \(X\) has the distribution \(\mathrm { N } \left( \mu , 3 ^ { 2 } \right)\). A random sample of 9 observations of \(X\) produced the following values. $$\begin{array} { l l l l l l l l l } 6 & 2 & 3 & 6 & 8 & 11 & 12 & 5 & 10 \end{array}$$
  1. Find a \(90 \%\) confidence interval for \(\mu\).
  2. Explain what is meant by a \(90 \%\) confidence interval in this context.
    [0pt] [BLANK PAGE]
SPS SPS FM Statistics 2026 January Q6
6. A survey is carried out into the length of time for which customers wait for a response on a telephone helpline. A statistician who is analysing the results of the survey starts by modelling the waiting time, \(x\) minutes, by an exponential distribution with probability density function (PDF) $$\mathrm { f } ( x ) = \begin{cases} \lambda \mathrm { e } ^ { - \lambda x } & x \geqslant 0
0 & x < 0 \end{cases}$$
  1. In this question you must show detailed reasoning. The mean waiting time is found to be 5.0 minutes. Show that \(\lambda = 0.2\).
    ii) Use the model to calculate the probability that a customer has to wait longer than 20 minutes for a response. In practice it is found that no customer waits for more than 15 minutes for a response. The statistician constructs an improved model to incorporate this fact.
    iii) Sketch the following on the same axis.
    (a) the PDF of the model using the exponential distribution,
    (b) a possible PDF for the improved model.
    [0pt] [BLANK PAGE]
SPS SPS FM Statistics 2026 January Q7
10 marks
7. A machine is designed to make paper with mean thickness 56.80 micrometres. The thicknesses, \(x\) micrometres, of a random sample of 300 sheets are summarised by $$n = 300 , \quad \Sigma x = 17085.0 , \quad \Sigma x ^ { 2 } = 973847.0 .$$ Test, at the \(10 \%\) significance level, whether the machine is producing paper of the designed thickness.
[0pt] [10]
[0pt] [BLANK PAGE]
SPS SPS FM Statistics 2026 January Q8
8. Each of the 7 letters in the word DIVIDED is printed on a separate card. The cards are arranged in a row.
  1. How many different arrangements of the letters are possible?
  2. In how many of these arrangements are all three Ds together? The 7 cards are now shuffled and 2 cards are selected at random, without replacement.
  3. Find the probability that at least one of these 2 cards has D printed on it.
    [0pt] [BLANK PAGE]
SPS SPS FM Statistics 2026 January Q9
9 A continuous random variable \(X\) has probability density function given by the following function, - where \(a\) is a constant.
\(\mathrm { f } ( x ) = \left\{ \begin{array} { l l } \frac { 2 x } { a ^ { 2 } } & 0 \leqslant x \leqslant a ,
0 & \text { otherwise. } \end{array} \right\}\)
The expected value of \(X\) is 4 .
  1. Show that \(a = 6\). Five independent observations of \(X\) are obtained, and the largest of them is denoted by \(M\).
  2. Find the cumulative distribution function of \(M\).
    [0pt] [BLANK PAGE]
SPS SPS FM Statistics 2024 April Q1
1. 200 candidates took each of two examination papers. The diagram shows the cumulative frequency graphs for their marks.
\includegraphics[max width=\textwidth, alt={}, center]{5b55a372-2cc8-454e-a10a-cabdc9801421-04_1091_1484_429_285}
  1. State, with a reason, which of the two papers was the easier one.
  2. The minimum mark for grade A , the top grade, on Paper 1 was 10 marks lower than the minimum mark for grade A on Paper 2. Given that 25 candidates gained grade A in Paper 1, find the number of candidates who gained grade A in Paper 2.
  3. The mean and standard deviation of the marks on Paper 1 were 36.5 and 28.2 respectively. Later, a marking error was discovered and it was decided to add 1 mark to each of the 200 marks on Paper 1. State the mean and standard deviation of the new marks on Paper 1.
    [0pt] [BLANK PAGE]
SPS SPS FM Statistics 2024 April Q2
2. In this question you must show detailed reasoning.
A disease that affects trees shows no visible evidence for the first few years after the tree is infected. A test has been developed to determine whether a particular tree has the disease. A positive result to the test suggests that the tree has the disease. However, the test is not \(100 \%\) reliable, and a researcher uses the following model.
  • If the tree has the disease, the probability of a positive result is 0.95 .
  • If the tree does not have the disease, the probability of a positive result is 0.1 .
    1. It is known that in a certain county, \(A , 35 \%\) of the trees have the disease. A tree in county \(A\) is chosen at random and is tested.
Given that the result is positive, determine the probability that this tree has the disease. A forestry company wants to determine what proportion of trees in another county, \(B\), have the disease. They choose a large random sample of trees in county \(B\). Each tree in the sample is tested and it is found that the result is positive for \(43 \%\) of these trees.
  • By carrying out a calculation, determine an estimate of the proportion of trees in county \(B\) that have the disease.
    [0pt] [BLANK PAGE]
  • SPS SPS FM Statistics 2024 April Q3
    3. A group of 8 people, including Kathy, David and Harpreet, are planning a theatre trip.
    1. Four of the group are chosen at random, without regard to order, to carry the refreshments. Find the probability that these 4 people include Kathy and David but not Harpreet.
    2. The 8 people sit in a row. Kathy and David sit next to each other and Harpreet sits at the left-hand end of the row. How many different arrangements of the 8 people are possible?
      [0pt] [BLANK PAGE]
    SPS SPS FM Statistics 2024 April Q4
    4. A market researcher wants to interview people who watched a particular television programme. Audience research data used by the broadcaster indicates that \(12 \%\) of the adult population watched this programme. This figure is used to model the situation.
    The researcher asks people in a shopping centre, one at a time, if they watched the programme. You should assume that these people form a random sample of the adult population.
    1. Find the probability that the fifth person the researcher asks is the first to have watched the programme.
    2. Find the probability that the researcher has to ask at least 10 people in order to find one who watched the programme.
    3. Find the probability that the twentieth person the researcher asks is the third to have watched the programme.
    4. Find how many people the researcher would have to ask to ensure that there is a probability of at least 0.95 that at least one of them watched the programme.
      [0pt] [BLANK PAGE]
    SPS SPS FM Statistics 2024 April Q5
    5. The random variable \(H\) has the distribution \(\mathrm { N } \left( \mu , \sigma ^ { 2 } \right)\). It is given that \(\mathrm { P } ( H < 105.0 ) = 0.2420\) and \(\mathrm { P } ( H > 110.0 ) = 0.6915\). Find the values of \(\mu\) and \(\sigma\), giving your answers to a suitable degree of accuracy.
    SPS SPS FM Statistics 2024 April Q6
    6. For the events \(A\) and \(B\), $$\mathrm { P } \left( A \cap B ^ { \prime } \right) = 0.32 , \quad \mathrm { P } \left( A ^ { \prime } \cap B \right) = 0.11 \quad \text { and } \quad \mathrm { P } ( A \cup B ) = 0.65$$
    1. Find \(\mathrm { P } \left( A \mid B ^ { \prime } \right)\).
    2. Determine whether or not \(A\) and \(B\) are independent.
    SPS SPS FM Statistics 2024 April Q7
    7. An online shopping company takes orders through its website. On average \(80 \%\) of orders from the website are delivered within 24 hours. The quality controller selects 10 orders at random to check when they are delivered.
    1. Find the probability that
      (A) exactly 8 of these orders are delivered within 24 hours,
      (B) at least 8 of these orders are delivered within 24 hours. The company changes its delivery method. The quality controller suspects that the changes will mean that fewer than \(80 \%\) of orders will be delivered within 24 hours. A random sample of 18 orders is checked and it is found that 12 of them arrive within 24 hours.
    2. Write down suitable hypotheses and carry out a test at the \(5 \%\) significance level to determine whether there is any evidence to support the quality controller's suspicion.
    3. A statistician argues that it is possible that the new method could result in either better or worse delivery times. Therefore it would be better to carry out a 2 -tail test at the \(5 \%\) significance level. State the alternative hypothesis for this test. Assuming that the sample size is still 18 , find the critical region for this test, showing all of your calculations.
      [0pt] [BLANK PAGE]
    SPS SPS FM Statistics 2024 April Q18
    18 \({ } ^ { \text {th } }\) April 2024} \date{} \DeclareUnicodeCharacter{25A1}{\ifmmode\square\else{\(\square\)}\fi} \begin{document} \maketitle Name: □
    □ Instructions
    • Answer all the questions.
    • Use black or blue ink. Pencil may be used for graphs and diagrams only.
    • There are blank pages at the end of the paper for additional working. You must clearly indicate when you have moved onto additional pages on the question itself. Make sure to include the question number.
    • You are permitted to use a scientific or graphical calculator in this paper.
    • Where appropriate, your answer should be supported with working. Marks might be given for using a correct method, even if your answer is wrong.
    • Give non-exact numerical answers correct to 3 significant figures unless a different degree of accuracy is specified in the question.
    • The acceleration due to gravity is denoted by \(g \mathrm {~ms} ^ { - 2 }\). When a numerical value is needed use \(g = 9.8\) unless a different value is specified in the question.
    Information
    • The total mark for this paper is \(\mathbf { 6 0 }\) marks.
    • The marks for each question are shown in brackets.
    • You are reminded of the need for clear presentation in your answers.
    • You have \(\mathbf { 6 0 }\) minutes for this paper.
    \section*{Formulae
    A Level Mathematics A (H240)} Arithmetic series
    \(S _ { n } = \frac { 1 } { 2 } n ( a + l ) = \frac { 1 } { 2 } n \{ 2 a + ( n - 1 ) d \}\) \section*{Geometric series} \(S _ { n } = \frac { a \left( 1 - r ^ { n } \right) } { 1 - r }\)
    \(S _ { \infty } = \frac { a } { 1 - r }\) for \(| r | < 1\) \section*{Binomial series} \(( a + b ) ^ { n } = a ^ { n } + { } ^ { n } \mathrm { C } _ { 1 } a ^ { n - 1 } b + { } ^ { n } \mathrm { C } _ { 2 } a ^ { n - 2 } b ^ { 2 } + \ldots + { } ^ { n } \mathrm { C } _ { r } a ^ { n - r } b ^ { r } + \ldots + b ^ { n } \quad ( n \in \mathbb { N } )\)
    where \({ } ^ { n } \mathrm { C } _ { r } = { } _ { n } \mathrm { C } _ { r } = \binom { n } { r } = \frac { n ! } { r ! ( n - r ) ! }\) $$( 1 + x ) ^ { n } = 1 + n x + \frac { n ( n - 1 ) } { 2 ! } x ^ { 2 } + \ldots + \frac { n ( n - 1 ) \ldots ( n - r + 1 ) } { r ! } x ^ { r } + \ldots \quad ( | x | < 1 , n \in \mathbb { R } )$$ \section*{Differentiation}
    \(\mathrm { f } ( x )\)\(\mathrm { f } ^ { \prime } ( x )\)
    \(\tan k x\)\(k \sec ^ { 2 } k x\)
    \(\sec x\)\(\sec x \tan x\)
    \(\cot x\)\(- \operatorname { cosec } ^ { 2 } x\)
    \(\operatorname { cosec } x\)\(- \operatorname { cosec } x \cot x\)
    Quotient rule \(y = \frac { u } { v } , \frac { \mathrm {~d} y } { \mathrm {~d} x } = \frac { v \frac { \mathrm {~d} u } { \mathrm {~d} x } - u \frac { \mathrm {~d} v } { \mathrm {~d} x } } { v ^ { 2 } }\) \section*{Differentiation from first principles} \(\mathrm { f } ^ { \prime } ( x ) = \lim _ { h \rightarrow 0 } \frac { \mathrm { f } ( x + h ) - \mathrm { f } ( x ) } { h }\) \section*{Integration} \(\int \frac { \mathrm { f } ^ { \prime } ( x ) } { \mathrm { f } ( x ) } \mathrm { d } x = \ln | \mathrm { f } ( x ) | + c\)
    \(\int \mathrm { f } ^ { \prime } ( x ) ( \mathrm { f } ( x ) ) ^ { n } \mathrm {~d} x = \frac { 1 } { n + 1 } ( \mathrm { f } ( x ) ) ^ { n + 1 } + c\)
    Integration by parts \(\int u \frac { \mathrm {~d} v } { \mathrm {~d} x } \mathrm {~d} x = u v - \int v \frac { \mathrm {~d} u } { \mathrm {~d} x } \mathrm {~d} x\) \section*{Small angle approximations} \(\sin \theta \approx \theta , \cos \theta \approx 1 - \frac { 1 } { 2 } \theta ^ { 2 } , \tan \theta \approx \theta\) where \(\theta\) is measured in radians \section*{Trigonometric identities} $$\begin{aligned} & \sin ( A \pm B ) = \sin A \cos B \pm \cos A \sin B
    & \cos ( A \pm B ) = \cos A \cos B \mp \sin A \sin B
    & \tan ( A \pm B ) = \frac { \tan A \pm \tan B } { 1 \mp \tan A \tan B } \quad \left( A \pm B \neq \left( k + \frac { 1 } { 2 } \right) \pi \right) \end{aligned}$$ \section*{Numerical methods} Trapezium rule: \(\int _ { a } ^ { b } y \mathrm {~d} x \approx \frac { 1 } { 2 } h \left\{ \left( y _ { 0 } + y _ { n } \right) + 2 \left( y _ { 1 } + y _ { 2 } + \ldots + y _ { n - 1 } \right) \right\}\), where \(h = \frac { b - a } { n }\)
    The Newton-Raphson iteration for solving \(\mathrm { f } ( x ) = 0 : x _ { n + 1 } = x _ { n } - \frac { \mathrm { f } \left( x _ { n } \right) } { \mathrm { f } ^ { \prime } \left( x _ { n } \right) }\) \section*{Probability} \(\mathrm { P } ( A \cup B ) = \mathrm { P } ( A ) + \mathrm { P } ( B ) - \mathrm { P } ( A \cap B )\)
    \(\mathrm { P } ( A \cap B ) = \mathrm { P } ( A ) \mathrm { P } ( B \mid A ) = \mathrm { P } ( B ) \mathrm { P } ( A \mid B ) \quad\) or \(\quad \mathrm { P } ( A \mid B ) = \frac { \mathrm { P } ( A \cap B ) } { \mathrm { P } ( B ) }\) \section*{Standard deviation} \(\sqrt { \frac { \sum ( x - \bar { x } ) ^ { 2 } } { n } } = \sqrt { \frac { \sum x ^ { 2 } } { n } - \bar { x } ^ { 2 } }\) or \(\sqrt { \frac { \sum f ( x - \bar { x } ) ^ { 2 } } { \sum f } } = \sqrt { \frac { \sum f x ^ { 2 } } { \sum f } - \bar { x } ^ { 2 } }\) \section*{The binomial distribution} If \(X \sim \mathrm {~B} ( n , p )\) then \(\mathrm { P } ( X = x ) = \binom { n } { x } p ^ { x } ( 1 - p ) ^ { n - x }\), mean of \(X\) is \(n p\), variance of \(X\) is \(n p ( 1 - p )\) \section*{Hypothesis test for the mean of a normal distribution} If \(X \sim \mathrm {~N} \left( \mu , \sigma ^ { 2 } \right)\) then \(\bar { X } \sim \mathrm {~N} \left( \mu , \frac { \sigma ^ { 2 } } { n } \right)\) and \(\frac { \bar { X } - \mu } { \sigma / \sqrt { n } } \sim \mathrm {~N} ( 0,1 )\) \section*{Percentage points of the normal distribution} If \(Z\) has a normal distribution with mean 0 and variance 1 then, for each value of \(p\), the table gives the value of \(z\) such that \(\mathrm { P } ( Z \leqslant z ) = p\).
    \(p\)0.750.900.950.9750.990.9950.99750.9990.9995
    \(z\)0.6741.2821.6451.9602.3262.5762.8073.0903.291
    \section*{Kinematics} Motion in a straight line
    \(v = u + a t\)
    \(s = u t + \frac { 1 } { 2 } a t ^ { 2 }\)
    \(s = \frac { 1 } { 2 } ( u + v ) t\)
    \(v ^ { 2 } = u ^ { 2 } + 2 a s\)
    \(s = v t - \frac { 1 } { 2 } a t ^ { 2 }\) Motion in two dimensions
    \(\mathbf { v } = \mathbf { u } + \mathbf { a } t\)
    \(\mathbf { s } = \mathbf { u } t + \frac { 1 } { 2 } \mathbf { a } t ^ { 2 }\)
    \(\mathbf { s } = \frac { 1 } { 2 } ( \mathbf { u } + \mathbf { v } ) t\)
    \(\mathbf { s } = \mathbf { v } t - \frac { 1 } { 2 } \mathbf { a } t ^ { 2 }\)
    1. 200 candidates took each of two examination papers. The diagram shows the cumulative frequency graphs for their marks.
    \includegraphics[max width=\textwidth, alt={}, center]{5b55a372-2cc8-454e-a10a-cabdc9801421-04_1091_1484_429_285}
    1. State, with a reason, which of the two papers was the easier one.
    2. The minimum mark for grade A , the top grade, on Paper 1 was 10 marks lower than the minimum mark for grade A on Paper 2. Given that 25 candidates gained grade A in Paper 1, find the number of candidates who gained grade A in Paper 2.
    3. The mean and standard deviation of the marks on Paper 1 were 36.5 and 28.2 respectively. Later, a marking error was discovered and it was decided to add 1 mark to each of the 200 marks on Paper 1. State the mean and standard deviation of the new marks on Paper 1.
      [0pt] [BLANK PAGE]
      2. In this question you must show detailed reasoning.
      A disease that affects trees shows no visible evidence for the first few years after the tree is infected. A test has been developed to determine whether a particular tree has the disease. A positive result to the test suggests that the tree has the disease. However, the test is not \(100 \%\) reliable, and a researcher uses the following model.
      • If the tree has the disease, the probability of a positive result is 0.95 .
      • If the tree does not have the disease, the probability of a positive result is 0.1 .
        (a) It is known that in a certain county, \(A , 35 \%\) of the trees have the disease. A tree in county \(A\) is chosen at random and is tested.
      Given that the result is positive, determine the probability that this tree has the disease. A forestry company wants to determine what proportion of trees in another county, \(B\), have the disease. They choose a large random sample of trees in county \(B\). Each tree in the sample is tested and it is found that the result is positive for \(43 \%\) of these trees.
      (b) By carrying out a calculation, determine an estimate of the proportion of trees in county \(B\) that have the disease.
      [0pt] [BLANK PAGE]
      3. A group of 8 people, including Kathy, David and Harpreet, are planning a theatre trip.
    4. Four of the group are chosen at random, without regard to order, to carry the refreshments. Find the probability that these 4 people include Kathy and David but not Harpreet.
    5. The 8 people sit in a row. Kathy and David sit next to each other and Harpreet sits at the left-hand end of the row. How many different arrangements of the 8 people are possible?
      [0pt] [BLANK PAGE]
      4. A market researcher wants to interview people who watched a particular television programme. Audience research data used by the broadcaster indicates that \(12 \%\) of the adult population watched this programme. This figure is used to model the situation.
      The researcher asks people in a shopping centre, one at a time, if they watched the programme. You should assume that these people form a random sample of the adult population.
      (a) Find the probability that the fifth person the researcher asks is the first to have watched the programme.
      (b) Find the probability that the researcher has to ask at least 10 people in order to find one who watched the programme.
      (c) Find the probability that the twentieth person the researcher asks is the third to have watched the programme.
      (d) Find how many people the researcher would have to ask to ensure that there is a probability of at least 0.95 that at least one of them watched the programme.
      [0pt] [BLANK PAGE]
      5. The random variable \(H\) has the distribution \(\mathrm { N } \left( \mu , \sigma ^ { 2 } \right)\). It is given that \(\mathrm { P } ( H < 105.0 ) = 0.2420\) and \(\mathrm { P } ( H > 110.0 ) = 0.6915\). Find the values of \(\mu\) and \(\sigma\), giving your answers to a suitable degree of accuracy.
      6. For the events \(A\) and \(B\), $$\mathrm { P } \left( A \cap B ^ { \prime } \right) = 0.32 , \quad \mathrm { P } \left( A ^ { \prime } \cap B \right) = 0.11 \quad \text { and } \quad \mathrm { P } ( A \cup B ) = 0.65$$
    6. Find \(\mathrm { P } \left( A \mid B ^ { \prime } \right)\).
    7. Determine whether or not \(A\) and \(B\) are independent.
      7. An online shopping company takes orders through its website. On average \(80 \%\) of orders from the website are delivered within 24 hours. The quality controller selects 10 orders at random to check when they are delivered.
    8. Find the probability that
      (A) exactly 8 of these orders are delivered within 24 hours,
      (B) at least 8 of these orders are delivered within 24 hours. The company changes its delivery method. The quality controller suspects that the changes will mean that fewer than \(80 \%\) of orders will be delivered within 24 hours. A random sample of 18 orders is checked and it is found that 12 of them arrive within 24 hours.
    9. Write down suitable hypotheses and carry out a test at the \(5 \%\) significance level to determine whether there is any evidence to support the quality controller's suspicion.
    10. A statistician argues that it is possible that the new method could result in either better or worse delivery times. Therefore it would be better to carry out a 2 -tail test at the \(5 \%\) significance level. State the alternative hypothesis for this test. Assuming that the sample size is still 18 , find the critical region for this test, showing all of your calculations.
      [0pt] [BLANK PAGE]
      8. The continuous random variable \(X\) has probability density function \(\mathrm { f } ( x ) = \begin{cases} k x ^ { n } & 0 \leqslant x \leqslant 1 ,
      0 & \text { otherwise, } \end{cases}\) where \(k\) is a constant and \(n\) is a parameter whose value is positive. It is given that the median of \(X\) is 0.8816 correct to 4 decimal places.
      Ten independent observations of \(X\) are obtained.
      Find the expected number of observations that are less than 0.8 .
      [0pt] [BLANK PAGE]
      [0pt] [BLANK PAGE]