Identify outliers using mean and standard deviation

Question asks to show a value is an outlier using a criterion based on mean ± k×standard deviation (typically k=2 or k=3), not the IQR rule.

6 questions · Moderate -0.7

2.02g Calculate mean and standard deviation2.02h Recognize outliers
Sort by: Default | Easiest first | Hardest first
OCR MEI S1 2007 June Q3
8 marks Moderate -0.8
3 The marks \(x\) scored by a sample of 56 students in an examination are summarised by $$n = 56 , \quad \Sigma x = 3026 , \quad \Sigma x ^ { 2 } = 178890 .$$
  1. Calculate the mean and standard deviation of the marks.
  2. The highest mark scored by any of the 56 students in the examination was 93 . Show that this result may be considered to be an outlier.
  3. The formula \(y = 1.2 x - 10\) is used to scale the marks. Find the mean and standard deviation of the scaled marks.
OCR MEI S1 Q2
8 marks Moderate -0.8
2 The marks \(x\) scored by a sample of 56 students in an examination are summarised by $$n = 56 , \quad \Sigma x = 3026 , \quad \Sigma x ^ { 2 } = 178890 .$$
  1. Calculate the mean and standard deviation of the marks.
  2. The highest mark scored by any of the 56 students in the examination was 93. Show that this result may be considered to be an outlier.
  3. The formula \(y = 1.2 x - 10\) is used to scale the marks. Find the mean and standard deviation of the scaled marks.
Pre-U Pre-U 9794/3 2017 June Q1
5 marks Moderate -0.8
1 Levels of nitrogen dioxide in the atmosphere are being monitored at the side of a road in a busy city centre. A sample of 18 measurements taken (in suitable units) is as follows. $$\begin{array} { l l l l l l l l l l l l l l l l l l } 83 & 44 & 95 & 92 & 98 & 63 & 69 & 76 & 19 & 91 & 70 & 91 & 74 & 65 & 62 & 70 & 95 & 108 \end{array}$$
  1. Find the mean and standard deviation of the sample.
  2. Hence identify, with justification, any possible outliers.
OCR MEI S1 Q4
7 marks Moderate -0.8
A sprinter runs many 100-metre trials, and the time, \(x\) seconds, for each is recorded. A sample of eight of these times is taken, as follows. 10.53 \quad 10.61 \quad 10.04 \quad 10.49 \quad 10.63 \quad 10.55 \quad 10.47 \quad 10.63
  1. Calculate the sample mean, \(\bar{x}\), and sample standard deviation, \(s\), of these times. [3]
  2. Show that the time of 10.04 seconds may be regarded as an outlier. [2]
  3. Discuss briefly whether or not the time of 10.04 seconds should be discarded. [2]
AQA AS Paper 2 Specimen Q17
6 marks Moderate -0.8
The table below is an extract from the Large Data Set.
MakeRegionEngine sizeMassCO2CO
VAUXHALLSouth West139811631180.463
VOLKSWAGENLondon99910551060.407
VAUXHALLSouth West12481225850.141
BMWSouth West297916351940.139
TOYOTASouth West199516501230.274
BMWSouth West297902440.447
FORDSouth West159601650.518
TOYOTASouth West12991050144
VAUXHALLLondon139813611400.695
FORDNorth West495117992990.621
    1. Calculate the standard deviation of the engine sizes in the table. [1 mark]
    2. The mean of the engine sizes is 2084 Any value more than 2 standard deviations from the mean can be identified as an outlier. Using this definition of an outlier, show that the sample of engine sizes has exactly one outlier. Fully justify your answer. [3 marks]
  1. Rajan calculates the mean of the masses of the cars in this extract and states that it is 1094 kg. Use your knowledge of the Large Data Set to suggest what error Rajan is likely to have made in his calculation. [1 mark]
  2. Rajan claims there is an error in the data recorded in the table for one of the Toyotas from the South West, because there is no value for its carbon monoxide emissions. Use your knowledge of the Large Data Set to comment on Rajan's claim. [1 mark]
AQA Paper 3 2019 June Q12
6 marks Moderate -0.3
Amelia decides to analyse the heights of members of her school rowing club. The heights of a random sample of 10 rowers are shown in the table below.
RowerJessNellLivNeveAnnToriMayaKathDarcyJen
Height (cm)162169172156146161159164157160
  1. Any value more than 2 standard deviations from the mean may be regarded as an outlier. Verify that Ann's height is an outlier. Fully justify your answer. [4 marks]
  2. Amelia thinks she may have written down Ann's height incorrectly. If Ann's height were discarded, state with a reason what, if any, difference this would make to the mean and standard deviation. [2 marks]