Identify outliers using IQR rule

Question asks to determine if specific given values are outliers using the Q₁ - 1.5×IQR or Q₃ + 1.5×IQR criterion, where quartiles must be calculated from raw data or are provided.

12 questions · Moderate -1.0

2.02f Measures of average and spread2.02h Recognize outliers
Sort by: Default | Easiest first | Hardest first
OCR MEI S1 2005 January Q2
7 marks Moderate -0.8
2 A sprinter runs many 100 -metre trials, and the time, \(x\) seconds, for each is recorded. A sample of eight of these times is taken, as follows. $$\begin{array} { l l l l l l l l } 10.53 & 10.61 & 10.04 & 10.49 & 10.63 & 10.55 & 10.47 & 10.63 \end{array}$$
  1. Calculate the sample mean, \(\bar { x }\), and sample standard deviation, \(s\), of these times.
  2. Show that the time of 10.04 seconds may be regarded as an outlier.
  3. Discuss briefly whether or not the time of 10.04 seconds should be discarded.
OCR MEI S1 2005 June Q1
5 marks Moderate -0.8
1 At a certain stage of a football league season, the numbers of goals scored by a sample of 20 teams in the league were as follows. \(\begin{array} { l l l l l l l l l l l l l l l l l l l l l } 22 & 23 & 23 & 23 & 26 & 28 & 28 & 30 & 31 & 33 & 33 & 34 & 35 & 35 & 36 & 36 & 37 & 46 & 49 & 49 \end{array}\)
  1. Calculate the sample mean and sample variance, \(s ^ { 2 }\), of these data.
  2. The three teams with the most goals appear to be well ahead of the other teams. Determine whether or not any of these three pieces of data may be considered outliers.
OCR MEI S1 Q4
5 marks Moderate -0.8
4 At a certain stage of a football league season, the numbers of goals scored by a sample of 20 teams in the league were as follows. \(\begin{array} { l l l l l l l l l l l l l l l l l l l l } 22 & 23 & 23 & 23 & 26 & 28 & 28 & 30 & 31 & 33 & 33 & 34 & 35 & 35 & 36 & 36 & 37 & 46 & 49 & 49 \end{array}\)
  1. Calculate the sample mean and sample variance, \(s ^ { 2 }\), of these data.
  2. The three teams with the most goals appear to be well ahead of the other teams. Determine whether or not any of these three pieces of data may be considered outliers.
OCR MEI S1 2016 June Q1
7 marks Moderate -0.8
1 The stem and leaf diagram illustrates the weights in grams of 20 house sparrows.
250
26058
2779
28145
29002
3077
316
32047
3333
Key: \(\quad 27 \quad \mid \quad 7 \quad\) represents 27.7 grams
  1. Find the median and interquartile range of the data.
  2. Determine whether there are any outliers.
Edexcel S1 2003 January Q4
16 marks Easy -1.2
4. A restaurant owner is concerned about the amount of time customers have to wait before being served. He collects data on the waiting times, to the nearest minute, of 20 customers. These data are listed below.
15,14,16,15,17,16,15,14,15,16,
17,16,15,14,16,17,15,25,18,16
  1. Find the median and inter-quartile range of the waiting times. An outlier is an observation that falls either \(1.5 \times\) (inter-quartile range) above the upper quartile or \(1.5 \times\) (inter-quartile range) below the lower quartile.
  2. Draw a boxplot to represent these data, clearly indicating any outliers.
  3. Find the mean of these data.
  4. Comment on the skewness of these data. Justify your answer.
Edexcel S1 2009 January Q4
14 marks Moderate -0.8
4. In a study of how students use their mobile telephones, the phone usage of a random sample of 11 students was examined for a particular week. The total length of calls, \(y\) minutes, for the 11 students were $$17,23,35,36,51,53,54,55,60,77,110$$
  1. Find the median and quartiles for these data. A value that is greater than \(Q _ { 3 } + 1.5 \times \left( Q _ { 3 } - Q _ { 1 } \right)\) or smaller than \(Q _ { 1 } - 1.5 \times \left( Q _ { 3 } - Q _ { 1 } \right)\) is defined as an outlier.
  2. Show that 110 is the only outlier.
  3. Using the graph paper on page 15 draw a box plot for these data indicating clearly the position of the outlier. The value of 110 is omitted.
  4. Show that \(S _ { y y }\) for the remaining 10 students is 2966.9 These 10 students were each asked how many text messages, \(x\), they sent in the same week. The values of \(S _ { x x }\) and \(S _ { x y }\) for these 10 students are \(S _ { x x } = 3463.6\) and \(S _ { x y } = - 18.3\).
  5. Calculate the product moment correlation coefficient between the number of text messages sent and the total length of calls for these 10 students. A parent believes that a student who sends a large number of text messages will spend fewer minutes on calls.
  6. Comment on this belief in the light of your calculation in part (e). \includegraphics[max width=\textwidth, alt={}, center]{d5d000c7-de42-461a-ba05-6c8b2c333780-09_611_1593_297_178}
Edexcel S1 2003 November Q6
16 marks Moderate -0.8
6. A travel agent sells holidays from his shop. The price, in \(\pounds\), of 15 holidays sold on a particular day are shown below.
29910502315999485
3501691015650830
992100689550475
For these data, find
  1. the mean and the standard deviation,
  2. the median and the inter-quartile range. An outlier is an observation that falls either more than \(1.5 \times\) (inter-quartile range) above the upper quartile or more than \(1.5 \times\) (inter-quartile range) below the lower quartile.
  3. Determine if any of the prices are outliers. The travel agent also sells holidays from a website on the Internet. On the same day, he recorded the price, \(\pounds x\), of each of 20 holidays sold on the website. The cheapest holiday sold was \(\pounds 98\), the most expensive was \(\pounds 2400\) and the quartiles of these data were \(\pounds 305 , \pounds 1379\) and \(\pounds 1805\). There were no outliers.
  4. On graph paper, and using the same scale, draw box plots for the holidays sold in the shop and the holidays sold on the website.
  5. Compare and contrast sales from the shop and sales from the website. \section*{END}
Pre-U Pre-U 9794/3 2016 Specimen Q3
5 marks Easy -1.2
3 The table shows fuel economy figures in miles per gallon (mpg) for some new cars.
CarABCDEFGHIJKLMNO
Mpg574034331117302731203524262332
  1. Find the median and quartiles for the mpg of these fifteen cars.
  2. Use the values in part (i) to identify any cars for which the mpg is an outlier.
AQA Paper 3 2023 June Q15
11 marks Moderate -0.8
  1. A random sample of eight cars was selected from the Large Data Set. The masses of these cars, in kilograms, were as follows. 950 989 1247 1415 1506 1680 1833 2040 It is given that, for the population of cars in the Large Data Set: lower quartile = 1167 median = 1393 upper quartile = 1570
    1. It was decided to remove any of the masses which fall outside the following interval. median \(- 1.5 \times\) interquartile range \(\leq\) mass \(\leq\) median \(+ 1.5 \times\) interquartile range Show that only one of the eight masses in the sample should be removed. [3 marks]
    2. Write down the statistical name for the mass that should be removed in part (a)(i). [1 mark]
  2. The table shows the probability distribution of the number of previous owners, \(N\), for a sample of cars taken from the Large Data Set.
    \(n\)0123456 or more
    \(P(N = n)\)0.140.370.9k0.250.4k1.7k0
    Find the value of \(P(1 \leq N < 5)\) [4 marks]
  3. An expert team is investigating whether there have been any changes in CO₂ emissions from all cars taken from the Large Data Set. The team decided to collect a quota sample of 200 cars to reflect the different years and the different makes of cars in the Large Data Set.
    1. Using your knowledge of the Large Data Set, explain how the team can collect this sample. [2 marks]
    2. Describe one disadvantage of quota sampling. [1 mark]
WJEC Unit 2 2018 June Q06
10 marks Moderate -0.8
Basel is a keen learner of languages. He finds a website on which a large number of language tutors offer their services. Basel records the cost, in dollars, of a one hour lesson from a random sample of tutors. He puts the data into a computer program which gives the following summary statistics. Cost per 1 hour lesson Min. :10.0 1st Qu. :16.0 Median :17.2 Mean :19.8 3rd Qu. :21.0 Max. :40.0
  1. Showing all calculations, comment on any outliers for the cost of a one hour lesson with a language tutor. [4]
  2. Describe the skewness of the data and explain what it means in this context. [2]
Dafydd is also a keen learner of languages. He takes his own random sample of the cost, in dollars, for a one hour lesson. He produces the following box plot. \includegraphics{figure_6}
    1. What will happen to the mean if the outlier is removed?
    2. What will happen to the median if the outlier is removed? [2]
  1. Compare and contrast the distributions of the cost of one hour language lessons for Dafydd's sample and Basel's sample. [2]
Pre-U Pre-U 9794/3 2019 Specimen Q3
5 marks Easy -1.8
The table shows fuel economy figures in miles per gallon (mpg) for some new cars.
CarABCDEFGHIJKLMNO
Mpg574034331117302731203524262332
  1. Find the median and quartiles for the mpg of these 15 cars. [2]
  2. Use the values in part (a) to identify any cars for which the mpg is an outlier. [3]
Pre-U Pre-U 9794/3 2020 Specimen Q3
5 marks Easy -1.3
The table shows fuel economy figures in miles per gallon (mpg) for some new cars.
CarABCDEFGHIJKLMNO
Mpg574034331117302731203524262332
  1. Find the median and quartiles for the mpg of these 15 cars. [2]
  2. Use the values in part (a) to identify any cars for which the mpg is an outlier. [3]