2.02f Measures of average and spread

447 questions

Sort by: Default | Easiest first | Hardest first
OCR MEI S1 2007 January Q2
7 marks Easy -1.8
2 The numbers of absentees per day from Mrs Smith's reception class over a period of 50 days are summarised below.
Number of absentees0123456\(> 6\)
Frequency8151183410
  1. Illustrate these data by means of a vertical line chart.
  2. Calculate the mean and root mean square deviation of these data.
  3. There are 30 children in Mrs Smith's class altogether. Find the mean and root mean square deviation of the number of children who are present during the 50 days.
OCR MEI S1 2007 January Q3
6 marks Easy -1.8
3 The times taken for 480 university students to travel from their accommodation to lectures are summarised below.
Time \(( t\) minutes \()\)\(0 \leqslant t < 5\)\(5 \leqslant t < 10\)\(10 \leqslant t < 20\)\(20 \leqslant t < 30\)\(30 \leqslant t < 40\)\(40 \leqslant t < 60\)
Frequency3415318873275
  1. Illustrate these data by means of a histogram.
  2. Identify the type of skewness of the distribution.
OCR MEI S1 2007 January Q6
18 marks Moderate -0.3
6 The birth weights in grams of a random sample of 1000 babies are displayed in the cumulative frequency diagram below. \includegraphics[max width=\textwidth, alt={}, center]{05b96db3-93c7-4921-a1c6-c7b2f8952a8f-4_1264_1553_486_296}
  1. Use the diagram to estimate the median and interquartile range of the data.
  2. Use your answers to part (i) to estimate the number of outliers in the sample.
  3. Should these outliers be excluded from any further analysis? Briefly explain your answer.
  4. Any baby whose weight is below the 10th percentile is selected for careful monitoring. Use the diagram to determine the range of weights of the babies who are selected. \(12 \%\) of new-born babies require some form of special care. A maternity unit has 17 new-born babies. You may assume that these 17 babies form an independent random sample.
  5. Find the probability that
    (A) exactly 2 of these 17 babies require special care,
    (B) more than 2 of the 17 babies require special care.
  6. On 100 independent occasions the unit has 17 babies. Find the expected number of occasions on which there would be more than 2 babies who require special care.
OCR MEI S1 2008 January Q1
7 marks Easy -1.8
1 Alice carries out a survey of the 28 students in her class to find how many text messages each sent on the previous day. Her results are shown in the stem and leaf diagram.
000113577788
1012334469
201337
357
4
58
Key: 2 | 3 represents 23
  1. Find the mode and median of the number of text messages.
  2. Identify the type of skewness of the distribution.
  3. Alice is considering whether to use the mean or the median as a measure of central tendency for these data.
    (A) In view of the skewness of the distribution, state whether Alice should choose the mean or the median.
    (B) What other feature of the distribution confirms Alice's choice?
  4. The mean number of text messages is 14.75 . If each message costs 10 pence, find the total cost of all of these messages.
OCR MEI S1 2008 January Q4
8 marks Moderate -0.8
4 A company is searching for oil reserves. The company has purchased the rights to make test drillings at four sites. It investigates these sites one at a time but, if oil is found, it does not proceed to any further sites. At each site, there is probability 0.2 of finding oil, independently of all other sites. The random variable \(X\) represents the number of sites investigated. The probability distribution of \(X\) is shown below.
\(r\)1234
\(\mathrm { P } ( X = r )\)0.20.160.1280.512
  1. Find the expectation and variance of \(X\).
  2. It costs \(\pounds 45000\) to investigate each site. Find the expected total cost of the investigation.
  3. Draw a suitable diagram to illustrate the distribution of \(X\).
OCR MEI S1 2008 January Q6
18 marks Easy -1.2
6 The maximum temperatures \(x\) degrees Celsius recorded during each month of 2005 in Cambridge are given in the table below.
JanFebMarAprMayJunJulAugSepOctNovDec
9.27.110.714.216.621.822.022.621.117.410.17.8
These data are summarised by \(n = 12 , \Sigma x = 180.6 , \Sigma x ^ { 2 } = 3107.56\).
  1. Calculate the mean and standard deviation of the data.
  2. Determine whether there are any outliers.
  3. The formula \(y = 1.8 x + 32\) is used to convert degrees Celsius to degrees Fahrenheit. Find the mean and standard deviation of the 2005 maximum temperatures in degrees Fahrenheit.
  4. In New York, the monthly maximum temperatures are recorded in degrees Fahrenheit. In 2005 the mean was 63.7 and the standard deviation was 16.0 . Briefly compare the maximum monthly temperatures in Cambridge and New York in 2005. The total numbers of hours of sunshine recorded in Cambridge during the month of January for each of the last 48 years are summarised below.
    Hours \(h\)\(70 \leqslant h < 100\)\(100 \leqslant h < 110\)\(110 \leqslant h < 120\)\(120 \leqslant h < 150\)\(150 \leqslant h < 170\)\(170 \leqslant h < 190\)
    Number of years681011103
  5. Draw a cumulative frequency graph for these data.
  6. Use your graph to estimate the 90th percentile.
OCR MEI S1 2005 June Q1
5 marks Moderate -0.8
1 At a certain stage of a football league season, the numbers of goals scored by a sample of 20 teams in the league were as follows. \(\begin{array} { l l l l l l l l l l l l l l l l l l l l l } 22 & 23 & 23 & 23 & 26 & 28 & 28 & 30 & 31 & 33 & 33 & 34 & 35 & 35 & 36 & 36 & 37 & 46 & 49 & 49 \end{array}\)
  1. Calculate the sample mean and sample variance, \(s ^ { 2 }\), of these data.
  2. The three teams with the most goals appear to be well ahead of the other teams. Determine whether or not any of these three pieces of data may be considered outliers.
OCR MEI S1 2005 June Q2
8 marks Easy -1.3
2 Answer part (i) of this question on the insert provided.
A taxi driver operates from a taxi rank at a main railway station in London. During one particular week he makes 120 journeys, the lengths of which are summarised in the table.
Length
\(( x\) miles \()\)
\(0 < x \leqslant 1\)\(1 < x \leqslant 2\)\(2 < x \leqslant 3\)\(3 < x \leqslant 4\)\(4 < x \leqslant 6\)\(6 < x \leqslant 10\)
Number of
journeys
3830211498
  1. On the insert, draw a cumulative frequency diagram to illustrate the data.
  2. Use your graph to estimate the median length of journey and the quartiles. Hence find the interquartile range.
  3. State the type of skewness of the distribution of the data.
OCR MEI S1 2005 June Q3
8 marks Easy -1.2
3 Jeremy is a computing consultant who sometimes works at home. The number, \(X\), of days that Jeremy works at home in any given week is modelled by the probability distribution $$\mathrm { P } ( X = r ) = \frac { 1 } { 40 } r ( r + 1 ) \quad \text { for } r = 1,2,3,4 .$$
  1. Verify that \(\mathrm { P } ( X = 4 ) = \frac { 1 } { 2 }\).
  2. Calculate \(\mathrm { E } ( X )\) and \(\operatorname { Var } ( X )\).
  3. Jeremy works for 45 weeks each year. Find the expected number of weeks during which he works at home for exactly 2 days.
OCR MEI S1 2006 June Q1
8 marks Easy -1.8
1 Every day, George attempts the quiz in a national newspaper. The quiz always consists of 7 questions. In the first 25 days of January, the numbers of questions George answers correctly each day are summarised in the table below.
Number correct1234567
Frequency1233475
  1. Draw a vertical line chart to illustrate the data.
  2. State the type of skewness shown by your diagram.
  3. Calculate the mean and the mean squared deviation of the data.
  4. How many correct answers would George need to average over the next 6 days if he is to achieve an average of 5 correct answers for all 31 days of January?
OCR MEI S1 2006 June Q3
7 marks Moderate -0.8
3 The score, \(X\), obtained on a given throw of a biased, four-faced die is given by the probability distribution $$\mathrm { P } ( X = r ) = k r ( 8 - r ) \text { for } r = 1,2,3,4 .$$
  1. Show that \(k = \frac { 1 } { 50 }\).
  2. Calculate \(\mathrm { E } ( X )\) and \(\operatorname { Var } ( X )\).
OCR MEI S1 2007 June Q5
6 marks Easy -1.8
5 A GCSE geography student is investigating a claim that global warming is causing summers in Britain to have more rainfall. He collects rainfall data from a local weather station for 2001 and 2006. The vertical line chart shows the number of days per week on which some rainfall was recorded during the 22 weeks of summer 2001. \includegraphics[max width=\textwidth, alt={}, center]{5e4f3310-b96e-43db-9b6d-61da3270db06-4_720_1557_443_296} Number of days per week with rain recorded in summer 2001
  1. Show that the median of the data is 4 , and find the interquartile range.
  2. For summer 2006 the median is 3 and the interquartile range is also 3. The student concludes that the data demonstrate that global warming is causing summer rainfall to decrease rather than increase. Is this a valid conclusion from the data? Give two brief reasons to justify your answer.
OCR MEI S1 2008 June Q7
20 marks Moderate -0.8
7 The histogram shows the age distribution of people living in Inner London in 2001. \includegraphics[max width=\textwidth, alt={}, center]{be764df3-ff20-415d-9c5c-10edabf350de-5_814_1383_349_379} Data sourced from the 2001 Census, \href{http://www.statistics.gov.uk}{www.statistics.gov.uk}
  1. State the type of skewness shown by the distribution.
  2. Use the histogram to estimate the number of people aged under 25.
  3. The table below shows the cumulative frequency distribution.
    Age2030405065100
    Cumulative frequency (thousands)66012401810\(a\)24902770
    (A) Use the histogram to find the value of \(a\).
    (B) Use the table to calculate an estimate of the median age of these people. The ages of people living in Outer London in 2001 are summarised below.
    Age ( \(x\) years)\(0 \leqslant x < 20\)\(20 \leqslant x < 30\)\(30 \leqslant x < 40\)\(40 \leqslant x < 50\)\(50 \leqslant x < 65\)\(65 \leqslant x < 100\)
    Frequency (thousands)1120650770590680610
  4. Illustrate these data by means of a histogram.
  5. Make two brief comments on the differences between the age distributions of the populations of Inner London and Outer London.
  6. The data given in the table for Outer London are used to calculate the following estimates. Mean 38.5, median 35.7, midrange 50, standard deviation 23.7, interquartile range 34.4.
    The final group in the table assumes that the maximum age of any resident is 100 years. These estimates are to be recalculated, based on a maximum age of 105, rather than 100. For each of the five estimates, state whether it would increase, decrease or be unchanged.
OCR MEI S1 Q1
8 marks Easy -1.2
1 At a tourist information office the numbers of people seeking information each hour over the course of a 12-hour day are shown below. $$\begin{array} { l l l l l l l l l l l l } 6 & 25 & 38 & 39 & 31 & 18 & 35 & 31 & 33 & 15 & 21 & 28 \end{array}$$
  1. Construct a sorted stem and leaf diagram to represent these data.
  2. State the type of skewness suggested by your stem and leaf diagram.
  3. For these data find the median, the mean and the mode. Comment on the usefulness of the mode in this case.
OCR MEI S1 Q2
19 marks Moderate -0.5
2 The box and whisker plot below summarises the weights in grams of the 20 chocolates in a box. \includegraphics[max width=\textwidth, alt={}, center]{452a52c9-b1fa-4b98-a85d-a34ba0f84a9d-1_290_1186_1099_452}
  1. Find the interquartile range of the data and hence determine whether there are any outliers at either end of the distribution. Ben buys a box of these chocolates each weekend. The chocolates all look the same on the outside, but 7 of them have orange centres, 6 have cherry centres, 4 have coffee centres and 3 have lemon centres. One weekend, each of Ben's 3 children eats one of the chocolates, chosen at random.
  2. Calculate the probabilities of the following events. A: all 3 chocolates have orange centres \(B\) : all 3 chocolates have the same centres
  3. Find \(\mathrm { P } ( A \mid B )\) and \(\mathrm { P } ( B \mid A )\). The following weekend, Ben buys an identical box of chocolates and again each of his 3 children eats one of the chocolates, chosen at random.
  4. Find the probability that, on both weekends, the 3 chocolates that they eat all have orange centres.
  5. Ben likes all of the chocolates except those with cherry centres. On another weekend he is the first of his family to eat some of the chocolates. Find the probability that he has to select more than 2 chocolates before he finds one that he likes.
OCR MEI S1 Q3
8 marks Easy -1.8
3 The ages, \(x\) years, of the senior members of a running club are summarised in the table below.
Age \(( x )\)\(20 \leqslant x < 30\)\(30 \leqslant x < 40\)\(40 \leqslant x < 50\)\(50 \leqslant x < 60\)\(60 \leqslant x < 70\)\(70 \leqslant x < 80\)\(80 \leqslant x < 90\)
Frequency10304223951
  1. Draw a cumulative frequency diagram to illustrate the data.
  2. Use your diagram to estimate the median and interquartile range of the data.
OCR MEI S1 Q4
17 marks Moderate -0.8
4 The weights, \(w\) grams, of a random sample of 60 carrots of variety A are summarised in the table below.
Weight\(30 \leqslant w < 50\)\(50 \leqslant w < 60\)\(60 \leqslant w < 70\)\(70 \leqslant w < 80\)\(80 \leqslant w < 90\)
Frequency111018147
  1. Draw a histogram to illustrate these data.
  2. Calculate estimates of the mean and standard deviation of \(w\).
  3. Use your answers to part (ii) to investigate whether there are any outliers. The weights, \(x\) grams, of a random sample of 50 carrots of variety B are summarised as follows. $$n = 50 \quad \sum x = 3624.5 \quad \sum x ^ { 2 } = 265416$$
  4. Calculate the mean and standard deviation of \(x\).
  5. Compare the central tendency and variation of the weights of varieties A and B .
OCR MEI S1 Q1
18 marks Moderate -0.3
1 The heights \(x \mathrm {~cm}\) of 100 boys in Year 7 at a school are summarised in the table below.
Height\(125 \leqslant x \leqslant 140\)\(140 < x \leqslant 145\)\(145 < x \leqslant 150\)\(150 < x \leqslant 160\)\(160 < x \leqslant 170\)
Frequency252924184
  1. Estimate the number of boys who have heights of at least 155 cm .
  2. Calculate an estimate of the median height of the 100 boys.
  3. Draw a histogram to illustrate the data. The histogram below shows the heights of 100 girls in Year 7 at the same school. \includegraphics[max width=\textwidth, alt={}, center]{ab4d5ab1-e3b7-495f-9142-d37df7e712de-1_868_1361_1015_381}
  4. How many more girls than boys had heights exceeding 160 cm ?
  5. Calculate an estimate of the mean height of the 100 girls.
OCR MEI S1 Q2
18 marks Moderate -0.8
2 The engine sizes \(x \mathrm {~cm} ^ { 3 }\) of a sample of 80 cars are summarised in the table below.
Engine size \(x\)\(500 \leqslant x \leqslant 1000\)\(1000 < x \leqslant 1500\)\(1500 < x \leqslant 2000\)\(2000 < x \leqslant 3000\)\(3000 < x \leqslant 5000\)
Frequency72226187
  1. Draw a histogram to illustrate the distribution.
  2. A student claims that the midrange is \(2750 \mathrm {~cm} ^ { 3 }\). Discuss briefly whether he is likely to be correct.
  3. Calculate estimates of the mean and standard deviation of the engine sizes. Explain why your answers are only estimates.
  4. Hence investigate whether there are any outliers in the sample.
  5. A vehicle duty of \(\pounds 1000\) is proposed for all new cars with engine size greater than \(2000 \mathrm {~cm} ^ { 3 }\). Assuming that this sample of cars is representative of all new cars in Britain and that there are 2.5 million new cars registered in Britain each year, calculate an estimate of the total amount of money that this vehicle duty would raise in one year.
  6. Why in practice might your estimate in part (v) turn out to be too high?
OCR MEI S1 Q3
19 marks Moderate -0.3
3 The birth weights of 200 lambs from crossbred sheep are illustrated by the cumulative frequency diagram below. \includegraphics[max width=\textwidth, alt={}, center]{ab4d5ab1-e3b7-495f-9142-d37df7e712de-3_919_1144_430_476}
  1. Estimate the percentage of lambs with birth weight over 6 kg .
  2. Estimate the median and interquartile range of the data.
  3. Use your answers to part (ii) to show that there are very few, if any, outliers. Comment briefly on whether any outliers should be disregarded in analysing these data. The box and whisker plot shows the birth weights of 100 lambs from Welsh Mountain sheep. \includegraphics[max width=\textwidth, alt={}, center]{ab4d5ab1-e3b7-495f-9142-d37df7e712de-3_321_1610_1818_293}
  4. Use appropriate measures to compare briefly the central tendencies and variations of the weights of the two types of lamb.
  5. The weight of the largest Welsh Mountain lamb was originally recorded as 6.5 kg , but then corrected. If this error had not been corrected, how would this have affected your answers to part (iv)? Briefly explain your answer.
  6. One lamb of each type is selected at random. Estimate the probability that the birth weight of both lambs is at least 3.9 kg .
OCR MEI S1 Q3
18 marks Moderate -0.8
3 The heating quality of the coal in a sample of 50 sacks is measured in suitable units. The data are summarised below.
Heating quality \(( x )\)\(9.1 \leqslant x \leqslant 9.3\)\(9.3 < x \leqslant 9.5\)\(9.5 < x \leqslant 9.7\)\(9.7 < x \leqslant 9.9\)\(9.9 < x \leqslant 10.1\)
Frequency5715167
  1. Draw a cumulative frequency diagram to illustrate these data.
  2. Use the diagram to estimate the median and interquartile range of the data.
  3. Show that there are no outliers in the sample.
  4. Three of these 50 sacks are selected at random. Find the probability that
    (A) in all three, the heating quality \(x\) is more than 9.5 , \(( B )\) in at least two, the heating quality \(x\) is more than 9.5.
OCR MEI S1 Q3
6 marks Easy -1.8
3 A GCSE geography student is investigating a claim that global warming is causing summers in Britain to have more rainfall. He collects rainfall data from a local weather station for 2001 and 2006. The vertical line chart shows the number of days per week on which some rainfall was recorded during the 22 weeks of summer 2001. \includegraphics[max width=\textwidth, alt={}, center]{c7cb0f6b-7b6b-4c52-8287-7efc6bd70247-3_804_1557_547_337}
  1. Show that the median of the data is 4 , and find the interquartile range.
  2. For summer 2006 the median is 3 and the interquartile range is also 3. The student concludes that the data demonstrate that global warming is causing summer rainfall to decrease rather than increase. Is this a valid conclusion from the data? Give two brief reasons to justify your answer.
OCR MEI S1 Q4
7 marks Easy -1.8
4 The numbers of absentees per day from Mrs Smith's reception class over a period of 50 days are summarised below.
Number of absentees0123456\(> 6\)
Frequency8151183410
  1. Illustrate these data by means of a vertical line chart.
  2. Calculate the mean and root mean square deviation of these data.
  3. There are 30 children in Mrs Smith's class altogether. Find the mean and root mean square deviation of the number of children who are present during the 50 days.
OCR MEI S1 Q1
18 marks Moderate -0.3
1 The birth weights in grams of a random sample of 1000 babies are displayed in the cumulative frequency diagram below. \includegraphics[max width=\textwidth, alt={}, center]{088972e9-bfcd-429c-9145-af274a4c0a58-1_1268_1548_472_335}
  1. Use the diagram to estimate the median and interquartile range of the data.
  2. Use your answers to part (i) to estimate the number of outliers in the sample.
  3. Should these outliers be excluded from any further analysis? Briefly explain your answer.
  4. Any baby whose weight is below the 10th percentile is selected for careful monitoring. Use the diagram to determine the range of weights of the babies who are selected. \(12 \%\) of new-born babies require some form of special care. A maternity unit has 17 new-born babies. You may assume that these 17 babies form an independent random sample.
  5. Find the probability that
    (A) exactly 2 of these 17 babies require special care,
    (B) more than 2 of the 17 babies require special care.
  6. On 100 independent occasions the unit has 17 babies. Find the expected number of occasions on which there would be more than 2 babies who require special care.
OCR MEI S1 Q2
6 marks Easy -1.3
2 The times taken, in minutes, by 80 people to complete a crossword puzzle are summarised by the box and whisker plot below. \includegraphics[max width=\textwidth, alt={}, center]{088972e9-bfcd-429c-9145-af274a4c0a58-2_163_857_436_642}
  1. Write down the range and the interquartile range of the times.
  2. Determine whether any of the times can be regarded as outliers.
  3. Describe the shape of the distribution of the times.