2.02i Select/critique data presentation

68 questions

Sort by: Default | Easiest first | Hardest first
OCR PURE Q9
8 marks Easy -1.8
9 A researcher is studying changes in behaviour in travelling to work by people who live outside London, between 2001 and 2011. He chooses the 15 Local Authorities (LAs) outside London with the largest decreases in the percentage of people driving to work, and arranges these in descending order. The table shows the changes in percentages from 2001 to 2011 in various travel categories, for these Local Authorities.
Local AuthorityWork mainly at or from homeUnderground, metro, light rail, tramTrainBus, minibus or coachDriving a car or vanPassenger in a car or vanBicycleOn foot
Brighton and Hove3.20.11.50.8-8.2-1.52.12.3
Cambridge2.20.01.61.2-7.4-1.03.10.6
Elmbridge2.90.44.10.2-6.6-0.70.3-0.3
Oxford2.00.00.6-0.4-5.2-1.12.22.1
Epsom and Ewell1.60.43.91.1-5.2-0.90.0-0.6
Watford0.72.03.10.4-4.5-1.20.0-0.1
Tandridge3.30.24.0-0.1-4.5-1.10.0-1.3
Mole Valley3.30.11.90.3-4.4-0.70.2-0.3
St Albans2.30.33.4-0.3-4.3-1.20.3-0.2
Chiltern2.91.41.40.1-4.2-0.6-0.2-0.8
Exeter0.70.01.0-0.6-4.2-1.51.73.4
Woking2.10.13.70.0-4.2-1.3-0.10.0
Reigate and Banstead1.80.13.20.6-4.1-1.00.1-0.2
Waverley4.30.12.5-0.5-3.9-0.9-0.3-0.9
Guildford2.70.12.40.2-3.6-1.20.0-0.3
  1. Explain why these LAs are not necessarily the 15 LAs with the largest decreases in the percentage of people driving to work.
  2. The researcher wants to talk to those LAs outside London which have been most successful in encouraging people to change to cycling or walking to work.
    Suggest four LAs that he should talk to and why.
  3. The researcher claims that Waverley is the LA outside London which has had the largest increase in the number of people working mainly at or from home.
    Does the data support his claim? Explain your answer.
  4. Which two categories have replaced driving to work for the highest percentages of workers in these LAs? Support your answer with evidence from the table.
  5. The researcher suggested that there would be strong correlation between the decrease in the percentage driving to work and the increase in percentage working mainly at or from home. Without calculation, use data from the table to comment briefly on this suggestion.
OCR MEI Paper 2 2024 June Q14
8 marks Moderate -0.8
14 The pre-release material contains medical data for 103 women and 97 men.
The boxplot represents the weights in kg of 101 of the women from the pre-release material. \includegraphics[max width=\textwidth, alt={}, center]{8e48bbd3-2166-49e7-8906-833261f331ca-09_421_1232_735_244}
  1. Use your knowledge of the pre-release material to give a reason why the weights of all 103 women were not included in the diagram.
  2. Determine the range of values in which any outliers lie.
  3. Use your knowledge of the pre-release material to explain whether these outliers should be removed from any further analysis of the data.
  4. The median weight of men in the sample was found to be 79.9 kg . Explain what may be inferred by comparing the median weight of men with the median weight of women. Further analysis of the weights of both men and women is carried out. The table shows some of the results.
    meanstandard deviation
    men82.69 kg19.98 kg
    women72.5 kg19.95 kg
  5. Use the information in the table to make two inferences about the distribution of the weights of men compared with the distribution of the weights of women.
Edexcel S1 2005 January Q2
14 marks Easy -1.8
2. The number of caravans on Seaview caravan site on each night in August last year is summarised in the following stem and leaf diagram.
Caravans110 means 10Totals
10(2)
218(4)
30347(8)
41588(9)
5267(5)
62(3)
  1. Find the three quartiles of these data. During the same month, the least number of caravans on Northcliffe caravan site was 31. The maximum number of caravans on this site on any night that month was 72 . The three quartiles for this site were 38,45 and 52 respectively.
  2. On graph paper and using the same scale, draw box plots to represent the data for both caravan sites. You may assume that there are no outliers.
  3. Compare and contrast these two box plots.
  4. Give an interpretation to the upper quartiles of these two distributions.
Edexcel S1 2001 June Q6
16 marks Easy -1.2
6. Three swimmers Alan, Diane and Gopal record the number of lengths of the swimming pool they swim during each practice session over several weeks. The stem and leaf diagram below shows the results for Alan.
Lengths20 means 20
20122\(( 4 )\)
255667789\(( 7 )\)
3012224\(( 5 )\)
3566679\(( 5 )\)
401333333444\(( 10 )\)
45556667788999\(( 12 )\)
5000\(( 3 )\)
  1. Find the three quartiles for Alan's results. The table below summarises the results for Diane and Gopal.
    DianeGopal
    Smallest value3525
    Lower quartile3734
    Median4242
    Upper quartile5350
    Largest value6557
  2. Using the same scale and on the same sheet of graph paper draw box plots to represent the data for Alan, Diane and Gopal.
  3. Compare and contrast the three box plots.
Edexcel S1 2002 June Q6
14 marks Moderate -0.3
6. The labelling on bags of garden compost indicates that the bags weigh 20 kg . The weights of a random sample of 50 bags are summarised in the table below.
Weight in kgFrequency
14.6-14.81
14.8-18.00
18.0-18.55
18.5-20.06
20.0-20.222
20.2-20.415
20.4-21.01
  1. On graph paper, draw a histogram of these data.
  2. Using the coding \(y = 10\) (weight in \(\mathrm { kg } - 14\) ), find an estimate for the mean and standard deviation of the weight of a bag of compost.
    [0pt] [Use \(\Sigma f y ^ { 2 } = 171\) 503.75]
  3. Using linear interpolation, estimate the median. The company that produces the bags of compost wants to improve the accuracy of the labelling. The company decides to put the average weight in kg on each bag.
  4. Write down which of these averages you would recommend the company to use. Give a reason for your answer.
Edexcel S1 2004 June Q4
19 marks Easy -1.3
4. The attendance at college of a group of 18 students was recorded for a 4-week period. The number of students actually attending each of 16 classes are shown below.
18181717
16171618
18141718
15171816
    1. Calculate the mean and the standard deviation of the number of students attending these classes.
    2. Express the mean as a percentage of the 18 students in the group. In the same 4-week period, the attendance of a different group of 20, students is shown below.
      20161819
      15141415
      18151617
      16181514
  1. Construct a back-to-back stem and leaf diagram to represent the attendance in both groups.
  2. Find the mode, median and inter-quartile range for each group of students. The mean percentage attendance and standard deviation for the second group of students are 81.25 and 1.82 respectively.
  3. Compare and contrast the attendance of these 2 groups of students.
Edexcel S1 2005 June Q2
16 marks Moderate -0.8
2. The following table summarises the distances, to the nearest km , that 134 examiners travelled to attend a meeting in London.
Distance (km)Number of examiners
41-454
46-5019
51-6053
61-7037
71-9015
91-1506
  1. Give a reason to justify the use of a histogram to represent these data.
  2. Calculate the frequency densities needed to draw a histogram for these data.
    (DO NOT DRAW THE HISTOGRAM)
  3. Use interpolation to estimate the median \(Q _ { 2 }\), the lower quartile \(Q _ { 1 }\), and the upper quartile \(Q _ { 3 }\) of these data. The mid-point of each class is represented by \(x\) and the corresponding frequency by \(f\). Calculations then give the following values $$\Sigma f _ { x } = 8379.5 \quad \text { and } \quad \Sigma f _ { x ^ { 2 } } = 557489.75$$
  4. Calculate an estimate of the mean and an estimate of the standard deviation for these data. One coefficient of skewness is given by $$\frac { Q _ { 3 } - 2 Q _ { 2 } + Q _ { 1 } } { Q _ { 3 } - Q _ { 1 } }$$
  5. Evaluate this coefficient and comment on the skewness of these data.
  6. Give another justification of your comment in part (e).
Edexcel S1 2005 June Q4
10 marks Easy -1.2
4. Aeroplanes fly from City \(A\) to City \(B\). Over a long period of time the number of minutes delay in take-off from City \(A\) was recorded. The minimum delay was 5 minutes and the maximum delay was 63 minutes. A quarter of all delays were at most 12 minutes, half were at most 17 minutes and \(75 \%\) were at most 28 minutes. Only one of the delays was longer than 45 minutes. An outlier is an observation that falls either \(1.5 \times\) (interquartile range) above the upper quartile or \(1.5 \times\) (interquartile range) below the lower quartile.
  1. On the graph paper opposite draw a box plot to represent these data.
  2. Comment on the distribution of delays. Justify your answer.
  3. Suggest how the distribution might be interpreted by a passenger who frequently flies from City \(A\) to City \(B\). \includegraphics[max width=\textwidth, alt={}, center]{9698650f-ef85-468d-a703-1b40df7f9d02-07_1190_1487_278_223}
Edexcel S1 2006 June Q1
15 marks Easy -1.8
  1. (a) Describe the main features and uses of a box plot.
Children from schools \(A\) and \(B\) took part in a fun run for charity. The times, to the nearest minute, taken by the children from school \(A\) are summarised in Figure 1. \begin{figure}[h]
\captionsetup{labelformat=empty} \caption{Figure 1} \includegraphics[alt={},max width=\textwidth]{c8bade79-a39a-4055-bfae-928f5338fdfc-02_398_1045_946_461}
\end{figure} (b) (i) Write down the time by which \(75 \%\) of the children in school \(A\) had completed the run.
(ii) State the name given to this value.
(c) Explain what you understand by the two crosses ( X ) on Figure 1.
For school \(B\) the least time taken by any of the children was 25 minutes and the longest time was 55 minutes. The three quartiles were 30,37 and 50 respectively.
(d) Draw a box plot to represent the data from school \(B\). \includegraphics[max width=\textwidth, alt={}, center]{c8bade79-a39a-4055-bfae-928f5338fdfc-03_798_1196_580_372}
(e) Compare and contrast these two box plots.
Edexcel S1 2013 June Q4
14 marks Moderate -0.8
4. The following table summarises the times, \(t\) minutes to the nearest minute, recorded for a group of students to complete an exam.
Time (minutes) \(t\)\(11 - 20\)\(21 - 25\)\(26 - 30\)\(31 - 35\)\(36 - 45\)\(46 - 60\)
Number of students f628816131110
$$\text { [You may use } \sum \mathrm { f } t ^ { 2 } = 134281.25 \text { ] }$$
  1. Estimate the mean and standard deviation of these data.
  2. Use linear interpolation to estimate the value of the median.
  3. Show that the estimated value of the lower quartile is 18.6 to 3 significant figures.
  4. Estimate the interquartile range of this distribution.
  5. Give a reason why the mean and standard deviation are not the most appropriate summary statistics to use with these data. The person timing the exam made an error and each student actually took 5 minutes less than the times recorded above. The table below summarises the actual times.
    Time (minutes) \(t\)\(6 - 15\)\(16 - 20\)\(21 - 25\)\(26 - 30\)\(31 - 40\)\(41 - 55\)
    Number of students f628816131110
  6. Without further calculations, explain the effect this would have on each of the estimates found in parts (a), (b), (c) and (d).
Edexcel S1 2014 June Q5
12 marks Moderate -0.8
  1. The table shows the time, to the nearest minute, spent waiting for a taxi by each of 80 people one Sunday afternoon.
Waiting time
(in minutes)
Frequency
\(2 - 4\)15
\(5 - 6\)9
76
824
\(9 - 10\)14
\(11 - 15\)12
  1. Write down the upper class boundary for the \(2 - 4\) minute interval. A histogram is drawn to represent these data. The height of the tallest bar is 6 cm .
  2. Calculate the height of the second tallest bar.
  3. Estimate the number of people with a waiting time between 3.5 minutes and 7 minutes.
  4. Use linear interpolation to estimate the median, the lower quartile and the upper quartile of the waiting times.
  5. Describe the skewness of these data, giving a reason for your answer.
Edexcel S1 2014 June Q1
9 marks Moderate -0.8
  1. A random sample of 35 homeowners was taken from each of the villages Greenslax and Penville and their ages were recorded. The results are summarised in the back-to-back stem and leaf diagram below.
TotalsGreenslaxPenvilleTotals
(2)8725567889(7)
(3)98731112344569(11)
(4)4440401247(5)
(5)66522500555(5)
(7)865421162566(4)
(8)8664311705(2)
(5)984328(0)
(1)499(1)
Key: 7 | 3 | 1 means 37 years for Greenslax and 31 years for Penville
Some of the quartiles for these two distributions are given in the table below.
GreenslaxPenville
Lower quartile, \(Q _ { 1 }\)\(a\)31
Median, \(Q _ { 2 }\)6439
Upper quartile, \(Q _ { 3 }\)\(b\)55
  1. Find the value of \(a\) and the value of \(b\). An outlier is a value that falls either $$\begin{aligned} & \text { more than } 1.5 \times \left( Q _ { 3 } - Q _ { 1 } \right) \text { above } Q _ { 3 } \\ & \text { or more than } 1.5 \times \left( Q _ { 3 } - Q _ { 1 } \right) \text { below } Q _ { 1 } \end{aligned}$$
  2. On the graph paper opposite draw a box plot to represent the data from Penville. Show clearly any outliers.
  3. State the skewness of each distribution. Justify your answers. \includegraphics[max width=\textwidth, alt={}, center]{8270bcae-494c-4248-8229-a72e9e84eab0-03_930_1237_1800_367}
Edexcel S1 2014 June Q6
11 marks Moderate -0.3
6. The times, in seconds, spent in a queue at a supermarket by 85 randomly selected customers, are summarised in the table below.
Time (seconds)Number of customers, \(f\)
0-302
30-6010
60-7017
70-8025
80-10025
100-1506
A histogram was drawn to represent these data. The \(30 - 60\) group was represented by a bar of width 1.5 cm and height 1 cm .
  1. Find the width and the height of the \(70 - 80\) group.
  2. Use linear interpolation to estimate the median of this distribution. Given that \(x\) denotes the midpoint of each group in the table and $$\sum f x = 6460 \quad \sum f x ^ { 2 } = 529400$$
  3. calculate an estimate for
    1. the mean,
    2. the standard deviation,
      for the above data. One measure of skewness is given by $$\text { coefficient of skewness } = \frac { 3 ( \text { mean } - \text { median } ) } { \text { standard deviation } }$$
  4. Evaluate this coefficient and comment on the skewness of these data.
Edexcel S1 2015 June Q1
14 marks Easy -1.2
  1. Each of 60 students was asked to draw a \(20 ^ { \circ }\) angle without using a protractor. The size of each angle drawn was measured. The results are summarised in the box plot below. \includegraphics[max width=\textwidth, alt={}, center]{9626e3ce-35d6-41b5-a0bd-1185f38b9e36-02_371_1040_340_461}
    1. Find the range for these data.
    2. Find the interquartile range for these data.
    The students were then asked to draw a \(70 ^ { \circ }\) angle.
    The results are summarised in the table below.
    Angle, \(\boldsymbol { a }\), (degrees)Number of students
    \(55 \leqslant a < 60\)6
    \(60 \leqslant a < 65\)15
    \(65 \leqslant a < 70\)13
    \(70 \leqslant a < 75\)11
    \(75 \leqslant a < 80\)8
    \(80 \leqslant a < 85\)7
  2. Use linear interpolation to estimate the size of the median angle drawn. Give your answer to 1 decimal place.
  3. Show that the lower quartile is \(63 ^ { \circ }\) For these data, the upper quartile is \(75 ^ { \circ }\), the minimum is \(55 ^ { \circ }\) and the maximum is \(84 ^ { \circ }\) An outlier is an observation that falls either more than \(1.5 \times\) (interquartile range) above the upper quartile or more than \(1.5 \times\) (interquartile range) below the lower quartile.
    1. Show that there are no outliers for these data.
    2. Draw a box plot for these data on the grid on page 3.
  4. State which angle the students were more accurate at drawing. Give reasons for your answer.
    (3) \includegraphics[max width=\textwidth, alt={}, center]{9626e3ce-35d6-41b5-a0bd-1185f38b9e36-03_378_1059_2067_447}
Edexcel S1 Q2
11 marks Easy -1.3
2. A botany student counted the number of daisies in each of 42 randomly chosen areas of 1 m by 1 m in a large field. The results are summarised in the following stem and leaf diagram.
Number of daisies\(1 \mid 1\) means 11
11223444(7)
15567899(7)
200133334(8)
25567999(7)
3001244(6)
366788(5)
413(2)
  1. Write down the modal value of these data.
  2. Find the median and the quartiles of these data.
  3. On graph paper and showing your scale clearly, draw a box plot to represent these data.
  4. Comment on the skewness of this distribution. The student moved to another field and collected similar data from that field.
  5. Comment on how the student might summarise both sets of raw data before drawing box plots.
    (1 mark)
Edexcel S1 2003 November Q6
16 marks Moderate -0.8
6. A travel agent sells holidays from his shop. The price, in \(\pounds\), of 15 holidays sold on a particular day are shown below.
29910502315999485
3501691015650830
992100689550475
For these data, find
  1. the mean and the standard deviation,
  2. the median and the inter-quartile range. An outlier is an observation that falls either more than \(1.5 \times\) (inter-quartile range) above the upper quartile or more than \(1.5 \times\) (inter-quartile range) below the lower quartile.
  3. Determine if any of the prices are outliers. The travel agent also sells holidays from a website on the Internet. On the same day, he recorded the price, \(\pounds x\), of each of 20 holidays sold on the website. The cheapest holiday sold was \(\pounds 98\), the most expensive was \(\pounds 2400\) and the quartiles of these data were \(\pounds 305 , \pounds 1379\) and \(\pounds 1805\). There were no outliers.
  4. On graph paper, and using the same scale, draw box plots for the holidays sold in the shop and the holidays sold on the website.
  5. Compare and contrast sales from the shop and sales from the website. \section*{END}
AQA S1 2011 June Q1
11 marks Easy -1.3
1 The number of matches in each of a sample of 85 boxes is summarised in the table.
Number of matchesNumber of boxes
Less than 2391
239-2431
244-2462
2473
2484
2496
25010
25113
25216
25320
2545
255-2593
More than 2591
Total85
  1. For these data:
    1. state the modal value;
    2. determine values for the median and the interquartile range.
  2. Given that, on investigation, the 2 extreme values in the above table are 227 and 271 :
    1. calculate the range;
    2. calculate estimates of the mean and the standard deviation.
  3. For the numbers of matches in the 85 boxes, suggest, with a reason, the most appropriate measure of spread.
AQA S1 2016 June Q2
8 marks Moderate -0.8
2 A small chapel was open to visitors for 55 days during the summer of 2015. The table summarises the daily numbers of visitors.
Number of visitorsNumber of days
20 or fewer1
212
223
236
248
2510
2613
277
282
291
30 or more2
Total55
  1. For these data:
    1. state the modal value;
    2. find values for the median and the interquartile range.
  2. Name one measure of average and one measure of spread that cannot be calculated exactly from the data in the table.
    [0pt] [2 marks]
  3. Reference to the raw data revealed that the 3 unknown exact values in the table were 13,37 and 58. Making use of this additional information, together with the data in the table, calculate the value of each of the two measures that you named in part (b).
    [0pt] [3 marks]
Edexcel S1 Q5
12 marks Easy -1.3
5. For a project, a student asked 40 people to draw two straight lines with what they thought was an angle of \(75 ^ { \circ }\) between them, using just a ruler and a pencil. She then measured the size of the angles that had been drawn and her data are summarised in this stem and leaf diagram.
Angle( \(6 \mid 4\) means \(64 ^ { \circ }\) )Totals
41(1)
4(0)
5024(3)
5589(3)
611334(5)
655789(5)
7011233444(9)
75667799(7)
801134(5)
856(2)
  1. Find the median and quartiles of these data. Given that any values outside of the limits \(\mathrm { Q } _ { 1 } - 1.5 \left( \mathrm { Q } _ { 3 } - \mathrm { Q } _ { 1 } \right)\) and \(\mathrm { Q } _ { 3 } + 1.5 \left( \mathrm { Q } _ { 3 } - \mathrm { Q } _ { 1 } \right)\) are to be regarded as outliers,
  2. determine if there are any outliers in these data,
  3. draw a box plot representing these data on graph paper,
  4. describe the skewness of the distribution and suggest a reason for it.
OCR H240/02 2018 September Q10
6 marks Easy -1.8
10 The table shows information, derived from the 2011 UK census, about the percentage of employees who used various methods of travel to work in four Local Authorities.
Local AuthorityUnderground, metro, light rail or tramTrainBusDriveWalk or cycle
A0.3\%4.5\%17\%52.8\%11\%
B0.2\%1.7\%1.7\%63.4\%11\%
C35.2\%3.0\%12\%11.7\%16\%
D8.9\%1.4\%9\%54.7\%10\%
One of the Local Authorities is a London borough and two are metropolitan boroughs, not in London.
  1. Which one of the Local Authorities is a London borough? Give a reason for your answer.
  2. Which two of the Local Authorities are metropolitan boroughs outside London? In each case give a reason for your answer.
  3. Describe one difference between the public transport available in the two metropolitan boroughs, as suggested by the table.
  4. Comment on the availability of public transport in Local Authority B as suggested by the table.
OCR H240/02 2022 June Q10
10 marks Easy -1.8
10 The table shows the age structure of usual residents of 18 Local Authorities (LAs) in the North West region of the UK in 2011.
Local AuthorityAge 0 to 17Age 18 to 24Age 25 to 64Age 65 and over
A26.20\%9.06\%51.81\%12.92\%
B23.32\%8.99\%52.32\%15.37\%
C22.24\%8.96\%52.56\%16.23\%
D22.67\%8.10\%53.27\%15.96\%
E20.70\%7.77\%54.77\%16.76\%
F18.14\%6.51\%51.13\%24.21\%
G18.96\%14.20\%48.51\%18.33\%
H19.06\%14.79\%52.12\%14.04\%
I25.15\%9.04\%51.16\%14.65\%
J22.93\%8.81\%52.22\%16.04\%
K21.48\%13.98\%50.82\%13.73\%
L23.98\%9.20\%52.26\%14.56\%
M21.67\%11.19\%52.94\%14.19\%
N17.82\%6.01\%51.93\%24.23\%
O22.83\%7.30\%53.86\%16.01\%
P21.76\%8.28\%54.03\%15.93\%
Q21.42\%8.43\%53.90\%16.25\%
R18.61\%7.33\%49.35\%24.71\%
\section*{Percentage of residents}
  1. Without reference to any other columns, explain how you would use only the columns for the age ranges 0 to 17 and 18 to 24 to decide whether an LA might be one of the following.
    1. An LA that includes a university
    2. An LA that attracts young couples to live
    3. An LA that attracts retired people to live
  2. Using your answers to part (a), identify the following.
    1. Four LAs that might include a university
    2. Three LAs that might be attractive to retired people
  3. Explain why your answer to part (b)(ii), based only on the columns for the age ranges 0 to 17 and 18 to 24, may not be reliable.
  4. The lower quartile, median and upper quartile of the percentages in the column "Age 65 and over" are \(14.56 \% , 15.99 \%\) and \(16.76 \%\) respectively. Use this information to comment on your answers to part (b)(ii) and part (c). In a magazine article, a councillor plans to describe a typical LA in the North West region. He wants to quote the average percentage of residents aged 65 or over.
  5. The mean of the percentages in the column "Age 65 and over" is \(16.90 \%\). Use this information, and the information given in part (d), to explain whether the median or the mean better represents the data in the column "Age 65 and over".
AQA AS Paper 2 2021 June Q15
3 marks Easy -1.8
15
The number of hours of sunshine and the daily maximum temperature were recorded over a 9-day period in June at an English seaside town. A scatter diagram representing the recorded data is shown below. \includegraphics[max width=\textwidth, alt={}, center]{f87d1b36-26db-4a0b-b9ec-d7d82a396aba-20_872_1511_488_264} One of the points on the scatter diagram is an error. 15
    1. Write down the letter that identifies this point.
      15
      1. (ii) Suggest one possible action that could be taken to deal with this error.
        15
    2. It is claimed that the scatter diagram proves that longer hours of sunshine cause
      higher maximum daily temperatures. Comment on the validity of this claim.
      [0pt] [1 mark]
Pre-U Pre-U 9794/2 2016 Specimen Q10
15 marks Standard +0.8
10 A curve has equation $$y = \mathrm { e } ^ { a x } \cos b x$$ where \(a\) and \(b\) are constants.
  1. Show that, at any stationary points on the curve, \(\tan b x = \frac { a } { b }\).
  2. \includegraphics[max width=\textwidth, alt={}, center]{1c957cfe-bead-41d9-8985-479e876e1616-4_620_896_959_333} Values of related quantities \(x\) and \(y\) were measured in an experiment and plotted on a graph of \(y\) against \(x\), as shown in the diagram. Two of the points, labelled \(A\) and \(B\), have coordinates \(( 0,1 )\) and \(( 0.2 , - 0.8 )\) respectively. A third point labelled C has coordinates ( \(0.3,0.04\) ). Attempts were then made to find the equation of a curve which fitted closely to these three points, and two models were proposed. In the first model the equation is \(y = \mathrm { e } ^ { - x } \cos 15 x\).
    In the second model the equation is \(y = f \cos ( \lambda x ) + \mathrm { g }\), where the constants \(f , \lambda\), and \(g\) are chosen to give a maximum precisely at the point \(A ( 0,1 )\) and a minimum precisely at the point \(B ( 0.2 , - 0.8 )\). By calculating suitable values evaluate the suitability of the two models.
WJEC Unit 2 2022 June Q2
Standard +0.3
The probability distribution for \(X\), the lifetime of a light bulb, in hours, is given below.
\(X\)\(256 \leqslant x < 259\)\(259 \leqslant x < 262\)\(262 \leqslant x < 265\)\(265 \leqslant x < 267\)\(267 \leqslant x < 300\)
Probability0.050.250.450.200.05
a) Suppose that a random sample of 40 light bulbs is tested, and a histogram is drawn of their lifetimes. Calculate the expected height of the bar for the interval \(262 \leqslant x < 265\).
b) Now suppose that the last two intervals are changed to \(265 \leqslant x < 268\) and \(268 \leqslant x < 300\). Explain why it is not possible to tell what will happen to the expected heights of the last two bars.
c) Celyn collects a different random sample of 40 light bulbs to test. She draws a histogram of their lifetimes and finds that it is different to the histogram referred to in part (a). Should Celyn be concerned that the two histograms are different?
Edexcel S1 2002 November Q7
18 marks Moderate -0.8
The following stem and leaf diagram shows the aptitude scores \(x\) obtained by all the applicants for a particular job.
Aptitude score\(3|1\) means 31
31 2 9(3)
42 4 6 8 9(5)
51 3 3 5 6 7 9(7)
60 1 3 3 3 5 6 8 8 9(10)
71 2 2 2 4 5 5 5 6 8 8 8 8 9(14)
80 1 2 3 5 8 8 9(8)
90 1 2(3)
  1. Write down the modal aptitude score. [1]
  2. Find the three quartiles for these data. [3]
Outliers can be defined to be outside the limits \(Q_1 - 1.0(Q_3 - Q_1)\) and \(Q_3 + 1.0(Q_3 - Q_1)\).
  1. On a graph paper, draw a box plot to represent these data. [7]
For these data, \(\Sigma x = 3363\) and \(\Sigma x^2 = 238305\).
  1. Calculate, to 2 decimal places, the mean and the standard deviation for these data. [3]
  2. Use two different methods to show that these data are negatively skewed. [4]