Data representation - Question Types

Draw histogram then perform calculations

The question provides a frequency table with unequal class widths, asks the student to draw a histogram, and then requires additional statistical work such as finding quartiles, medians, or other summary statistics from the histogram or data.

38

12.8% of questions

Show example »

2 The table summarises the lengths in centimetres of 104 dragonflies.

Length $( \mathrm { cm } )$	$2.0 - 3.5$	$3.5 - 4.5$	$4.5 - 5.5$	$5.5 - 7.0$	$7.0 - 9.0$
Frequency	8	25	28	31	12

State which class contains the upper quartile.
Draw a histogram, on graph paper, to represent the data.

View full question →

Draw cumulative frequency graph from frequency table

Questions that provide data in frequency form (not cumulative) and require students to first calculate cumulative frequencies before drawing the graph.

18

6.0% of questions

Show example »

2 The manager of a company noted the times spent in 80 meetings. The results were as follows.

Time ( $t$ minutes)	$0 < t \leqslant 15$	$15 < t \leqslant 30$	$30 < t \leqslant 60$	$60 < t \leqslant 90$	$90 < t \leqslant 120$
Number of meetings	4	7	24	38	7

Draw a cumulative frequency graph and use this to estimate the median time and the interquartile range.

View full question →

Calculate frequency density from frequency

Questions where students must calculate frequency density values given frequencies and class widths, typically to complete a table before drawing a histogram.

18

6.0% of questions

Show example »

3. The variable $x$ was measured to the nearest whole number. Forty observations are given in the table below.

$x$	$10 - 15$	$16 - 18$	$19 -$
Frequency	15	9	16

A histogram was drawn and the bar representing the $10 - 15$ class has a width of 2 cm and a height of 5 cm . For the $16 - 18$ class find

the width,
the height
of the bar representing this class.

View full question →

Estimate mean and standard deviation from frequency table

Questions that provide a grouped frequency table directly (with or without midpoints pre-calculated) and ask the student to calculate estimates of mean and/or standard deviation.

17

5.7% of questions

Show example »

2 The table summarises the diameters, $d$ millimetres, of a random sample of 60 new cricket balls to be used in junior cricket.

View full question →

State advantages of diagram types

A question is this type if and only if it asks the student to give advantages or disadvantages of using specific types of statistical diagrams (e.g., box plot vs stem-and-leaf).

11

3.7% of questions

Show example »

1 A study of the ages of car drivers in a certain country produced the results shown in the table. \begin{table}[h]

\captionsetup{labelformat=empty} \caption{Percentage of drivers in each age group}

	Young	Middle-aged	Elderly
Males	40	35	25
Females	20	70	10

\end{table} Illustrate these results diagrammatically.

View full question →

Draw cumulative frequency graph from cumulative frequency table

Questions that provide data already in cumulative frequency form and ask students to draw the cumulative frequency graph or curve directly.

11

3.7% of questions

Show example »

2 In a recent survey, 640 people were asked about the length of time each week that they spent watching television. The median time was found to be 20 hours, and the lower and upper quartiles were 15 hours and 35 hours respectively. The least amount of time that anyone spent was 3 hours, and the greatest amount was 60 hours.

On graph paper, show these results using a fully labelled cumulative frequency graph.
Use your graph to estimate how many people watched more than 50 hours of television each week.

View full question →

Estimate single values from cumulative frequency graph

Questions that ask students to read off and estimate specific statistical measures (median, quartiles, percentiles) or single frequency values from a cumulative frequency graph.

11

3.7% of questions

Show example »

4 Fig. 4 shows a cumulative frequency diagram for the time spent revising mathematics by year 11 students at a certain school during a week in the summer term. \begin{figure}[h]

\includegraphics[alt={},max width=\textwidth]{cea67565-8074-4703-8e1a-09b98e380baf-05_554_1070_737_242} \captionsetup{labelformat=empty} \caption{Fig. 4}

\end{figure}

Use the diagram to estimate the median time spent revising mathematics by these students. [1] A teacher comments that $90 \%$ of the students spent less than an hour revising mathematics during this week.
Determine whether the information in the diagram supports this comment.

View full question →

Describe shape or skewness of distribution

A question is this type if and only if it asks the student to identify or name the type of skewness (positive, negative) from a graph or summary statistics.

10

3.4% of questions

Show example »

12 The box plot below shows summary data for the number of minutes late that buses arrived at a rural bus stop.
\includegraphics[max width=\textwidth, alt={}, center]{6ad3bac9-bf08-443d-8be2-b0c26209ffe8-17_515_1614_1384_212} Identify which term best describes the distribution of this data.
Circle your answer.
[0pt] [1 mark]
negatively skewed
normal
positively skewed
symmetrical

View full question →

Calculate range and interquartile range

A question is this type if and only if it asks the student to find the range and/or interquartile range from raw data, a box plot, or summary statistics.

10

3.4% of questions

Show example »

1 The times taken, in minutes, by 80 people to complete a crossword puzzle are summarised by the box and whisker plot below.
\includegraphics[max width=\textwidth, alt={}, center]{acb05873-e441-4b95-9732-6ebd5ae79fa6-2_147_848_507_612}

Write down the range and the interquartile range of the times.
Determine whether any of the times can be regarded as outliers.
Describe the shape of the distribution of the times.

View full question →

Compare distributions using stem-and-leaf

A question is this type if and only if it provides a back-to-back stem-and-leaf diagram and asks for comparison or advantages of this representation.

9

3.0% of questions

Show example »

12
12
The box plot below summarises the $\mathrm { CO } _ { 2 }$ emissions, in $\mathrm { g } / \mathrm { km }$, for cars in the Large Data Set from the London and North West regions.
London
39
119142168
346
North West

View full question →

Use linear interpolation for median or quartiles

A question is this type if and only if it asks the student to estimate the median, quartiles, or percentiles from grouped data using interpolation within a class interval.

9

3.0% of questions

Show example »

The histogram summarises the heights of 256 seedlings two weeks after they were planted.
\includegraphics[max width=\textwidth, alt={}, center]{08e3b0b0-2155-4b37-83e3-343c317ca10c-06_1242_1810_287_132}
1. Use linear interpolation to estimate the median height of the seedlings.
  (4)
Chris decides to model the frequency density for these 256 seedlings by a curve with equation $$y = k x ( 8 - x ) \quad 0 \leqslant x \leqslant 8$$ where $k$ is a constant.
Find the value of $k$ Using this model,
write down the median height of the seedlings.

View full question →

Draw histogram from frequency table

The question provides a frequency table with unequal class widths and explicitly asks the student to draw a histogram, with no additional statistical calculations required beyond the histogram construction itself.

9

3.0% of questions

Show example »

1 The number of minutes of recorded music on a sample of 100 CDs is summarised below.

Time ( $t$ minutes)	$40 \leqslant t < 45$	$45 \leqslant t < 50$	$50 \leqslant t < 60$	$60 \leqslant t < 70$	$70 \leqslant t < 90$
Number of CDs	26	18	31	16	9

Illustrate the data by means of a histogram.
Identify two features of the distribution.

View full question →

Estimate mean and standard deviation from histogram

Questions that provide a histogram (rather than a frequency table) and require the student to first extract frequency data from the histogram before calculating estimates of mean and/or standard deviation.

9

3.0% of questions

Show example »

4 The following histogram summarises the times, in minutes, taken by 190 people to complete a race.
\includegraphics[max width=\textwidth, alt={}, center]{df246a50-157b-49f7-bba0-f9b86960b8b9-2_1210_1125_1251_513}

Show that 75 people took between 200 and 250 minutes to complete the race.
Calculate estimates of the mean and standard deviation of the times of the 190 people.
Explain why your answers to part (ii) are estimates.

View full question →

Outliers from box plot or summary statistics

Questions where the five-number summary or quartiles are directly given or read from a box plot, and the student applies the 1.5×IQR rule to determine outliers.

9

3.0% of questions

Show example »

1 A business analyst collects data about the distribution of hourly wages, in $\pounds$, of shop-floor workers at a factory. These data are illustrated in the box and whisker plot.
\includegraphics[max width=\textwidth, alt={}, center]{56f1bd5c-4b45-4e36-a324-e7e0edbb5bdd-1_206_1420_505_397}

Name the type of skewness of the distribution.
Find the interquartile range and hence show that there are no outliers at the lower end of the distribution, but there is at least one outlier at the upper end.
Suggest possible reasons why this may be the case.

View full question →

Direct frequency calculation from histogram

Questions that ask students to calculate the frequency (number of items) in one or more specific class intervals by reading frequency density from the histogram and multiplying by class width.

8

2.7% of questions

Show example »

5. Figure 2 is a histogram showing the distribution of the time taken in minutes, $t$, by a group of people to swim 500 m . \begin{figure}[h]

\captionsetup{labelformat=empty} \caption{Figure 2} \includegraphics[alt={},max width=\textwidth]{a58a2c41-3a53-41cc-b80f-84adb04a5f5c-11_547_1120_333_374}

\end{figure} a Find the probability that a person chosen at random from the group takes longer than 18 minutes.

View full question →

Inappropriate graph type selection

Questions asking students to explain why the chosen type of graph or diagram is inappropriate for the data being represented (e.g., wrong chart type, bipartite graph used incorrectly, radar diagrams for inappropriate comparisons).

8

2.7% of questions

Show example »

7. The table shows information, derived from the 2011 UK census, about the percentage of employees who used various methods of travel to work in four Local Authorities.

Local Authority	Underground, metro, light rail or tram	Train	Bus	Drive	Walk or cycle
A	0.3\%	4.5\%	17\%	52.8\%	11\%
B	0.2\%	1.7\%	1.7\%	63.4\%	11\%
C	35.2\%	3.0\%	12\%	11.7\%	16\%
D	8.9\%	1.4\%	9\%	54.7\%	10\%

One of the Local Authorities is a London borough and two are metropolitan boroughs, not in London.

Which one of the Local Authorities is a London borough? Give a reason for your answer.
Which two of the Local Authorities are metropolitan boroughs outside London? In each case give a reason for your answer.
Describe one difference between the public transport available in the two metropolitan boroughs, as suggested by the table.
[0pt] [BLANK PAGE]

View full question →

Misinterpretation of data in graphs

Questions asking students to critique misleading conclusions or statements drawn from graphs, where the graph itself may be correct but the interpretation or accompanying text is flawed.

8

2.7% of questions

Show example »

Keith is studying the variable Daily Mean Wind Direction, in degrees, from the large data set.

Keith summarised the data for Camborne from 1987 into 4 directions $A , B , C$ and $D$ representing North, South, East and West in some order.

Direction	$A$	$B$	$C$	$D$
Frequency	22	48	56	58

Using your knowledge of the large data set state, giving a reason, which direction $A$ represents. The entry for Hurn on 27th September 1987 was 999
State, giving a reason, what Keith should do with this value.

View full question →

Complete frequency table from histogram only

Questions that provide a histogram and ask students to construct or complete a frequency table by reading frequency densities and calculating frequencies from the histogram bars.

8

2.7% of questions

Show example »

2 The histogram shows the amount of money, in pounds, spent by the customers at a supermarket on a particular day.
\includegraphics[max width=\textwidth, alt={}, center]{c7cb0f6b-7b6b-4c52-8287-7efc6bd70247-2_985_1473_470_379}

Express the data in the form of a grouped frequency table.
Use your table to estimate the total amount of money spent by customers on that day.

View full question →

Draw back-to-back stem-and-leaf diagram

A question is this type if and only if it provides two data sets and asks the student to represent them using a back-to-back stem-and-leaf diagram.

7

2.3% of questions

Show example »

6 Emly at a atcula cmay a $b$ ty $t$ duc abc $t$ cmay dcd $t t$
$t u$ ac day at ay tm bt am ad m F a adm aml mly
$t u m b u a b c t y a b$
a $g$ t llg tabl
$k g u a c$ daym am $t m T$
duc xtm ad all mly $t k$
ad $t$ ya at $t$ tduct xtm

Emly
$B$
$A t$

a ad aml tt t tt at t gcac ll t t ulat ma umb
$u$ abc a dcad ll g t tduct xtm

View full question →

Total sample size from histogram

Questions that require students to find the total number of items in the entire sample by calculating frequencies across all bars and summing them, often using a given frequency in one class to establish the scale.

7

2.3% of questions

Show example »

7 Kareem bought some tomatoes. He recorded the mass of each tomato and displayed the results in a histogram, which is shown below.
\includegraphics[max width=\textwidth, alt={}, center]{57007d39-abb0-475e-9ed8-03021fa1273b-05_1273_1849_363_109} Determine how many tomatoes Kareem bought.
[0pt] [2] Answer all the questions.
Section B (77 marks)

View full question →

Outliers from cumulative frequency diagram

Questions where quartiles must first be estimated from a cumulative frequency diagram before applying the 1.5×IQR rule to identify outliers.

7

2.3% of questions

Show example »

6 The birth weights in grams of a random sample of 1000 babies are displayed in the cumulative frequency diagram below.
\includegraphics[max width=\textwidth, alt={}, center]{05b96db3-93c7-4921-a1c6-c7b2f8952a8f-4_1264_1553_486_296}

Use the diagram to estimate the median and interquartile range of the data.
Use your answers to part (i) to estimate the number of outliers in the sample.
Should these outliers be excluded from any further analysis? Briefly explain your answer.
Any baby whose weight is below the 10th percentile is selected for careful monitoring. Use the diagram to determine the range of weights of the babies who are selected.
$12 \%$ of new-born babies require some form of special care. A maternity unit has 17 new-born babies. You may assume that these 17 babies form an independent random sample.
Find the probability that
(A) exactly 2 of these 17 babies require special care,
(B) more than 2 of the 17 babies require special care.
On 100 independent occasions the unit has 17 babies. Find the expected number of occasions on which there would be more than 2 babies who require special care.

View full question →

Draw box plot from summary statistics

A question is this sub-type if and only if it provides pre-calculated summary statistics (minimum, quartiles, median, maximum) and asks the student to draw a box-and-whisker plot directly from these values.

7

2.3% of questions

Show example »

The students in a class were each asked to write down how many CDs they owned. The student with the least number of CDs had 14 and all but one of the others owned 60 or fewer. The remaining student owned 65 . The quartiles for the class were 30,34 and 42 respectively.

Outliers are defined to be any values outside the limits of $1.5 \left( Q _ { 3 } - Q _ { 1 } \right)$ below the lower quartile or above the upper quartile. On graph paper draw a box plot to represent these data, indicating clearly any outliers.
(7 marks)

View full question →

Draw box plot from raw data

A question is this sub-type if and only if it provides raw data or a stem-and-leaf diagram and asks the student to first calculate the summary statistics (quartiles, median, etc.) before drawing the box-and-whisker plot.

6

2.0% of questions

Show example »

1 The number of passengers getting off the 11.45 am train at a railway station on each of 35 days is summarised as follows.

View full question →

Compare or interpret cumulative frequency graphs

Questions that require students to compare two or more cumulative frequency graphs, interpret features, or make judgments about which distribution has certain properties.

5

1.7% of questions

Show example »

1. 200 candidates took each of two examination papers. The diagram shows the cumulative frequency graphs for their marks.
\includegraphics[max width=\textwidth, alt={}, center]{5b55a372-2cc8-454e-a10a-cabdc9801421-04_1091_1484_429_285}

State, with a reason, which of the two papers was the easier one.
The minimum mark for grade A , the top grade, on Paper 1 was 10 marks lower than the minimum mark for grade A on Paper 2. Given that 25 candidates gained grade A in Paper 1, find the number of candidates who gained grade A in Paper 2.
The mean and standard deviation of the marks on Paper 1 were 36.5 and 28.2 respectively. Later, a marking error was discovered and it was decided to add 1 mark to each of the 200 marks on Paper 1. State the mean and standard deviation of the new marks on Paper 1.
[0pt] [BLANK PAGE]

View full question →

Critique given sampling method

A question is this sub-type if and only if it presents a specific sampling method already chosen or implemented and asks the student to identify flaws, give reasons why it is unsatisfactory, or explain why it fails to produce a random/representative sample.

5

1.7% of questions

Show example »

2. A video rental shop needs to find out whether or not videos have been rewound when they are returned; it will do this by taking a sample of returned videos

State one advantage and one disadvantage of taking a sample.
Suggest a suitable sampling frame.
Describe the sampling units.
Criticise the sampling method of looking at just one particular shelf of videos.

View full question →

Describe or suggest sampling method

A question is this sub-type if and only if it asks the student to describe how to implement a valid sampling method, suggest an appropriate sampling technique for a given context, or identify suitable sampling frames/units without critiquing a pre-existing flawed method.

5

1.7% of questions

Show example »

A random sample is to be taken from the A-level results obtained by the final-year students in a Sixth Form College. Suggest
1. suitable sampling units,
2. a suitable sampling frame.
3. Would it be advisable simply to use the results of all those doing A-level Maths?
Explain your answer.

View full question →

Complete frequency table and histogram together

Questions that provide both a partially completed histogram AND a partially completed frequency table, requiring students to use information from both to fill in missing values in the table.

5

1.7% of questions

Show example »

6 The times taken by 57 athletes to run 100 metres are summarised in the following cumulative frequency table.

Time (seconds)	$< 10.0$	$< 10.5$	$< 11.0$	$< 12.0$	$< 12.5$	$< 13.5$
Cumulative frequency	0	4	10	40	49	57

State how many athletes ran 100 metres in a time between 10.5 and 11.0 seconds.
Draw a histogram on graph paper to represent the times taken by these athletes to run 100 metres.
Calculate estimates of the mean and variance of the times taken by these athletes.

View full question →

Select random sample using random numbers

A question is this type if and only if it provides random number tables or calculator output and asks the student to identify sample members.

4

1.3% of questions

Show example »

1 A residents' association has 654 members, numbered from 1 to 654 . The secretary wishes to send a questionnaire to a random sample of members. In order to choose the members for the sample she uses a table of random numbers. The first line in the table is as follows. $$\begin{array} { l l l l l l } 1096 & 4357 & 3765 & 0431 & 0928 & 9264 \end{array}$$ The numbers of the first two members in the sample are 109 and 643. Find the numbers of the next three members in the sample.

View full question →

Calculate using histogram bar dimensions

Questions where students must use the physical dimensions (height and width) of histogram bars to find frequency density or frequency values, requiring understanding that area represents frequency.

4

1.3% of questions

Show example »

4 The weights in grams of a number of stones, measured correct to the nearest gram, are represented in the following table.

Weight (grams)	$1 - 10$	$11 - 20$	$21 - 25$	$26 - 30$	$31 - 50$	$51 - 70$
Frequency	$2 x$	$4 x$	$3 x$	$5 x$	$4 x$	$x$

A histogram is drawn with a scale of 1 cm to 1 unit on the vertical axis, which represents frequency density. The $1 - 10$ rectangle has height 3 cm .

Calculate the value of $x$ and the height of the 51-70 rectangle.
Calculate an estimate of the mean weight of the stones.

View full question →

Identify and compare sampling techniques

A question is this sub-type if and only if it asks the student to name/state the sampling method being used, compare advantages/disadvantages of different sampling techniques, or explain differences between sampling approaches such as census versus sample or stratified versus simple random sampling.

4

1.3% of questions

Show example »

(a) Briefly describe the difference between a census and a sample survey.
(b) Illustrate the difference by considering the case of a village council which has to decide whether or not to build a new village hall.

Given that the council decides to use a sample survey,
(c) suggest suitable sampling units.

View full question →

Misleading visual representation features

Questions asking students to identify why a graph or chart is misleading due to visual design choices (e.g., truncated axes, inappropriate scales, misleading pictograms, or distorted visual elements).

4

1.3% of questions

Show example »

12 A student plots the scatter diagram below showing the mass in kilograms against the $\mathrm { CO } _ { 2 }$ emissions in grams per kilogram for a sample of cars in the Large Data Set.
\includegraphics[max width=\textwidth, alt={}, center]{7cca79eb-fd09-4ec0-8a1d-a7a38ca73f7a-16_947_1445_794_296} Their teacher tells them to remove an error to clean the data.
Identify the data point which should be removed.
Circle your answer below.
[0pt] [1 mark]
A
B
C
D

View full question →

Estimate percentages or proportions from graphs

A question is this type if and only if it asks the student to estimate what percentage or proportion of data falls in a certain range using a cumulative frequency graph or histogram.

3

1.0% of questions

Show example »

1. \begin{figure}[h]

\includegraphics[alt={},max width=\textwidth]{d62e5a00-cd23-417f-b244-8b3e24da4aa2-02_849_1271_246_303} \captionsetup{labelformat=empty} \caption{Figure 1}

\end{figure} The histogram in Figure 1 shows the times taken to complete a crossword by a random sample of students. The number of students who completed the crossword in more than 15 minutes is 78
Estimate the percentage of students who took less than 11 minutes to complete the crossword.

View full question →

Draw box plot from cumulative frequency

A question is this sub-type if and only if it provides a cumulative frequency graph or curve and asks the student to read off or estimate the summary statistics from the graph before drawing the box-and-whisker plot.

2

0.7% of questions

Show example »

5
\includegraphics[max width=\textwidth, alt={}, center]{b72ace6b-d3d4-401d-bffe-403c9127f2a8-3_1157_1001_258_573} The cumulative frequency graph shows the annual salaries, in thousands of euros, of a random sample of 500 adults with jobs, in France. It has been plotted using grouped data. You may assume that the lowest salary is 5000 euros and the highest salary is 80000 euros.

On graph paper, draw a box-and-whisker plot to illustrate these salaries.
Comment on the salaries of the people in this sample.
An 'outlier' is defined as any data value which is more than 1.5 times the interquartile range above the upper quartile, or more than 1.5 times the interquartile range below the lower quartile.
(a) How high must a salary be in order to be classified as an outlier?
(b) Show that none of the salaries is low enough to be classified as an outlier.

View full question →

Outliers from raw data

Questions where raw data values are provided and the student must calculate quartiles and IQR from the data before applying the outlier rule.

1

0.3% of questions

Show example »

6. The number of bags of potato crisps sold per day in a bar was recorded over a two-week period. The results are shown below. $$20,15,10,30,33,40,5,11,13,20,25,42,31,17$$

Calculate the mean of these data.
Draw a stem and leaf diagram to represent these data.
Find the median and the quartiles of these data. An outlier is an observation that falls either $1.5 \times$ (interquartile range) above the upper quartile or $1.5 \times$ (interquartile range) below the lower quartile.
Determine whether or not any items of data are outliers.
On graph paper draw a box plot to represent these data. Show your scale clearly.
Comment on the skewness of the distribution of bags of crisps sold per day. Justify your answer.

View full question →

Calculate frequency from frequency density

Questions where students must work backwards from given frequency density values to find missing frequencies, often using information from a histogram or partial table.

0

0.0% of questions

Unclassified

Questions not yet assigned to a type.

1

0.3% of questions

Show 1 unclassified »

18 ${ } ^ { \text {th } }$ April 2024} \date{} \DeclareUnicodeCharacter{25A1}{\ifmmode\square\else{$\square$}\fi} \begin{document} \maketitle Name: □
□ Instructions

Answer all the questions.
Use black or blue ink. Pencil may be used for graphs and diagrams only.
There are blank pages at the end of the paper for additional working. You must clearly indicate when you have moved onto additional pages on the question itself. Make sure to include the question number.
You are permitted to use a scientific or graphical calculator in this paper.
Where appropriate, your answer should be supported with working. Marks might be given for using a correct method, even if your answer is wrong.
Give non-exact numerical answers correct to 3 significant figures unless a different degree of accuracy is specified in the question.
The acceleration due to gravity is denoted by $g \mathrm {~ms} ^ { - 2 }$. When a numerical value is needed use $g = 9.8$ unless a different value is specified in the question.

Information

The total mark for this paper is $\mathbf { 6 0 }$ marks.
The marks for each question are shown in brackets.
You are reminded of the need for clear presentation in your answers.
You have $\mathbf { 6 0 }$ minutes for this paper.

\section*{Formulae
A Level Mathematics A (H240)} Arithmetic series
$S _ { n } = \frac { 1 } { 2 } n ( a + l ) = \frac { 1 } { 2 } n \{ 2 a + ( n - 1 ) d \}$ \section*{Geometric series} $S _ { n } = \frac { a \left( 1 - r ^ { n } \right) } { 1 - r }$
$S _ { \infty } = \frac { a } { 1 - r }$ for $| r | < 1$ \section*{Binomial series} $( a + b ) ^ { n } = a ^ { n } + { } ^ { n } \mathrm { C } _ { 1 } a ^ { n - 1 } b + { } ^ { n } \mathrm { C } _ { 2 } a ^ { n - 2 } b ^ { 2 } + \ldots + { } ^ { n } \mathrm { C } _ { r } a ^ { n - r } b ^ { r } + \ldots + b ^ { n } \quad ( n \in \mathbb { N } )$
where ${ } ^ { n } \mathrm { C } _ { r } = { } _ { n } \mathrm { C } _ { r } = \binom { n } { r } = \frac { n ! } { r ! ( n - r ) ! }$ $$( 1 + x ) ^ { n } = 1 + n x + \frac { n ( n - 1 ) } { 2 ! } x ^ { 2 } + \ldots + \frac { n ( n - 1 ) \ldots ( n - r + 1 ) } { r ! } x ^ { r } + \ldots \quad ( | x | < 1 , n \in \mathbb { R } )$$ \section*{Differentiation}

$\mathrm { f } ( x )$	$\mathrm { f } ^ { \prime } ( x )$
$\tan k x$	$k \sec ^ { 2 } k x$
$\sec x$	$\sec x \tan x$
$\cot x$	$- \operatorname { cosec } ^ { 2 } x$
$\operatorname { cosec } x$	$- \operatorname { cosec } x \cot x$

Quotient rule $y = \frac { u } { v } , \frac { \mathrm {~d} y } { \mathrm {~d} x } = \frac { v \frac { \mathrm {~d} u } { \mathrm {~d} x } - u \frac { \mathrm {~d} v } { \mathrm {~d} x } } { v ^ { 2 } }$ \section*{Differentiation from first principles} $\mathrm { f } ^ { \prime } ( x ) = \lim _ { h \rightarrow 0 } \frac { \mathrm { f } ( x + h ) - \mathrm { f } ( x ) } { h }$ \section*{Integration} $\int \frac { \mathrm { f } ^ { \prime } ( x ) } { \mathrm { f } ( x ) } \mathrm { d } x = \ln | \mathrm { f } ( x ) | + c$
$\int \mathrm { f } ^ { \prime } ( x ) ( \mathrm { f } ( x ) ) ^ { n } \mathrm {~d} x = \frac { 1 } { n + 1 } ( \mathrm { f } ( x ) ) ^ { n + 1 } + c$
Integration by parts $\int u \frac { \mathrm {~d} v } { \mathrm {~d} x } \mathrm {~d} x = u v - \int v \frac { \mathrm {~d} u } { \mathrm {~d} x } \mathrm {~d} x$ \section*{Small angle approximations} $\sin \theta \approx \theta , \cos \theta \approx 1 - \frac { 1 } { 2 } \theta ^ { 2 } , \tan \theta \approx \theta$ where $\theta$ is measured in radians \section*{Trigonometric identities} $$\begin{aligned} & \sin ( A \pm B ) = \sin A \cos B \pm \cos A \sin B
& \cos ( A \pm B ) = \cos A \cos B \mp \sin A \sin B
& \tan ( A \pm B ) = \frac { \tan A \pm \tan B } { 1 \mp \tan A \tan B } \quad \left( A \pm B \neq \left( k + \frac { 1 } { 2 } \right) \pi \right) \end{aligned}$$ \section*{Numerical methods} Trapezium rule: $\int _ { a } ^ { b } y \mathrm {~d} x \approx \frac { 1 } { 2 } h \left\{ \left( y _ { 0 } + y _ { n } \right) + 2 \left( y _ { 1 } + y _ { 2 } + \ldots + y _ { n - 1 } \right) \right\}$, where $h = \frac { b - a } { n }$
The Newton-Raphson iteration for solving $\mathrm { f } ( x ) = 0 : x _ { n + 1 } = x _ { n } - \frac { \mathrm { f } \left( x _ { n } \right) } { \mathrm { f } ^ { \prime } \left( x _ { n } \right) }$ \section*{Probability} $\mathrm { P } ( A \cup B ) = \mathrm { P } ( A ) + \mathrm { P } ( B ) - \mathrm { P } ( A \cap B )$
$\mathrm { P } ( A \cap B ) = \mathrm { P } ( A ) \mathrm { P } ( B \mid A ) = \mathrm { P } ( B ) \mathrm { P } ( A \mid B ) \quad$ or $\quad \mathrm { P } ( A \mid B ) = \frac { \mathrm { P } ( A \cap B ) } { \mathrm { P } ( B ) }$ \section*{Standard deviation} $\sqrt { \frac { \sum ( x - \bar { x } ) ^ { 2 } } { n } } = \sqrt { \frac { \sum x ^ { 2 } } { n } - \bar { x } ^ { 2 } }$ or $\sqrt { \frac { \sum f ( x - \bar { x } ) ^ { 2 } } { \sum f } } = \sqrt { \frac { \sum f x ^ { 2 } } { \sum f } - \bar { x } ^ { 2 } }$ \section*{The binomial distribution} If $X \sim \mathrm {~B} ( n , p )$ then $\mathrm { P } ( X = x ) = \binom { n } { x } p ^ { x } ( 1 - p ) ^ { n - x }$, mean of $X$ is $n p$, variance of $X$ is $n p ( 1 - p )$ \section*{Hypothesis test for the mean of a normal distribution} If $X \sim \mathrm {~N} \left( \mu , \sigma ^ { 2 } \right)$ then $\bar { X } \sim \mathrm {~N} \left( \mu , \frac { \sigma ^ { 2 } } { n } \right)$ and $\frac { \bar { X } - \mu } { \sigma / \sqrt { n } } \sim \mathrm {~N} ( 0,1 )$ \section*{Percentage points of the normal distribution} If $Z$ has a normal distribution with mean 0 and variance 1 then, for each value of $p$, the table gives the value of $z$ such that $\mathrm { P } ( Z \leqslant z ) = p$.

$p$	0.75	0.90	0.95	0.975	0.99	0.995	0.9975	0.999	0.9995
$z$	0.674	1.282	1.645	1.960	2.326	2.576	2.807	3.090	3.291

\section*{Kinematics} Motion in a straight line
$v = u + a t$
$s = u t + \frac { 1 } { 2 } a t ^ { 2 }$
$s = \frac { 1 } { 2 } ( u + v ) t$
$v ^ { 2 } = u ^ { 2 } + 2 a s$
$s = v t - \frac { 1 } { 2 } a t ^ { 2 }$ Motion in two dimensions
$\mathbf { v } = \mathbf { u } + \mathbf { a } t$
$\mathbf { s } = \mathbf { u } t + \frac { 1 } { 2 } \mathbf { a } t ^ { 2 }$
$\mathbf { s } = \frac { 1 } { 2 } ( \mathbf { u } + \mathbf { v } ) t$
$\mathbf { s } = \mathbf { v } t - \frac { 1 } { 2 } \mathbf { a } t ^ { 2 }$
1. 200 candidates took each of two examination papers. The diagram shows the cumulative frequency graphs for their marks.
\includegraphics[max width=\textwidth, alt={}, center]{5b55a372-2cc8-454e-a10a-cabdc9801421-04_1091_1484_429_285}

State, with a reason, which of the two papers was the easier one.
The minimum mark for grade A , the top grade, on Paper 1 was 10 marks lower than the minimum mark for grade A on Paper 2. Given that 25 candidates gained grade A in Paper 1, find the number of candidates who gained grade A in Paper 2.
The mean and standard deviation of the marks on Paper 1 were 36.5 and 28.2 respectively. Later, a marking error was discovered and it was decided to add 1 mark to each of the 200 marks on Paper 1. State the mean and standard deviation of the new marks on Paper 1.
[0pt] [BLANK PAGE]
2. In this question you must show detailed reasoning.
A disease that affects trees shows no visible evidence for the first few years after the tree is infected. A test has been developed to determine whether a particular tree has the disease. A positive result to the test suggests that the tree has the disease. However, the test is not $100 \%$ reliable, and a researcher uses the following model.
- If the tree has the disease, the probability of a positive result is 0.95 .
- If the tree does not have the disease, the probability of a positive result is 0.1 .
  (a) It is known that in a certain county, $A , 35 \%$ of the trees have the disease. A tree in county $A$ is chosen at random and is tested.
Given that the result is positive, determine the probability that this tree has the disease. A forestry company wants to determine what proportion of trees in another county, $B$, have the disease. They choose a large random sample of trees in county $B$. Each tree in the sample is tested and it is found that the result is positive for $43 \%$ of these trees.
(b) By carrying out a calculation, determine an estimate of the proportion of trees in county $B$ that have the disease.
[0pt] [BLANK PAGE]
3. A group of 8 people, including Kathy, David and Harpreet, are planning a theatre trip.
Four of the group are chosen at random, without regard to order, to carry the refreshments. Find the probability that these 4 people include Kathy and David but not Harpreet.
The 8 people sit in a row. Kathy and David sit next to each other and Harpreet sits at the left-hand end of the row. How many different arrangements of the 8 people are possible?
[0pt] [BLANK PAGE]
4. A market researcher wants to interview people who watched a particular television programme. Audience research data used by the broadcaster indicates that $12 \%$ of the adult population watched this programme. This figure is used to model the situation.
The researcher asks people in a shopping centre, one at a time, if they watched the programme. You should assume that these people form a random sample of the adult population.
(a) Find the probability that the fifth person the researcher asks is the first to have watched the programme.
(b) Find the probability that the researcher has to ask at least 10 people in order to find one who watched the programme.
(c) Find the probability that the twentieth person the researcher asks is the third to have watched the programme.
(d) Find how many people the researcher would have to ask to ensure that there is a probability of at least 0.95 that at least one of them watched the programme.
[0pt] [BLANK PAGE]
5. The random variable $H$ has the distribution $\mathrm { N } \left( \mu , \sigma ^ { 2 } \right)$. It is given that $\mathrm { P } ( H < 105.0 ) = 0.2420$ and $\mathrm { P } ( H > 110.0 ) = 0.6915$. Find the values of $\mu$ and $\sigma$, giving your answers to a suitable degree of accuracy.
6. For the events $A$ and $B$, $$\mathrm { P } \left( A \cap B ^ { \prime } \right) = 0.32 , \quad \mathrm { P } \left( A ^ { \prime } \cap B \right) = 0.11 \quad \text { and } \quad \mathrm { P } ( A \cup B ) = 0.65$$
Find $\mathrm { P } \left( A \mid B ^ { \prime } \right)$.
Determine whether or not $A$ and $B$ are independent.
7. An online shopping company takes orders through its website. On average $80 \%$ of orders from the website are delivered within 24 hours. The quality controller selects 10 orders at random to check when they are delivered.
Find the probability that
(A) exactly 8 of these orders are delivered within 24 hours,
(B) at least 8 of these orders are delivered within 24 hours. The company changes its delivery method. The quality controller suspects that the changes will mean that fewer than $80 \%$ of orders will be delivered within 24 hours. A random sample of 18 orders is checked and it is found that 12 of them arrive within 24 hours.
Write down suitable hypotheses and carry out a test at the $5 \%$ significance level to determine whether there is any evidence to support the quality controller's suspicion.
A statistician argues that it is possible that the new method could result in either better or worse delivery times. Therefore it would be better to carry out a 2 -tail test at the $5 \%$ significance level. State the alternative hypothesis for this test. Assuming that the sample size is still 18 , find the critical region for this test, showing all of your calculations.
[0pt] [BLANK PAGE]
8. The continuous random variable $X$ has probability density function $\mathrm { f } ( x ) = \begin{cases} k x ^ { n } & 0 \leqslant x \leqslant 1 ,
0 & \text { otherwise, } \end{cases}$ where $k$ is a constant and $n$ is a parameter whose value is positive. It is given that the median of $X$ is 0.8816 correct to 4 decimal places.
Ten independent observations of $X$ are obtained.
Find the expected number of observations that are less than 0.8 .
[0pt] [BLANK PAGE]
[0pt] [BLANK PAGE]

Length \(( \mathrm { cm } )\)	\(2.0 - 3.5\)	\(3.5 - 4.5\)	\(4.5 - 5.5\)	\(5.5 - 7.0\)	\(7.0 - 9.0\)
Frequency	8	25	28	31	12

Time ( \(t\) minutes)	\(0 < t \leqslant 15\)	\(15 < t \leqslant 30\)	\(30 < t \leqslant 60\)	\(60 < t \leqslant 90\)	\(90 < t \leqslant 120\)
Number of meetings	4	7	24	38	7

\(x\)	\(10 - 15\)	\(16 - 18\)	\(19 -\)
Frequency	15	9	16

Time ( \(t\) minutes)	\(40 \leqslant t < 45\)	\(45 \leqslant t < 50\)	\(50 \leqslant t < 60\)	\(60 \leqslant t < 70\)	\(70 \leqslant t < 90\)
Number of CDs	26	18	31	16	9

Weight (grams)	\(1 - 10\)	\(11 - 20\)	\(21 - 25\)	\(26 - 30\)	\(31 - 50\)	\(51 - 70\)
Frequency	\(2 x\)	\(4 x\)	\(3 x\)	\(5 x\)	\(4 x\)	\(x\)

Time (seconds)	\(< 10.0\)	\(< 10.5\)	\(< 11.0\)	\(< 12.0\)	\(< 12.5\)	\(< 13.5\)
Cumulative frequency	0	4	10	40	49	57

\(\mathrm { f } ( x )\)	\(\mathrm { f } ^ { \prime } ( x )\)
\(\tan k x\)	\(k \sec ^ { 2 } k x\)
\(\sec x\)	\(\sec x \tan x\)
\(\cot x\)	\(- \operatorname { cosec } ^ { 2 } x\)
\(\operatorname { cosec } x\)	\(- \operatorname { cosec } x \cot x\)

\(p\)	0.75	0.90	0.95	0.975	0.99	0.995	0.9975	0.999	0.9995
\(z\)	0.674	1.282	1.645	1.960	2.326	2.576	2.807	3.090	3.291

Direction	\(A\)	\(B\)	\(C\)	\(D\)
Frequency	22	48	56	58

Emly
\(B\)
\(A t\)