7. In a school there are 148 students in Years 12 and 13 studying Science, Humanities or Arts subjects. Of these students, 89 wear glasses and the others do not. There are 30 Science students of whom 18 wear glasses. The corresponding figures for the Humanities students are 68 and 44 respectively.
A student is chosen at random.
Find the probability that this student
- is studying Arts subjects,
- does not wear glasses, given that the student is studying Arts subjects.
Amongst the Science students, \(80 \%\) are right-handed. Corresponding percentages for Humanities and Arts students are 75\% and 70\% respectively.
A student is again chosen at random.
- Find the probability that this student is right-handed.
- Given that this student is right-handed, find the probability that the student is studying Science subjects.
Turn over
- (a) Describe the main features and uses of a box plot.
Children from schools \(A\) and \(B\) took part in a fun run for charity. The times, to the nearest minute, taken by the children from school \(A\) are summarised in Figure 1.
\begin{figure}[h]
\captionsetup{labelformat=empty}
\caption{Figure 1}
\includegraphics[alt={},max width=\textwidth]{3d4f7bfb-b235-418a-9411-a4d0b3188254-015_398_1045_946_461}
\end{figure} - Write down the time by which \(75 \%\) of the children in school \(A\) had completed the run.
- State the name given to this value.
- Explain what you understand by the two crosses ( X ) on Figure 1.
For school \(B\) the least time taken by any of the children was 25 minutes and the longest time was 55 minutes. The three quartiles were 30,37 and 50 respectively. - Draw a box plot to represent the data from school \(B\).
\includegraphics[max width=\textwidth, alt={}, center]{3d4f7bfb-b235-418a-9411-a4d0b3188254-016_798_1196_580_372} - Compare and contrast these two box plots.
2. Sunita and Shelley talk to one another once a week on the telephone. Over many weeks they recorded, to the nearest minute, the number of minutes spent in conversation on each occasion. The following table summarises their results.
Turn over
- As part of a statistics project, Gill collected data relating to the length of time, to the nearest minute, spent by shoppers in a supermarket and the amount of money they spent. Her data for a random sample of 10 shoppers are summarised in the table below, where \(t\) represents time and \(\pounds m\) the amount spent over \(\pounds 20\).
Turn over
- A young family were looking for a new 3 bedroom semi-detached house. A local survey recorded the price \(x\), in \(\pounds 1000\), and the distance \(y\), in miles, from the station of such houses. The following summary statistics were provided
$$S _ { x x } = 113573 , \quad S _ { y y } = 8.657 , \quad S _ { x y } = - 808.917$$ - Use these values to calculate the product moment correlation coefficient.
- Give an interpretation of your answer to part (a).
Another family asked for the distances to be measured in km rather than miles.
- State the value of the product moment correlation coefficient in this case.
2. The box plot in Figure 1 shows a summary of the weights of the luggage, in kg, for each musician in an orchestra on an overseas tour.
\begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{3d4f7bfb-b235-418a-9411-a4d0b3188254-045_346_1452_324_228}
\captionsetup{labelformat=empty}
\caption{Figure 1}
\end{figure}
The airline's recommended weight limit for each musician's luggage was 45 kg . Given that none of the musicians' luggage weighed exactly 45 kg , - state the proportion of the musicians whose luggage was below the recommended weight limit.
A quarter of the musicians had to pay a charge for taking heavy luggage.
- State the smallest weight for which the charge was made.
- Explain what you understand by the + on the box plot in Figure 1, and suggest an instrument that the owner of this luggage might play.
- Describe the skewness of this distribution. Give a reason for your answer.
One musician of the orchestra suggests that the weights of luggage, in kg, can be modelled by a normal distribution with quartiles as given in Figure 1.
- Find the standard deviation of this normal distribution.
3. A student is investigating the relationship between the price ( \(y\) pence) of 100 g of chocolate and the percentage ( \(x \%\) ) of cocoa solids in the chocolate.
The following data is obtained
Turn over
advancing learning, changing lives
- A personnel manager wants to find out if a test carried out during an employee's interview and a skills assessment at the end of basic training is a guide to performance after working for the company for one year.
The table below shows the results of the interview test of 10 employees and their performance after one year.
Turn over
advancing learning, changing lives
- A disease is known to be present in \(2 \%\) of a population. A test is developed to help determine whether or not someone has the disease.
Given that a person has the disease, the test is positive with probability 0.95
Given that a person does not have the disease, the test is positive with probability 0.03 - Draw a tree diagram to represent this information.
A person is selected at random from the population and tested for this disease.
- Find the probability that the test is positive.
A doctor randomly selects a person from the population and tests him for the disease. Given that the test is positive,
- find the probability that he does not have the disease.
- Comment on the usefulness of this test.
2. The age in years of the residents of two hotels are shown in the back to back stem and leaf diagram below.
Abbey Hotel \(8 | 5 | 0\) means 58 years in Abbey hotel and 50 years in Balmoral hotel Balmoral Hotel
Turn over
- A teacher is monitoring the progress of students using a computer based revision course. The improvement in performance, \(y\) marks, is recorded for each student along with the time, \(x\) hours, that the student spent using the revision course. The results for a random sample of 10 students are recorded below.
Turn over
advancing learning, changing lives
- The volume of a sample of gas is kept constant. The gas is heated and the pressure, \(p\), is measured at 10 different temperatures, \(t\). The results are summarised below.
\(\sum p = 445 \quad \sum p ^ { 2 } = 38125 \quad \sum t = 240 \quad \sum t ^ { 2 } = 27520 \quad \sum p t = 26830\) - Find \(\mathrm { S } _ { p p }\) and \(\mathrm { S } _ { p t }\).
Given that \(\mathrm { S } _ { t t } = 21760\), - calculate the product moment correlation coefficient.
- Give an interpretation of your answer to part (b).
2. On a randomly chosen day the probability that Bill travels to school by car, by bicycle or on foot is \(\frac { 1 } { 2 } , \frac { 1 } { 6 }\) and \(\frac { 1 } { 3 }\) respectively. The probability of being late when using these methods of travel is \(\frac { 1 } { 5 } , \frac { 2 } { 5 }\) and \(\frac { 1 } { 10 }\) respectively. - Draw a tree diagram to represent this information.
- Find the probability that on a randomly chosen day
- Bill travels by foot and is late,
- Bill is not late.
- Given that Bill is late, find the probability that he did not travel on foot.
3. The variable \(x\) was measured to the nearest whole number. Forty observations are given in the table below.
| \(x\) | \(10 - 15\) | \(16 - 18\) | \(19 -\) |
| Frequency | 15 | 9 | 16 |
A histogram was drawn and the bar representing the \(10 - 15\) class has a width of 2 cm and a height of 5 cm . For the \(16 - 18\) class find - the width,
- the height
of the bar representing this class.
4. A researcher measured the foot lengths of a random sample of 120 ten-year-old children. The lengths are summarised in the table below.
| Foot length, \(l\), (cm) | Number of children |
| \(10 \leqslant l < 12\) | 5 |
| \(12 \leqslant l < 17\) | 53 |
| \(17 \leqslant l < 19\) | 29 |
| \(19 \leqslant l < 21\) | 15 |
| \(21 \leqslant l < 23\) | 11 |
| \(23 \leqslant l < 25\) | 7 |
- Use interpolation to estimate the median of this distribution.
- Calculate estimates for the mean and the standard deviation of these data.
One measure of skewness is given by
$$\text { Coefficient of skewness } = \frac { 3 ( \text { mean } - \text { median } ) } { \text { standard deviation } }$$
- Evaluate this coefficient and comment on the skewness of these data.
Greg suggests that a normal distribution is a suitable model for the foot lengths of ten-year-old children.
- Using the value found in part (c), comment on Greg's suggestion, giving a reason for your answer.
5. The weight, \(w\) grams, and the length, \(l \mathrm {~mm}\), of 10 randomly selected newborn turtles are given in the table below.
| \(l\) | 49.0 | 52.0 | 53.0 | 54.5 | 54.1 | 53.4 | 50.0 | 51.6 | 49.5 | 51.2 |
| \(w\) | 29 | 32 | 34 | 39 | 38 | 35 | 30 | 31 | 29 | 30 |
$$\text { (You may use } \mathrm { S } _ { l l } = 33.381 \quad \mathrm {~S} _ { w l } = 59.99 \quad \mathrm {~S} _ { w w } = 120.1 \text { ) }$$ - Find the equation of the regression line of \(w\) on \(l\) in the form \(w = a + b l\).
- Use your regression line to estimate the weight of a newborn turtle of length 60 mm .
- Comment on the reliability of your estimate giving a reason for your answer.
6. The discrete random variable \(X\) has probability function
$$\mathrm { P } ( X = x ) = \left\{ \begin{array} { c l }
a ( 3 - x ) & x = 0,1,2
b & x = 3
\end{array} \right.$$ - Find \(\mathrm { P } ( X = 2 )\) and complete the table below.
| \(x\) | 0 | 1 | 2 | 3 |
| \(\mathrm { P } ( X = x )\) | \(3 a\) | \(2 a\) | | \(b\) |
Given that \(\mathrm { E } ( X ) = 1.6\) - Find the value of \(a\) and the value of \(b\).
Find
- \(\mathrm { P } ( 0.5 < X < 3 )\),
- \(\mathrm { E } ( 3 X - 2 )\).
- Show that the \(\operatorname { Var } ( X ) = 1.64\)
- Calculate \(\operatorname { Var } ( 3 X - 2 )\).
7. (a) Given that \(\mathrm { P } ( A ) = a\) and \(\mathrm { P } ( B ) = b\) express \(\mathrm { P } ( A \cup B )\) in terms of \(a\) and \(b\) when
- \(A\) and \(B\) are mutually exclusive,
- \(A\) and \(B\) are independent.
Two events \(R\) and \(Q\) are such that
\(\mathrm { P } \left( R \cap Q ^ { \prime } \right) = 0.15 , \quad \mathrm { P } ( Q ) = 0.35\) and \(\mathrm { P } ( R \mid Q ) = 0.1\)
Find the value of
- \(\mathrm { P } ( R \cup Q )\),
- \(\mathrm { P } ( R \cap Q )\),
- \(\mathrm { P } ( R )\).