OCR MEI S1 (Statistics 1) 2012 January

Question 1
View details
1 The mean daily maximum temperatures at a research station over a 12-month period, measured to the nearest degree Celsius, are given below.
JanFebMarAprMayJunJulAugSepOctNovDec
8152529313134363426158
  1. Construct a sorted stem and leaf diagram to represent these data, taking stem values of \(0,10 , \ldots\).
  2. Write down the median of these data.
  3. The mean of these data is 24.3 . Would the mean or the median be a better measure of central tendency of the data? Briefly explain your answer.
Question 2
View details
2 The hourly wages, \(\pounds x\), of a random sample of 60 employees working for a company are summarised as follows. $$n = 60 \quad \sum x = 759.00 \quad \sum x ^ { 2 } = 11736.59$$
  1. Calculate the mean and standard deviation of \(x\).
  2. The workers are offered a wage increase of \(2 \%\). Use your answers to part (i) to deduce the new mean and standard deviation of the hourly wages after this increase.
  3. As an alternative the workers are offered a wage increase of 25 p per hour. Write down the new mean and standard deviation of the hourly wages after this 25p increase.
Question 3
View details
3 Jimmy and Alan are playing a tennis match against each other. The winner of the match is the first player to win three sets. Jimmy won the first set and Alan won the second set. For each of the remaining sets, the probability that Jimmy wins a set is
  • 0.7 if he won the previous set,
  • 0.4 if Alan won the previous set.
It is not possible to draw a set.
  1. Draw a probability tree diagram to illustrate the possible outcomes for each of the remaining sets.
  2. Find the probability that Alan wins the match.
  3. Find the probability that the match ends after exactly four sets have been played.
Question 4
View details
4 In a food survey, a large number of people are asked whether they like tomato soup, mushroom soup, both or neither. One of these people is selected at random.
  • \(T\) is the event that this person likes tomato soup.
  • \(M\) is the event that this person likes mushroom soup.
You are given that \(\mathrm { P } ( T ) = 0.55 , \mathrm { P } ( M ) = 0.33\) and \(\mathrm { P } ( T \mid M ) = 0.80\).
  1. Use this information to show that the events \(T\) and \(M\) are not independent.
  2. Find \(\mathrm { P } ( T \cap M )\).
  3. Draw a Venn diagram showing the events \(T\) and \(M\), and fill in the probability corresponding to each of the four regions of your diagram.
Question 5
View details
5 A couple plan to have at least one child of each sex, after which they will have no more children. However, if they have four children of one sex, they will have no more children. You should assume that each child is equally likely to be of either sex, and that the sexes of the children are independent. The random variable \(X\) represents the total number of girls the couple have.
  1. Show that \(\mathrm { P } ( X = 1 ) = \frac { 11 } { 16 }\). The table shows the probability distribution of \(X\).
    \(r\)01234
    \(\mathrm { P } ( X = r )\)\(\frac { 1 } { 16 }\)\(\frac { 11 } { 16 }\)\(\frac { 1 } { 8 }\)\(\frac { 1 } { 16 }\)\(\frac { 1 } { 16 }\)
  2. Find \(\mathrm { E } ( X )\) and \(\operatorname { Var } ( X )\).
Question 6
View details
6 It is known that \(25 \%\) of students in a particular city are smokers. A random sample of 20 of the students is selected.
  1. (A) Find the probability that there are exactly 4 smokers in the sample.
    (B) Find the probability that there are at least 3 but no more than 6 smokers in the sample.
    (C) Write down the expected number of smokers in the sample. A new health education programme is introduced. This programme aims to reduce the percentage of students in this city who are smokers. After the programme has been running for a year, it is decided to carry out a hypothesis test to assess the effectiveness of the programme. A random sample of 20 students is selected.
  2. (A) Write down suitable null and alternative hypotheses for the test.
    (B) Explain why the alternative hypothesis has the form that it does.
  3. Find the critical region for the test at the \(5 \%\) level, showing all of your calculations.
  4. In fact there are 3 smokers in the sample. Complete the test, stating your conclusion clearly.
Question 7
View details
7 The birth weights of 200 lambs from crossbred sheep are illustrated by the cumulative frequency diagram below.
\includegraphics[max width=\textwidth, alt={}, center]{4b259fe3-73ef-419f-85ad-1a3b1e6ea56e-4_917_1146_367_447}
  1. Estimate the percentage of lambs with birth weight over 6 kg .
  2. Estimate the median and interquartile range of the data.
  3. Use your answers to part (ii) to show that there are very few, if any, outliers. Comment briefly on whether any outliers should be disregarded in analysing these data. The box and whisker plot shows the birth weights of 100 lambs from Welsh Mountain sheep.
    \includegraphics[max width=\textwidth, alt={}, center]{4b259fe3-73ef-419f-85ad-1a3b1e6ea56e-4_328_1616_1749_260}
  4. Use appropriate measures to compare briefly the central tendencies and variations of the weights of the two types of lamb.
  5. The weight of the largest Welsh Mountain lamb was originally recorded as 6.5 kg , but then corrected. If this error had not been corrected, how would this have affected your answers to part (iv)? Briefly explain your answer.
  6. One lamb of each type is selected at random. Estimate the probability that the birth weight of both lambs is at least 3.9 kg .