Edexcel S1 (Statistics 1) 2010 January

Question 1
View details
  1. A jar contains 2 red, 1 blue and 1 green bead. Two beads are drawn at random from the jar without replacement.
    1. In the space below, draw a tree diagram to illustrate all the possible outcomes and associated probabilities. State your probabilities clearly.
    2. Find the probability that a blue bead and a green bead are drawn from the jar.
    3. The 19 employees of a company take an aptitude test. The scores out of 40 are illustrated in the stem and leaf diagram below.
    \(2 \mid 6\) means a score of 26
    07\(( 1 )\)
    188\(( 2 )\)
    24468\(( 4 )\)
    32333459\(( 7 )\)
    400000\(( 5 )\)
    Find
  2. the median score,
  3. the interquartile range. The company director decides that any employees whose scores are so low that they are outliers will undergo retraining. An outlier is an observation whose value is less than the lower quartile minus 1.0 times the interquartile range.
  4. Explain why there is only one employee who will undergo retraining.
  5. On the graph paper on page 5, draw a box plot to illustrate the employees' scores. \includegraphics[max width=\textwidth, alt={}, center]{a0058e3c-046f-4271-aee4-33a74c719e2a-04_611_1596_2006_185}
Question 3
View details
3. The birth weights, in kg, of 1500 babies are summarised in the table below.
Weight (kg)Midpoint, xkgFrequency, f
0.0-1.00.501
1.0-2.01.506
2.0-2.52.2560
2.5-3.0280
3.0-3.53.25820
3.5-4.03.75320
4.0-5.04.5010
5.0-6.03
$$\text { [You may use } \sum \mathrm { f } x = 4841 \text { and } \sum \mathrm { f } x ^ { 2 } = 15889.5 \text { ] }$$
  1. Write down the missing midpoints in the table above.
  2. Calculate an estimate of the mean birth weight.
  3. Calculate an estimate of the standard deviation of the birth weight.
  4. Use interpolation to estimate the median birth weight.
  5. Describe the skewness of the distribution. Give a reason for your answer.
Question 4
View details
4. There are 180 students at a college following a general course in computing. Students on this course can choose to take up to three extra options. \begin{displayquote} 112 take systems support,
70 take developing software,
81 take networking,
35 take developing software and systems support,
28 take networking and developing software,
40 take systems support and networking,
4 take all three extra options.
  1. In the space below, draw a Venn diagram to represent this information. \end{displayquote} A student from the course is chosen at random. Find the probability that this student takes
  2. none of the three extra options,
  3. networking only. Students who want to become technicians take systems support and networking. Given that a randomly chosen student wants to become a technician,
  4. find the probability that this student takes all three extra options.
Question 5
View details
5. The probability function of a discrete random variable \(X\) is given by $$\mathrm { p } ( x ) = k x ^ { 2 } \quad x = 1,2,3$$ where \(k\) is a positive constant.
  1. Show that \(k = \frac { 1 } { 14 }\) Find
  2. \(\mathrm { P } ( X \geqslant 2 )\)
  3. \(\mathrm { E } ( X )\)
  4. \(\operatorname { Var } ( 1 - X )\)
Question 6
View details
  1. The blood pressures, \(p\) mmHg, and the ages, \(t\) years, of 7 hospital patients are shown in the table below.
PatientABCDEFG
\(t\)42744835562660
\(p\)981301208818280135
$$\left[ \sum t = 341 , \sum p = 833 , \sum t ^ { 2 } = 18181 , \sum p ^ { 2 } = 106397 , \sum t p = 42948 \right]$$
  1. Find \(S _ { p p } , S _ { t p }\) and \(S _ { t t }\) for these data.
  2. Calculate the product moment correlation coefficient for these data.
  3. Interpret the correlation coefficient.
  4. On the graph paper on page 17, draw the scatter diagram of blood pressure against age for these 7 patients.
  5. Find the equation of the regression line of \(p\) on \(t\).
  6. Plot your regression line on your scatter diagram.
  7. Use your regression line to estimate the blood pressure of a 40 year old patient. \begin{figure}[h]
    \captionsetup{labelformat=empty} \caption{Question 6 continued} \includegraphics[alt={},max width=\textwidth]{a0058e3c-046f-4271-aee4-33a74c719e2a-12_2071_1729_386_157}
    \end{figure}
Question 7
View details
  1. The heights of a population of women are normally distributed with mean \(\mu \mathrm { cm }\) and standard deviation \(\sigma \mathrm { cm }\). It is known that \(30 \%\) of the women are taller than 172 cm and \(5 \%\) are shorter than 154 cm .
    1. Sketch a diagram to show the distribution of heights represented by this information.
    2. Show that \(\mu = 154 + 1.6449 \sigma\).
    3. Obtain a second equation and hence find the value of \(\mu\) and the value of \(\sigma\).
    A woman is chosen at random from the population.
  2. Find the probability that she is taller than 160 cm .