6. A local authority is investigating the cost of reconditioning its incinerators. Data from 10 randomly chosen incinerators were collected. The variables monitored were the operating time \(x\) (in thousands of hours) since last reconditioning and the reconditioning cost \(y\) (in \(\pounds 1000\) ). None of the incinerators had been used for more than 3000 hours since last reconditioning.
The data are summarised below,
$$\Sigma x = 25.0 , \Sigma x ^ { 2 } = 65.68 , \Sigma y = 50.0 , \Sigma y ^ { 2 } = 260.48 , \Sigma x y = 130.64$$
- Find \(\mathrm { S } _ { x x } , \mathrm {~S} _ { x y } , \mathrm {~S} _ { y y }\).
- Calculate the product moment correlation coefficient between \(x\) and \(y\).
- Explain why this value might support the fitting of a linear regression model of the form \(y = a + b x\).
- Find the values of \(a\) and \(b\).
- Give an interpretation of \(a\).
- Estimate
- the reconditioning cost for an operating time of 2400 hours,
- the financial effect of an increase of 1500 hours in operating time.
- Suggest why the authority might be cautious about making a prediction of the reconditioning cost of an incinerator which had been operating for 4500 hours since its last reconditioning.
Materials required for examination
Answer Book (AB16)
Graph Paper (ASG2)
Mathematical Formulae (Lilac)
Items included with question papers
Nil
Paper Reference(s)
6683
\section*{Edexcel GCE
Statistics S1
(New Syllabus)
Advanced/Advanced Subsidiary
Tuesday 12 June 2001 - Afternoon
Time: 1 hour 30 minutes}
Candidates may use any calculator EXCEPT those with the facility for symbolic algebra, differentiation and/or integration. Thus candidates may NOT use calculators such as the Texas Instruments TI 89, TI 92, Casio CFX 9970G, Hewlett Packard HP 48G.
In the boxes on the answer book, write the name of the examining body (Edexcel), your centre number, candidate number, the unit title (Statistics S1), the paper reference (6683), your surname, other name and signature.
Values from the statistical tables should be quoted in full. When a calculator is used, the answer should be given to an appropriate degree of accuracy.
A booklet 'Mathematical Formulae and Statistical Tables' is provided.
Full marks may be obtained for answers to ALL questions.
This paper has seven questions. Pages 6, 7 and 8 are blank.
You must ensure that your answers to parts of questions are clearly labelled.
You must show sufficient working to make your methods clear to the Examiner. Answers without working may gain no credit.
- Each of the 25 students on a computer course recorded the number of minutes \(x\), to the nearest minute, spent surfing the internet during a given day. The results are summarised below.
$$\Sigma x = 1075 , \Sigma x ^ { 2 } = 46625$$ - Find \(\mu\) and \(\sigma\) for these data.
Two other students surfed the internet on the same day for 35 and 51 minutes respectively.
- Without further calculation, explain the effect on the mean of including these two students.
2. On a particular day in summer 1993 at 0800 hours the height above sea level, \(x\) metres, and the temperature, \(y ^ { \circ } \mathrm { C }\), were recorded in 10 Mediterranean towns. The following summary statistics were calculated from the results.
$$\Sigma x = 7300 , \Sigma x ^ { 2 } = 6599600 , S _ { x y } = - 13060 , S _ { y y } = 140.9 .$$ - Find \(S _ { x x }\).
- Calculate, to 3 significant figures, the product moment correlation coefficient between \(x\) and \(y\).
- Give an interpretation of your coefficient.
3. The continuous random variable \(Y\) is normally distributed with mean 100 and variance 256 . - Find \(\mathrm { P } ( Y < 80 )\).
- Find \(k\) such that \(\mathrm { P } ( 100 - k \leq Y \leq 100 + k ) = 0.516\).
4. The discrete random variable \(X\) has the probability function shown in the table below.
| \(x\) | - 2 | - 1 | 0 | 1 | 2 | 3 |
| \(\mathrm { P } ( X = x )\) | 0.1 | \(\alpha\) | 0.3 | 0.2 | 0.1 | 0.1 |
Find - \(\alpha\),
- \(\mathrm { P } ( - 1 < X \leq 2 )\),
- \(\mathrm { F } ( - 0.4 )\),
- \(\mathrm { E } ( 3 X + 4 )\),
- \(\operatorname { Var } ( 2 X + 3 )\).
5. A market researcher asked 100 adults which of the three newspapers \(A , B , C\) they read. The results showed that \(30 \operatorname { read } A , 26\) read \(B , 21\) read \(C , 5\) read both \(A\) and \(B\), 7 read both \(B\) and \(C\), 6 read both \(C\) and \(A\) and 2 read all three. - Draw a Venn diagram to represent these data.
One of the adults is then selected at random.
Find the probability that she reads - at least one of the newspapers,
- only \(A\),
- only one of the newspapers,
- \(A\) given that she reads only one newspaper.
6. Three swimmers Alan, Diane and Gopal record the number of lengths of the swimming pool they swim during each practice session over several weeks. The stem and leaf diagram below shows the results for Alan.
| Lengths | \(2 \mid 0\) means 20 | |
| 2 | 0122 | \(( 4 )\) |
| 2 | 5567789 | \(( 7 )\) |
| 3 | 01224 | \(( 5 )\) |
| 3 | 56679 | \(( 5 )\) |
| 4 | 0133333444 | \(( 10 )\) |
| 4 | 556667788999 | \(( 12 )\) |
| 5 | 000 | \(( 3 )\) |
- Find the three quartiles for Alan's results.
The table below summarises the results for Diane and Gopal.
| Diane | Gopal |
| Smallest value | 35 | 25 |
| Lower quartile | 37 | 34 |
| Median | 42 | 42 |
| Upper quartile | 53 | 50 |
| Largest value | 65 | 57 |
- Using the same scale and on the same sheet of graph paper draw box plots to represent the data for Alan, Diane and Gopal.
- Compare and contrast the three box plots.