Edexcel AS Paper 2 Specimen — Question 3

Exam BoardEdexcel
ModuleAS Paper 2 (AS Paper 2)
SessionSpecimen
TopicBivariate data
TypeAnalyze large data set correlations

  1. Pete is investigating the relationship between daily rainfall, \(w \mathrm {~mm}\), and daily mean pressure, \(p\) hPa , in Perth during 2015. He used the large data set to take a sample of size 12.
He obtained the following results.
\(p\)100710121013100910191010101010101013101110141022
\(w\)102.063.063.038.438.035.034.232.030.428.028.015
Pete drew the following scatter diagram for the values of \(w\) and \(p\) and calculated the quartiles.
Q 1Q 2Q 3
\(p\)10101011.51013.5
\(w\)29.234.650.7
\includegraphics[max width=\textwidth, alt={}]{b29b0411-8401-420b-9227-befe25c245d8-04_818_1081_989_477}
An outlier is a value which is more than 1.5 times the interquartile range above Q3 or more than 1.5 times the interquartile range below Q1.
  1. Show that the 3 points circled on the scatter diagram above are outliers.
    (2)
  2. Describe the effect of removing the 3 outliers on the correlation between daily rainfall and daily mean pressure in this sample.
    (1) John has also been studying the large data set and believes that the sample Pete has taken is not random.
  3. From your knowledge of the large data set, explain why Pete's sample is unlikely to be a random sample. John finds that the equation of the regression line of \(w\) on \(p\), using all the data in the large data set, is $$w = 1023 - 0.223 p$$
  4. Give an interpretation of the figure - 0.223 in this regression line. John decided to use the regression line to estimate the daily rainfall for a day in December when the daily mean pressure is 1011 hPa .
  5. Using your knowledge of the large data set, comment on the reliability of John's estimate.
    (Total for Question 3 is 6 marks)