Edexcel S1 2007 June — Question 3

Exam BoardEdexcel
ModuleS1 (Statistics 1)
Year2007
SessionJune
TopicLinear regression
TypeCalculate y on x from raw data table

3. A student is investigating the relationship between the price ( \(y\) pence) of 100 g of chocolate and the percentage ( \(x \%\) ) of cocoa solids in the chocolate.
The following data is obtained
Chocolate brandABC\(D\)\(E\)\(F\)G\(H\)
\(x\) (\% cocoa)1020303540506070
\(y\) (pence)3555401006090110130
(You may use: \(\sum x = 315 , \sum x ^ { 2 } = 15225 , \sum y = 620 , \sum y ^ { 2 } = 56550 , \sum x y = 28750\) )
  1. On the graph paper on page 9 draw a scatter diagram to represent these data.
  2. Show that \(S _ { x y } = 4337.5\) and find \(S _ { x x }\). The student believes that a linear relationship of the form \(y = a + b x\) could be used to describe these data.
  3. Use linear regression to find the value of \(a\) and the value of \(b\), giving your answers to 1 decimal place.
  4. Draw the regression line on your scatter diagram. The student believes that one brand of chocolate is overpriced.
  5. Use the scatter diagram to
    1. state which brand is overpriced,
    2. suggest a fair price for this brand. Give reasons for both your answers.
      \includegraphics[max width=\textwidth, alt={}]{045e10d2-1766-4399-aa0a-5619dd0cce0f-06_2454_1485_282_228}
      The data on page 8 has been repeated here to help you
      Chocolate brandA\(B\)\(C\)D\(E\)\(F\)G\(H\)
      \(x\) (\% cocoa)1020303540506070
      \(y\) (pence)3555401006090110130
      (You may use: \(\sum x = 315 , \sum x ^ { 2 } = 15225 , \sum y = 620 , \sum y ^ { 2 } = 56550 , \sum x y = 28750\) )