Calculate from raw data

A question is this sub-type if and only if it provides raw data values and asks to calculate Sxx, Syy, or Sxy directly from those values.

3 questions · Moderate -0.6

5.08a Pearson correlation: calculate pmcc5.09c Calculate regression line5.09d Linear coding: effect on regression
Sort by: Default | Easiest first | Hardest first
Edexcel S1 2016 June Q1
12 marks Moderate -0.8
  1. The percentage oil content, \(p\), and the weight, \(w\) milligrams, of each of 10 randomly selected sunflower seeds were recorded. These data are summarised below.
$$\sum w ^ { 2 } = 41252 \quad \sum w p = 27557.8 \quad \sum w = 640 \quad \sum p = 431 \quad \mathrm {~S} _ { p p } = 2.72$$
  1. Find the value of \(\mathrm { S } _ { w w }\) and the value of \(\mathrm { S } _ { w p }\)
  2. Calculate the product moment correlation coefficient between \(p\) and \(w\)
  3. Give an interpretation of your product moment correlation coefficient. The equation of the regression line of \(p\) on \(w\) is given in the form \(p = a + b w\)
  4. Find the equation of the regression line of \(p\) on \(w\)
  5. Hence estimate the percentage oil content of a sunflower seed which weighs 60 milligrams.
Edexcel S1 2024 June Q4
13 marks Moderate -0.3
  1. A biologist is studying bears. The biologist records the length, \(d \mathrm {~cm}\), and the girth, \(g \mathrm {~cm}\), of 8 bears. The biologist summarises the data as follows
$$\begin{gathered} \sum d = 1456.8 \quad \sum g = 713.2 \quad \sum d g = 141978.84 \quad \sum g ^ { 2 } = 72675.98 \\ S _ { d d } = 16769.78 \end{gathered}$$
  1. Calculate the exact value of \(S _ { d g }\) and the exact value of \(S _ { g g }\)
  2. Calculate the value of the product moment correlation coefficient between \(d\) and \(g\)
  3. Show that the equation of the regression line of \(g\) on \(d\) can be written as $$g = - 42.3 + 0.722 d$$ where the values of the intercept and gradient are given to 3 significant figures.
  4. Give an interpretation, in context, of the gradient of the regression line. Using the equation of the regression line given in part (c)
    1. estimate the girth of a bear with a length of 2.5 metres,
    2. explain why an estimate for the girth of a bear with a length of 0.5 metres is not reliable. Using the regression line from part (c), the biologist estimates that for each \(x \mathrm {~cm}\) increase in the length of a bear there will be a 17.3 cm increase in the girth.
  5. Find the value of \(x\)
Edexcel S1 2018 Specimen Q1
12 marks Moderate -0.8
  1. The percentage oil content, \(p\), and the weight, \(w\) milligrams, of each of 10 randomly selected sunflower seeds were recorded. These data are summarised below.
$$\sum w ^ { 2 } = 41252 \quad \sum w p = 27557.8 \quad \sum w = 640 \quad \sum p = 431 \quad \mathrm {~S} _ { p p } = 2.72$$
  1. Find the value of \(\mathrm { S } _ { w w }\) and the value of \(\mathrm { S } _ { w p }\)
  2. Calculate the product moment correlation coefficient between \(p\) and \(w\)
  3. Give an interpretation of your product moment correlation coefficient. The equation of the regression line of \(p\) on \(w\) is given in the form \(p = a + b w\)
  4. Find the equation of the regression line of \(p\) on \(w\)
  5. Hence estimate the percentage oil content of a sunflower seed which weighs 60 milligrams. \(\_\_\_\_\) VAYV SIHI NI JIIIM ION OC
    VJYV SIHI NI JIIIM ION OC
    VJYV SIHI NI JLIYM ION OC