Calculate summary statistics (Sxx, Syy, Sxy)

A question is this type if and only if it asks to calculate the summary statistics Sxx, Syy, or Sxy from raw data or other given statistics.

8 questions

Edexcel S1 2011 January Q1
  1. A random sample of 50 salmon was caught by a scientist. He recorded the length \(l \mathrm {~cm}\) and weight \(w \mathrm {~kg}\) of each salmon.
The following summary statistics were calculated from these data.
\(\sum l = 4027 \quad \sum l ^ { 2 } = 327754.5 \quad \sum w = 357.1 \quad \sum l w = 29330.5 \quad S _ { w w } = 289.6\)
  1. Find \(S _ { l l }\) and \(S _ { l w }\)
  2. Calculate, to 3 significant figures, the product moment correlation coefficient between \(l\) and \(w\).
  3. Give an interpretation of your coefficient.
Edexcel S1 2012 January Q5
  1. The age, \(t\) years, and weight, \(w\) grams, of each of 10 coins were recorded. These data are summarised below.
$$\sum t ^ { 2 } = 2688 \quad \sum t w = 1760.62 \quad \sum t = 158 \quad \sum w = 111.75 \quad S _ { w w } = 0.16$$
  1. Find \(S _ { t t }\) and \(S _ { t w }\) for these data.
  2. Calculate, to 3 significant figures, the product moment correlation coefficient between \(t\) and \(w\).
  3. Find the equation of the regression line of \(w\) on \(t\) in the form \(w = a + b t\)
  4. State, with a reason, which variable is the explanatory variable.
  5. Using this model, estimate
    1. the weight of a coin which is 5 years old,
    2. the effect of an increase of 4 years in age on the weight of a coin. It was discovered that a coin in the original sample, which was 5 years old and weighed 20 grams, was a fake.
  6. State, without any further calculations, whether the exclusion of this coin would increase or decrease the value of the product moment correlation coefficient. Give a reason for your answer.
Edexcel S1 2013 January Q1
  1. A teacher asked a random sample of 10 students to record the number of hours of television, \(t\), they watched in the week before their mock exam. She then calculated their grade, \(g\), in their mock exam. The results are summarised as follows.
$$\sum t = 258 \quad \sum t ^ { 2 } = 8702 \quad \sum g = 63.6 \quad \mathrm {~S} _ { g g } = 7.864 \quad \sum g t = 1550.2$$
  1. Find \(\mathrm { S } _ { t t }\) and \(\mathrm { S } _ { g t }\)
  2. Calculate, to 3 significant figures, the product moment correlation coefficient between \(t\) and \(g\). The teacher also recorded the number of hours of revision, \(v\), these 10 students completed during the week before their mock exam. The correlation coefficient between \(t\) and \(v\) was -0.753
  3. Describe, giving a reason, the nature of the correlation you would expect to find between \(v\) and \(g\).
Edexcel S1 2001 June Q2
2. On a particular day in summer 1993 at 0800 hours the height above sea level, \(x\) metres, and the temperature, \(y ^ { \circ } \mathrm { C }\), were recorded in 10 Mediterranean towns. The following summary statistics were calculated from the results. $$\Sigma x = 7300 , \Sigma x ^ { 2 } = 6599600 , S _ { x y } = - 13060 , S _ { y y } = 140.9 .$$
  1. Find \(S _ { x x }\).
  2. Calculate, to 3 significant figures, the product moment correlation coefficient between \(x\) and \(y\).
  3. Give an interpretation of your coefficient.
Edexcel S1 2009 June Q1
  1. The volume of a sample of gas is kept constant. The gas is heated and the pressure, \(p\), is measured at 10 different temperatures, \(t\). The results are summarised below.
    \(\sum p = 445 \quad \sum p ^ { 2 } = 38125 \quad \sum t = 240 \quad \sum t ^ { 2 } = 27520 \quad \sum p t = 26830\)
    1. Find \(\mathrm { S } _ { p p }\) and \(\mathrm { S } _ { p t }\).
    Given that \(\mathrm { S } _ { t t } = 21760\),
  2. calculate the product moment correlation coefficient.
  3. Give an interpretation of your answer to part (b).
Edexcel S1 2011 June Q1
  1. On a particular day the height above sea level, \(x\) metres, and the mid-day temperature, \(y ^ { \circ } \mathrm { C }\), were recorded in 8 north European towns. These data are summarised below
$$\mathrm { S } _ { x x } = 3535237.5 \quad \sum y = 181 \quad \sum y ^ { 2 } = 4305 \quad \mathrm {~S} _ { x y } = - 23726.25$$
  1. Find \(\mathrm { S } _ { y y }\)
  2. Calculate, to 3 significant figures, the product moment correlation coefficient for these data.
  3. Give an interpretation of your coefficient. A student thought that the calculations would be simpler if the height above sea level, \(h\), was measured in kilometres and used the variable \(h = \frac { x } { 1000 }\) instead of \(x\).
  4. Write down the value of \(\mathrm { S } _ { h h }\)
  5. Write down the value of the correlation coefficient between \(h\) and \(y\).
Edexcel S1 2013 June Q5
5. A researcher believes that parents with a short family name tended to give their children a long first name. A random sample of 10 children was selected and the number of letters in their family name, \(x\), and the number of letters in their first name, \(y\), were recorded. The data are summarised as: $$\sum x = 60 , \quad \sum y = 61 , \quad \sum y ^ { 2 } = 393 , \quad \sum x y = 382 , \quad \mathrm {~S} _ { x x } = 28$$
  1. Find \(\mathrm { S } _ { y y }\) and \(\mathrm { S } _ { x y }\)
  2. Calculate the product moment correlation coefficient, \(r\), between \(x\) and \(y\).
  3. State, giving a reason, whether or not these data support the researcher's belief. The researcher decides to add a child with family name "Turner" to the sample.
  4. Using the definition \(\mathrm { S } _ { x x } = \sum ( x - \bar { x } ) ^ { 2 }\), state the new value of \(\mathrm { S } _ { x x }\) giving a reason for your answer. Given that the addition of the child with family name "Turner" to the sample leads to an increase in \(\mathrm { S } _ { y y }\)
  5. use the definition \(\mathrm { S } _ { x y } = \sum ( x - \bar { x } ) ( y - \bar { y } )\) to determine whether or not the value of \(r\) will increase, decrease or stay the same. Give a reason for your answer.
Edexcel S1 2004 November Q6
6. Students in Mr Brawn's exercise class have to do press-ups and sit-ups. The number of press-ups \(x\) and the number of sit-ups \(y\) done by a random sample of 8 students are summarised below. $$\begin{array} { l l } \Sigma x = 272 , & \Sigma x ^ { 2 } = 10164 , \quad \Sigma x y = 11222 ,
\Sigma y = 320 , & \Sigma y ^ { 2 } = 13464 . \end{array}$$
  1. Evaluate \(S _ { x x } , S _ { y y }\) and \(S _ { x y }\).
  2. Calculate, to 3 decimal places, the product moment correlation coefficient between \(x\) and \(y\).
  3. Give an interpretation of your coefficient.
  4. Calculate the mean and the standard deviation of the number of press-ups done by these students. Mr Brawn assumes that the number of press-ups that can be done by any student can be modelled by a normal distribution with mean \(\mu\) and standard deviation \(\sigma\). Assuming that \(\mu\) and \(\sigma\) take the same values as those calculated in part (d),
  5. find the value of \(a\) such that \(\mathrm { P } ( \mu - a < X < \mu + a ) = 0.95\).
  6. Comment on Mr Brawn's assumption of normality.