Calculate summary statistics (Sxx, Syy, Sxy)

A question is this type if and only if it asks to calculate the summary statistics Sxx, Syy, or Sxy from raw data or other given statistics.

8 questions · Moderate -1.0

Sort by: Default | Easiest first | Hardest first
Edexcel S1 2011 January Q1
6 marks Easy -1.2
  1. A random sample of 50 salmon was caught by a scientist. He recorded the length \(l \mathrm {~cm}\) and weight \(w \mathrm {~kg}\) of each salmon.
The following summary statistics were calculated from these data. \(\sum l = 4027 \quad \sum l ^ { 2 } = 327754.5 \quad \sum w = 357.1 \quad \sum l w = 29330.5 \quad S _ { w w } = 289.6\)
  1. Find \(S _ { l l }\) and \(S _ { l w }\)
  2. Calculate, to 3 significant figures, the product moment correlation coefficient between \(l\) and \(w\).
  3. Give an interpretation of your coefficient.
Edexcel S1 2012 January Q5
15 marks Moderate -0.8
  1. The age, \(t\) years, and weight, \(w\) grams, of each of 10 coins were recorded. These data are summarised below.
$$\sum t ^ { 2 } = 2688 \quad \sum t w = 1760.62 \quad \sum t = 158 \quad \sum w = 111.75 \quad S _ { w w } = 0.16$$
  1. Find \(S _ { t t }\) and \(S _ { t w }\) for these data.
  2. Calculate, to 3 significant figures, the product moment correlation coefficient between \(t\) and \(w\).
  3. Find the equation of the regression line of \(w\) on \(t\) in the form \(w = a + b t\)
  4. State, with a reason, which variable is the explanatory variable.
  5. Using this model, estimate
    1. the weight of a coin which is 5 years old,
    2. the effect of an increase of 4 years in age on the weight of a coin. It was discovered that a coin in the original sample, which was 5 years old and weighed 20 grams, was a fake.
  6. State, without any further calculations, whether the exclusion of this coin would increase or decrease the value of the product moment correlation coefficient. Give a reason for your answer.
Edexcel S1 2013 January Q1
7 marks Easy -1.2
  1. A teacher asked a random sample of 10 students to record the number of hours of television, \(t\), they watched in the week before their mock exam. She then calculated their grade, \(g\), in their mock exam. The results are summarised as follows.
$$\sum t = 258 \quad \sum t ^ { 2 } = 8702 \quad \sum g = 63.6 \quad \mathrm {~S} _ { g g } = 7.864 \quad \sum g t = 1550.2$$
  1. Find \(\mathrm { S } _ { t t }\) and \(\mathrm { S } _ { g t }\)
  2. Calculate, to 3 significant figures, the product moment correlation coefficient between \(t\) and \(g\). The teacher also recorded the number of hours of revision, \(v\), these 10 students completed during the week before their mock exam. The correlation coefficient between \(t\) and \(v\) was -0.753
  3. Describe, giving a reason, the nature of the correlation you would expect to find between \(v\) and \(g\).
Edexcel S1 2001 June Q2
5 marks Easy -1.2
2. On a particular day in summer 1993 at 0800 hours the height above sea level, \(x\) metres, and the temperature, \(y ^ { \circ } \mathrm { C }\), were recorded in 10 Mediterranean towns. The following summary statistics were calculated from the results. $$\Sigma x = 7300 , \Sigma x ^ { 2 } = 6599600 , S _ { x y } = - 13060 , S _ { y y } = 140.9 .$$
  1. Find \(S _ { x x }\).
  2. Calculate, to 3 significant figures, the product moment correlation coefficient between \(x\) and \(y\).
  3. Give an interpretation of your coefficient.
Edexcel S1 2009 June Q1
6 marks Easy -1.2
  1. The volume of a sample of gas is kept constant. The gas is heated and the pressure, \(p\), is measured at 10 different temperatures, \(t\). The results are summarised below. \(\sum p = 445 \quad \sum p ^ { 2 } = 38125 \quad \sum t = 240 \quad \sum t ^ { 2 } = 27520 \quad \sum p t = 26830\)
    1. Find \(\mathrm { S } _ { p p }\) and \(\mathrm { S } _ { p t }\).
    Given that \(\mathrm { S } _ { t t } = 21760\),
  2. calculate the product moment correlation coefficient.
  3. Give an interpretation of your answer to part (b).
Edexcel S1 2013 June Q5
11 marks Moderate -0.3
5. A researcher believes that parents with a short family name tended to give their children a long first name. A random sample of 10 children was selected and the number of letters in their family name, \(x\), and the number of letters in their first name, \(y\), were recorded. The data are summarised as: $$\sum x = 60 , \quad \sum y = 61 , \quad \sum y ^ { 2 } = 393 , \quad \sum x y = 382 , \quad \mathrm {~S} _ { x x } = 28$$
  1. Find \(\mathrm { S } _ { y y }\) and \(\mathrm { S } _ { x y }\)
  2. Calculate the product moment correlation coefficient, \(r\), between \(x\) and \(y\).
  3. State, giving a reason, whether or not these data support the researcher's belief. The researcher decides to add a child with family name "Turner" to the sample.
  4. Using the definition \(\mathrm { S } _ { x x } = \sum ( x - \bar { x } ) ^ { 2 }\), state the new value of \(\mathrm { S } _ { x x }\) giving a reason for your answer. Given that the addition of the child with family name "Turner" to the sample leads to an increase in \(\mathrm { S } _ { y y }\)
  5. use the definition \(\mathrm { S } _ { x y } = \sum ( x - \bar { x } ) ( y - \bar { y } )\) to determine whether or not the value of \(r\) will increase, decrease or stay the same. Give a reason for your answer.
Edexcel S1 2004 November Q6
18 marks Easy -1.2
6. Students in Mr Brawn's exercise class have to do press-ups and sit-ups. The number of press-ups \(x\) and the number of sit-ups \(y\) done by a random sample of 8 students are summarised below. $$\begin{array} { l l } \Sigma x = 272 , & \Sigma x ^ { 2 } = 10164 , \quad \Sigma x y = 11222 , \\ \Sigma y = 320 , & \Sigma y ^ { 2 } = 13464 . \end{array}$$
  1. Evaluate \(S _ { x x } , S _ { y y }\) and \(S _ { x y }\).
  2. Calculate, to 3 decimal places, the product moment correlation coefficient between \(x\) and \(y\).
  3. Give an interpretation of your coefficient.
  4. Calculate the mean and the standard deviation of the number of press-ups done by these students. Mr Brawn assumes that the number of press-ups that can be done by any student can be modelled by a normal distribution with mean \(\mu\) and standard deviation \(\sigma\). Assuming that \(\mu\) and \(\sigma\) take the same values as those calculated in part (d),
  5. find the value of \(a\) such that \(\mathrm { P } ( \mu - a < X < \mu + a ) = 0.95\).
  6. Comment on Mr Brawn's assumption of normality.
Edexcel S1 2010 January Q6
18 marks Moderate -0.8
The blood pressures, \(p\) mmHg, and the ages, \(t\) years, of 7 hospital patients are shown in the table below.
PatientABCDEFG
\(t\)42744835562660
\(p\)981301208818280135
[\(\sum t = 341\), \(\sum p = 833\), \(\sum t^2 = 18181\), \(\sum p^2 = 106397\), \(\sum tp = 42948\)]
  1. Find \(S_{tt}\), \(S_{pp}\) and \(S_t\) for these data. [4]
  2. Calculate the product moment correlation coefficient for these data. [3]
  3. Interpret the correlation coefficient. [1]
  4. On the graph paper on page 17, draw the scatter diagram of blood pressure against age for these 7 patients. [2]
  5. Find the equation of the regression line of \(p\) on \(t\). [4]
  6. Plot your regression line on your scatter diagram. [2]
  7. Use your regression line to estimate the blood pressure of a 40 year old patient. [2]