Calculate x on y regression line

Questions that ask to find the regression line of x on y (the reverse regression), either from summary statistics or raw data.

3 questions

OCR S1 2015 June Q4
4 The table shows the load a lorry was carrying, \(x\) tonnes, and the fuel economy, \(y \mathrm {~km}\) per litre, for 8 different journeys. You should assume that neither variable is controlled.
Load
\(( x\) tonnes \()\)
5.15.86.57.17.68.49.510.5
Fuel economy
\(( y \mathrm {~km}\) per litre \()\)
6.26.15.95.65.35.45.35.1
$$n = 8 \quad \sum x = 60.5 \quad \sum y = 44.9 \quad \sum x ^ { 2 } = 481.13 \quad \sum y ^ { 2 } = 253.17 \quad \sum x y = 334.65$$
  1. Calculate the equation of the regression line of \(y\) on \(x\).
  2. Estimate the fuel economy for a load of 9.2 tonnes.
  3. An analyst calculated the equation of the regression line of \(x\) on \(y\). Without calculating this equation, state the coordinates of the point where the two regression lines intersect.
  4. Describe briefly the method required to estimate the load when the fuel economy is 5.8 km per litre.
CAIE FP2 2015 June Q10
10 Young children at a primary school are learning to throw a ball as far as they can. The distance thrown at the beginning of the school year and the distance thrown at the end of the same school year are recorded for each child. The distance thrown, in metres, at the beginning of the year is denoted by \(x\); the distance thrown, in metres, at the end of the year is denoted by \(y\). For a random sample of 10 children, the results are shown in the following table.
Child\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)
\(x\)5.24.13.75.47.66.13.24.03.58.0
\(y\)6.24.85.05.67.77.04.04.53.68.5
$$\left[ \Sigma x = 50.8 , \quad \Sigma x ^ { 2 } = 284.16 , \quad \Sigma y = 56.9 , \quad \Sigma y ^ { 2 } = 347.59 , \quad \Sigma x y = 313.28 . \right]$$ A particular child threw the ball a distance of 7.0 metres at the beginning of the year, but he could not throw at the end of the year because he had broken his arm. By finding the equation of an appropriate regression line, estimate the distance this child would have thrown at the end of the year. The teacher suspects that, on average, the distance thrown by a child increases between the two throws by more than 0.4 metres. Stating suitable hypotheses and assuming a normal distribution, test the teacher's suspicion at the \(5 \%\) significance level.
CAIE FP2 2017 June Q11 OR
The times taken to run 200 metres at the beginning of the year and at the end of the year are recorded for each member of a large athletics club. The time taken, in seconds, at the beginning of the year is denoted by \(x\) and the time taken, in seconds, at the end of the year is denoted by \(y\). For a random sample of 8 members, the results are shown in the following table.
Member\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)
\(x\)24.223.822.825.124.524.023.822.8
\(y\)23.923.622.824.524.223.523.622.7
$$\left[ \Sigma x = 191 , \quad \Sigma x ^ { 2 } = 4564.46 , \quad \Sigma y = 188.8 , \quad \Sigma y ^ { 2 } = 4458.4 , \quad \Sigma x y = 4510.99 . \right]$$
  1. Find, showing all necessary working, the equation of the regression line of \(y\) on \(x\).
    The athletics coach believes that, on average, the time taken by an athlete to run 200 metres decreases between the beginning and the end of the year by more than 0.2 seconds.
  2. Stating suitable hypotheses and assuming a normal distribution, test the coach's belief at the \(10 \%\) significance level.