Minimize sum of squared residuals

A question is this type if and only if it involves algebraically minimizing an expression for the sum of squared residuals to derive regression line parameters.

2 questions · Moderate -0.1

Sort by: Default | Easiest first | Hardest first
OCR Further Statistics 2024 June Q7
8 marks Standard +0.3
7 The coordinates of a set of 10 points are denoted by ( \(\mathrm { x } _ { \mathrm { i } } , \mathrm { y } _ { \mathrm { i } }\) ) for \(i = 1,2 , \ldots , 10\). For a particular set of values of ( \(\mathrm { x } _ { \mathrm { i } } , \mathrm { y } _ { \mathrm { i } }\) ) and any constants \(a\) and \(b\) it can be shown that \(\Sigma \left( y _ { i } - a - b x _ { i } \right) ^ { 2 } = 10 ( 11 - a - 6 b ) ^ { 2 } + 126 \left( b - \frac { 83 } { 42 } \right) ^ { 2 } + \frac { 139 } { 14 }\).
    1. Explain why \(\sum \left( \mathrm { y } _ { \mathrm { i } } - \mathrm { a } - \mathrm { bx } _ { \mathrm { i } } \right) ^ { 2 }\) is minimised by taking \(b = \frac { 83 } { 42 }\) and \(\mathrm { a } = 11 - 6 \mathrm {~b}\).
    2. Hence explain why the equation of the regression line of \(y\) on \(x\) for these points is given by the corresponding values of \(a\) and \(b\) (so that the equation is \(\mathrm { y } = \frac { 83 } { 42 } \mathrm { x } - \frac { 6 } { 7 }\) ).
  1. State which of the following terms cannot apply to the variable \(X\) if the regression line of \(y\) on \(x\) can be used for estimating values of \(Y\). Dependent Independent Controlled Response
  2. Use the regression line to estimate the value of \(y\) corresponding to \(x = 8\).
  3. State what must be true of the value \(x = 8\) if the estimate in part (c) is to be reliable.
  4. Variables \(u\) and \(v\) are related to \(x\) and \(y\) by the following relationships. \(u = 2 + 4 x \quad v = 8 - 2 y\) Show that the gradient of the regression line of \(v\) on \(u\) is very close to - 1 .
CAIE FP2 2017 June Q11
24 marks Moderate -0.5
Answer only one of the following two alternatives. EITHER \includegraphics{figure_11a} The diagram shows a uniform thin rod \(AB\) of length \(3a\) and mass \(8m\). The end \(A\) is rigidly attached to the surface of a sphere with centre \(O\) and radius \(a\). The rod is perpendicular to the surface of the sphere. The sphere consists of two parts: an inner uniform solid sphere of mass \(m\) and radius \(a\) surrounded by a thin uniform spherical shell of mass \(m\) and also of radius \(a\). The horizontal axis \(l\) is perpendicular to the rod and passes through the point \(C\) on the rod where \(AC = a\).
  1. Show that the moment of inertia of the object, consisting of rod, shell and inner sphere, about the axis \(l\) is \(\frac{289}{15}ma^2\). [6]
The object is free to rotate about the axis \(l\). The object is held so that \(CA\) makes an angle \(\alpha\) with the downward vertical and is released from rest.
  1. Given that \(\cos \alpha = \frac{1}{6}\), find the greatest speed achieved by the centre of the sphere in the subsequent motion. [6]
OR The times taken to run \(200\) metres at the beginning of the year and at the end of the year are recorded for each member of a large athletics club. The time taken, in seconds, at the beginning of the year is denoted by \(x\) and the time taken, in seconds, at the end of the year is denoted by \(y\). For a random sample of \(8\) members, the results are shown in the following table.
MemberABCDEFGH
\(x\)24.223.822.825.124.524.023.822.8
\(y\)23.923.622.824.524.223.523.622.7
\([\Sigma x = 191, \quad \Sigma x^2 = 4564.46, \quad \Sigma y = 188.8, \quad \Sigma y^2 = 4458.4, \quad \Sigma xy = 4510.99.]\)
  1. Find, showing all necessary working, the equation of the regression line of \(y\) on \(x\). [4]
The athletics coach believes that, on average, the time taken by an athlete to run \(200\) metres decreases between the beginning and the end of the year by more than \(0.2\) seconds.
  1. Stating suitable hypotheses and assuming a normal distribution, test the coach's belief at the \(10\%\) significance level. [8]