5.09c Calculate regression line

235 questions

Sort by: Default | Easiest first | Hardest first
WJEC Further Unit 2 2018 June Q7
7 marks Moderate -0.3
A university professor conducted some research into factors that affect job satisfaction. The four factors considered were Interesting work, Good wages, Job security and Appreciation of work done. The professor interviewed workers at 14 different companies and asked them to rate their companies on each of the factors. The workers' ratings were averaged to give each company a score out of 5 on each factor. Each company was also given a score out of 100 for Job satisfaction. The following graph shows the part of the research concerning Job Satisfaction versus Interesting work. \includegraphics{figure_2}
  1. Calculate the equation of the least squares regression line of Job satisfaction (\(y\)) on Interesting work (\(x\)), given the following summary statistics. [5] \(\sum x = 46 \cdot 2\), \quad \(\sum y = 898\), \quad \(S_{xx} = 3 \cdot 48\) \(S_{xy} = 49 \cdot 45\), \quad \(S_{yy} = 1437 \cdot 714\), \quad \(n = 14\)
  2. Give two reasons why it would be inappropriate for the professor to use this equation to calculate the score for Interesting work from a Job satisfaction score of 90. [2]
WJEC Further Unit 2 2023 June Q2
8 marks Moderate -0.3
For a set of 30 pairs of observations of the variables \(x\) and \(y\), it is known that \(\sum x = 420\) and \(\sum y = 240\). The least squares regression line of \(y\) on \(x\) passes through the point with coordinates \((19, 20)\).
  1. Show that the equation of the regression line of \(y\) on \(x\) is \(y = 2 \cdot 4x - 25 \cdot 6\) and use it to predict the value of \(y\) when \(x = 26\). [6]
  2. State two reasons why your prediction in part (a) may not be reliable. [2]
WJEC Further Unit 2 Specimen Q4
9 marks Moderate -0.8
A year 12 student wishes to study at a Welsh university. For a randomly chosen year between 2000 and 2017 she collected data for seven universities in Wales from the Complete University Guide website. The data are for the variables: • 'Entry standards' – the average UCAS tariff score of new undergraduate students; • 'Student satisfaction' – a measure of student views of the teaching quality at the university taken from the National Student Survey (maximum 5); • 'Graduate prospects' – a measure of the employability of a university's first degree graduates (maximum 100); • 'Research quality' – a measure of the quality of the research undertaken in the university (maximum 4).
  1. Pearson's product-moment correlation coefficients, for each pairing of the four variables, are shown in the table below. Discuss the correlation between graduate prospects and the other three variables. [2]
    VariableEntry standardsStudent satisfactionGraduate prospectsResearch quality
    Entry standards1
    Student satisfaction-0.0301
    Graduate prospects0.7720.2361
    Research quality0.8660.0660.8271
  2. Calculate the equation of the least squares regression line to predict 'Entry standards'(y) from 'Research quality'(x), given the summary statistics: $$\sum x = 22.24, \sum y = 2522, S_{xx} = 1.0542, S_{xy} = 20193.5, S_{yy} = 122.72.$$ [5]
  3. The data for one of the Welsh universities are missing. This university has a research quality of 3.00. Use your equation to predict the entry standard for this university. [2]
SPS SPS FM Statistics 2021 January Q3
7 marks Moderate -0.3
A large field of wheat is split into 8 plots of equal area. Each plot is treated with a different amount of fertiliser, \(f\) grams/m². The yield of wheat, \(w\) tonnes, from each plot is recorded. The results are summarised below. $$\sum f = 28 \quad \sum w = 303 \quad \sum w^2 = 13447 \quad S_{ff} = 42 \quad S_{fw} = 269.5$$
  1. Calculate the product moment correlation coefficient between \(f\) and \(w\) [2]
  2. Interpret the value of your product moment correlation coefficient. [1]
  3. Find the equation of the regression line of \(w\) on \(f\) in the form \(w = a + bf\) [3]
  4. Using your equation, estimate the decrease in yield when the amount of fertiliser decreases by 0.5 grams/m² [1]
OCR Further Statistics 2021 June Q1
5 marks Moderate -0.3
A set of bivariate data \((X, Y)\) is summarised as follows. \(n = 25\), \(\Sigma x = 9.975\), \(\Sigma y = 11.175\), \(\Sigma x^2 = 5.725\), \(\Sigma y^2 = 46.200\), \(\Sigma xy = 11.575\)
  1. Calculate the value of Pearson's product-moment correlation coefficient. [1]
  2. Calculate the equation of the regression line of \(y\) on \(x\). [2]
It is desired to know whether the regression line of \(y\) on \(x\) will provide a reliable estimate of \(y\) when \(x = 0.75\).
  1. State one reason for believing that the estimate will be reliable. [1]
  2. State what further information is needed in order to determine whether the estimate is reliable. [1]
OCR Further Statistics 2017 Specimen Q1
6 marks Moderate -0.8
The table below shows the typical stopping distances \(d\) metres for a particular car travelling at \(v\) miles per hour.
\(v\)203040506070
\(d\)132436527294
  1. State each of the following words that describe the variable \(v\). Independent \quad Dependent \quad Controlled \quad Response [1]
  2. Calculate the equation of the regression line of \(d\) on \(v\). [2]
  3. Use the equation found in part (ii) to estimate the typical stopping distance when this car is travelling at 45 miles per hour. [1]
It is given that the product moment correlation coefficient for the data is 0.990 correct to three significant figures.
  1. Explain whether your estimate found in part (iii) is reliable. [2]
OCR FS1 AS 2017 Specimen Q8
10 marks Standard +0.3
The following table gives the mean per capita consumption of mozzarella cheese per annum, \(x\) pounds, and the number of civil engineering doctorates awarded, \(y\), in the United States in each of 10 years.
\(x\)9.39.79.79.79.910.210.511.010.610.6
\(y\)480501540552547622655701712708
source: www.tylervigen.com
  1. Find the equation of the regression line of \(y\) on \(x\). [2]
You are given that the product moment correlation coefficient is 0.959.
  1. Explain whether this value would be different if \(x\) is measured in kilograms instead of pounds. [1]
It is desired to carry out a hypothesis test to investigate whether there is correlation between these two variables.
  1. Assume that the data is a random sample of all years.
    1. Carry out the test at the 10\% significance level. [6]
    2. Explain whether your conclusion suggests that manufacturers of mozzarella cheese could increase consumption by sponsoring doctoral candidates in civil engineering. [1]
Pre-U Pre-U 9794/3 2013 November Q4
6 marks Moderate -0.8
As part of a study into the effects of alcohol, volunteers have their reaction times measured after they have consumed various fixed amounts of alcohol. For a random sample of 12 volunteers the following information was collected.
Units of alcohol consumed23344.55.5667889
Reaction time (seconds)12553.85.54.88.57.26.898
  1. Which is the independent variable in this experiment? [1]
  2. Find the least squares regression line of \(y\) (Reaction time) on \(x\) (Units of alcohol), and use it to estimate the reaction time of someone who has consumed 5 units of alcohol. [5]
CAIE FP2 2013 November Q9
Standard +0.3
9 For a random sample of 10 observations of pairs of values \(( x , y )\), the equations of the regression lines of \(y\) on \(x\) and of \(x\) on \(y\) are $$y = 4.21 x - 0.862 \quad \text { and } \quad x = 0.043 y + 6.36 ,$$ respectively.
  1. Find the value of the product moment correlation coefficient for the sample.
  2. Test, at the \(10 \%\) significance level, whether there is evidence of non-zero correlation between the variables.
  3. Find the mean values of \(x\) and \(y\) for this sample.
  4. Estimate the value of \(x\) when \(y = 2.3\) and comment on the reliability of your answer.
CAIE FP2 2014 June Q11
Challenging +1.2
11 Answer only one of the following two alternatives.
EITHER
A particle \(P\) of mass \(m\) is suspended from a fixed point by a light elastic string of natural length \(l\), and hangs in equilibrium. The particle is pulled vertically down to a position where the length of the string is \(\frac { 13 } { 7 } l\). The particle is released from rest in this position and reaches its greatest height when the length of the string is \(\frac { 11 } { 7 } l\).
  1. Show that the modulus of elasticity of the string is \(\frac { 7 } { 5 } \mathrm { mg }\).
  2. Show that \(P\) moves in simple harmonic motion about the equilibrium position and state the period of the motion.
  3. Find the time after release when the speed of \(P\) is first equal to half of its maximum value.
    OR
    For a random sample of 12 observations of pairs of values \(( x , y )\), the equation of the regression line of \(y\) on \(x\) and the equation of the regression line of \(x\) on \(y\) are $$y = b x + 4.5 \quad \text { and } \quad x = a y + c$$ respectively, where \(a , b\) and \(c\) are constants. The product moment correlation coefficient for the sample is 0.6 .
  4. Test, at the \(5 \%\) significance level, whether there is evidence of positive correlation between the variables.
  5. Given that \(b - a = 0.5\), find the values of \(a\) and \(b\).
  6. Given that the sum of the \(x\)-values in the sample data is 66, find the value of \(c\) and sketch the two regression lines on the same diagram. For each of the 12 pairs of values of \(( x , y )\) in the sample, another variable \(z\) is considered, where \(z = 5 y\).
  7. State the coefficient of \(x\) in the equation of the regression line of \(z\) on \(x\) and find the value of the product moment correlation coefficient between \(x\) and \(z\), justifying your answer.