Identify response/explanatory variables

A question is this type if and only if it asks to identify which variable is the independent/explanatory/controlled variable and which is the dependent/response variable.

11 questions · Moderate -0.9

5.09c Calculate regression line
Sort by: Default | Easiest first | Hardest first
Edexcel S1 2015 January Q5
13 marks Easy -1.3
  1. The resting heart rate, \(h\) beats per minute (bpm), and average length of daily exercise, \(t\) minutes, of a random sample of 8 teachers are shown in the table below.
\(t\)2035402545707590
\(h\)8885777571666054
  1. State, with a reason, which variable is the response variable. The equation of the least squares regression line of \(h\) on \(t\) is $$h = 93.5 - 0.43 t$$
  2. Give an interpretation of the gradient of this regression line.
  3. Find the value of \(\bar { t }\) and the value of \(\bar { h }\)
  4. Show that the point \(( \bar { t } , \bar { h } )\) lies on the regression line.
  5. Estimate the resting heart rate of a teacher with an average length of daily exercise of 1 hour.
  6. Comment, giving a reason, on the reliability of the estimate in part (e). The resting heart rate of teachers is assumed to be normally distributed with mean 73 bpm and standard deviation 8 bpm . The middle \(95 \%\) of resting heart rates of teachers lies between \(a\) and \(b\)
  7. Find the value of \(a\) and the value of \(b\).
CAIE FP2 2012 November Q8
11 marks Moderate -0.8
8 The yield of a particular crop on a farm is thought to depend principally on the amount of sunshine during the growing season. For a random sample of 8 years, the average yield, \(y\) kilograms per square metre, and the average amount of sunshine per day, \(x\) hours, are recorded. The results are given in the following table.
\(x\)12.210.45.26.311.810.014.22.3
\(y\)159107811126
$$\left[ \Sigma x = 72.4 , \Sigma x ^ { 2 } = 769.9 , \Sigma y = 78 , \Sigma y ^ { 2 } = 820 , \Sigma x y = 761.3 . \right]$$
  1. Find the equation of the regression line of \(y\) on \(x\).
  2. Find the product moment correlation coefficient.
  3. Test, at the \(5 \%\) significance level, whether there is positive correlation between the average yield and the average amount of sunshine per day.
OCR Further Statistics AS 2018 June Q7
8 marks Moderate -0.8
7 An environmentalist measures the mean concentration, \(c\) milligrams per litre, of a particular chemical in a group of rivers, and the mean mass, \(m\) pounds, of fish of a certain species found in those rivers. The results are given in the table.
\(c\)1.941.781.621.511.521.4
\(m\)6.57.27.47.68.39.7
  1. State which, if either, of \(m\) and \(c\) is an independent variable.
  2. Calculate the equation of the least squares regression line of \(c\) on \(m\).
  3. State what effect, if any, there would be on your answer to part (ii) if the masses of the fish had been recorded in kilograms rather than pounds. ( \(1 \mathrm {~kg} \approx 2.2\) pounds.)
  4. The data is illustrated in the scatter diagram. Explain what is meant by 'least squares', illustrating your answer using the copy of this diagram in the Printed Answer Booklet.
    [diagram]
OCR Further Statistics Specimen Q1
6 marks Easy -1.2
1 The table below shows the typical stopping distances \(d\) metres for a particular car travelling at \(v\) miles per hour.
\(v\)203040506070
\(d\)132436527294
  1. State each of the following words that describe the variable \(v\). \section*{Independent Dependent Controlled Response}
  2. Calculate the equation of the regression line of \(d\) on \(v\).
  3. Use the equation found in part (ii) to estimate the typical stopping distance when this car is travelling at 45 miles per hour. It is given that the product moment correlation coefficient for the data is 0.990 correct to three significant figures.
  4. Explain whether your estimate found in part (iii) is reliable.
Edexcel S1 2008 June Q4
15 marks Moderate -0.8
4. Crickets make a noise. The pitch, \(v \mathrm { kHz }\), of the noise made by a cricket was recorded at 15 different temperatures, \(t ^ { \circ } \mathrm { C }\). These data are summarised below. $$\sum t ^ { 2 } = 10922.81 , \sum v ^ { 2 } = 42.3356 , \sum t v = 677.971 , \sum t = 401.3 , \sum v = 25.08$$
  1. Find \(S _ { t t } , S _ { v v }\) and \(S _ { t v }\) for these data.
  2. Find the product moment correlation coefficient between \(t\) and \(v\).
  3. State, with a reason, which variable is the explanatory variable.
  4. Give a reason to support fitting a regression model of the form \(v = a + b t\) to these data.
  5. Find the value of \(a\) and the value of \(b\). Give your answers to 3 significant figures.
  6. Using this model, predict the pitch of the noise at \(19 ^ { \circ } \mathrm { C }\).
Edexcel S1 2015 June Q4
14 marks Easy -1.2
  1. Statistical models can provide a cheap and quick way to describe a real world situation.
    1. Give two other reasons why statistical models are used.
    A scientist wants to develop a model to describe the relationship between the average daily temperature, \(x ^ { \circ } \mathrm { C }\), and her household's daily energy consumption, \(y \mathrm { kWh }\), in winter. A random sample of the average daily temperature and her household's daily energy consumption are taken from 10 winter days and shown in the table.
    \(x\)- 0.4- 0.20.30.81.11.41.82.12.52.6
    \(y\)28302625262726242221
    $$\text { [You may use } \sum x ^ { 2 } = 24.76 \quad \sum y = 255 \quad \sum x y = 283.8 \quad \mathrm {~S} _ { x x } = 10.36 \text { ] }$$
  2. Find \(\mathrm { S } _ { x y }\) for these data.
  3. Find the equation of the regression line of \(y\) on \(x\) in the form \(y = a + b x\) Give the value of \(a\) and the value of \(b\) to 3 significant figures.
  4. Give an interpretation of the value of \(a\)
  5. Estimate her household's daily energy consumption when the average daily temperature is \(2 ^ { \circ } \mathrm { C }\) The scientist wants to use the linear regression model to predict her household's energy consumption in the summer.
  6. Discuss the reliability of using this model to predict her household's energy consumption in the summer.
OCR MEI Further Statistics Minor 2022 June Q2
13 marks Moderate -0.8
2 A forester is investigating the relationship between the diameter and the height of young beech trees. She selects a random sample of 15 young beech trees in a forest and records their diameters, \(d \mathrm {~cm}\), and their heights, \(h \mathrm {~m}\). The data are illustrated in the scatter diagram. \includegraphics[max width=\textwidth, alt={}, center]{e8624e9b-5143-49d2-9683-cc3a1082694e-3_649_1116_386_230}
  1. State whether either or both of the variables \(d\) and \(h\) are random variables. Summary data for the diameters and heights are as follows. $$\mathrm { n } = 15 \quad \sum \mathrm {~d} = 84.9 \quad \sum \mathrm {~h} = 124.7 \quad \sum \mathrm {~d} ^ { 2 } = 624.55 \quad \sum \mathrm {~h} ^ { 2 } = 1230.57 \quad \sum \mathrm { dh } = 866.63$$
  2. Find the equation of the regression line of \(h\) on \(d\). Give your answer in the form \(h = a d + b\), giving the values of \(a\) and \(b\) correct to \(\mathbf { 2 }\) decimal places.
  3. Use the regression line to predict the heights of beech trees with the following diameters.
    Comment on this in relation to your regression line.
  4. State the coordinates of the point at which the regression line of \(d\) on \(h\) meets the line which you calculated in part (b).
OCR MEI Further Statistics Major 2020 November Q5
13 marks Moderate -0.3
5 A hearing expert is investigating whether web-based hearing tests can be used instead of hearing tests in a hearing laboratory. The expert selects a random sample of 16 people with normal hearing. Each of them is given two hearing tests, one in the laboratory and one web-based. The scores in the laboratory-based test, \(x\), and the web-based test, \(y\), are both measured in the same suitable units.
  1. Half of the participants do the laboratory-based test first and the other half do the web-based test first. Explain why the expert adopts this approach. The scatter diagram in Fig. 5 shows the data that the expert collected. \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{8d36bc92-07ac-40c3-9e75-26f2bc9d2fcc-05_785_1360_1009_242} \captionsetup{labelformat=empty} \caption{Fig. 5}
    \end{figure} Summary statistics for these data are as follows. $$\Sigma x = 198.0 \quad \Sigma x ^ { 2 } = 2936.92 \quad \Sigma y = 188.7 \quad \Sigma y ^ { 2 } = 2605.35 \quad \Sigma x y = 2554.87$$
  2. Calculate the equation of the regression line suitable for estimating web-based scores from laboratory-based scores.
  3. Estimate the web-based scores of people whose laboratory-based scores were as follows.
    Stating the approximate coordinates of the outlier, suggest what the expert should do.
OCR S1 2010 June Q3
10 marks Moderate -0.8
  1. Some values, \((x, y)\), of a bivariate distribution are plotted on a scatter diagram and a regression line is to be drawn. Explain how to decide whether the regression line of \(y\) on \(x\) or the regression line of \(x\) on \(y\) is appropriate. [2]
  2. In an experiment the temperature, \(x\) °C, of a rod was gradually increased from 0 °C, and the extension, \(y\), was measured nine times at 50 °C intervals. The results are summarised below. \(n = 9\) \quad \(\Sigma x = 1800\) \quad \(\Sigma y = 14.4\) \quad \(\Sigma x^2 = 510000\) \quad \(\Sigma y^2 = 32.6416\) \quad \(\Sigma xy = 4080\)
    1. Show that the gradient of the regression line of \(y\) on \(x\) is 0.008 and find the equation of this line. [4]
    2. Use your equation to estimate the temperature when the extension is 2.5 mm. [1]
    3. Use your equation to estimate the extension for a temperature of \(-50\) °C. [1]
    4. Comment on the meaning and the reliability of your estimate in part (c). [2]
OCR Further Statistics 2017 Specimen Q1
6 marks Moderate -0.8
The table below shows the typical stopping distances \(d\) metres for a particular car travelling at \(v\) miles per hour.
\(v\)203040506070
\(d\)132436527294
  1. State each of the following words that describe the variable \(v\). Independent \quad Dependent \quad Controlled \quad Response [1]
  2. Calculate the equation of the regression line of \(d\) on \(v\). [2]
  3. Use the equation found in part (ii) to estimate the typical stopping distance when this car is travelling at 45 miles per hour. [1]
It is given that the product moment correlation coefficient for the data is 0.990 correct to three significant figures.
  1. Explain whether your estimate found in part (iii) is reliable. [2]
Pre-U Pre-U 9794/3 2013 November Q4
6 marks Moderate -0.8
As part of a study into the effects of alcohol, volunteers have their reaction times measured after they have consumed various fixed amounts of alcohol. For a random sample of 12 volunteers the following information was collected.
Units of alcohol consumed23344.55.5667889
Reaction time (seconds)12553.85.54.88.57.26.898
  1. Which is the independent variable in this experiment? [1]
  2. Find the least squares regression line of \(y\) (Reaction time) on \(x\) (Units of alcohol), and use it to estimate the reaction time of someone who has consumed 5 units of alcohol. [5]