5.09c Calculate regression line

235 questions

Sort by: Default | Easiest first | Hardest first
AQA S1 2011 June Q3
15 marks Moderate -0.8
3
  1. During a particular summer holiday, Rick worked in a fish and chip shop at a seaside resort. He suspected that the shop's takings, \(\pounds y\), on a weekday were dependent upon the forecast of that day's maximum temperature, \(x ^ { \circ } \mathrm { C }\), in the resort, made at 6.00 pm on the previous day. To investigate this suspicion, he recorded values of \(x\) and \(y\) for a random sample of 7 weekdays during July.
    \(\boldsymbol { x }\)23182719252022
    \(\boldsymbol { y }\)4290318851063829505742644485
    1. Calculate the equation of the least squares regression line of \(y\) on \(x\).
    2. Estimate the shop's takings on a weekday during July when the maximum temperature was forecast to be \(24 ^ { \circ } \mathrm { C }\).
    3. Explain why your equation may not be suitable for estimating the shop's takings on a weekday during February.
    4. Describe, in the context of this question, a variable other than the maximum temperature, \(x\), that may affect \(y\).
  2. Seren, who also worked in the fish and chip shop, investigated the possible linear relationship between the shop's takings, \(\pounds z\), recorded in \(\pounds 000\) s, and each of two other explanatory variables, \(v\) and \(w\).
    1. She calculated correctly that the regression line of \(z\) on \(v\) had a \(z\)-intercept of - 1 and a gradient of 0.15 . Draw this line, for values of \(v\) from 0 to 40, on Figure 1 on page 4.
    2. She also calculated correctly that the regression line of \(z\) on \(w\) had a \(z\)-intercept of 5 and a gradient of - 0.40 . Draw this line, for values of \(w\) from 0 to 10, on Figure 2 below. \begin{figure}[h]
      \captionsetup{labelformat=empty} \caption{Figure 1} \includegraphics[alt={},max width=\textwidth]{767ec629-6350-41d9-bbb9-e059a5fd8c70-4_792_604_680_717}
      \end{figure} \begin{figure}[h]
      \captionsetup{labelformat=empty} \caption{Figure 2} \includegraphics[alt={},max width=\textwidth]{767ec629-6350-41d9-bbb9-e059a5fd8c70-4_792_696_1692_687}
      \end{figure}
AQA S1 2012 June Q3
11 marks Moderate -0.3
3 The table shows the maximum weight, \(y _ { A }\) grams, of Salt \(A\) that will dissolve in 100 grams of water at various temperatures, \(x ^ { \circ } \mathrm { C }\).
\(\boldsymbol { x }\)101520253035404550607080
\(\boldsymbol { y } _ { \boldsymbol { A } }\)203548577792101111121137159182
  1. Calculate the equation of the least squares regression line of \(y _ { A }\) on \(x\).
  2. The data in the above table are plotted on the scatter diagram on page 4. Draw your regression line on this scatter diagram.
  3. For water temperatures in the range \(10 ^ { \circ } \mathrm { C }\) to \(80 ^ { \circ } \mathrm { C }\), the maximum weight, \(y _ { B }\) grams, of Salt \(B\) that will dissolve in 100 grams of water is given by the equation $$y _ { B } = 60.1 + 0.255 x$$
    1. Draw this line on the scatter diagram.
    2. Estimate the water temperature at which the maximum weight of Salt \(A\) that will dissolve in 100 grams of water is the same as that of Salt B.
    3. For Salt \(A\) and Salt \(B\), compare the effects of water temperature on the maximum weight that will dissolve in 100 grams of water. Your answer should identify two distinct differences. \section*{Temperatures and Maximum Weights}
      \includegraphics[max width=\textwidth, alt={}]{91466019-8feb-4292-b616-e8e8667e2e54-4_2023_1682_404_173}
AQA S1 2013 June Q4
17 marks Standard +0.3
4 The girth, \(g\) metres, the length, \(l\) metres, and the weight, \(y\) kilograms, of each of a sample of 20 pigs were measured. The data collected is summarised as follows. $$S _ { g g } = 0.1196 \quad S _ { l l } = 0.0436 \quad S _ { y y } = 5880 \quad S _ { g y } = 24.15 \quad S _ { l y } = 10.25$$
  1. Calculate the value of the product moment correlation coefficient between:
    1. girth and weight;
    2. length and weight.
  2. Interpret, in context, each of the values that you obtained in part (a).
  3. Weighing pigs requires expensive equipment, whereas measuring their girths and lengths simply requires a tape measure. With this in mind, the following formula is proposed to make an estimate of a pig's weight, \(x\) kilograms, from its girth and length. $$x = 69.3 \times g ^ { 2 } \times l$$ Applying this formula to the relevant data on the 20 pigs resulted in $$S _ { x x } = 5656.15 \quad S _ { x y } = 5662.97$$
    1. By calculating a third value of the product moment correlation coefficient, state which of \(g , l\) or \(x\) is the most strongly correlated with \(y\), the weight.
    2. Estimate the weight of a pig that has a girth of 1.25 metres and a length of 1.15 metres.
    3. Given the additional information that \(\bar { x } = 115.4\) and \(\bar { y } = 116.0\), calculate the equation of the least squares regression line of \(y\) on \(x\), in the form \(y = a + b x\).
    4. Comment on the likely accuracy of the estimated weight found in part (c)(ii). Your answer should make reference to the value of the product moment correlation coefficient found in part (c)(i) and to the values of \(b\) and \(a\) found in part (c)(iii).
      (4 marks)
AQA S1 2014 June Q3
11 marks Moderate -0.8
3 The table shows the body mass index (BMI), \(x\), and the systolic blood pressure (SBP), \(y \mathrm { mmHg }\), for each of a random sample of 10 men, aged between 35 years and 40 years, from a particular population.
\(\boldsymbol { x }\)13232935173425203127
\(\boldsymbol { y }\)103115124126108120113117118119
  1. Calculate the equation of the least squares regression line of \(y\) on \(x\).
  2. Use your equation to estimate the SBP of a man from this population who is aged 38 years and who has a BMI of 30 .
  3. State why your equation might not be appropriate for estimating the SBP of a man from this population:
    1. who is aged 38 years and who has a BMI of 45 ;
    2. who is aged 50 years and who has a BMI of 25 .
  4. Find the value of the residual for the point \(( 20,117 )\).
  5. The mean of the vertical distances of the 10 points from the regression line calculated in part (a) is 2.71, correct to three significant figures. Comment on the likely accuracy of your estimate in part (b).
    [0pt] [1 mark]
AQA S1 2014 June Q5
13 marks Moderate -0.5
5 As part of a study of charity shops in a small market town, two such shops, \(X\) and \(Y\), were each asked to provide details of its takings on 12 randomly selected days. The table shows, for each of the 12 days, the day's takings, \(\pounds x\), of charity shop \(X\) and the day's takings, \(\pounds y\), of charity shop \(Y\).
Day\(\mathbf { A }\)\(\mathbf { B }\)\(\mathbf { C }\)\(\mathbf { D }\)\(\mathbf { E }\)\(\mathbf { F }\)\(\mathbf { G }\)\(\mathbf { H }\)\(\mathbf { I }\)\(\mathbf { J }\)\(\mathbf { K }\)\(\mathbf { L }\)
\(\boldsymbol { x }\)4657391166277416115536861
\(\boldsymbol { y }\)781026621498729813421679583
    1. Calculate the value of the product moment correlation coefficient between \(x\) and \(y\).
    2. Interpret your value in the context of this question.
  1. Complete the scatter diagram shown on the opposite page.
  2. The investigator realised subsequently that one of the 12 selected days was a particularly popular town market day and another was a day on which the weather was extremely severe. Identify each of these days giving a reason for each choice.
  3. Removing the two days described in part (c) from the data gives the following information. $$S _ { x x } = 1292.5 \quad S _ { y y } = 3850.1 \quad S _ { x y } = 407.5$$
    1. Use this information to recalculate the value of the product moment correlation coefficient between \(x\) and \(y\).
    2. Hence revise, as necessary, your interpretation in part (a)(ii).
      [0pt] [3 marks] Shop \(X\) takings(£) \begin{figure}[h]
      \captionsetup{labelformat=empty} \caption{harity Shops} \includegraphics[alt={},max width=\textwidth]{ddf7f158-b6ae-42c6-98f1-d59c205646ad-17_33_21_294_1617}
      \end{figure} \begin{figure}[h]
      \captionsetup{labelformat=empty} \caption{harity Shops} \includegraphics[alt={},max width=\textwidth]{ddf7f158-b6ae-42c6-98f1-d59c205646ad-17_49_24_276_1710}
      \end{figure}
      \includegraphics[max width=\textwidth, alt={}]{ddf7f158-b6ae-42c6-98f1-d59c205646ad-17_1304_415_406_1391}
AQA S1 2014 June Q6
12 marks Moderate -0.8
6 A rubber seal is fitted to the bottom of a flood barrier. When no pressure is applied, the depth of the seal is 15 cm . When pressure is applied, a watertight seal is created between the flood barrier and the ground. The table shows the pressure, \(x\) kilopascals ( kPa ), applied to the seal and the resultant depth, \(y\) centimetres, of the seal.
\(\boldsymbol { x }\)255075100125150175200250300
\(\boldsymbol { y }\)14.713.412.811.911.010.39.79.07.56.7
    1. State the value that you would expect for \(a\) in the equation of the least squares regression line, \(y = a + b x\).
    2. Calculate the equation of the least squares regression line, \(y = a + b x\).
    3. Interpret, in context, your value for \(b\).
  1. Calculate an estimate of the depth of the seal when it is subjected to a pressure of 225 kPa .
    1. Give a statistical reason as to why your equation is unlikely to give a realistic estimate of the depth of the seal if it were to be subjected to a pressure of 400 kPa .
    2. Give a reason based on the context of this question as to why your equation will not give a realistic estimate of the depth of the seal if it were to be subjected to a pressure of 525 kPa .
      [0pt] [3 marks]
      \includegraphics[max width=\textwidth, alt={}]{8aeacd54-a5a1-4f2d-b936-2faf635ffce7-20_946_1709_1761_153}
      \includegraphics[max width=\textwidth, alt={}]{8aeacd54-a5a1-4f2d-b936-2faf635ffce7-21_2484_1707_221_153}
      \includegraphics[max width=\textwidth, alt={}]{8aeacd54-a5a1-4f2d-b936-2faf635ffce7-23_2484_1707_221_153}
AQA S1 2016 June Q4
9 marks Moderate -0.8
4 As part of her science project, a student found the mass, \(y\) grams, of a particular compound that dissolved in 100 ml of water at each of 12 different set temperatures, \(x ^ { \circ } \mathrm { C }\). The results are shown in the table.
\(\boldsymbol { x }\)202530354045505560657075
\(\boldsymbol { y }\)242262269290298310326355359375390412
  1. Calculate the equation of the least squares regression line of \(y\) on \(x\).
  2. Interpret, in context, your value for the gradient of this regression line.
  3. Use your equation to estimate the mass of the compound which will dissolve in 100 ml of water at \(68 ^ { \circ } \mathrm { C }\).
  4. Given that the values of the 12 residuals for the regression line of \(y\) on \(x\) lie between - 7 and + 9 , comment, with justification, on the likely accuracy of your estimate in part (c).
    [0pt] [2 marks]
Edexcel S1 Q5
12 marks Moderate -0.8
  1. The table shows the numbers of cars and vans in a company's fleet having registrations with the prefix letters shown.
Registration letter\(K\)\(L\)\(M\)\(N\)\(P\)\(R\)\(S\)\(T\)\(V\)
Number of cars \(( x )\)67911151412107
Number of vans \(( y )\)810141313151498
  1. Plot a scatter graph of this data, with the number of cars on the horizontal axis and the number of vans on the vertical axis.
  2. If there were \(4 J\)-registered cars, estimate the number of \(J\)-registered vans. Given that \(\sum x ^ { 2 } = 1001 , \sum y ^ { 2 } = 1264\) and \(\sum x y = 1106\),
  3. calculate the product-moment correlation coefficient between \(x\) and \(y\). Give a brief interpretation of your answer.
Edexcel S1 Q6
17 marks Standard +0.3
6. In a survey for a computer magazine, the times \(t\) seconds taken by eight laser printers to print a page of text were compared with the prices \(\pounds p\) of the printers. The data were coded using the equations \(x = t - 10\) and \(y = p - 150\), and it was found that $$\sum x = 42 \cdot 4 , \quad \sum x ^ { 2 } = 314 \cdot 5 , \quad \sum y = 560 , \quad \sum y ^ { 2 } = 60600 , \quad \sum x y = 1592 .$$
  1. Find the mean time and the mean price for the eight printers.
  2. Find the variance of the times.
  3. Find the equation of the regression line of \(p\) on \(t\).
  4. Estimate the price of a printer which takes 11.3 seconds to print the page.
Edexcel S1 Q4
10 marks Moderate -0.8
4. An internet service provider runs a series of television adverts at weekly intervals. To investigate the effectiveness of the adverts the company recorded the viewing figures in millions, \(v\), for the programme in which the advert was shown, and the number of new customers, \(c\), who signed up for their service the next day. The results are summarised as follows. $$\bar { v } = 4.92 , \quad \bar { c } = 104.4 , \quad S _ { v c } = 594.05 , \quad S _ { v v } = 85.44 .$$
  1. Calculate the equation of the regression line of \(c\) on \(v\) in the form \(c = a + b v\).
  2. Give an interpretation of the constants \(a\) and \(b\) in this context.
  3. Estimate the number of customers that will sign up with the company the day after an advert is shown during a programme watched by 3.7 million viewers.
  4. State two other factors besides viewing figures that will affect the success of an advert in gaining new customers for the company.
Edexcel S1 Q7
17 marks Moderate -0.8
7. A doctor wished to investigate the effects of staying awake for long periods on a person's ability to complete simple tasks. She recorded the number of times, \(n\), that a subject could clinch his or her fist in 30 seconds after being awake for \(h\) hours. The results for one subject were as follows.
\(h\) (hours)161718192021222324
\(n\)1161141091019494868180
  1. Plot a scatter diagram of \(n\) against \(h\) for these results. You may use $$\Sigma h = 180 , \quad \Sigma n = 875 , \quad \Sigma h ^ { 2 } = 3660 , \quad \Sigma h n = 17204 .$$
  2. Obtain the equation of the regression line of \(n\) on \(h\) in the form \(n = a + b h\).
  3. Give a practical interpretation of the constant b.
  4. Explain why this regression line would be unlikely to be appropriate for values of \(h\) between 0 and 16 .
    (2 marks)
    Another subject underwent the same tests giving rise to a regression line of \(n = 213.4 - 5.87\) h
  5. After how many hours of being awake together would you expect these two subjects to be able to clench their fists the same number of times in 30 seconds?
Edexcel S1 Q6
17 marks Moderate -0.8
6. A school introduced a new programme of support lessons in 1994 with a view to improving grades in GCSE English. The table below shows the number of years since 1994, n, and the corresponding percentage of students achieving A to C grades in GCSE English, \(p\), for each year.
\(n\)123456
\(p ( \% )\)35.237.140.639.043.444.8
  1. Represent these data on a scatter diagram. You may use the following values. $$\Sigma n = 21 , \quad \Sigma p = 240.1 , \quad \Sigma n ^ { 2 } = 91 , \quad \Sigma p ^ { 2 } = 9675.41 , \quad \Sigma n p = 873 .$$
  2. Find an equation of the regression line of \(p\) on \(n\) and draw it on your graph.
  3. Calculate the product moment correlation coefficient for these data and comment on the suitability of a linear model for the relationship between \(n\) and \(p\) during this period.
Edexcel S1 Q7
15 marks Moderate -0.8
7. Pipes-R-us manufacture a special lightweight aluminium tubing. The price \(\pounds P\), for each length, \(l\) metres, that the company sells is shown in the table.
\(l\) (metres)0.50.81.01.5246
\(P ( \pounds )\)2.503.404.005.206.0010.5015.00
  1. Represent these data on a scatter diagram. You may use $$\Sigma l = 15.8 , \quad \Sigma P = 46.6 , \quad \Sigma l ^ { 2 } = 60.14 , \quad \Sigma l P = 159.77$$
  2. Find the equation of the regression line of \(P\) on \(l\) in the form \(P = a + b l\).
  3. Give a practical interpretation of the constant b. In response to customer demand Pipes- \(R\)-us decide to start selling tubes cut to specific lengths. Initially the company decides to use the regression line found in part (b) as a pricing formula for this new service.
  4. Calculate the price that Pipes- \(R\)-us should charge for 5.2 metres of the tubing.
  5. Suggest a reason why Pipes- \(R\)-us might not offer prices based on the regression line for any length of tubing.
Edexcel S1 Q7
17 marks Standard +0.3
7. A new vaccine is tested over a six-month period in one health authority. The table shows the number of new cases of the disease, \(d\), reported in the \(m\) th month after the trials began.
\(m\)123456
\(d\)1026961585248
A doctor suggests that a relationship of the form \(d = a + b x\) where \(x = \frac { 1 } { m }\) can be used to model the situation.
  1. Tabulate the values of \(x\) corresponding to the given values of \(d\) and plot a scatter diagram of \(d\) against \(x\).
  2. Explain how your scatter diagram supports the suggested model. You may use $$\Sigma x = 2.45 , \quad \Sigma d = 390 , \quad \Sigma x ^ { 2 } = 1.491 , \quad \Sigma x d = 189.733$$
  3. Find an equation of the regression line \(d\) on \(x\) in the form \(d = a + b x\).
  4. Use your regression line to estimate how many new cases of the disease there will be in the 13th month after the trial began.
  5. Comment on the reliability of your answer to part (d).
Edexcel S1 Q6
14 marks Moderate -0.8
6. A physics student recorded the length, \(l \mathrm {~cm}\), of a spring when different masses, \(m\) grams, were suspended from it giving the following results.
\(m ( \mathrm {~g} )\)50100200300400500600700
\(l ( \mathrm {~cm} )\)7.810.716.522.128.033.935.235.6
  1. Represent these data on a scatter diagram with \(l\) on the vertical axis. The student decides to find the equation of a regression line of the form \(l = a + b m\) using only the data for \(m \leq 500 \mathrm {~g}\).
  2. Give a reason to support the fitting of such a regression line and explain why the student is excluding two of his values.
    (2 marks)
    You may use $$\Sigma m = 1550 , \quad \Sigma l = 119 , \quad \Sigma m ^ { 2 } = 552500 , \quad \Sigma l ^ { 2 } = 2869.2 , \quad \Sigma m l = 39540 .$$
  3. Find the values of \(a\) and \(b\).
  4. Explain the significance of the values of \(a\) and \(b\) in this situation.
Edexcel S1 Q4
11 marks Standard +0.3
  1. An engineer tested a new material under extreme conditions in a wind tunnel. He recorded the number of microfractures, \(n\), that formed and the wind speed, \(v\) metres per second, for 8 different values of \(v\) with all other conditions remaining constant. He then coded the data using \(x = v - 700\) and \(y = n - 20\) and calculated the following summary statistics.
$$\Sigma x = 100 , \quad \Sigma y = 23 , \quad \Sigma x ^ { 2 } = 215000 , \quad \Sigma x y = 11600 .$$
  1. Find an equation of the regression line of \(y\) on \(x\).
  2. Hence, find an equation of the regression line of \(n\) on \(v\).
  3. Use your regression line to estimate the number of microfractures that would be formed if the material was tested in a wind speed of 900 metres per second with all other conditions remaining constant.
    (2 marks)
AQA S3 2012 June Q1
6 marks Moderate -0.8
1 A wildlife expert measured the neck lengths, \(x\) metres, and the tail lengths, \(y\) metres, of a sample of 12 mature male giraffes as part of a study into their physical characteristics. The results are shown in the table.
AQA S3 2015 June Q1
6 marks Moderate -0.8
1 A demographer measured the length of the right foot, \(x\) millimetres, and the length of the right hand, \(y\) millimetres, of each of a sample of 12 males aged between 19 years and 25 years. The results are given in the table.
OCR MEI Further Statistics A AS 2018 June Q6
9 marks Standard +0.3
6 A researcher is investigating various bodily characteristics of frogs of various species. She collects data on length, \(x \mathrm {~mm}\), and head width, \(y \mathrm {~mm}\), of a random sample of 14 frogs of a particular species. A scatter diagram of the data is shown in Fig. 6, together with the equation of the regression line of \(y\) on \(x\) and also the value of \(r ^ { 2 }\). \begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{e3ac0ba0-9692-4018-894e-2b04b07eaf32-6_949_1616_450_228} \captionsetup{labelformat=empty} \caption{Fig. 6}
\end{figure}
  1. (A) Use the equation of the regression line to estimate the mean head width for frogs of each of the following lengths.
OCR MEI Further Statistics A AS 2019 June Q5
13 marks Standard +0.3
5 A researcher is investigating births of females and males in a particular species of animal which very often produces litters of 7 offspring.
The table shows some data about the number of females per litter in 200 litters of 7 offspring. The researcher thinks that a binomial distribution \(\mathrm { B } ( 7 , p )\) may be an appropriate model for these data. (c) Complete the test at the \(5 \%\) significance level. Fig. 5 shows the probability distribution \(\mathrm { B } ( 7,0.35 )\) together with the relative frequencies of the observed data (the numbers of litters each divided by 200). \begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{fd496303-10f1-450e-bbeb-421ab6f4de21-5_659_1285_342_319} \captionsetup{labelformat=empty} \caption{Fig. 5}
\end{figure} (d) Comment on the result of the test completed in part (c) by considering Fig. 5.
OCR MEI Further Statistics A AS 2019 June Q6
13 marks Standard +0.3
6 A meteorologist is investigating the relationship between altitude \(x\) metres and mean annual temperature \(y ^ { \circ } \mathrm { C }\) in an American state.
She selects 12 locations at various altitudes and then stations a remote monitoring device at each of them to measure the temperature over the course of a year. Fig. 6 illustrates the data which she obtains. \begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{fd496303-10f1-450e-bbeb-421ab6f4de21-6_686_1477_486_292} \captionsetup{labelformat=empty} \caption{Fig. 6}
\end{figure}
  1. Explain why it would not be appropriate to carry out a hypothesis test for correlation based on the product moment correlation coefficient.
  2. Explain why altitude has been plotted on the horizontal axis in Fig. 6. Summary statistics for \(x\) and \(y\) are as follows. $$\sum x = 21200 \quad \sum y = 105.4 \quad \sum x ^ { 2 } = 39100000 \quad \sum y ^ { 2 } = 1004 \quad \sum x y = 176090$$
  3. Calculate the equation of the regression line of \(y\) on \(x\).
  4. Use the equation of the regression line to predict the values of the mean annual temperature at each of the following altitudes.
OCR MEI Further Statistics A AS 2022 June Q6
10 marks Moderate -0.8
6 Tom has read in a newspaper that you can tell the air temperature by counting how often a cricket chirps in a period of 20 seconds. (A cricket is a type of insect.) He wants to know exactly how the temperature can be predicted. On 8 randomly selected days, when Tom can hear crickets chirping, he records the number of chirps, \(x\), made by a cricket in a 20-second interval, and also the temperature, \(y ^ { \circ } \mathrm { C }\), at that time. The data are summarised as follows. \(n = 8 \quad \sum x = 268 \quad \sum y = 141.9 \quad \sum x ^ { 2 } = 9618 \quad \sum y ^ { 2 } = 2630.55 \quad \sum \mathrm { xy } = 5009.1\) These data are illustrated below. \includegraphics[max width=\textwidth, alt={}, center]{8f1e0c68-a334-4657-823e-386ab0994c02-5_661_1035_699_242}
  1. Determine the equation of the regression line of \(y\) on \(x\). Give your answer in the form \(\mathrm { y } = \mathrm { ax } + \mathrm { b }\), giving the values of \(a\) and \(b\) correct to \(\mathbf { 3 }\) significant figures.
  2. Use the equation of the regression line to predict the temperature for the following values of \(x\).
OCR MEI Further Statistics A AS 2024 June Q4
10 marks Standard +0.3
4 A chemist is conducting an experiment in which the concentration of a certain chemical, A , is supposed to be recorded at the start of the experiment and then every 30 seconds after the start. The time after the start is denoted by \(t \mathrm {~s}\) and the concentration by \(\mathrm { z } \mathrm { mg } \mathrm { cm } ^ { - 3 }\). The collected data are shown in the table below. Note that the concentration at \(t = 90\) was not recorded.
Time, \(t\)03060120150
Concentration of A, \(z\)40.031.327.512.811.4
The chemist wishes to plot the data on a graph.
  1. Explain why \(t\) should be plotted on the horizontal axis. You are given that the summary statistics for the data are as follows. \(n = 5 \quad \sum t = 360 \quad \sum z = 123.0 \quad \sum t ^ { 2 } = 41400 \quad \sum z ^ { 2 } = 3629.74 \quad \sum \mathrm { t } = 5835\) The regression line of \(z\) on \(t\) is given by \(\mathbf { z = a + b t }\) and is used to model the concentration of chemical A for \(t \geqslant 0\).
    1. Use the summary statistics to determine the value of \(a\) and the value of \(b\).
    2. Find the value of the residual at each of the following values of \(t\).
      • \(t = 60\)
      • \(t = 120\)
        1. Use the equation of the regression line to estimate the value of the concentration at 90 seconds.
        2. With reference to your answers to part (b)(ii), comment on the reliability of your answer to part (c)(i).
      Further experiments indicate that the model is reasonably reliable for times greater than 150 seconds up to about 200 seconds.
  2. Show that the model cannot be valid beyond a time of about 200 seconds.
OCR MEI Further Statistics A AS 2020 November Q5
8 marks Moderate -0.3
5 A doctor is investigating the relationship between the levels in the blood of a particular hormone and of calcium in healthy adults. The levels of the hormone and of calcium, each measured in suitable units, are denoted by \(x\) and \(y\) respectively. The doctor selects a random sample of 14 adults and measures the hormone and calcium levels in each of them. The spreadsheet in Fig. 5 shows the values obtained, together with a scatter diagram which illustrates the data. The equation of the regression line of \(y\) on \(x\) is shown on the scatter diagram, together with the value of the square of the product moment correlation coefficient. \begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{ba3fcd3c-6834-4116-be0e-d5b27aed0a7e-5_801_1644_646_255} \captionsetup{labelformat=empty} \caption{Fig. 5}
\end{figure}
  1. Use the equation of the regression line to estimate the mean calcium level of people with the following hormone levels.
OCR MEI Further Statistics A AS 2021 November Q6
11 marks Moderate -0.3
6 A health researcher is investigating the relationship between age and maximum heart rate. A commonly quoted formula states that 'maximum heart rate \(= 220\) - age in years'. The researcher wants to check if this formula is a satisfactory model for people who work in the large hospital where she is employed. The researcher selects a random sample of 20 people who work in her hospital, and measures their maximum heart rates.
  1. Explain why the researcher selects a sample, rather than using all of the people who work in the hospital. The ages, \(x\) years, and maximum heart rates, \(y\) beats per minute, of the people in the researcher's sample are summarised as follows. \(n = 20 \quad \sum x = 922 \quad \sum y = 3638 \quad \sum x ^ { 2 } = 47250 \quad \sum y ^ { 2 } = 664610 \quad \sum x y = 164998\) These data are illustrated below. \includegraphics[max width=\textwidth, alt={}, center]{5be067ff-4668-48d6-8ed2-b8dfa3e678f7-5_758_1246_1027_244}
    1. Draw the line which represents the formula 'maximum heart rate \(= 220 -\) age in years' on the copy of the scatter diagram in the Printed Answer Booklet.
    2. Comment on how well this model fits the data.
  2. Determine the equation of the regression line of maximum heart rate on age.
  3. Use the equation of the regression line to predict the values of the maximum heart rate for each of the following ages.