9 It is thought that the pH value of sand (a measure of the sand's acidity) may affect the extent to which a particular species of plant will grow in that sand. A botanist wished to determine whether there was any correlation between the pH value of the sand on certain sand dunes, and the amount of each of two plant species growing there. She chose random sections of equal area on each of eight sand dunes and measured the pH values. She then measured the area within each section that was covered by each of the two species. The results were as follows.
| \cline { 2 - 10 }
\multicolumn{1}{c|}{} | Dune | \(A\) | \(B\) | \(C\) | \(D\) | \(E\) | \(F\) | \(G\) | \(H\) |
| \cline { 2 - 10 }
\multicolumn{1}{c|}{} | pH value, \(x\) | 8.5 | 8.5 | 9.5 | 8.5 | 6.5 | 7.5 | 8.5 | 9.0 |
\multirow{2}{*}{| Area, \(y \mathrm {~cm} ^ { 2 }\) | | covered | } | Species \(P\) | 150 | 150 | 575 | 330 | 45 | 15 | 340 | 330 |
| \cline { 2 - 10 } | Species \(Q\) | 170 | 15 | 80 | 230 | 75 | 25 | 0 | 0 |
The results for species \(P\) can be summarised by
$$n = 8 , \quad \Sigma x = 66.5 , \quad \Sigma x ^ { 2 } = 558.75 , \quad \Sigma y = 1935 , \quad \Sigma y ^ { 2 } = 711275 , \quad \Sigma x y = 17082.5 .$$
- Give a reason why it might be appropriate to calculate the equation of the regression line of \(y\) on \(x\) rather than \(x\) on \(y\) in this situation.
- Calculate the equation of the regression line of \(y\) on \(x\) for species \(P\), in the form \(y = a + b x\), giving the values of \(a\) and \(b\) correct to 3 significant figures.
- Estimate the value of \(y\) for species \(P\) on sand where the pH value is 7.0 .
The values of the product moment correlation coefficient between \(x\) and \(y\) for species \(P\) and \(Q\) are \(r _ { P } = 0.828\) and \(r _ { Q } = 0.0302\).
- Describe the relationship between the area covered by species \(Q\) and the pH value.
- State, with a reason, whether the regression line of \(y\) on \(x\) for species \(P\) will provide a reliable estimate of the value of \(y\) when the pH value is
(a) 8,
(b) 4 . - Assume that the equation of the regression line of \(y\) on \(x\) for species \(Q\) is also known. State, with a reason, whether this line will provide a reliable estimate of the value of \(y\) when the pH value is 8 .