OCR MEI Further Statistics Major 2020 November — Question 6 10 marks

Exam BoardOCR MEI
ModuleFurther Statistics Major (Further Statistics Major)
Year2020
SessionNovember
Marks10
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicHypothesis test of Pearson’s product-moment correlation coefficient
TypeTwo-tailed test for any correlation
DifficultyStandard +0.3 This is a straightforward application of hypothesis testing for correlation with standard procedures: (a) requires a routine t-test calculation with n=60 and r=0.3231, comparing to critical values at 10% significance; (b) asks for a standard validity condition (bivariate normality). The table interpretation adds minimal complexity. Slightly above average difficulty due to being Further Maths content, but mechanically routine.
Spec5.08a Pearson correlation: calculate pmcc5.08d Hypothesis test: Pearson correlation

6 A pollution control officer is investigating a possible link between the levels of various pollutants in the air and the speed of the wind at various sites. A random sample of 60 values of the windspeed together with the levels of a variety of pollutants is taken at a particular site. The product moment correlation coefficient between wind-speed and nitrogen dioxide level is 0.3231 .
  1. Carry out a hypothesis test at the \(10 \%\) significance level to investigate whether there is any correlation between wind-speed and nitrogen dioxide level.
  2. State the condition required for the test carried out in part (a) to be valid. Table 6.1 shows the values of the product moment correlation coefficient between 5 different measures of pollution and also wind-speed for a very large random sample of values at another site. Those correlations that are significant at the \(10 \%\) level are denoted by a * after the value of the correlation. \begin{table}[h]
    CorrelationsPM10SPEED\(\mathrm { NO } _ { 2 }\)\(\mathrm { O } _ { 3 }\)PM25\(\mathrm { SO } _ { 2 }\)
    PM101.00
    SPEED0.08*1.00
    \(\mathrm { NO } _ { 2 }\)0.59*0.25*1.00
    \(\mathbf { O } _ { \mathbf { 3 } }\)-0.05*-0.04*-0.30*1.00
    PM250.85*-0.010.56*-0.021.00
    \(\mathrm { SO } _ { 2 }\)0.42*0.15*0.73*-0.63*0.40*1.00
    \captionsetup{labelformat=empty} \caption{Table 6.1}
    \end{table} \begin{table}[h]
    \captionsetup{labelformat=empty} \caption{Table 6.2 shows standard guidelines for effect sizes.}
    Product moment
    correlation coefficient
    Effect size
    0.1Small
    0.3Medium
    0.5Large
    \end{table} Table 6.2 The officer analyses these data for effect size.
  3. Explain how the very large sample size relates to the interpretation of the correlation coefficients shown in Table 6.1.
  4. Comment briefly on what the pollution control officer might conclude from these tables, relevant to her investigation into wind-speed and pollutant levels.

Question 6:
AnswerMarks Guidance
6(a) H : ρ = 0 , H : ρ ≠ 0 (two-tailed test)
0 1
where ρ is the population correlation coefficient
(between wind-speed and nitrogen dioxide level)
For n = 10, 5% critical value = 0.2144
Since 0.3231 > 0.2144 the result is significant. There is
sufficient evidence to reject H
0
There is sufficient evidence at the 10% level to suggest
that there is correlation between wind-speed and
AnswerMarks
nitrogen dioxide levelB1
B1
B1
M1
A1
AnswerMarks
[5]3.3
2.5
3.4
1.1
AnswerMarks
2.2bFor both hypotheses
For defining ρ
For critical value
For comparison leading to a
conclusion
FT for conclusion in words
AnswerMarks Guidance
6(b) The population from which the data are drawn must be
bivariate Normally distributed.B1
[1]2.5 Do not allow ‘Data must be bivariate Normally distributed’ but
condone ‘Data must be from a bivariate Normal distribution’
AnswerMarks Guidance
6(c) For very large samples, the sample correlation
coefficient is very often significant.
However as in this case those significant correlations
AnswerMarks
which are very close to zero are of very little real use.E1
E1
AnswerMarks
[2]3.5a
3.5aoe eg ‘large sample size means lower values of r can be
significant’
AnswerMarks Guidance
6(d) Although four out of five of the correlation coefficients
are statistically significant, very little use can be made
of most of them as the effect sizes are small.
However the effect size of 0.25 between wind-speed
and NO suggests that the windier it is, the higher the
2
level of NO pollution.
AnswerMarks
2E1
E1
AnswerMarks
[2]2.2b
2.2b
Question 6:
6 | (a) | H : ρ = 0 , H : ρ ≠ 0 (two-tailed test)
0 1
where ρ is the population correlation coefficient
(between wind-speed and nitrogen dioxide level)
For n = 10, 5% critical value = 0.2144
Since 0.3231 > 0.2144 the result is significant. There is
sufficient evidence to reject H
0
There is sufficient evidence at the 10% level to suggest
that there is correlation between wind-speed and
nitrogen dioxide level | B1
B1
B1
M1
A1
[5] | 3.3
2.5
3.4
1.1
2.2b | For both hypotheses
For defining ρ
For critical value
For comparison leading to a
conclusion
FT for conclusion in words
6 | (b) | The population from which the data are drawn must be
bivariate Normally distributed. | B1
[1] | 2.5 | Do not allow ‘Data must be bivariate Normally distributed’ but
condone ‘Data must be from a bivariate Normal distribution’
6 | (c) | For very large samples, the sample correlation
coefficient is very often significant.
However as in this case those significant correlations
which are very close to zero are of very little real use. | E1
E1
[2] | 3.5a
3.5a | oe eg ‘large sample size means lower values of r can be
significant’
6 | (d) | Although four out of five of the correlation coefficients
are statistically significant, very little use can be made
of most of them as the effect sizes are small.
However the effect size of 0.25 between wind-speed
and NO suggests that the windier it is, the higher the
2
level of NO pollution.
2 | E1
E1
[2] | 2.2b
2.2b
6 A pollution control officer is investigating a possible link between the levels of various pollutants in the air and the speed of the wind at various sites. A random sample of 60 values of the windspeed together with the levels of a variety of pollutants is taken at a particular site. The product moment correlation coefficient between wind-speed and nitrogen dioxide level is 0.3231 .
\begin{enumerate}[label=(\alph*)]
\item Carry out a hypothesis test at the $10 \%$ significance level to investigate whether there is any correlation between wind-speed and nitrogen dioxide level.
\item State the condition required for the test carried out in part (a) to be valid.

Table 6.1 shows the values of the product moment correlation coefficient between 5 different measures of pollution and also wind-speed for a very large random sample of values at another site. Those correlations that are significant at the $10 \%$ level are denoted by a * after the value of the correlation.

\begin{table}[h]
\begin{center}
\begin{tabular}{|l|l|l|l|l|l|l|}
\hline
Correlations & PM10 & SPEED & $\mathrm { NO } _ { 2 }$ & $\mathrm { O } _ { 3 }$ & PM25 & $\mathrm { SO } _ { 2 }$ \\
\hline
PM10 & 1.00 &  &  &  &  &  \\
\hline
SPEED & 0.08* & 1.00 &  &  &  &  \\
\hline
$\mathrm { NO } _ { 2 }$ & 0.59* & 0.25* & 1.00 &  &  &  \\
\hline
$\mathbf { O } _ { \mathbf { 3 } }$ & -0.05* & -0.04* & -0.30* & 1.00 &  &  \\
\hline
PM25 & 0.85* & -0.01 & 0.56* & -0.02 & 1.00 &  \\
\hline
$\mathrm { SO } _ { 2 }$ & 0.42* & 0.15* & 0.73* & -0.63* & 0.40* & 1.00 \\
\hline
\end{tabular}
\captionsetup{labelformat=empty}
\caption{Table 6.1}
\end{center}
\end{table}

\begin{table}[h]
\begin{center}
\captionsetup{labelformat=empty}
\caption{Table 6.2 shows standard guidelines for effect sizes.}
\begin{tabular}{ | c | c | }
\hline
\begin{tabular}{ c }
Product moment \\
correlation coefficient \\
\end{tabular} & Effect size \\
\hline
0.1 & Small \\
\hline
0.3 & Medium \\
\hline
0.5 & Large \\
\hline
\end{tabular}
\end{center}
\end{table}

Table 6.2

The officer analyses these data for effect size.
\item Explain how the very large sample size relates to the interpretation of the correlation coefficients shown in Table 6.1.
\item Comment briefly on what the pollution control officer might conclude from these tables, relevant to her investigation into wind-speed and pollutant levels.
\end{enumerate}

\hfill \mbox{\textit{OCR MEI Further Statistics Major 2020 Q6 [10]}}