OCR MEI Further Statistics Minor 2020 November — Question 5 17 marks

Exam BoardOCR MEI
ModuleFurther Statistics Minor (Further Statistics Minor)
Year2020
SessionNovember
Marks17
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicHypothesis test of Spearman’s rank correlation coefficien
TypeHypothesis test for association
DifficultyModerate -0.3 This is a straightforward application of standard hypothesis testing procedures for Spearman's rank correlation. Parts (a)-(c) involve recognizing when to use Spearman's over Pearson's (routine), calculating the coefficient (mechanical with small n=8), and conducting a standard significance test. Parts (d)-(f) are basic regression calculations using a calculator. All steps are textbook procedures with no novel problem-solving required, making it slightly easier than average.
Spec5.08e Spearman rank correlation5.08f Hypothesis test: Spearman rank5.08g Compare: Pearson vs Spearman

5 A student is investigating immunisation. He wonders if there is any relationship between the percentage of young children who have been given measles vaccine and the percentage who have been given BCG vaccine in various countries. He takes a random sample of 8 countries and finds the data for the two variables. The spreadsheet in Fig. 5.1 shows the values obtained, together with a scatter diagram which illustrates the data. \begin{figure}[h]
\includegraphics[alt={},max width=\textwidth]{882f9f3c-40d8-4abb-822a-49bd505a33ea-5_910_1653_541_246} \captionsetup{labelformat=empty} \caption{Fig. 5.1}
\end{figure}
  1. The student decides that a test based on Pearson's product moment correlation coefficient is not valid. Explain why he comes to this conclusion. The student carries out a test based on Spearman's rank correlation coefficient.
  2. Calculate the value of Spearman's rank correlation coefficient.
  3. Carry out a test based on this coefficient at the \(5 \%\) significance level to investigate whether there is any association between measles and BCG vaccination levels. The student then decides to investigate the relationship between number of doctors per 1000 people in a country and unemployment rate in that country (unemployment rate is the percentage of the working age population who are not in paid work). He selects a random sample of 6 countries. The spreadsheet in Fig. 5.2 shows the values obtained, together with a scatter diagram which illustrates the data. \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{882f9f3c-40d8-4abb-822a-49bd505a33ea-6_776_1649_495_248} \captionsetup{labelformat=empty} \caption{Fig. 5.2}
    \end{figure}
  4. Use your calculator to write down the equation of the regression line of unemployment rate on doctors per 1000.
  5. Use the regression line to estimate the unemployment rate for a country with 2.00 doctors per 1000.
  6. Comment briefly on the reliability of your answer to part (e). The student decides to add the data for another country with 3.99 doctors per 1000 and unemployment rate 11.42 to his diagram.
  7. Add this point to the scatter diagram in the Printed Answer Booklet.
  8. Without doing any further calculations, comment on what difference, if any, including this extra data point would make to the usefulness of a regression line of unemployment rate on doctors per 1000.

Question 5:
AnswerMarks Guidance
5(a) Because (the grouping of points on) the scatter diagram
does not appear to be elliptical
AnswerMarks
due to the outlier/anomalous pointE1
E1
AnswerMarks Guidance
[2]3.5a
2.4Or: not bivariate Normal
5(b) Rank BCG 1 3 6 2 7 5 4 8
Rank Measles 1 3 4 2 5 7 6 8
Evaluates soi
Spearman’s ra2nk coeff = (= 0.8095… or 0.81)
∑𝑑𝑑 (= 16)
AnswerMarks
17M1
M1
A1
AnswerMarks
[3]1.1
1.1
AnswerMarks
1.1For ranking BCG & Measles
(may be reversed or implied from d
values)
AnswerMarks
BCUse of percentage values
is M0 M0
AnswerMarks Guidance
5(c) 21
H : There is no association between (vaccination rates
0
for) BCG and measles in the population
H : There is some association between (vaccination
1
rates for) BCG and measles in the population
For n = 8, 5% critical value is (±)0.7381
0.8095 > 0.7381 Reject H oe
0
There is sufficient evidence to suggest that there is
some association between (vaccination rates for) BCG
AnswerMarks
and measles (in the population)B1
B1
B1
M1
A1
AnswerMarks
[5]3.3
1.2
3.4
1.1
AnswerMarks
2.2bNeed to see population in one or
other of the hypotheses
For comparison with sensible critical
AnswerMarks Guidance
value,r < 1 and conclusion correct
s
FT their r
s
AnswerMarks
Not too assertive and includes contextSC if outlier excluded in
(b) may still score 5/5
For n = 7, c.v. is 0.7857
0.7143 < 0.7857
Do not reject H
0
Insufficient evidence…
AnswerMarks Guidance
5(d) y = -4.80x + 24.49 awrt
x = doctors, y = unemploymentB1
B1
AnswerMarks
[2]1.1
1.1BC
Variables defined (other letters may
be used)
AnswerMarks Guidance
5(e) 14.89 [14.8, 15.0]
[1]1.1
5(f) Not reliable since extrapolation oe
[1]2.4
5(g) Data point (3.99, 11.42) plotted correctly
[1]1.1
5(h) The fit would be worse.
The regression line might no longer be valid due to this
AnswerMarks
point which is perhaps an outlier.E1
E1
AnswerMarks Guidance
[2]2.2b
2.4may refer to residuals or correlation
Rank BCG1 3
Rank Measles1 3
Question 5:
5 | (a) | Because (the grouping of points on) the scatter diagram
does not appear to be elliptical
due to the outlier/anomalous point | E1
E1
[2] | 3.5a
2.4 | Or: not bivariate Normal
5 | (b) | Rank BCG 1 3 6 2 7 5 4 8
Rank Measles 1 3 4 2 5 7 6 8
Evaluates soi
Spearman’s ra2nk coeff = (= 0.8095… or 0.81)
∑𝑑𝑑 (= 16)
17 | M1
M1
A1
[3] | 1.1
1.1
1.1 | For ranking BCG & Measles
(may be reversed or implied from d
values)
BC | Use of percentage values
is M0 M0
5 | (c) | 21
H : There is no association between (vaccination rates
0
for) BCG and measles in the population
H : There is some association between (vaccination
1
rates for) BCG and measles in the population
For n = 8, 5% critical value is (±)0.7381
0.8095 > 0.7381 Reject H oe
0
There is sufficient evidence to suggest that there is
some association between (vaccination rates for) BCG
and measles (in the population) | B1
B1
B1
M1
A1
[5] | 3.3
1.2
3.4
1.1
2.2b | Need to see population in one or
other of the hypotheses
For comparison with sensible critical
value, |r| < 1 and conclusion correct
s
FT their r
s
Not too assertive and includes context | SC if outlier excluded in
(b) may still score 5/5
For n = 7, c.v. is 0.7857
0.7143 < 0.7857
Do not reject H
0
Insufficient evidence…
5 | (d) | y = -4.80x + 24.49 awrt
x = doctors, y = unemployment | B1
B1
[2] | 1.1
1.1 | BC
Variables defined (other letters may
be used)
5 | (e) | 14.89 [14.8, 15.0] | B1
[1] | 1.1
5 | (f) | Not reliable since extrapolation oe | E1
[1] | 2.4
5 | (g) | Data point (3.99, 11.42) plotted correctly | B1
[1] | 1.1
5 | (h) | The fit would be worse.
The regression line might no longer be valid due to this
point which is perhaps an outlier. | E1
E1
[2] | 2.2b
2.4 | may refer to residuals or correlation
Rank BCG | 1 | 3 | 6 | 2 | 7 | 5 | 4 | 8
Rank Measles | 1 | 3 | 4 | 2 | 5 | 7 | 6 | 8
5 A student is investigating immunisation. He wonders if there is any relationship between the percentage of young children who have been given measles vaccine and the percentage who have been given BCG vaccine in various countries.

He takes a random sample of 8 countries and finds the data for the two variables. The spreadsheet in Fig. 5.1 shows the values obtained, together with a scatter diagram which illustrates the data.

\begin{figure}[h]
\begin{center}
  \includegraphics[alt={},max width=\textwidth]{882f9f3c-40d8-4abb-822a-49bd505a33ea-5_910_1653_541_246}
\captionsetup{labelformat=empty}
\caption{Fig. 5.1}
\end{center}
\end{figure}
\begin{enumerate}[label=(\alph*)]
\item The student decides that a test based on Pearson's product moment correlation coefficient is not valid. Explain why he comes to this conclusion.

The student carries out a test based on Spearman's rank correlation coefficient.
\item Calculate the value of Spearman's rank correlation coefficient.
\item Carry out a test based on this coefficient at the $5 \%$ significance level to investigate whether there is any association between measles and BCG vaccination levels.

The student then decides to investigate the relationship between number of doctors per 1000 people in a country and unemployment rate in that country (unemployment rate is the percentage of the working age population who are not in paid work). He selects a random sample of 6 countries. The spreadsheet in Fig. 5.2 shows the values obtained, together with a scatter diagram which illustrates the data.

\begin{figure}[h]
\begin{center}
  \includegraphics[alt={},max width=\textwidth]{882f9f3c-40d8-4abb-822a-49bd505a33ea-6_776_1649_495_248}
\captionsetup{labelformat=empty}
\caption{Fig. 5.2}
\end{center}
\end{figure}
\item Use your calculator to write down the equation of the regression line of unemployment rate on doctors per 1000.
\item Use the regression line to estimate the unemployment rate for a country with 2.00 doctors per 1000.
\item Comment briefly on the reliability of your answer to part (e).

The student decides to add the data for another country with 3.99 doctors per 1000 and unemployment rate 11.42 to his diagram.
\item Add this point to the scatter diagram in the Printed Answer Booklet.
\item Without doing any further calculations, comment on what difference, if any, including this extra data point would make to the usefulness of a regression line of unemployment rate on doctors per 1000.
\end{enumerate}

\hfill \mbox{\textit{OCR MEI Further Statistics Minor 2020 Q5 [17]}}