OCR MEI Further Statistics Major Specimen — Question 9 13 marks

Exam BoardOCR MEI
ModuleFurther Statistics Major (Further Statistics Major)
SessionSpecimen
Marks13
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicChi-squared goodness of fit
TypeSpreadsheet-based chi-squared test
DifficultyStandard +0.3 This is a straightforward chi-squared test of independence with standard spreadsheet interpretation. Students must identify the test, state hypotheses, calculate missing expected frequencies and contributions using given formulas, and complete the test using tables. All steps are routine applications of the chi-squared procedure taught in Further Statistics, requiring no novel insight or complex reasoning.
Spec5.06a Chi-squared: contingency tables

9 A random sample of adults in the UK were asked to state their primary source of news: television (T), internet (I), newspapers (N) or radio (R). The responses were classified by age group, and an analysis was carried out to see if there is any association between age group and primary source of news. Fig. 9 is a screenshot showing part of the spreadsheet used to analyse the data. Some values in the spreadsheet have been deliberately omitted. \begin{table}[h]
ABCDEF
1SourceAge group
2of news18-3233-4748-6465+
3T63617180275
4I33332212100
5N98112048
6R499527
7109111113117450
8
9Expected frequencies
1066.6167.8369.0671.50
1124.2224.6726.00
1211.6311.8412.0512.48
136.546.666.787.02
14
15Contributions to the test statistic
160.200.690.051.01
173.182.827.54
180.590.094.53
190.990.820.730.58
20test statistic25.45
\captionsetup{labelformat=empty} \caption{Fig. 9}
\end{table}
  1. (A) State the sample size.
    (B) Give the name of the appropriate hypothesis test.
    (C) State the null and alternative hypotheses.
  2. Showing your calculations, find the missing values in cells

Question 9:
AnswerMarks Guidance
9(i) (A)
[1]2.2a
9(i) (B)
[1]1.2
9(i) (C)
0
source
H : some association between age and news
1
AnswerMarks Guidance
sourceB1
[1]2.5 N
9(ii) 113
D11= (cid:117)100
450
=25.11
(cid:11)8(cid:16)11.84(cid:12)2
C18 = (cid:32)1.25
11.84
(cid:11)22(cid:16)25.11(cid:12)2
D17 = (cid:32)0.39
AnswerMarks
25.11M1
A1
M1
A1
E
AnswerMarks
[4]3.4
1.1
I1.1
AnswerMarks
C1.1E
M
(cid:11)O(cid:16)E(cid:12)2
M1 for applied at
E
least once
A1 for both correct: accept
1.245, 0.385
(NB one can be calculated by
subtraction)
AnswerMarks Guidance
9(iii) P
Degrees of freedom = 9
S
Critical value = 16.92
Test statistic = 25.45
25.45 > 16.92 so reject H
0
There is sufficient evidence to suggest that
there is some association between age and
AnswerMarks
primary news sourceB1
B1
M1
A1
AnswerMarks
[4]3.3
1.1
2.2b
3.5a
AnswerMarks Guidance
9(iv) For age group 18-32 and 33-47, the
contributions of 3.18 and 2.82 show that
more than expected have primary source the
internet
For age group 65+, the contributions of 7.54
and 4.53 show that fewer than expected have
primary source the internet and more than
expected have primary source newspapers.
For age group 48 - 64 the contributions show
AnswerMarks
that primary sources are as expectedE1
E1
E1
AnswerMarks
[3]3.5a
3.5a
AnswerMarks
3.5aN
E
Allow other suitable answers.
M
Max 2 out of 3 if numerical
values of contributions to test
statistic not mentioned
Question 9:
9 | (i) | (A) | Sample size = 450 | B1
[1] | 2.2a
9 | (i) | (B) | Chi-squared test [for a contingency table] | B1
[1] | 1.2
9 | (i) | (C) | H : no association between age and news
0
source
H : some association between age and news
1
source | B1
[1] | 2.5 | N
9 | (ii) | 113
D11= (cid:117)100
450
=25.11
(cid:11)8(cid:16)11.84(cid:12)2
C18 = (cid:32)1.25
11.84
(cid:11)22(cid:16)25.11(cid:12)2
D17 = (cid:32)0.39
25.11 | M1
A1
M1
A1
E
[4] | 3.4
1.1
I1.1
C1.1 | E
M
(cid:11)O(cid:16)E(cid:12)2
M1 for applied at
E
least once
A1 for both correct: accept
1.245, 0.385
(NB one can be calculated by
subtraction)
9 | (iii) | P
Degrees of freedom = 9
S
Critical value = 16.92
Test statistic = 25.45
25.45 > 16.92 so reject H
0
There is sufficient evidence to suggest that
there is some association between age and
primary news source | B1
B1
M1
A1
[4] | 3.3
1.1
2.2b
3.5a
9 | (iv) | For age group 18-32 and 33-47, the
contributions of 3.18 and 2.82 show that
more than expected have primary source the
internet
For age group 65+, the contributions of 7.54
and 4.53 show that fewer than expected have
primary source the internet and more than
expected have primary source newspapers.
For age group 48 - 64 the contributions show
that primary sources are as expected | E1
E1
E1
[3] | 3.5a
3.5a
3.5a | N
E
Allow other suitable answers.
M
Max 2 out of 3 if numerical
values of contributions to test
statistic not mentioned
9 A random sample of adults in the UK were asked to state their primary source of news: television (T), internet (I), newspapers (N) or radio (R). The responses were classified by age group, and an analysis was carried out to see if there is any association between age group and primary source of news.

Fig. 9 is a screenshot showing part of the spreadsheet used to analyse the data. Some values in the spreadsheet have been deliberately omitted.

\begin{table}[h]
\begin{center}
\begin{tabular}{|l|l|l|l|l|l|l|}
\hline
 & A & B & C & D & E & F \\
\hline
1 & Source & \multicolumn{4}{|c|}{Age group} &  \\
\hline
2 & of news & 18-32 & 33-47 & 48-64 & 65+ &  \\
\hline
3 & T & 63 & 61 & 71 & 80 & 275 \\
\hline
4 & I & 33 & 33 & 22 & 12 & 100 \\
\hline
5 & N & 9 & 8 & 11 & 20 & 48 \\
\hline
6 & R & 4 & 9 & 9 & 5 & 27 \\
\hline
7 &  & 109 & 111 & 113 & 117 & 450 \\
\hline
8 &  &  &  &  &  &  \\
\hline
9 &  & \multicolumn{4}{|l|}{Expected frequencies} &  \\
\hline
10 &  & 66.61 & 67.83 & 69.06 & 71.50 &  \\
\hline
11 &  & 24.22 & 24.67 &  & 26.00 &  \\
\hline
12 &  & 11.63 & 11.84 & 12.05 & 12.48 &  \\
\hline
13 &  & 6.54 & 6.66 & 6.78 & 7.02 &  \\
\hline
14 &  &  &  &  &  &  \\
\hline
15 & \multicolumn{5}{|c|}{Contributions to the test statistic} &  \\
\hline
16 &  & 0.20 & 0.69 & 0.05 & 1.01 &  \\
\hline
17 &  & 3.18 & 2.82 &  & 7.54 &  \\
\hline
18 &  & 0.59 &  & 0.09 & 4.53 &  \\
\hline
19 &  & 0.99 & 0.82 & 0.73 & 0.58 &  \\
\hline
20 & \multicolumn{5}{|r|}{test statistic} & 25.45 \\
\hline
\end{tabular}
\captionsetup{labelformat=empty}
\caption{Fig. 9}
\end{center}
\end{table}
\begin{enumerate}[label=(\roman*)]
\item (A) State the sample size.\\
(B) Give the name of the appropriate hypothesis test.\\
(C) State the null and alternative hypotheses.
\item Showing your calculations, find the missing values in cells

\begin{itemize}
  \item D11,
  \item D17 and
  \item C18.
\item Complete the appropriate hypothesis test at the $5 \%$ level of significance.
\item Discuss briefly what the data suggest about primary source of news. You should make a comment for each age group.
\end{itemize}
\end{enumerate}

\hfill \mbox{\textit{OCR MEI Further Statistics Major  Q9 [13]}}