| Exam Board | OCR MEI |
|---|---|
| Module | S2 (Statistics 2) |
| Year | 2007 |
| Session | January |
| Marks | 18 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Chi-squared goodness of fit |
| Type | Chi-squared test of independence |
| Difficulty | Standard +0.3 This is a standard chi-squared test for independence question with routine calculations of expected frequencies, test statistics, and hypothesis testing. Part (i) requires a complete chi-squared test with a 2×2 table (straightforward), part (ii) extends to a 2×3 table with given test statistic (minimal calculation), and part (iii) asks for interpretation. All steps are textbook procedures with no novel problem-solving required, making it slightly easier than average. |
| Spec | 5.06a Chi-squared: contingency tables5.06b Fit prescribed distribution: chi-squared test |
| \multirow{2}{*}{Observed} | Home location | ||
| \cline{2-3} | City | Non-city | |
| \multirow{2}{*}{Ambition} | Good results | 102 | 147 |
| \cline{2-3} | Other | 75 | 156 |
| \multirow{2}{*}{Observed} | Home location | |||
| \cline{2-4} | City | Town | Country | |
| \multirow{2}{*}{Ambition} | Good results | 102 | 83 | 64 |
| \cline{2-4} | Other | 75 | 64 | 92 |
\multirow{2}{*}{
| Home location | ||||
| \cline{2-4} | City | Town | Country | ||
| \multirow{2}{*}{Ambition} | Good results | 1.129 | 0.596 | 3.540 | |
| \cline{2-4} | Other | 1.217 | 0.643 | 3.816 | |
| Answer | Marks | Guidance |
|---|---|---|
| Observed | Home location | |
| City | Non-city | |
| Ambition | Good results | 102 |
| Other | 75 | 156 |
| Expected | Home location | |
| City | Non-city | |
| Ambition | Good results | 91.82 |
| Other | 85.18 | 145.82 |
| Contribution to the test statistic | Home location | |
| City | Non-city | |
| Ambition | Good results | 1.129 |
| Other | 1.217 | 0.711 |
| Answer | Marks | Guidance |
|---|---|---|
| NB if \(H_0, H_1\) reversed, or 'correlation' mentioned, do not award first B1or final B1 or final E1 | B1 in context; M1 A1 for attempt at expected values; M1 for valid attempt at (O-E)²/E; A1CAO for \(X^2\); B1 for 1 dof SOI; B1 CAO for cv; B1 dep on attempt at cv; E1 conclusion in context | 8 |
| Answer | Marks | Guidance |
|---|---|---|
| Expected Country, Other = \(231 \times 156 / 480 = 75.08\) | B1; B1 | 2 |
| Answer | Marks | Guidance |
|---|---|---|
| There is evidence to conclude that there is association between home location and ambition. | B1 for 2 dof SOI; B1 CAO for cv; E1 for conclusion in context | 3 |
| Answer | Marks | Guidance |
|---|---|---|
| 'Country' students are much less likely than city or town to have 'Results' as their main ambition. Low contributions show that city and town students do not appear to differ markedly in their ambitions. | E1 for correct obs" for 'Country'; E1 for additional correct observation (must refer to contributions) | 2 |
| Answer | Marks | Guidance |
|---|---|---|
| Conclusion in (i) is valid if only categorizing home location into city and non-city. However if non-city is subdivided into town and country this additional subdivision gives the data more precision and allows the relationship in part (ii) (C) to be revealed. | E1; E1 | 2 |
## (i)
$H_0:$ no association between ambition and home location;
$H_1:$ some association between ambition and home location;
| Observed | Home location | | |
|----------|---|---|
| | City | Non-city |
| Ambition | Good results | 102 | 147 |
| | Other | 75 | 156 |
| Expected | Home location | | |
|----------|---|---|
| | City | Non-city |
| Ambition | Good results | 91.82 | 157.18 |
| | Other | 85.18 | 145.82 |
| Contribution to the test statistic | Home location | | |
|----------|---|---|
| | City | Non-city |
| Ambition | Good results | 1.129 | 0.659 |
| | Other | 1.217 | 0.711 |
$X^2 = 3.716$, Refer to $\chi_1^2$
Critical value at 5% level = 3.841
Result is not significant
There is insufficient evidence to conclude that there is any association between home location and ambition.
NB if $H_0, H_1$ reversed, or 'correlation' mentioned, do not award first B1or final B1 or final E1 | B1 in context; M1 A1 for attempt at expected values; M1 for valid attempt at (O-E)²/E; A1CAO for $X^2$; B1 for 1 dof SOI; B1 CAO for cv; B1 dep on attempt at cv; E1 conclusion in context | 8
## (ii)(A)
Expected Country, Results = $249 \times 156 / 480 = 80.93$
Expected Country, Other = $231 \times 156 / 480 = 75.08$ | B1; B1 | 2
## (ii)(B)
Refer to $\chi_2^2$
Critical value at 5% level = 5.991
Result is significant
There is evidence to conclude that there is association between home location and ambition. | B1 for 2 dof SOI; B1 CAO for cv; E1 for conclusion in context | 3
## (ii)(C)
'Country' students are much less likely than city or town to have 'Results' as their main ambition. Low contributions show that city and town students do not appear to differ markedly in their ambitions. | E1 for correct obs" for 'Country'; E1 for additional correct observation (must refer to contributions) | 2
## (iii)
Conclusion in (i) is valid if only categorizing home location into city and non-city. However if non-city is subdivided into town and country this additional subdivision gives the data more precision and allows the relationship in part (ii) (C) to be revealed. | E1; E1 | 2
---
**Total: 18 marks**
Two educational researchers are investigating the relationship between personal ambitions and home location of students. The researchers classify students into those whose main personal ambition is good academic results and those who have some other ambition. A random sample of 480 students is selected.
\begin{enumerate}[label=(\roman*)]
\item One researcher summarises the data as follows.
\begin{center}
\begin{tabular}{|c|c|c|c|}
\hline
\multirow{2}{*}{\textbf{Observed}} & \multicolumn{2}{|c|}{\textbf{Home location}} \\
\cline{2-3}
& City & Non-city \\
\hline
\multirow{2}{*}{Ambition} & Good results & 102 & 147 \\
\cline{2-3}
& Other & 75 & 156 \\
\hline
\end{tabular}
\end{center}
Carry out a test at the 5\% significance level to examine whether there is any association between home location and ambition. State carefully your null and alternative hypotheses. Your working should include a table showing the contributions of each cell to the test statistic. [9]
\item The other researcher summarises the same data in a different way as follows.
\begin{center}
\begin{tabular}{|c|c|c|c|c|}
\hline
\multirow{2}{*}{\textbf{Observed}} & \multicolumn{3}{|c|}{\textbf{Home location}} \\
\cline{2-4}
& City & Town & Country \\
\hline
\multirow{2}{*}{Ambition} & Good results & 102 & 83 & 64 \\
\cline{2-4}
& Other & 75 & 64 & 92 \\
\hline
\end{tabular}
\end{center}
\begin{enumerate}[label=(\Alph*)]
\item Calculate the expected frequencies for both 'Country' cells. [2]
\item The test statistic for these data is 10.94. Carry out a test at the 5\% level based on this table, using the same hypotheses as in part (i). [3]
\item The table below gives the contribution of each cell to the test statistic. Discuss briefly how personal ambitions are related to home location. [2]
\begin{center}
\begin{tabular}{|c|c|c|c|c|}
\hline
\multirow{2}{*}{\begin{tabular}{c}\textbf{Contribution to the}\\\textbf{test statistic}\end{tabular}} & \multicolumn{3}{|c|}{\textbf{Home location}} \\
\cline{2-4}
& City & Town & Country \\
\hline
\multirow{2}{*}{Ambition} & Good results & 1.129 & 0.596 & 3.540 \\
\cline{2-4}
& Other & 1.217 & 0.643 & 3.816 \\
\hline
\end{tabular}
\end{center}
\end{enumerate}
\item Comment briefly on whether the analysis in part (ii) means that the conclusion in part (i) is invalid. [2]
\end{enumerate}
\hfill \mbox{\textit{OCR MEI S2 2007 Q4 [18]}}