OCR PURE — Question 11 6 marks

Exam BoardOCR
ModulePURE
Marks6
PaperDownload PDF ↗
TopicBivariate data
TypeInterpret census or real-world data
DifficultyModerate -0.5 This question tests interpretation of scatter diagrams with real-world census data, requiring conceptual understanding rather than calculation. While it involves multiple parts and requires careful reasoning about causation vs correlation and identifying patterns, the individual components are straightforward: explaining why age groups are compared (they're the same cohort 10 years later), interpreting y=x (population stability), understanding that these ARE causally linked (same people aged), and identifying outliers (universities attract 18-19 year olds). This is easier than average A-level as it requires no mathematical computation, only data interpretation skills.
Spec2.02c Scatter diagrams and regression lines2.02e Correlation does not imply causation

11 A student is investigating changes in the number of residents in Local Authorities in the SouthEast Region between 2001 and 2011. The scatter diagram shows the number \(x\) of residents in these Local Authorities in the age group 8 to 9 in 2001 and the number \(y\) of residents in the same Local Authorities in the age group 18 to 19 in 2011.
  1. Suggest a reason why the student is comparing these two age groups in 2001 and 2011. The student notices that most of the data points are close to the line \(y = x\).
    1. Explain what this suggests about the residents in these Local Authorities.
    2. The student says that correlation does not imply causation, so there is no causal link between the values of \(x\) and the values of \(y\). Explain whether or not they are correct.
  2. Some of these Local Authorities contain universities.
    1. On the diagram in the Printed Answer Booklet, circle three points that are likely to represent Local Authorities containing universities.
    2. Give a reason for your choice of points in part (c)(i). Assume that the proportion of residents in age group 8 to 9 in 2001 was roughly the same in each Local Authority in the South-East. The Local Authority in this region with the largest population is Medway.
  3. On the diagram in the Printed Answer Booklet, label clearly with the letter \(M\) the point that corresponds to Medway.

Question 11:
Part (a):
AnswerMarks Guidance
Those in age group 8 to 9 in 2001 will be in age group 18 to 19 in 2011B1, [1] or e.g. "Same people". May be implied
Part (b)(i):
AnswerMarks Guidance
Few people joined (or left) this group, or "Not much change in this group" or "Most have lived there since 2001"B1, [1] LAs with a high number of 8-9s in 2001 have a high number of 18-19s in 2011. Most have stayed in the same area since childhood.
Part (b)(ii):
AnswerMarks Guidance
(No) The nos of 18-19s in 2011 is (largely) caused by the nos of 8-9s in 2001, (but this cannot be inferred from the correlation).B1, [1] (No) There is a link. They are the same people. Must state or imply that there is a link between \(x\) and \(y\) in this context.
Part (c)(i):
AnswerMarks Guidance
3 (or 4 or 5) of the highest five \(y\)-valuesB1, [1] Must not include right-most point
Part (c)(ii):
AnswerMarks Guidance
In these LAs, the no. of 18-19s in 2011 is more than the no. of 8-9s in 2001. In these LAs there are large numbers of 18+B1, [1] e.g. the no. of people in this group in 2011 is more than it was in 2001. Students are aged 18+ so they cause an increase in this age group. Allow numerical example. May be implied. Not just "These are outliers" oe
Part (d):
AnswerMarks
Right most point indicatedB1, [1]
# Question 11:

## Part (a):
Those in age group 8 to 9 in 2001 will be in age group 18 to 19 in 2011 | B1, [1] | or e.g. "Same people". May be implied

## Part (b)(i):
Few people joined (or left) this group, or "Not much change in this group" or "Most have lived there since 2001" | B1, [1] | LAs with a high number of 8-9s in 2001 have a high number of 18-19s in 2011. Most have stayed in the same area since childhood.

## Part (b)(ii):
(No) The nos of 18-19s in 2011 is (largely) caused by the nos of 8-9s in 2001, (but this cannot be inferred from the correlation). | B1, [1] | (No) There is a link. They are the same people. Must state or imply that there is a link between $x$ and $y$ in this context.

## Part (c)(i):
3 (or 4 or 5) of the highest five $y$-values | B1, [1] | Must not include right-most point

## Part (c)(ii):
In these LAs, the no. of 18-19s in 2011 is more than the no. of 8-9s in 2001. In these LAs there are large numbers of 18+ | B1, [1] | e.g. the no. of people in this group in 2011 is more than it was in 2001. Students are aged 18+ so they cause an increase in this age group. Allow numerical example. May be implied. Not just "These are outliers" oe

## Part (d):
Right most point indicated | B1, [1] |

---
11 A student is investigating changes in the number of residents in Local Authorities in the SouthEast Region between 2001 and 2011. The scatter diagram shows the number $x$ of residents in these Local Authorities in the age group 8 to 9 in 2001 and the number $y$ of residents in the same Local Authorities in the age group 18 to 19 in 2011.\\
\begin{tikzpicture}[
    x=0.0012cm,
    y=0.0010cm,
    font=\rmfamily
]

% 1. Background Grid Lines
\foreach \x in {2000, 4000, 6000, 8000} {
    \draw[thin, black] (\x, 0) -- (\x, 12000);
}
\foreach \y in {2000, 4000, 6000, 8000, 10000, 12000} {
    \draw[thin, black] (0, \y) -- (8000, \y);
}

% 2. Axes Lines
\draw[-latex, thin] (0, 0) -- (8800, 0);
\draw[-latex, thin] (0, 0) -- (0, 12800);

% 3. Tick Marks and Values
\foreach \x in {0, 2000, 4000, 6000, 8000} {
    \draw (\x, 0) -- ++(0, -150) node[below] {\x};
}
\foreach \y in {2000, 4000, 6000, 8000, 10000, 12000} {
    \draw (0, \y) -- ++(-150, 0) node[left] {\y};
}

% 4. Axis Labels
\node[anchor=north] at (4000, -1200) {Number of residents in age group 8 to 9 in 2001};
\node[anchor=east, align=left] at (-1800, 6000) {Number of \\ residents in \\ age group \\ 18 to 19 \\ in 2011};

% 5. Data Points
\foreach \Point in {
    (1450,1200), (1650,1300), (1600,1700), (1900,1650), (1950,1800),
    (2000,1750), (2000,1950), (2200,1700), (2400,1750), (2400,1900),
    (2100,2600), (2200,2100), (2300,2300), (2400,2100), (2400,2200),
    (2500,2250), (2550,2450), (2550,2850), (2600,2100), (2600,3400),
    (2700,2150), (2700,2300), (2750,2500), (2750,2700), (2800,2450),
    (2850,2300), (2850,2650), (2900,2600), (2900,2850), (3000,2200),
    (3000,2500), (3100,2600), (3100,2900), (3150,2550), (3150,2700),
    (3200,2400), (3200,2700), (3200,3150), (3300,2300), (3300,2650),
    (3350,2700), (3400,2700), (3400,3100), (3450,3350), (3500,3100),
    (3550,3200), (3550,3350), (3600,3100), (3600,3300), (3950,3200),
    (4100,3500), (4150,3300), (4400,4100), (4500,3700),
    (1800,2900), (3050,4450), (3400,4850), (6250,5150),
    (2800,8300), (3200,7900), (4600,9000), (5100,9800), (5400,10600),
    (7400,8100)
} {
    \fill \Point circle (2.0pt);
}

\end{tikzpicture}
\begin{enumerate}[label=(\alph*)]
\item Suggest a reason why the student is comparing these two age groups in 2001 and 2011.

The student notices that most of the data points are close to the line $y = x$.
\item \begin{enumerate}[label=(\roman*)]
\item Explain what this suggests about the residents in these Local Authorities.
\item The student says that correlation does not imply causation, so there is no causal link between the values of $x$ and the values of $y$.

Explain whether or not they are correct.
\end{enumerate}\item Some of these Local Authorities contain universities.
\begin{enumerate}[label=(\roman*)]
\item On the diagram in the Printed Answer Booklet, circle three points that are likely to represent Local Authorities containing universities.
\item Give a reason for your choice of points in part (c)(i).

Assume that the proportion of residents in age group 8 to 9 in 2001 was roughly the same in each Local Authority in the South-East. The Local Authority in this region with the largest population is Medway.
\end{enumerate}\item On the diagram in the Printed Answer Booklet, label clearly with the letter $M$ the point that corresponds to Medway.
\end{enumerate}

\hfill \mbox{\textit{OCR PURE  Q11 [6]}}