| Exam Board | Edexcel |
|---|---|
| Module | Paper 3 (Paper 3) |
| Year | 2021 |
| Session | October |
| Marks | 6 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Hypothesis test of Pearson’s product-moment correlation coefficient |
| Type | Describe correlation from scatter diagram |
| Difficulty | Standard +0.3 This is a straightforward hypothesis testing question on correlation that follows a standard template. Parts (a) and (b) require basic interpretation of a scatter diagram, part (c) is calculator work, and part (d) is a routine one-tailed hypothesis test with clearly defined steps. While it's a Further Maths Statistics topic, the execution is mechanical with no novel problem-solving required, making it slightly easier than average overall. |
| Spec | 2.02d Informal interpretation of correlation2.05g Hypothesis test using Pearson's r |
| \(x\) | 3 | 6 | 8 | 7 | 5 | 3 | 11 | 3 | 4 | 5 | 4 | 9 | 7 | 10 | 6 | 6 |
| \(y\) | 7 | 7 | 4 | 4 | 6 | 8 | 5 | 5 | 8 | 4 | 7 | 4 | 5 | 5 | 6 | 3 |
| Answer | Marks | Guidance |
|---|---|---|
| Part | Answer/Working | Marks |
| (a) | Negative | B1 |
| (b) | Marc's suggestion is compatible because it's negative correlation | B1 |
| (c) | \((r =) -0.54458266...\) awrt \(\mathbf{-0.545}\) | B1 |
| (d) | \(H_0: \rho = 0\) \(H_1: \rho < 0\) | B1 |
| \([5\% \text{ 1-tail cv} =] (\pm) 0.4259\) (significant result / reject \(H_0\)) There is evidence of negative correlation between the number of letters in (or length of) a student's last name and their first name | M1, A1 | M1 for a critical value compatible with their \(H_1\): 1-tail: awrt \(\pm 0.426\) (condone \(\pm 0.425\)) or 2-tail (B0 scored for \(H_1\)) : awrt \(\pm 0.497\). If hypotheses are in words and can deduce whether one or two-tail then use their words. If no hypotheses or their \(H_1\) is not clearly one or two tail assume one-tail. A1 for compatible signs between cv and \(r\) and a correct conclusion in context mentioning correlation and number of letters or length and name (if their value from (c)). Do NOT award this A mark if contradictory statements or working seen e.g. "accept \(H_0\)" or comparison of 0.426 with significance level of 0.05 etc. |
| Part | Answer/Working | Marks | Guidance |
|------|---|---|---|
| (a) | Negative | B1 | B1 for "negative". Allow "slight" or "weak" etc. Allow a description e.g. "as $x$ increases $y$ decreases" or in context e.g. "people with longer last names tend to have shorter first names". A comment of "negative skew" is B0. **Need to see distinct or separate responses for (a) and (b)** |
| (b) | Marc's suggestion is compatible because it's negative correlation | B1 | B1 for a comment that suggests data is compatible with the suggestion and a suitable reason such as "there is negative correlation" or a description in $x$ and $y$ or in context, or the points lie close to a line with negative gradient, or draw line $y = x$ and state that more points below the line so supports (or is compatible with) his suggestion. A reason based on just a **single point** is B0. e.g. "11 letters in last name has only 5 in first name" |
| (c) | $(r =) -0.54458266...$ awrt $\mathbf{-0.545}$ | B1 | B1 for awrt $-0.545$ |
| (d) | $H_0: \rho = 0$ $H_1: \rho < 0$ | B1 | B1 for both hypotheses correct in terms of $\rho$ |
| | $[5\% \text{ 1-tail cv} =] (\pm) 0.4259$ (significant result / reject $H_0$) There is evidence of negative correlation between the number of letters in (or length of) a student's last name and their first name | M1, A1 | M1 for a critical value compatible with their $H_1$: 1-tail: awrt $\pm 0.426$ (condone $\pm 0.425$) or 2-tail (B0 scored for $H_1$) : awrt $\pm 0.497$. If hypotheses are in words and can deduce whether one or two-tail then use their words. If no hypotheses or their $H_1$ is not clearly one or two tail assume one-tail. A1 for compatible signs between cv and $r$ and a correct conclusion in context mentioning **correlation and number of letters or length and name** (if their value from (c)). **Do NOT award this A mark if contradictory statements or working seen** e.g. "accept $H_0$" or comparison of 0.426 with significance level of 0.05 etc. |
\begin{enumerate}
\item Marc took a random sample of 16 students from a school and for each student recorded
\end{enumerate}
\begin{itemize}
\item the number of letters, $x$, in their last name
\item the number of letters, $y$, in their first name
\end{itemize}
His results are shown in the scatter diagram on the next page.\\
(a) Describe the correlation between $x$ and $y$.
Marc suggests that parents with long last names tend to give their children shorter first names.\\
(b) Using the scatter diagram comment on Marc's suggestion, giving a reason for your answer.
The results from Marc's random sample of 16 observations are given in the table below.
\begin{center}
\begin{tabular}{ | l | l | l | l | l | l | l | l | l | l | l | l | l | l | l | l | l | }
\hline
$x$ & 3 & 6 & 8 & 7 & 5 & 3 & 11 & 3 & 4 & 5 & 4 & 9 & 7 & 10 & 6 & 6 \\
\hline
$y$ & 7 & 7 & 4 & 4 & 6 & 8 & 5 & 5 & 8 & 4 & 7 & 4 & 5 & 5 & 6 & 3 \\
\hline
\end{tabular}
\end{center}
(c) Use your calculator to find the product moment correlation coefficient between $x$ and $y$ for these data.\\
(d) Test whether or not there is evidence of a negative correlation between the number of letters in the last name and the number of letters in the first name.
You should
\begin{itemize}
\item state your hypotheses clearly
\item use a $5 \%$ level of significance
\end{itemize}
\section*{Question 2 continued.}
\begin{center}
\includegraphics[max width=\textwidth, alt={}]{10736735-3050-43eb-9e76-011ca6fa48b8-05_1125_1337_294_372}
\end{center}
\section*{Question 2 continued.}
\section*{Question 2 continued.}
\hfill \mbox{\textit{Edexcel Paper 3 2021 Q2 [6]}}