Edexcel S4 2003 June — Question 7 17 marks

Exam BoardEdexcel
ModuleS4 (Statistics 4)
Year2003
SessionJune
Marks17
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicT-tests (unknown variance)
TypeTwo-sample t-test equal variance
DifficultyStandard +0.3 This is a standard S4 question testing understanding of two-sample vs paired t-tests with all summary statistics provided. Part (a) is routine calculation with given values, part (b) requires computing differences and performing a paired test (standard procedure), and part (c) tests conceptual understanding of experimental design. The question is slightly easier than average because the calculations are straightforward and the experimental design clearly suggests pairing, making it a textbook application rather than requiring novel insight.
Spec5.05b Unbiased estimates: of population mean and variance5.05c Hypothesis test: normal distribution for population mean

Two methods of extracting juice from an orange are to be compared. Eight oranges are halved. One half of each orange is chosen at random and allocated to Method \(A\) and the other half is allocated to Method \(B\). The amounts of juice extracted, in ml, are given in the table.
Orange12345678
Method A2930262526222328
Method B2725282423262225
One statistician suggests performing a two-sample \(t\)-test to investigate whether or not there is a difference between the mean amounts of juice extracted by the two methods.
  1. Stating your hypotheses clearly and using a 5\% significance level, carry out this test. (You may assume \(\bar{x}_A = 26.125\), \(s_A^2 = 7.84\), \(\bar{x}_B = 25\), \(s_B^2 = 4\) and \(\sigma_A^2 = \sigma_B^2\)) [7]
Another statistician suggests analysing these data using a paired \(t\)-test.
  1. Using a 5\% significance level, carry out this test. [9]
  2. State which of these two tests you consider to be more appropriate. Give a reason for your choice. [1]

Part (a)
AnswerMarks Guidance
\(s_p^2 = \frac{7 \times 7.84 + 7 \times 4}{7 + 7} = 5.92\) M1
\(s_p = 2.433105\)awrt 2.43 A1
\(H_0 : \mu_A = \mu_B\), \(H_1 : \mu_A \neq \mu_B\)both B1
\(t = \frac{26.125 - 25}{2.43\sqrt{\frac{1}{8} + \frac{1}{8}}} = 0.92474\)awrt 0.925 M1A1
\(t_{14}(2.5\%) = 2.145\) B1
Insufficient evidence to reject \(H_0\) that there is no difference in the means. A1\(\mathcal{f}\)
Part (b)
AnswerMarks Guidance
\(d = M1 - M2\): \(2.5, -2, 1.3, -4, 1.3\) M1
\(\bar{d} = \frac{9}{8} = 1.125\) B1
\(s_d^2 = \frac{69 - 8 \times 1.125^2}{7} = 8.410714\)awrt 8.41 M1A1
\(H_0 : \delta = 0\), \(H_1 : \delta \neq 0\)both B1
\(t = \frac{1.125}{\sqrt{\frac{8.41}{8}}} = 1.0972\)awrt 1.10 M1A1
\(t_7(2.5\%) = 2.365\) B1
There is no significant evidence of a difference between method A and method B. A1\(\mathcal{f}\)
Part (c)
AnswerMarks Guidance
Paired sample as they are two measurements on the same orange B1
## Part (a)
$s_p^2 = \frac{7 \times 7.84 + 7 \times 4}{7 + 7} = 5.92$ | | M1 |
$s_p = 2.433105$ | awrt 2.43 | A1 |
$H_0 : \mu_A = \mu_B$, $H_1 : \mu_A \neq \mu_B$ | both | B1 |
$t = \frac{26.125 - 25}{2.43\sqrt{\frac{1}{8} + \frac{1}{8}}} = 0.92474$ | awrt 0.925 | M1A1 |
$t_{14}(2.5\%) = 2.145$ | | B1 |
Insufficient evidence to reject $H_0$ that there is no difference in the means. | | A1$\mathcal{f}$ | (7)

## Part (b)
$d = M1 - M2$: $2.5, -2, 1.3, -4, 1.3$ | | M1 |
$\bar{d} = \frac{9}{8} = 1.125$ | | B1 |
$s_d^2 = \frac{69 - 8 \times 1.125^2}{7} = 8.410714$ | awrt 8.41 | M1A1 |
$H_0 : \delta = 0$, $H_1 : \delta \neq 0$ | both | B1 |
$t = \frac{1.125}{\sqrt{\frac{8.41}{8}}} = 1.0972$ | awrt 1.10 | M1A1 |
$t_7(2.5\%) = 2.365$ | | B1 |
There is no significant evidence of a difference between method A and method B. | | A1$\mathcal{f}$ | (9)

## Part (c)
Paired sample as they are two measurements on the same orange | | B1 | (1)
Two methods of extracting juice from an orange are to be compared. Eight oranges are halved. One half of each orange is chosen at random and allocated to Method $A$ and the other half is allocated to Method $B$. The amounts of juice extracted, in ml, are given in the table.

\begin{center}
\begin{tabular}{|c|c|c|c|c|c|c|c|c|}
\hline
Orange & 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 \\
\hline
Method A & 29 & 30 & 26 & 25 & 26 & 22 & 23 & 28 \\
\hline
Method B & 27 & 25 & 28 & 24 & 23 & 26 & 22 & 25 \\
\hline
\end{tabular}
\end{center}

One statistician suggests performing a two-sample $t$-test to investigate whether or not there is a difference between the mean amounts of juice extracted by the two methods.

\begin{enumerate}[label=(\alph*)]
\item Stating your hypotheses clearly and using a 5\% significance level, carry out this test.

(You may assume $\bar{x}_A = 26.125$, $s_A^2 = 7.84$, $\bar{x}_B = 25$, $s_B^2 = 4$ and $\sigma_A^2 = \sigma_B^2$) [7]
\end{enumerate}

Another statistician suggests analysing these data using a paired $t$-test.

\begin{enumerate}[label=(\alph*)]
\setcounter{enumi}{1}
\item Using a 5\% significance level, carry out this test. [9]

\item State which of these two tests you consider to be more appropriate. Give a reason for your choice. [1]
\end{enumerate}

\hfill \mbox{\textit{Edexcel S4 2003 Q7 [17]}}