OCR MEI S3 2010 January — Question 3 18 marks

Exam BoardOCR MEI
ModuleS3 (Statistics 3)
Year2010
SessionJanuary
Marks18
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicWilcoxon tests
TypePaired t-test
DifficultyStandard +0.3 This is a straightforward paired t-test application with clear data and standard procedures. Part (i) requires stating normality assumption, calculating differences, and performing a one-tailed t-test—all routine S3 techniques. Part (ii) involves working backwards from a confidence interval to find sample statistics using the standard formula, which is slightly less routine but still mechanical. The context is accessible and the calculations are standard, making this slightly easier than average for S3.
Spec5.05c Hypothesis test: normal distribution for population mean

3 Cholesterol is a lipid (fat) which is manufactured by the liver from the fatty foods that we eat. It plays a vital part in allowing the body to function normally. However, when high levels of cholesterol are present in the blood there is a risk of arterial disease. Among the factors believed to assist with achieving and maintaining low cholesterol levels are weight loss and exercise. A doctor wishes to test the effectiveness of exercise in lowering cholesterol levels. For a random sample of 12 of her patients, she measures their cholesterol levels before and after they have followed a programme of exercise. The measurements obtained are as follows.
PatientABCDEFGHIJKL
Before5.75.74.06.87.45.56.76.47.27.27.14.4
After5.84.05.25.76.05.05.84.27.35.26.44.1
  1. A \(t\) test is to be used in order to see if, on average, the exercise programme seems to be effective in lowering cholesterol levels. State the distributional assumption necessary for the test, and carry out the test using a \(1 \%\) significance level.
  2. A second random sample of 12 patients gives a \(95 \%\) confidence interval of \(( - 0.5380,1.4046 )\) for the true mean reduction (before - after) in cholesterol level. Find the mean and standard deviation for this sample. How might the doctor interpret this interval in relation to the exercise programme?

Question 3:
Part (i):
AnswerMarks Guidance
Answer/WorkingMark Guidance
Must assume: Normality of population; of differencesB1, B1
\(H_0: \mu_D = 0\); \(H_1: \mu_D > 0\)B1 Both. Accept alternatives e.g. \(\mu_D < 0\) for \(H_1\), or \(\mu_B - \mu_A\) etc provided adequately defined. Hypotheses in words must include "population"
Where \(\mu_D\) is the (population) mean reduction/difference in cholesterol levelB1 For adequate verbal definition
MUST be PAIRED COMPARISON \(t\) test
Differences: \(-0.1,\ 1.7,\ -1.2,\ 1.1,\ 1.4,\ 0.5,\ 0.9,\ 2.2,\ -0.1,\ 2.0,\ 0.7,\ 0.3\)
\(\bar{x} = 0.7833\), \(s_{n-1} = 0.9833(46)\)B1 Do not allow \(s_n = 0.9415\)
Test statistic \(= \frac{0.7833 - 0}{\frac{0.9833}{\sqrt{12}}} = 2.7595\)M1, A1 Allow candidate's \(\bar{x}\) and/or \(s_{n-1}\). c.a.o. but ft from here
Refer to \(t_{11}\)M1 No ft from here if wrong. \(P(t > 2.7595) = 0.009286\)
Single-tailed 1% point is 2.718A1 No ft from here if wrong
SignificantA1 ft only candidate's test statistic
Seems mean cholesterol level has fallenA1 ft only candidate's test statistic [11]
Part (ii):
AnswerMarks Guidance
Answer/WorkingMark Guidance
CI is \(\bar{x} \pm 2.201 \times \frac{s}{\sqrt{12}}\)M1, B1 Overall structure seen or implied. From \(t_{11}\) seen or implied
\(= (-0.5380,\ 1.4046)\)A1 Fully correct pair of equations using the given interval
\(\bar{x} = \frac{1}{2}(1.4046 - 0.5380) = 0.4333\)B1
\(s = (1.4046 - 0.4333) \times \frac{\sqrt{12}}{2.201} = 1.5287\)M1, A1 Substitute \(\bar{x}\) and rearrange to find \(s\). c.a.o.
Using this interval the doctor might conclude that the mean cholesterol level did not seem to have been reducedE1 Accept any sensible comment or interpretation of this interval [7]
# Question 3:

## Part (i):
| Answer/Working | Mark | Guidance |
|---|---|---|
| Must assume: Normality of population; of differences | B1, B1 | |
| $H_0: \mu_D = 0$; $H_1: \mu_D > 0$ | B1 | Both. Accept alternatives e.g. $\mu_D < 0$ for $H_1$, or $\mu_B - \mu_A$ etc provided adequately defined. Hypotheses in words must include "population" |
| Where $\mu_D$ is the (population) mean reduction/difference in cholesterol level | B1 | For adequate verbal definition |
| MUST be PAIRED COMPARISON $t$ test | | |
| Differences: $-0.1,\ 1.7,\ -1.2,\ 1.1,\ 1.4,\ 0.5,\ 0.9,\ 2.2,\ -0.1,\ 2.0,\ 0.7,\ 0.3$ | | |
| $\bar{x} = 0.7833$, $s_{n-1} = 0.9833(46)$ | B1 | Do not allow $s_n = 0.9415$ |
| Test statistic $= \frac{0.7833 - 0}{\frac{0.9833}{\sqrt{12}}} = 2.7595$ | M1, A1 | Allow candidate's $\bar{x}$ and/or $s_{n-1}$. c.a.o. but ft from here |
| Refer to $t_{11}$ | M1 | No ft from here if wrong. $P(t > 2.7595) = 0.009286$ |
| Single-tailed 1% point is 2.718 | A1 | No ft from here if wrong |
| Significant | A1 | ft only candidate's test statistic |
| Seems mean cholesterol level has fallen | A1 | ft only candidate's test statistic **[11]** |

## Part (ii):
| Answer/Working | Mark | Guidance |
|---|---|---|
| CI is $\bar{x} \pm 2.201 \times \frac{s}{\sqrt{12}}$ | M1, B1 | Overall structure seen or implied. From $t_{11}$ seen or implied |
| $= (-0.5380,\ 1.4046)$ | A1 | Fully correct pair of equations using the given interval |
| $\bar{x} = \frac{1}{2}(1.4046 - 0.5380) = 0.4333$ | B1 | |
| $s = (1.4046 - 0.4333) \times \frac{\sqrt{12}}{2.201} = 1.5287$ | M1, A1 | Substitute $\bar{x}$ and rearrange to find $s$. c.a.o. |
| Using this interval the doctor might conclude that the mean cholesterol level did not seem to have been reduced | E1 | Accept any sensible comment or interpretation of this interval **[7]** |

---
3 Cholesterol is a lipid (fat) which is manufactured by the liver from the fatty foods that we eat. It plays a vital part in allowing the body to function normally. However, when high levels of cholesterol are present in the blood there is a risk of arterial disease. Among the factors believed to assist with achieving and maintaining low cholesterol levels are weight loss and exercise.

A doctor wishes to test the effectiveness of exercise in lowering cholesterol levels. For a random sample of 12 of her patients, she measures their cholesterol levels before and after they have followed a programme of exercise. The measurements obtained are as follows.

\begin{center}
\begin{tabular}{ | l | c | c | c | c | c | c | c | c | c | c | c | c | }
\hline
Patient & A & B & C & D & E & F & G & H & I & J & K & L \\
\hline
Before & 5.7 & 5.7 & 4.0 & 6.8 & 7.4 & 5.5 & 6.7 & 6.4 & 7.2 & 7.2 & 7.1 & 4.4 \\
\hline
After & 5.8 & 4.0 & 5.2 & 5.7 & 6.0 & 5.0 & 5.8 & 4.2 & 7.3 & 5.2 & 6.4 & 4.1 \\
\hline
\end{tabular}
\end{center}

(i) A $t$ test is to be used in order to see if, on average, the exercise programme seems to be effective in lowering cholesterol levels. State the distributional assumption necessary for the test, and carry out the test using a $1 \%$ significance level.\\
(ii) A second random sample of 12 patients gives a $95 \%$ confidence interval of $( - 0.5380,1.4046 )$ for the true mean reduction (before - after) in cholesterol level. Find the mean and standard deviation for this sample. How might the doctor interpret this interval in relation to the exercise programme?

\hfill \mbox{\textit{OCR MEI S3 2010 Q3 [18]}}