CAIE S2 2014 June — Question 7 10 marks

Exam BoardCAIE
ModuleS2 (Statistics 2)
Year2014
SessionJune
Marks10
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicType I/II errors and power of test
TypeState meaning of Type I error
DifficultyStandard +0.3 This is a straightforward hypothesis testing question covering standard S2 content: sampling methods, Type I error definition, one-tailed z-test with known variance, and CLT understanding. All parts are routine recall and application with no novel problem-solving required. The calculation in part (iii) is mechanical (find sample mean, compute z-statistic, compare to critical value). Slightly easier than average due to the guided structure and basic conceptual questions.
Spec2.01c Sampling techniques: simple random, opportunity, etc2.01d Select/critique sampling: in context2.05a Hypothesis testing language: null, alternative, p-value, significance5.05a Sample mean distribution: central limit theorem5.05c Hypothesis test: normal distribution for population mean

7 A researcher is investigating the actual lengths of time that patients spend with the doctor at their appointments. He plans to choose a sample of 12 appointments on a particular day.
  1. Which of the following methods is preferable, and why?
    • Choose the first 12 appointments of the day.
    • Choose 12 appointments evenly spaced throughout the day.
    Appointments are scheduled to last 10 minutes. The actual lengths of time, in minutes, that patients spend with the doctor may be assumed to have a normal distribution with mean \(\mu\) and standard deviation 3.4. The researcher suspects that the actual time spent is more than 10 minutes on average. To test this suspicion, he recorded the actual times spent for a random sample of 12 appointments and carried out a hypothesis test at the 1\% significance level.
  2. State the probability of making a Type I error and explain what is meant by a Type I error in this context.
  3. Given that the total length of time spent for the 12 appointments was 147 minutes, carry out the test.
  4. Give a reason why the Central Limit theorem was not needed in part (iii).

AnswerMarks Guidance
(i) 2nd more representative of all appointments or Lengths may vary during the dayB1 Any implication that times or conditions vary throughout day, e.g. doctors get tired
or 1st does not include later appts so not representativeB1 [2]
(ii) \(0.01\) o.e.B1 [1]
Concluding that times spent are too long when they are not.B1 [1] Concluding that the mean time spent is more than 10 mins when it is not. Must be in context.
(iii) H0: Pop mean appt time (or \(\mu\)) = 10B1 Both correct. Allow \(\mu\), but not just "mean"
H1: Pop mean appt time (or \(\mu\)) \(> 10\)
AnswerMarks Guidance
\(\frac{147 - 10}{\sqrt{3.4^2/12}} = (\pm)2.292\) or (\(0.0109\) if area comparison done)M1 Allow incorrect \(10 + 2.326 \times \frac{3.4}{12}\) M1. Must have \(\sqrt{12}\) (accept totals method)
A1= 12.28 A1
\("2.292" < 2.326\) o.e. (No evidence to reject H0.)M1 For valid comparison \(< 12.28\) M1. Comp "2.292" with 2.326 Or 0.0109 with 0.01 Or 147/12 with 12.28
No reason to believe appts are too longA1 [5] Dep 2.326, ft their "2.292"
(iv) Normal populationB1 [1] No contradictions. Must have "population" or equiv
**(i)** 2nd more representative of all appointments or Lengths may vary during the day | B1 | Any implication that times or conditions vary throughout day, e.g. doctors get tired

or 1st does not include later appts so not representative | B1 [2]

**(ii)** $0.01$ o.e. | B1 [1]

Concluding that times spent are too long when they are not. | B1 [1] | Concluding that the mean time spent is more than 10 mins when it is not. Must be in context.

**(iii)** **H0:** Pop mean appt time (or $\mu$) = 10 | B1 | Both correct. Allow $\mu$, but not just "mean"

**H1:** Pop mean appt time (or $\mu$) $> 10$

$\frac{147 - 10}{\sqrt{3.4^2/12}} = (\pm)2.292$ or ($0.0109$ if area comparison done) | M1 | Allow incorrect $10 + 2.326 \times \frac{3.4}{12}$ M1. Must have $\sqrt{12}$ (accept totals method)

| A1 | = 12.28 A1

$"2.292" < 2.326$ o.e. (No evidence to reject **H0**.) | M1 | For valid comparison $< 12.28$ M1. Comp "2.292" with 2.326 Or 0.0109 with 0.01 Or 147/12 with 12.28

No reason to believe appts are too long | A1 [5] | Dep 2.326, ft their "2.292"

**(iv)** Normal population | B1 [1] | No contradictions. Must have "population" or equiv
7 A researcher is investigating the actual lengths of time that patients spend with the doctor at their appointments. He plans to choose a sample of 12 appointments on a particular day.\\
(i) Which of the following methods is preferable, and why?

\begin{itemize}
  \item Choose the first 12 appointments of the day.
  \item Choose 12 appointments evenly spaced throughout the day.
\end{itemize}

Appointments are scheduled to last 10 minutes. The actual lengths of time, in minutes, that patients spend with the doctor may be assumed to have a normal distribution with mean $\mu$ and standard deviation 3.4. The researcher suspects that the actual time spent is more than 10 minutes on average. To test this suspicion, he recorded the actual times spent for a random sample of 12 appointments and carried out a hypothesis test at the 1\% significance level.\\
(ii) State the probability of making a Type I error and explain what is meant by a Type I error in this context.\\
(iii) Given that the total length of time spent for the 12 appointments was 147 minutes, carry out the test.\\
(iv) Give a reason why the Central Limit theorem was not needed in part (iii).

\hfill \mbox{\textit{CAIE S2 2014 Q7 [10]}}