Question 1 - A-Level Maths

Edexcel S4 2015 June — Question 1 9 marks

Exam Board	Edexcel
Module	S4 (Statistics 4)
Year	2015
Session	June
Marks	9
Paper	Download PDF ↗
Mark scheme	Download PDF ↗
Topic	T-tests (unknown variance)
Type	Paired sample t-test
Difficulty	Standard +0.3 This is a straightforward application of a paired t-test with clear data provided. Students need to calculate differences, find mean and standard deviation, compute the test statistic, and compare to critical value. While it requires multiple steps and knowledge of S4 content (Further Maths Statistics), the procedure is entirely standard with no novel problem-solving or conceptual challenges beyond routine application of the paired t-test formula.
Spec	5.07b Sign test: and Wilcoxon signed-rank 5.07d Paired vs two-sample: selection

The Sales Manager of a large chain of convenience stores is studying the sale of lottery tickets in her stores. She randomly selects 8 of her stores. From these stores she collects data for the total sales of lottery tickets in the previous January and July. The data are shown below

Store	A	B	C	D	E	F	G	H
January ticket sales \(( \boldsymbol { \pounds } )\)	1080	1639	710	1108	915	1066	1322	819
July ticket sales \(( \boldsymbol { \pounds } )\)	1113	1702	831	1048	861	1090	1303	852

Use a paired \(t\)-test to determine whether or not there is evidence, at the \(5 \%\) level of significance, that the mean sales of lottery tickets in this chain's stores are higher in July than in January. You should state your hypotheses and show your working clearly.
State what assumption the Sales Manager needs to make about the sales of lottery tickets in her stores for the test in part (a) to be valid.

Show mark scheme Show mark scheme source

Question 1:

Part (a)

Answer	Marks	Guidance
Answer	Mark	Guidance
\(d_i = \text{July} - \text{January}\): 33, 63, 121, −60, −54, 24, −19, 33	B1	All differences correct (allow January − July)
\(H_0: \mu_d = 0\), \(H_1: \mu_d > 0\)	B1	Both hypotheses correct, where \(\mu_d\) is the population mean difference
\(\bar{d} = \frac{141}{8} = 17.625\)	M1 A1	Method for mean of differences
\(S_{dd} = 33^2 + 63^2 + 121^2 + (-60)^2 + (-54)^2 + 24^2 + (-19)^2 + 33^2 - \frac{141^2}{8}\)	M1	Correct method for \(S_{dd}\)
\(= 26557 - 2485.125 = 24071.875\)	A1	Correct value
\(s_d^2 = \frac{24071.875}{7} = 3438.839...\) so \(s_d = 58.64...\)
\(t = \frac{17.625}{\frac{58.64...}{\sqrt{8}}} = \frac{17.625}{20.737...} = 0.8499...\)	M1	Correct \(t\)-statistic method
Critical value \(t_7(5\%) = 1.895\) (one-tailed)	B1	Correct critical value
Since \(0.850 < 1.895\), do not reject \(H_0\). No evidence at 5% significance level that mean sales are higher in July than January.	A1	Correct conclusion in context

Part (b)

Answer	Marks	Guidance
Answer	Mark	Guidance
The differences in sales are normally distributed	B1	Must reference the differences (not just sales) being normally distributed

# Question 1:

## Part (a)

| Answer | Mark | Guidance |
|--------|------|----------|
| $d_i = \text{July} - \text{January}$: 33, 63, 121, −60, −54, 24, −19, 33 | B1 | All differences correct (allow January − July) |
| $H_0: \mu_d = 0$, $H_1: \mu_d > 0$ | B1 | Both hypotheses correct, where $\mu_d$ is the population mean difference |
| $\bar{d} = \frac{141}{8} = 17.625$ | M1 A1 | Method for mean of differences |
| $S_{dd} = 33^2 + 63^2 + 121^2 + (-60)^2 + (-54)^2 + 24^2 + (-19)^2 + 33^2 - \frac{141^2}{8}$ | M1 | Correct method for $S_{dd}$ |
| $= 26557 - 2485.125 = 24071.875$ | A1 | Correct value |
| $s_d^2 = \frac{24071.875}{7} = 3438.839...$ so $s_d = 58.64...$ | | |
| $t = \frac{17.625}{\frac{58.64...}{\sqrt{8}}} = \frac{17.625}{20.737...} = 0.8499...$ | M1 | Correct $t$-statistic method |
| Critical value $t_7(5\%) = 1.895$ (one-tailed) | B1 | Correct critical value |
| Since $0.850 < 1.895$, do not reject $H_0$. No evidence at 5% significance level that mean sales are higher in July than January. | A1 | Correct conclusion in context |

## Part (b)

| Answer | Mark | Guidance |
|--------|------|----------|
| The differences in sales are normally distributed | B1 | Must reference the differences (not just sales) being normally distributed |

---

Show LaTeX source

\begin{enumerate}
  \item The Sales Manager of a large chain of convenience stores is studying the sale of lottery tickets in her stores. She randomly selects 8 of her stores. From these stores she collects data for the total sales of lottery tickets in the previous January and July. The data are shown below
\end{enumerate}

\begin{center}
\begin{tabular}{ | l | c | c | c | c | c | c | c | c | }
\hline
Store & A & B & C & D & E & F & G & H \\
\hline
January ticket sales $( \boldsymbol { \pounds } )$ & 1080 & 1639 & 710 & 1108 & 915 & 1066 & 1322 & 819 \\
\hline
July ticket sales $( \boldsymbol { \pounds } )$ & 1113 & 1702 & 831 & 1048 & 861 & 1090 & 1303 & 852 \\
\hline
\end{tabular}
\end{center}

(a) Use a paired $t$-test to determine whether or not there is evidence, at the $5 \%$ level of significance, that the mean sales of lottery tickets in this chain's stores are higher in July than in January. You should state your hypotheses and show your working clearly.\\
(b) State what assumption the Sales Manager needs to make about the sales of lottery tickets in her stores for the test in part (a) to be valid.

\hfill \mbox{\textit{Edexcel S4 2015 Q1 [9]}}

This paper (6 questions)

View full paper

Q1 9 Q2 14 Q3 14 Q4 11 Q5 9 Q6 18