Edexcel S2 2018 Specimen — Question 3 11 marks

Exam BoardEdexcel
ModuleS2 (Statistics 2)
Year2018
SessionSpecimen
Marks11
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicCentral limit theorem
TypeSampling distribution theory
DifficultyModerate -0.3 This is a straightforward S2 question testing basic understanding of sampling distributions through enumeration. Parts (a)-(b) are definitional recall, part (c) is routine weighted mean/variance calculation, and parts (d)-(e) involve listing all possible samples (only 8 outcomes) and computing their probabilities—mechanical work with no novel insight required. Slightly easier than average due to small sample space and step-by-step scaffolding.
Spec2.01a Population and sample: terminology5.01a Permutations and combinations: evaluate probabilities5.04a Linear combinations: E(aX+bY), Var(aX+bY)5.05a Sample mean distribution: central limit theorem

3. Explain what you understand by
  1. a statistic,
  2. a sampling distribution. A factory stores screws in packets. A small packet contains 100 screws and a large packet contains 200 screws. The factory keeps small and large packets in the ratio 4:3 respectively.
  3. Find the mean and the variance of the number of screws in the packets stored at the factory. A random sample of 3 packets is taken from the factory and \(Y _ { 1 } , Y _ { 2 }\) and \(Y _ { 3 }\) denote the number of screws in each of these packets.
  4. List all the possible samples.
  5. Find the sampling distribution of \(\bar { Y }\)
    VIIIV SIHI NI IIIYM ION OCVIUV SIHI NI JIIIM I I ON OCVEXV SIHII NI JIIIM I ION OO

Question 3:
Part (a)
AnswerMarks Guidance
Answer/WorkingMark Guidance
A random variable that is a function of a (random) sample involving no unknown quantities/parameters; or a quantity calculated solely from a random sampleB1 Requires: (1) function/quantity, (2) sample/observations/data, (3) no unknown parameters
Part (b)
AnswerMarks Guidance
Answer/WorkingMark Guidance
If all possible samples are chosen from a population; then the values of a statistic and the associated probabilities is a sampling distribution; or a probability distribution of a statisticB1 Requires all underlined words: All values of a statistic with their associated probabilities; or probability distribution of a statistic
Part (c)
AnswerMarks Guidance
Answer/WorkingMark Guidance
Mean \(= 100 \times \frac{4}{7} + 200 \times \frac{3}{7} = \frac{1000}{7}\)B1 awrt 143
Variance \(= 100^2 \times \frac{4}{7} + 200^2 \times \frac{3}{7} - \left(\frac{1000}{7}\right)^2\)M1 \(100^2 \times \frac{4}{7} + 200^2 \times \frac{3}{7} -\) (their mean)\(^2\)
\(= \frac{120000}{49}\)A1 awrt 2450 (to 3sf)
Part (d)
AnswerMarks Guidance
Answer/WorkingMark Guidance
\((100,100,100)\)B2 Any 2 of the listed sample types for B1; all correct for B2
\((100,100,200)\), \((100,200,100)\), \((200,100,100)\) or \(3\times(100,100,200)\) Allow other notation for 100 and 200 e.g. Small and Large
\((100,200,200)\), \((200,100,200)\), \((200,200,100)\) or \(3\times(100,200,200)\)
\((200,200,200)\)
Part (e)
AnswerMarks Guidance
Answer/WorkingMark Guidance
\((100,100,100)\): \(\left(\frac{4}{7}\right)^3 = \frac{64}{343}\) awrt 0.187; \((200,200,200)\): \(\left(\frac{3}{7}\right)^3 = \frac{27}{343}\) awrt 0.0787B1 Both probabilities correct
\((100,100,200)\): \(3\times\left(\frac{4}{7}\right)^2\times\left(\frac{3}{7}\right) = \frac{144}{343}\) awrt 0.420M1 \(3 \times p^2 \times (1-p)\)
\((100,200,200)\): \(3\times\left(\frac{4}{7}\right)\times\left(\frac{3}{7}\right)^2 = \frac{108}{343}\) awrt 0.315A1 Either correct
Full table with \(m\): \(100\), \(\frac{400}{3}\), \(\frac{500}{3}\), \(200\) and all probabilities correctA1 All means correct and all probabilities correct (table not required but means must be associated with correct probabilities)
# Question 3:

## Part (a)

| Answer/Working | Mark | Guidance |
|---|---|---|
| A random variable that is a function of a (random) sample involving no unknown quantities/parameters; or a quantity calculated solely from a random sample | B1 | Requires: (1) function/quantity, (2) sample/observations/data, (3) no unknown parameters |

## Part (b)

| Answer/Working | Mark | Guidance |
|---|---|---|
| If all possible samples are chosen from a population; then the values of a statistic and the associated probabilities is a sampling distribution; or a probability distribution of a statistic | B1 | Requires all underlined words: **All values** of a **statistic** with their associated **probabilities**; or **probability distribution** of a **statistic** |

## Part (c)

| Answer/Working | Mark | Guidance |
|---|---|---|
| Mean $= 100 \times \frac{4}{7} + 200 \times \frac{3}{7} = \frac{1000}{7}$ | B1 | awrt 143 |
| Variance $= 100^2 \times \frac{4}{7} + 200^2 \times \frac{3}{7} - \left(\frac{1000}{7}\right)^2$ | M1 | $100^2 \times \frac{4}{7} + 200^2 \times \frac{3}{7} -$ (their mean)$^2$ |
| $= \frac{120000}{49}$ | A1 | awrt 2450 (to 3sf) |

## Part (d)

| Answer/Working | Mark | Guidance |
|---|---|---|
| $(100,100,100)$ | B2 | Any 2 of the listed sample types for B1; all correct for B2 |
| $(100,100,200)$, $(100,200,100)$, $(200,100,100)$ or $3\times(100,100,200)$ | | Allow other notation for 100 and 200 e.g. Small and Large |
| $(100,200,200)$, $(200,100,200)$, $(200,200,100)$ or $3\times(100,200,200)$ | | |
| $(200,200,200)$ | | |

## Part (e)

| Answer/Working | Mark | Guidance |
|---|---|---|
| $(100,100,100)$: $\left(\frac{4}{7}\right)^3 = \frac{64}{343}$ awrt 0.187; $(200,200,200)$: $\left(\frac{3}{7}\right)^3 = \frac{27}{343}$ awrt 0.0787 | B1 | Both probabilities correct |
| $(100,100,200)$: $3\times\left(\frac{4}{7}\right)^2\times\left(\frac{3}{7}\right) = \frac{144}{343}$ awrt 0.420 | M1 | $3 \times p^2 \times (1-p)$ |
| $(100,200,200)$: $3\times\left(\frac{4}{7}\right)\times\left(\frac{3}{7}\right)^2 = \frac{108}{343}$ awrt 0.315 | A1 | Either correct |
| Full table with $m$: $100$, $\frac{400}{3}$, $\frac{500}{3}$, $200$ and all probabilities correct | A1 | All means correct **and** all probabilities correct (table not required but means must be associated with correct probabilities) |
3. Explain what you understand by
\begin{enumerate}[label=(\alph*)]
\item a statistic,
\item a sampling distribution.

A factory stores screws in packets. A small packet contains 100 screws and a large packet contains 200 screws. The factory keeps small and large packets in the ratio 4:3 respectively.
\item Find the mean and the variance of the number of screws in the packets stored at the factory.

A random sample of 3 packets is taken from the factory and $Y _ { 1 } , Y _ { 2 }$ and $Y _ { 3 }$ denote the number of screws in each of these packets.
\item List all the possible samples.
\item Find the sampling distribution of $\bar { Y }$\\

\begin{center}
\begin{tabular}{|l|l|l|}
\hline
VIIIV SIHI NI IIIYM ION OC & VIUV SIHI NI JIIIM I I ON OC & VEXV SIHII NI JIIIM I ION OO \\
\hline
\end{tabular}
\end{center}
\end{enumerate}

\hfill \mbox{\textit{Edexcel S2 2018 Q3 [11]}}