| Exam Board | WJEC |
|---|---|
| Module | Unit 2 (Unit 2) |
| Session | Specimen |
| Marks | 12 |
| Paper | Download PDF ↗ |
| Mark scheme | Download PDF ↗ |
| Topic | Data representation |
| Type | Critique given sampling method |
| Difficulty | Easy -1.2 This is a straightforward AS-level statistics question testing basic concepts: sampling critique, histogram corrections, effect of outliers on mean/SD, and outlier calculation using IQR rule. All parts require recall of standard procedures with minimal problem-solving or mathematical manipulation. |
| Spec | 2.02a Interpret single variable data: tables and diagrams2.02f Measures of average and spread2.02h Recognize outliers2.02i Select/critique data presentation |
| Length of singles for top 50 UK Official Chart singles, 17 June 2016 | |||||||||
| 2.5-(3.0) | 3.0-(3.5) | 3.5-(4.0) | 4.0-(4.5) | 4.5-(5.0) | 5.0-(5.5) | 5.5-(6.0) | 6.0-(6.5) | 6.5-(7.0) | 7.0-(7.5) |
| 3 | 17 | 22 | 7 | 0 | 0 | 0 | 0 | 0 | 1 |
| Summary statistics | ||||||||
| Length of single for top 50 UK Official Singles Chart (minutes) | ||||||||
| Length of single | N | Mean | Standard deviation | Minimum | Lower quartile | Median | Upper quartile | Maximum |
| 49 | 3.57 | 0.393 | 2.77 | 3.26 | 3.60 | 3.89 | 4.38 | |
| Summary statistics | ||||||||
| Length of single for Gareth's random sample of 30 singles (minutes) | ||||||||
| Length of single | N | Mean | Standard deviation | Minimum | Lower quartile | Median | Upper quartile | Maximum |
| 30 | 3.13 | 0.364 | 2.58 | 2.73 | 2.92 | 3.22 | 3.95 | |
Gareth has a keen interest in pop music. He recently read the following claim in a music magazine.
\textbf{In the pop industry most songs on the radio are not longer than three minutes.}
\begin{enumerate}[label=(\alph*)]
\item He decided to investigate this claim by recording the lengths of the top 50 singles in the UK Official Singles Chart for the week beginning 17 June 2016. (A 'single' in this context is one digital audio track.)
Comment on the suitability of this sample to investigate the magazine's claim. [1]
\item Gareth recorded the data in the table below.
\begin{center}
\begin{tabular}{|c|c|c|c|c|c|c|c|c|c|}
\hline
\multicolumn{10}{|c|}{Length of singles for top 50 UK Official Chart singles, 17 June 2016} \\
\hline
2.5-(3.0) & 3.0-(3.5) & 3.5-(4.0) & 4.0-(4.5) & 4.5-(5.0) & 5.0-(5.5) & 5.5-(6.0) & 6.0-(6.5) & 6.5-(7.0) & 7.0-(7.5) \\
\hline
3 & 17 & 22 & 7 & 0 & 0 & 0 & 0 & 0 & 1 \\
\hline
\end{tabular}
\end{center}
He used these data to produce a graph of the distributions of the lengths of singles
\includegraphics{figure_2}
State two corrections that Gareth needs to make to the histogram so that it accurately represents the data in the table. [2]
\item Gareth also produced a box plot of the lengths of singles.
\includegraphics{figure_3}
He sees that there is one obvious outlier.
\begin{enumerate}[label=(\roman*)]
\item What will happen to the mean if the outlier is removed?
\item What will happen to the standard deviation if the outlier is removed? [2]
\end{enumerate}
\item Gareth decided to remove the outlier. He then produced a table of summary statistics.
\begin{enumerate}[label=(\roman*)]
\item Use the appropriate statistics from the table to show, by calculation, that the maximum value for the length of a single is not an outlier.
\begin{center}
\begin{tabular}{|c|c|c|c|c|c|c|c|}
\hline
\multicolumn{8}{|c|}{Summary statistics} \\
\multicolumn{8}{|c|}{Length of single for top 50 UK Official Singles Chart (minutes)} \\
\hline
Length of single & N & Mean & Standard deviation & Minimum & Lower quartile & Median & Upper quartile & Maximum \\
\hline
& 49 & 3.57 & 0.393 & 2.77 & 3.26 & 3.60 & 3.89 & 4.38 \\
\hline
\end{tabular}
\end{center}
\item State, with a reason, whether these statistics support the magazine's claim. [4]
\end{enumerate}
\item Gareth also calculated summary statistics for the lengths of 30 singles selected at random from his personal collection.
\begin{center}
\begin{tabular}{|c|c|c|c|c|c|c|c|}
\hline
\multicolumn{8}{|c|}{Summary statistics} \\
\multicolumn{8}{|c|}{Length of single for Gareth's random sample of 30 singles (minutes)} \\
\hline
Length of single & N & Mean & Standard deviation & Minimum & Lower quartile & Median & Upper quartile & Maximum \\
\hline
& 30 & 3.13 & 0.364 & 2.58 & 2.73 & 2.92 & 3.22 & 3.95 \\
\hline
\end{tabular}
\end{center}
Compare and contrast the distribution of lengths of singles in Gareth's personal collection with the distribution in the top 50 UK Official Singles Chart. [3]
\end{enumerate}
\hfill \mbox{\textit{WJEC Unit 2 Q5 [12]}}