Edexcel S1 2013 January — Question 5 15 marks

Exam BoardEdexcel
ModuleS1 (Statistics 1)
Year2013
SessionJanuary
Marks15
PaperDownload PDF ↗
Mark schemeDownload PDF ↗
TopicData representation
TypeEstimate mean and standard deviation from frequency table
DifficultyModerate -0.8 This is a routine S1 statistics question testing standard procedures: histogram scaling, linear interpolation for median, mean/SD from grouped frequency table (with Σfy² given), and basic normal distribution application. All techniques are textbook exercises requiring only formula recall and arithmetic, with no problem-solving insight needed.
Spec2.02b Histogram: area represents frequency2.02f Measures of average and spread2.02g Calculate mean and standard deviation2.04e Normal distribution: as model N(mu, sigma^2)2.04f Find normal probabilities: Z transformation

  1. A survey of 100 households gave the following results for weekly income \(\pounds y\).
Income \(y\) (£)Mid-pointFrequency \(f\)
\(0 \leqslant y < 200\)10012
\(200 \leqslant y < 240\)22028
\(240 \leqslant y < 320\)28022
\(320 \leqslant y < 400\)36018
\(400 \leqslant y < 600\)50012
\(600 \leqslant y < 800\)7008
(You may use \(\sum f y ^ { 2 } = 12452\) 800)
A histogram was drawn and the class \(200 \leqslant y < 240\) was represented by a rectangle of width 2 cm and height 7 cm .
  1. Calculate the width and the height of the rectangle representing the class $$320 \leqslant y < 400$$
  2. Use linear interpolation to estimate the median weekly income to the nearest pound.
  3. Estimate the mean and the standard deviation of the weekly income for these data. One measure of skewness is \(\frac { 3 ( \text { mean } - \text { median } ) } { \text { standard deviation } }\).
  4. Use this measure to calculate the skewness for these data and describe its value. Katie suggests using the random variable \(X\) which has a normal distribution with mean 320 and standard deviation 150 to model the weekly income for these data.
  5. Find \(\mathrm { P } ( 240 < X < 400 )\).
  6. With reference to your calculations in parts (d) and (e) and the data in the table, comment on Katie's suggestion.

Question 5:
Part (a)
AnswerMarks Guidance
WorkingMark Guidance
Width \(=4\) (cm)B1 Ignore units
Area of \(14\text{ cm}^2\) represents frequency 28 and area of \(4h\) represents 18; or \(\frac{4h}{18}=\frac{14}{28}\)M1 Clear method using area and frequency, or width \(\times\) height \(=9\)
\(h=\mathbf{2.25}\) (cm)A1
Part (b)
AnswerMarks Guidance
WorkingMark Guidance
\(m=(240)+\frac{10}{22}\times 80\)M1 Allow use of \((n+1)\) leading to £278.18 or [278, 278.5]
\(=276.36\ldots\) \(\left(\frac{3040}{11}\right)\); \((£)\mathbf{276}\leq m<(£)276.5\)A1 Do not award if incorrect end-point seen
Part (c)
AnswerMarks Guidance
WorkingMark Guidance
\(\sum fy=31600\) leading to \(\bar{y}=316\)M1A1 Attempt \(\sum fy\) with at least 3 correct products or answer rounding to 30000 (1 sf)
\(\sigma_y=\sqrt{\frac{12452800}{100}-(\bar{y})^2}=157.07\ldots\) (awrt \(\mathbf{157}\)); allow \(s=157.86\ldots\)M1A1 Correct expression including \(\sqrt{\phantom{x}}\); need \(\sum fy^2\) correct; condone minor transcription error
Part (d)
AnswerMarks Guidance
WorkingMark Guidance
Skewness \(=0.764\ldots\) (awrt \(\mathbf{0.76}\) or \(\mathbf{0.75}\))B1 For awrt 0.76/0.75 for \(m=£276\) or awrt 0.73/0.72 for \(m=£278\)
Positive skewB1ft Correct description based on their measure, or based on mean and median if no measure given
Part (e)
AnswerMarks Guidance
WorkingMark Guidance
\(z=\pm\frac{80}{150}\)M1 Attempt to standardise using 320 and 150 and either 240 or 400
\(P(240A1 Answer in range \([0.40, 0.41]\); answer only is 2/2
Part (f)
AnswerMarks Guidance
WorkingMark Guidance
(e) suggests a reasonable fit for this range BUT (d) since skew it will not be a good fit overallB2/1/0 Need 2 comments referencing each of parts (e) and (d); one comment not good since skew, other since matches range in (e); do not use B0B1
# Question 5:

## Part (a)
| Working | Mark | Guidance |
|---------|------|---------|
| Width $=4$ (cm) | B1 | Ignore units |
| Area of $14\text{ cm}^2$ represents frequency 28 and area of $4h$ represents 18; or $\frac{4h}{18}=\frac{14}{28}$ | M1 | Clear method using area and frequency, or width $\times$ height $=9$ |
| $h=\mathbf{2.25}$ (cm) | A1 | |

## Part (b)
| Working | Mark | Guidance |
|---------|------|---------|
| $m=(240)+\frac{10}{22}\times 80$ | M1 | Allow use of $(n+1)$ leading to £278.18 or [278, 278.5] |
| $=276.36\ldots$ $\left(\frac{3040}{11}\right)$; $(£)\mathbf{276}\leq m<(£)276.5$ | A1 | Do not award if incorrect end-point seen |

## Part (c)
| Working | Mark | Guidance |
|---------|------|---------|
| $\sum fy=31600$ leading to $\bar{y}=316$ | M1A1 | Attempt $\sum fy$ with at least 3 correct products or answer rounding to 30000 (1 sf) |
| $\sigma_y=\sqrt{\frac{12452800}{100}-(\bar{y})^2}=157.07\ldots$ (awrt $\mathbf{157}$); allow $s=157.86\ldots$ | M1A1 | Correct expression including $\sqrt{\phantom{x}}$; need $\sum fy^2$ correct; condone minor transcription error |

## Part (d)
| Working | Mark | Guidance |
|---------|------|---------|
| Skewness $=0.764\ldots$ (awrt $\mathbf{0.76}$ or $\mathbf{0.75}$) | B1 | For awrt 0.76/0.75 for $m=£276$ or awrt 0.73/0.72 for $m=£278$ |
| Positive skew | B1ft | Correct description based on their measure, or based on mean and median if no measure given |

## Part (e)
| Working | Mark | Guidance |
|---------|------|---------|
| $z=\pm\frac{80}{150}$ | M1 | Attempt to standardise using 320 and 150 and either 240 or 400 |
| $P(240<X<400)=\mathbf{0.40\sim 0.41}$ | A1 | Answer in range $[0.40, 0.41]$; answer only is 2/2 |

## Part (f)
| Working | Mark | Guidance |
|---------|------|---------|
| (e) suggests a reasonable fit for this range BUT (d) since skew it will not be a good fit overall | B2/1/0 | Need 2 comments referencing each of parts (e) and (d); one comment not good since skew, other since matches range in (e); do not use B0B1 |

---
\begin{enumerate}
  \item A survey of 100 households gave the following results for weekly income $\pounds y$.
\end{enumerate}

\begin{center}
\begin{tabular}{|l|l|l|}
\hline
Income $y$ (£) & Mid-point & Frequency $f$ \\
\hline
$0 \leqslant y < 200$ & 100 & 12 \\
\hline
$200 \leqslant y < 240$ & 220 & 28 \\
\hline
$240 \leqslant y < 320$ & 280 & 22 \\
\hline
$320 \leqslant y < 400$ & 360 & 18 \\
\hline
$400 \leqslant y < 600$ & 500 & 12 \\
\hline
$600 \leqslant y < 800$ & 700 & 8 \\
\hline
\end{tabular}
\end{center}

(You may use $\sum f y ^ { 2 } = 12452$ 800)\\
A histogram was drawn and the class $200 \leqslant y < 240$ was represented by a rectangle of width 2 cm and height 7 cm .\\
(a) Calculate the width and the height of the rectangle representing the class

$$320 \leqslant y < 400$$

(b) Use linear interpolation to estimate the median weekly income to the nearest pound.\\
(c) Estimate the mean and the standard deviation of the weekly income for these data.

One measure of skewness is $\frac { 3 ( \text { mean } - \text { median } ) } { \text { standard deviation } }$.\\
(d) Use this measure to calculate the skewness for these data and describe its value.

Katie suggests using the random variable $X$ which has a normal distribution with mean 320 and standard deviation 150 to model the weekly income for these data.\\
(e) Find $\mathrm { P } ( 240 < X < 400 )$.\\
(f) With reference to your calculations in parts (d) and (e) and the data in the table, comment on Katie's suggestion.

\hfill \mbox{\textit{Edexcel S1 2013 Q5 [15]}}