Estimate from grouped frequency data

Given a frequency table with class intervals, estimate μ and σ using mid-interval values and grouped data formulas, then model with a normal distribution.

4 questions · Moderate -0.6

2.04f Find normal probabilities: Z transformation
Sort by: Default | Easiest first | Hardest first
OCR H240/02 Q8
7 marks Moderate -0.8
8 A market gardener records the masses of a random sample of 100 of this year's crop of plums. The table shows his results.
Mass,
\(m\) grams
\(m < 25\)\(25 \leq m < 35\)\(35 \leq m < 45\)\(45 \leq m < 55\)\(55 \leq m < 65\)\(65 \leq m < 75\)\(m \geq 75\)
Number
of plums
0329363020
  1. Explain why the normal distribution might be a reasonable model for this distribution. The market gardener models the distribution of masses by \(\mathrm { N } \left( 47.5,10 ^ { 2 } \right)\).
  2. Find the number of plums in the sample that this model would predict to have masses in the range:
    1. \(35 \leq m < 45\)
    2. \(m < 25\).
  3. Use your answers to parts (b)(i) and (b)(ii) to comment on the suitability of this model. The market gardener plans to use this model to predict the distribution of the masses of next year's crop of plums.
  4. Comment on this plan.
OCR MEI Paper 2 2020 November Q8
12 marks Moderate -0.8
8 Rosella is carrying out an investigation into the age at which adults retire from work in the city where she lives. She collects a sample of size 50 , ensuring this comprises of 25 randomly selected retired men and 25 randomly selected retired women.
  1. State the name of the sampling method she uses. Fig. 8.1 shows the data she obtains in a frequency table and Fig. 8.2 shows these data displayed in a histogram. \begin{table}[h]
    Age in years at retirement\(45 -\)\(50 -\)\(55 -\)\(60 -\)\(65 -\)\(70 -\)\(75 - 80\)
    Frequency density0.41.82.42.21.81.20.2
    \captionsetup{labelformat=empty} \caption{Fig. 8.1}
    \end{table} \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{cea67565-8074-4703-8e1a-09b98e380baf-08_805_1006_1160_244} \captionsetup{labelformat=empty} \caption{Fig. 8.2}
    \end{figure}
  2. How many people in the sample are aged between 50 and 55? Rosella obtains a list of the names of all 4960 people who have retired in the city during the previous month.
  3. Describe how Rosella could collect a sample of size 200 from her list using
    Rosella collects two simple random samples, one of size 200 and one of size 500, from her list. The histograms in Fig. 8.3 show the data from the sample of size 200 on the left and the data from the sample of 500 on the right. \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{cea67565-8074-4703-8e1a-09b98e380baf-09_659_1909_388_77} \captionsetup{labelformat=empty} \caption{Fig. 8.3}
    \end{figure}
  4. With reference to the histograms shown in Fig. 8.2 and Fig. 8.3, explain why it appears reasonable to model the age of retirement in this city using the Normal distribution. Summary statistics for the sample of 500 are shown in Fig. 8.4. \begin{table}[h]
    Statistics
    n500
    Mean60.0515
    \(\sigma\)6.5717
    s6.5783
    \(\Sigma x\)30025.7601
    \(\Sigma \mathrm { x } ^ { 2 }\)1824686.322
    Min36.0793
    Q155.2573
    Median59.9202
    Q364.4239
    Max81.742
    \captionsetup{labelformat=empty} \caption{Fig. 8.4}
    \end{table}
  5. Use an appropriate Normal model based on the information in Fig. 8.4 to estimate the number of people aged over 65 who retired in the city in the previous month.
  6. Identify a limitation in using this model to predict the number of people aged over 65 retiring in the following month.
AQA S1 2011 January Q3
13 marks Moderate -0.3
3 The volume, \(X\) litres, of orange juice in a 1-litre carton may be modelled by a normal distribution with unknown mean \(\mu\). The volumes, \(x\) litres, recorded to the nearest 0.01 litre, in a random sample of 100 cartons are shown in the table.
Volume ( \(\boldsymbol { x }\) litres)Number of cartons (f)
0.95-0.972
0.98-1.007
1.01-1.0315
1.04-1.0632
1.07-1.0922
1.10-1.1214
1.13-1.157
1.16-1.181
Total100
  1. For the group ' \(0.98 - 1.00\) ':
    1. show that it has a mid-point of 0.99 litres;
    2. state the minimum and the maximum values of \(x\) that could be included in this group.
  2. Calculate, to three decimal places, estimates of the mean and the standard deviation of these 100 volumes.
    1. Construct an approximate \(99 \%\) confidence interval for \(\mu\).
    2. State why use of the Central Limit Theorem was not required when calculating this confidence interval.
    3. Give a reason why the confidence interval is approximate rather than exact.
  3. Give a reason in support of the claim that:
    1. \(\mu > 1\);
    2. \(\mathrm { P } ( 0.94 < X < 1.16 )\) is approximately 1 .
      \includegraphics[max width=\textwidth, alt={}]{156f9453-ebc6-4406-b5bc-08d1918ebc62-10_2486_1714_221_153}
      \includegraphics[max width=\textwidth, alt={}]{156f9453-ebc6-4406-b5bc-08d1918ebc62-11_2486_1714_221_153}
OCR H240/02 2018 September Q9
12 marks Moderate -0.3
9 The finance department of a retail firm recorded the daily income each day for 300 days. The results are summarised in the histogram. \includegraphics[max width=\textwidth, alt={}, center]{85de9a39-f8be-40ee-b0c8-e2e632be93d8-6_689_1575_488_246}
  1. Find the number of days on which the daily income was between \(\pounds 4000\) and \(\pounds 6000\).
  2. Calculate an estimate of the number of days on which the daily income was between \(\pounds 2700\) and \(\pounds 3600\).
  3. Use the midpoints of the classes to show that an estimate of the mean daily income is \(\pounds 3275\). An estimate of the standard deviation of the daily income is \(\pounds 1060\). The finance department uses the distribution \(\mathrm { N } \left( 3275,1060 ^ { 2 } \right)\) to model the daily income, in pounds.
  4. Calculate the number of days on which, according to this model, the daily income would be between \(\pounds 4000\) and \(\pounds 6000\).
  5. It is given that approximately \(95 \%\) of values of the distribution \(\mathrm { N } \left( \mu , \sigma ^ { 2 } \right)\) lie within the range \(\mu \pm 2 \sigma\). Without further calculation, use this fact to comment briefly on whether the proposed model is a good fit to the data illustrated in the histogram.