2.05d - OCR Spec

CAIE S2 2017 June Q1

4 marks Easy -1.2

1 In a survey of 2000 randomly chosen adults, 1602 said that they owned a smartphone. Calculate an approximate $95 \%$ confidence interval for the proportion of adults in the whole population who own a smartphone.

OCR S2 2007 January Q4

7 marks Moderate -0.8

4 A set of observations of a random variable $W$ can be summarised as follows: $$n = 14 , \quad \Sigma w = 100.8 , \quad \Sigma w ^ { 2 } = 938.70 .$$

Calculate an unbiased estimate of the variance of $W$.
The mean of 70 observations of $W$ is denoted by $\bar { W }$. State the approximate distribution of $\bar { W }$, including unbiased estimate(s) of any parameter(s).

OCR S2 2008 January Q4

7 marks Moderate -0.8

4 The random variable $Y$ has the distribution $\mathrm { N } \left( \mu , \sigma ^ { 2 } \right)$. The results of 40 independent observations of $Y$ are summarised by $$\Sigma y = 3296.0 , \quad \Sigma y ^ { 2 } = 286800.40$$

Calculate unbiased estimates of $\mu$ and $\sigma ^ { 2 }$.
Use your answers to part (i) to estimate the probability that a single random observation of $Y$ will be less than 60.0.
Explain whether it is necessary to know that $Y$ is normally distributed in answering part (i) of this question.

OCR S2 2006 June Q7

18 marks Standard +0.3

7 Three independent researchers, $A , B$ and $C$, carry out significance tests on the power consumption of a manufacturer's domestic heaters. The power consumption, $X$ watts, is a normally distributed random variable with mean $\mu$ and standard deviation 60. Each researcher tests the null hypothesis $\mathrm { H } _ { 0 } : \mu = 4000$ against the alternative hypothesis $\mathrm { H } _ { 1 } : \mu > 4000$. Researcher $A$ uses a sample of size 50 and a significance level of $5 \%$.

Find the critical region for this test, giving your answer correct to 4 significant figures. In fact the value of $\mu$ is 4020 .
Calculate the probability that Researcher $A$ makes a Type II error.
Researcher $B$ uses a sample bigger than 50 and a significance level of $5 \%$. Explain whether the probability that Researcher $B$ makes a Type II error is less than, equal to, or greater than your answer to part (ii).
Researcher $C$ uses a sample of size 50 and a significance level bigger than $5 \%$. Explain whether the probability that Researcher $C$ makes a Type II error is less than, equal to, or greater than your answer to part (ii).
State with a reason whether it is necessary to use the Central Limit Theorem at any point in this question.

OCR H240/02 2018 June Q10

9 marks Standard +0.3

10 A certain forest contains only trees of a particular species. Dipak wished to take a random sample of 5 trees from the forest. He numbered the trees from 1 to 784. Then, using his calculator, he generated the random digits 14781049 . Using these digits, Dipak formed 5 three-digit numbers. He took the first, second and third digits, followed by the second, third and fourth digits and so on. In this way he obtained the following list of numbers for his sample. $$\begin{array} { l l l l l } 147 & 478 & 781 & 104 & 49 \end{array}$$

Explain why Dipak omitted the number 810 from his list.
Explain why Dipak's sample is not random. The mean height of all trees of this species is known to be 4.2 m . Dipak wishes to test whether the mean height of trees in the forest is less than 4.2 m . He now uses a correct method to choose a random sample of 50 trees and finds that their mean height is 4.0 m . It is given that the standard deviation of trees in the forest is 0.8 m .
Carry out the test at the $2 \%$ significance level.

OCR MEI Paper 2 2018 June Q13

10 marks Challenging +1.2

13 Each weekday Keira drives to work with her son Kaito. She always sets off at 8.00 a.m. She models her journey time, $x$ minutes, by the distribution $X \sim \mathrm {~N} ( 15,4 )$. Over a long period of time she notes that her journey takes less than 14 minutes on $7 \%$ of the journeys, and takes more than 18 minutes on $31 \%$ of the journeys.

Investigate whether Keira's model is a good fit for the data. Kaito believes that Keira's value for the variance is correct, but realises that the mean is not correct.
Find, correct to two significant figures, the value of the mean that Keira should use in a refined model which does fit the data. Keira buys a new car. After driving to work in it each day for several weeks, she randomly selects the journey times for $n$ of these days. Her mean journey time for these $n$ days is 16 minutes. Using the refined model she conducts a hypothesis test to see if her mean journey time has changed, and finds that the result is significant at the $5 \%$ level.
Determine the smallest possible value of $n$.

OCR MEI Paper 2 2019 June Q15

6 marks Challenging +1.2

15 You must show detailed reasoning in this question. The screenshot in Fig. 15 shows the probability distribution for the continuous random variable $X$, where $X \sim \mathrm {~N} \left( \mu , \sigma ^ { 2 } \right)$. \begin{figure}[h]

\includegraphics[alt={},max width=\textwidth]{95eb3bcc-6d3c-4f7e-9b27-5e046ab57ec5-11_387_954_1599_260} \captionsetup{labelformat=empty} \caption{Fig. 15}

\end{figure} The distribution is symmetrical about the line $x = 35$ and there is a point of inflection at $x = 31$.
Fifty independent readings of $X$ are made. Show that the probability that at least 45 of these readings are between 30 and 40 is less than 0.05 .

Edexcel Paper 3 Specimen Q3

12 marks Standard +0.3

A machine cuts strips of metal to length $L \mathrm {~cm}$, where $L$ is normally distributed with standard deviation 0.5 cm .

Strips with length either less than 49 cm or greater than 50.75 cm cannot be used.
Given that 2.5\% of the cut lengths exceed 50.98 cm ,

find the probability that a randomly chosen strip of metal can be used. Ten strips of metal are selected at random.
Find the probability fewer than 4 of these strips cannot be used. A second machine cuts strips of metal of length $X \mathrm {~cm}$, where $X$ is normally distributed with standard deviation 0.6 cm A random sample of 15 strips cut by this second machine was found to have a mean length of 50.4 cm
Stating your hypotheses clearly and using a $1 \%$ level of significance, test whether or not the mean length of all the strips, cut by the second machine, is greater than 50.1 cm

OCR MEI Further Statistics Major Specimen Q7

11 marks Standard +0.3

7 A newspaper reports that the average price of unleaded petrol in the UK is 110.2 p per litre. The price, in pence, of a litre of unleaded petrol at a random sample of 15 petrol stations in Yorkshire is shown below together with some output from software used to analyse the data.

116.9	114.9	110.9	113.9	114.9
117.9	112.9	99.9	114.9	103.9
123.9	105.7	108.9	102.9	112.7

\begin{table}[h]

$\| l \|$	Statistics
n	15
Mean	111.6733
$\sigma$	6.1877
s	6.4048
$\Sigma \mathrm { x }$	1675.1
$\Sigma \mathrm { x } ^ { 2 }$	187638.31
Min	99.9
Q 1	105.7
Median	112.9
Q 3	114.9
Max	123.9

\captionsetup{labelformat=empty} \caption{Fig. 7.1}

\end{table}

$n$

15

Kolmogorov-Smirnov

test

$p > 0.15$

Null hypothesis

The data can be modelled

by a Normal distribution

Alternative hypothesis

The data cannot be

modelled by a Normal

distribution

Select a suitable hypothesis test to investigate whether there is any evidence that the average price of unleaded petrol in Yorkshire is different from 110.2 p. Justify your choice of test.
Conduct the hypothesis test at the $5 \%$ level of significance.

WJEC Unit 4 Specimen Q4

11 marks Moderate -0.3

4. Automatic coin counting machines sort, count and batch coins. A particular brand of these machines rejects $2 p$ coins that are less than 6.12 grams or greater than 8.12 grams.

The histogram represents the distribution of the weight of UK 2p coins supplied by the Royal Mint. This distribution has mean 7.12 grams and standard deviation 0.357 grams. \begin{figure}[h]
\captionsetup{labelformat=empty} \caption{Weight of UK two pence coins} \includegraphics[alt={},max width=\textwidth]{b35e94ab-a426-4fca-9ecb-c659e0143ed7-3_602_969_664_589}
\end{figure} Explain why the weight of 2 p coins can be modelled using a normal distribution.
Assume the distribution of the weight of $2 p$ coins is normally distributed. Calculate the proportion of $2 p$ coins that are rejected by this brand of coin counting machine.
A manager suspects that a large batch of $2 p$ coins is counterfeit. A random sample of 30 of the suspect coins is selected. Each of the coins in the sample is weighed. The results are shown in the summary statistics table.
Summary statistics
Mean
Standard
deviation
Minimum
Lower
quartile
Median
Upper
quartile
Maximum
6.89 0.296 6.45 6.63 6.88 7.08 7.48
i) What assumption must be made about the weights of coins in this batch in order to conduct a test of significance on the sample mean? State, with a reason, whether you think this assumption is reasonable.
ii) Assuming the population standard deviation is 0.357 grams, test at the $1 \%$ significance level whether the mean weight of the $2 p$ coins in this batch is less than 7.12 grams.

AQA S2 2010 June Q1

9 marks Moderate -0.3

Judith, the village postmistress, believes that, since moving the post office counter into the local pharmacy, the mean daily number of customers that she serves has increased from $79$. In order to investigate her belief, she counts the number of customers that she serves on $12$ randomly selected days, with the following results. $88 \quad 81 \quad 84 \quad 89 \quad 90 \quad 77 \quad 72 \quad 80 \quad 82 \quad 81 \quad 75 \quad 85$ Stating a necessary distributional assumption, test Judith's belief at the $5\%$ level of significance. [9 marks]

AQA Paper 3 2021 June Q15

7 marks Standard +0.3

A team game involves solving puzzles to escape from a room. Using data from the past, the mean time to solve the puzzles and escape from one of these rooms is 65 minutes with a standard deviation of 11.3 minutes. After recent changes to the puzzles in the room, it is claimed that the mean time to solve the puzzles and escape has changed. To test this claim, a random sample of 100 teams is selected. The total time to solve the puzzles and escape for the 100 teams is 6780 minutes. Assuming that the times are normally distributed, test at the 2% level the claim that the mean time has changed. [7 marks]

\(\| l \|\)	Statistics
n	15
Mean	111.6733
\(\sigma\)	6.1877
s	6.4048
\(\Sigma \mathrm { x }\)	1675.1
\(\Sigma \mathrm { x } ^ { 2 }\)	187638.31
Min	99.9
Q 1	105.7
Median	112.9
Q 3	114.9
Max	123.9

2.05d Sample mean as random variable