Questions — OCR (4619 questions)

Browse by board
AQA AS Paper 1 AS Paper 2 C1 C2 C3 C4 D1 D2 FP1 FP2 FP3 Further AS Paper 1 Further AS Paper 2 Discrete Further AS Paper 2 Mechanics Further AS Paper 2 Statistics Further Paper 1 Further Paper 2 Further Paper 3 Discrete Further Paper 3 Mechanics Further Paper 3 Statistics M1 M2 M3 Paper 1 Paper 2 Paper 3 S1 S2 S3 CAIE FP1 FP2 Further Paper 1 Further Paper 2 Further Paper 3 Further Paper 4 M1 M2 P1 P2 P3 S1 S2 Edexcel AEA AS Paper 1 AS Paper 2 C1 C12 C2 C3 C34 C4 CP AS CP1 CP2 D1 D2 F1 F2 F3 FD1 FD1 AS FD2 FD2 AS FM1 FM1 AS FM2 FM2 AS FP1 FP1 AS FP2 FP2 AS FP3 FS1 FS1 AS FS2 FS2 AS M1 M2 M3 M4 M5 P1 P2 P3 P4 PMT Mocks Paper 1 Paper 2 Paper 3 S1 S2 S3 S4 OCR AS Pure C1 C2 C3 C4 D1 D2 FD1 AS FM1 AS FP1 FP1 AS FP2 FP3 FS1 AS Further Additional Pure Further Additional Pure AS Further Discrete Further Discrete AS Further Mechanics Further Mechanics AS Further Pure Core 1 Further Pure Core 2 Further Pure Core AS Further Statistics Further Statistics AS H240/01 H240/02 H240/03 M1 M2 M3 M4 Mechanics 1 PURE Pure 1 S1 S2 S3 S4 Stats 1 OCR MEI AS Paper 1 AS Paper 2 C1 C2 C3 C4 D1 D2 FP1 FP2 FP3 Further Extra Pure Further Mechanics A AS Further Mechanics B AS Further Mechanics Major Further Mechanics Minor Further Numerical Methods Further Pure Core Further Pure Core AS Further Pure with Technology Further Statistics A AS Further Statistics B AS Further Statistics Major Further Statistics Minor M1 M2 M3 M4 Paper 1 Paper 2 Paper 3 S1 S2 S3 S4 SPS SPS ASFM SPS ASFM Mechanics SPS ASFM Pure SPS ASFM Statistics SPS FM SPS FM Mechanics SPS FM Pure SPS FM Statistics SPS SM SPS SM Mechanics SPS SM Pure SPS SM Statistics WJEC Further Unit 1 Further Unit 2 Further Unit 3 Further Unit 4 Further Unit 5 Further Unit 6 Unit 1 Unit 2 Unit 3 Unit 4
OCR S3 Specimen Q3
3 A random sample of 80 precision-engineered cylindrical components is checked as part of a quality control process. The diameters of the cylinders should be 25.00 cm . Accurate measurements of the diameters, \(x \mathrm {~cm}\), for the sample are summarised by $$\Sigma ( x - 25 ) = 0.44 , \quad \Sigma ( x - 25 ) ^ { 2 } = 0.2287 .$$
  1. Calculate a \(99 \%\) confidence interval for the population mean diameter of the components.
  2. For the calculation in part (i) to be valid, is it necessary to assume that component diameters are normally distributed? Justify your answer.
OCR S3 Specimen Q4
4 The lengths of time, in seconds, between vehicles passing a fixed observation point on a road were recorded at a time when traffic was flowing freely. The frequency distribution in Table 1 is a summary of the data from 100 observations. \begin{table}[h]
Time interval \(( x\) seconds \()\)\(0 < x \leqslant 5\)\(5 < x \leqslant 10\)\(10 < x \leqslant 20\)\(20 < x \leqslant 40\)\(40 < x\)
Observed frequency49222072
\captionsetup{labelformat=empty} \caption{Table 1}
\end{table} It is thought that the distribution of times might be modelled by the continuous random variable \(X\) with probability density function given by $$f ( x ) = \begin{cases} 0.1 e ^ { - 0.1 x } & x > 0
0 & \text { otherwise } \end{cases}$$ Using this model, the expected frequencies (correct to 2 decimal places) for the given time intervals are shown in Table 2. \begin{table}[h]
Time interval \(( x\) seconds \()\)\(0 < x \leqslant 5\)\(5 < x \leqslant 10\)\(10 < x \leqslant 20\)\(20 < x \leqslant 40\)\(40 < x\)
Expected frequency39.3523.8723.2511.701.83
\captionsetup{labelformat=empty} \caption{Table 2}
\end{table}
  1. Show how the expected frequency of 23.87, corresponding to the interval \(5 < x \leqslant 10\), is obtained.
  2. Test, at the 10\% significance level, the goodness of fit of the model to the data.
OCR S3 Specimen Q5
5 The continuous random variable \(X\) has a triangular distribution with probability density function given by $$f ( x ) = \left\{ \begin{array} { l r } 1 + x & - 1 \leqslant x \leqslant 0
1 - x & 0 \leqslant x \leqslant 1
0 & \text { otherwise } \end{array} \right.$$
  1. Show that, for \(0 \leqslant a \leqslant 1\), $$\mathrm { P } ( | X | \leqslant a ) = 2 a - a ^ { 2 } .$$ The random variable \(Y\) is given by \(Y = X ^ { 2 }\).
  2. Express \(\mathrm { P } ( Y \leqslant y )\) in terms of \(y\), for \(0 \leqslant y \leqslant 1\), and hence show that the probability density function of \(Y\) is given by $$g ( y ) = \frac { 1 } { \sqrt { } y } - 1 , \quad \text { for } 0 < y \leqslant 1 .$$
  3. Use the probability density function of \(Y\) to find \(\mathrm { E } ( Y )\), and show how the value of \(\mathrm { E } ( Y )\) may also be obtained directly using the probability density function of \(X\).
  4. Find \(\mathrm { E } ( \sqrt { } Y )\).
OCR S3 Specimen Q6
6 Certain types of food are now sold in metric units. A random sample of 1000 shoppers was asked whether they were in favour of the change to metric units or not. The results, classified according to age, were as shown in the table.
\cline { 2 - 4 } \multicolumn{1}{c|}{}Age of shopper
\cline { 2 - 4 } \multicolumn{1}{c|}{}Under 3535 and overTotal
In favour of change187161348
Not in favour of change283369652
Total4705301000
  1. Use a \(\chi ^ { 2 }\) test to show that there is very strong evidence that shoppers' views about changing to metric units are not independent of their ages.
  2. The data may also be regarded as consisting of two random samples of shoppers; one sample consists of 470 shoppers aged under 35 , of whom 187 were in favour of change, and the second sample consists of 530 shoppers aged 35 or over, of whom 161 were in favour of change. Determine whether a test for equality of population proportions supports the conclusion in part (i).
OCR S3 Specimen Q7
7 A factory manager wished to compare two methods of assembling a new component, to determine which method could be carried out more quickly, on average, by the workforce. A random sample of 12 workers was taken, and each worker tried out each of the methods of assembly. The times taken, in seconds, are shown in the table.
Worker\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)\(K\)\(L\)
Time in seconds for Method 1483847596241505258544960
Time in seconds for Method 2474038555742424062474751
  1. (a) Carry out an appropriate \(t\)-test, using a \(2 \%\) significance level, to test whether there is any difference in the times for the two methods of assembly.
    (b) State an assumption needed in carrying out this test.
    (c) Calculate a \(95 \%\) confidence interval for the population mean time difference for the two methods of assembly.
  2. Instead of using the same 12 workers to try both methods, the factory manager could have used two independent random samples of workers, allocating Method 1 to the members of one sample and Method 2 to the members of the other sample.
    (a) State one disadvantage of a procedure based on two independent random samples.
    (b) State any assumptions that would need to be made to carry out a \(t\)-test based on two independent random samples.
OCR S4 2007 June Q1
1 For the events \(A\) and \(B , \mathrm { P } ( A ) = 0.3 , \mathrm { P } ( B ) = 0.6\) and \(\mathrm { P } \left( A ^ { \prime } \cap B ^ { \prime } \right) = c\), where \(c \neq 0\).
  1. Find \(\mathrm { P } ( A \cap B )\) in terms of \(c\).
  2. Find \(\mathrm { P } ( B \mid A )\) and deduce that \(0.1 \leqslant c \leqslant 0.4\).
OCR S4 2007 June Q2
2 Of 9 randomly chosen students attending a lecture, 4 were found to be smokers and 5 were nonsmokers. During the lecture their pulse-rates were measured, with the following results in beats per minute.
Smokers77859098
Non-smokers5964688088
It may be assumed that these two groups of students were random samples from the student populations of smokers and non-smokers. Using a suitable Wilcoxon test at the \(10 \%\) significance level, test whether there is a difference in the median pulse-rates of the two populations.
OCR S4 2007 June Q3
3 The discrete random variables \(X\) and \(Y\) have the joint probability distribution given in the following table.
\(X\)
\cline { 2 - 5 } \multicolumn{1}{l}{}- 101
10.240.220.04
20.260.180.06
  1. Show that \(\operatorname { Cov } ( X , Y ) = 0\).
  2. Find the conditional distribution of \(X\) given that \(Y = 2\).
OCR S4 2007 June Q4
4 The levels of impurity in a particular alloy were measured using a random sample of 20 specimens. The results, in suitable units, were as follows.
3.002.053.152.653.503.252.853.352.652.75
2.902.202.953.053.653.452.552.152.802.60
  1. Use the sign test, at the \(5 \%\) significance level, to decide if there is evidence that the population median level of impurity is greater than 2.70 .
  2. State what other test might have been used, and give one advantage and one disadvantage this other test has over the sign test.
OCR S4 2007 June Q5
5 The continuous random variable \(X\) has probability density function given by $$\mathrm { f } ( x ) = \begin{cases} \frac { 1 } { ( \alpha - 1 ) ! } x ^ { \alpha - 1 } \mathrm { e } ^ { - x } & x \geqslant 0
0 & x < 0 \end{cases}$$ where \(\alpha\) is a positive integer.
  1. Explain how you can deduce that \(\int _ { 0 } ^ { \infty } x ^ { \alpha - 1 } \mathrm { e } ^ { - x } \mathrm {~d} x = ( \alpha - 1 )\) !.
  2. Write down an integral for the moment generating function \(\mathrm { M } _ { X } ( t )\) of \(X\) and show, by using the substitution \(x = \frac { u } { 1 - t }\), that \(\mathrm { M } _ { X } ( t ) = ( 1 - t ) ^ { - \alpha }\).
  3. Use the moment generating function to find, in terms of \(\alpha\),
    (a) \(\mathrm { E } ( X )\),
    (b) \(\operatorname { Var } ( X )\).
OCR S4 2007 June Q6
6 The discrete random variable \(X\) takes the values 0 and 1 with \(\mathrm { P } ( X = 0 ) = q\) and \(\mathrm { P } ( X = 1 ) = p\), where \(p + q = 1\).
  1. Write down the probability generating function of \(X\). The sum of \(n\) independent observations of \(X\) is denoted by \(S\).
  2. Write down the probability generating function of \(S\), and name the distribution of \(S\).
  3. Use the probability generating function of \(S\) to find \(\mathrm { E } ( S )\) and \(\operatorname { Var } ( S )\).
  4. The independent random variables \(Y\) and \(Z\) are such that \(Y\) has the distribution \(\mathrm { B } \left( 10 , \frac { 1 } { 2 } \right)\), and \(Z\) has probability generating function \(\mathrm { e } ^ { - ( 1 - t ) }\). Find the probability that the sum of one random observation of \(Y\) and one random observation of \(Z\) is equal to 2 .
OCR S4 2007 June Q7
7 The continuous random variable \(X\) has a uniform distribution over the interval \([ 0 , \theta ]\) so that the probability density function is given by $$f ( x ) = \begin{cases} \frac { 1 } { \theta } & 0 \leqslant x \leqslant \theta
0 & \text { otherwise } \end{cases}$$ where \(\theta\) is a positive constant. A sample of \(n\) independent observations of \(X\) is taken and the sample mean is denoted by \(\bar { X }\).
  1. The estimator \(T _ { 1 }\) is defined by \(T _ { 1 } = 2 \bar { X }\). Show that \(T _ { 1 }\) is an unbiased estimator of \(\theta\). It is given that the probability density function of the largest value, \(U\), in the sample is $$g ( u ) = \begin{cases} \frac { n u ^ { n - 1 } } { \theta ^ { n } } & 0 \leqslant u \leqslant \theta
    0 & \text { otherwise } \end{cases}$$
  2. Find \(\mathrm { E } ( U )\) and show that \(\operatorname { Var } ( U ) = \frac { n \theta ^ { 2 } } { ( n + 1 ) ^ { 2 } ( n + 2 ) }\).
  3. The estimator \(T _ { 2 }\) is defined by \(T _ { 2 } = \frac { n + 1 } { n } U\). Given that \(T _ { 2 }\) is also an unbiased estimator of \(\theta\), show that \(T _ { 2 }\) is a more efficient estimator than \(T _ { 1 }\) for \(n > 1\).
OCR S4 2008 June Q1
1 For the mutually exclusive events \(A\) and \(B , \mathrm { P } ( A ) = \mathrm { P } ( B ) = x\), where \(x \neq 0\).
  1. Show that \(x \leqslant \frac { 1 } { 2 }\).
  2. Show that \(A\) and \(B\) are not independent. The event \(C\) is independent of \(A\) and also independent of \(B\), and \(\mathrm { P } ( C ) = 2 x\).
  3. Show that \(\mathrm { P } ( A \cup B \cup C ) = 4 x ( 1 - x )\).
OCR S4 2008 June Q2
2 Part of Helen’s psychology dissertation involved the reaction times to a certain stimulus. She measured the reaction times of 30 randomly selected students, in seconds correct to 2 decimal places. The results are shown in the following stem-and-leaf diagram.
1412
1524
16036
17157
1834579
19246789
2001345789
217
Key: 18 | 3 means 1.83 seconds Helen wishes to test whether the population median time exceeds 1.80 seconds.
  1. Give a reason why the Wilcoxon signed-rank test should not be used.
  2. Carry out a suitable non-parametric test at the \(5 \%\) significance level.
OCR S4 2008 June Q3
3 From the records of Mulcaster United Football Club the following distribution was suggested as a probability model for future matches. \(X\) and \(Y\) denoted the numbers of goals scored by the home team and the away team respectively.
\(X\)
\cline { 2 - 5 } \multicolumn{1}{c}{}0123
00.110.040.060.08
10.080.050.120.05
20.050.080.070.03
30.030.060.070.02
Use the model to find
  1. \(\mathrm { E } ( X )\),
  2. the probability that the away team wins a randomly chosen match,
  3. the probability that the away team wins a randomly chosen match, given that the home team scores. One of the directors, an amateur statistician, finds that \(\operatorname { Cov } ( X , Y ) = 0.007\). He states that, as this value is very close to zero, \(X\) and \(Y\) may be considered to be independent.
  4. Comment on the director's statement.
OCR S4 2008 June Q4
4 William takes a bus regularly on the same journey, sometimes in the morning and sometimes in the afternoon. He wishes to compare morning and afternoon journey times. He records the journey times on 7 randomly chosen mornings and 8 randomly chosen afternoons. The results, each correct to the nearest minute, are as follows, where M denotes a morning time and A denotes an afternoon time.
MAAMMMMMMAAAAAA
192022242526283031333537383942
William wishes to test for a difference between the average times of morning and afternoon journeys.
  1. Given that \(s _ { M } ^ { 2 } = 16.5\) and \(s _ { A } ^ { 2 } = 64.5\), with the usual notation, explain why a \(t\)-test is not appropriate in this case.
  2. William chooses a non-parametric test at the \(5 \%\) significance level. Carry out the test, stating the rejection region.
OCR S4 2008 June Q5
5 The discrete random variable \(X\) has moment generating function \(\frac { 1 } { 4 } \mathrm { e } ^ { 2 t } + a \mathrm { e } ^ { 3 t } + b \mathrm { e } ^ { 4 t }\), where \(a\) and \(b\) are constants. It is given that \(\mathrm { E } ( X ) = 3 \frac { 3 } { 8 }\).
  1. Show that \(a = \frac { 1 } { 8 }\), and find the value of \(b\).
  2. Find \(\operatorname { Var } ( X )\).
  3. State the possible values of \(X\).
OCR S4 2008 June Q6
6 The continuous random variable \(Y\) has cumulative distribution function given by $$\mathrm { F } ( y ) = \begin{cases} 0 & y < a ,
1 - \frac { a ^ { 3 } } { y ^ { 3 } } & y \geqslant a , \end{cases}$$ where \(a\) is a positive constant. A random sample of 3 observations, \(Y _ { 1 } , Y _ { 2 } , Y _ { 3 }\), is taken, and the smallest is denoted by \(S\).
  1. Show that \(\mathrm { P } ( S > s ) = \left( \frac { a } { s } \right) ^ { 9 }\) and hence obtain the probability density function of \(S\).
  2. Show that \(S\) is not an unbiased estimator of \(a\), and construct an unbiased estimator, \(T _ { 1 }\), based on \(S\). It is given that \(T _ { 2 }\), where \(T _ { 2 } = \frac { 2 } { 9 } \left( Y _ { 1 } + Y _ { 2 } + Y _ { 3 } \right)\), is another unbiased estimator of \(a\).
  3. Given that \(\operatorname { Var } ( Y ) = \frac { 3 } { 4 } a ^ { 2 }\) and \(\operatorname { Var } ( S ) = \frac { 9 } { 448 } a ^ { 2 }\), determine which of \(T _ { 1 }\) and \(T _ { 2 }\) is the more efficient estimator.
  4. The values of \(Y\) for a particular sample are 12.8, 4.5 and 7.0. Find the values of \(T _ { 1 }\) and \(T _ { 2 }\) for this sample, and give a reason, unrelated to efficiency, why \(T _ { 1 }\) gives a better estimate of \(a\) than \(T _ { 2 }\) in this case.
OCR S4 2008 June Q7
7 The probability generating function of the random variable \(X\) is given by $$\mathrm { G } ( t ) = \frac { 1 + a t } { 4 - t }$$ where \(a\) is a constant.
  1. Find the value of \(a\).
  2. Find \(\mathrm { P } ( X = 3 )\). The sum of 3 independent observations of \(X\) is denoted by \(Y\). The probability generating function of \(Y\) is denoted by \(\mathrm { H } ( t )\).
  3. Use \(\mathrm { H } ( t )\) to find \(\mathrm { E } ( Y )\).
  4. By considering \(\mathrm { H } ( - 1 ) + \mathrm { H } ( 1 )\), show that \(\mathrm { P } ( Y\) is an even number \() = \frac { 62 } { 125 }\). \footnotetext{Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every reasonable effort has been made by the publisher (OCR) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the publisher will be pleased to make amends at the earliest possible opportunity. OCR is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge. }
OCR S4 2011 June Q1
1 The random variable \(X\) has the distribution \(\mathrm { B } ( n , p )\).
  1. Show, from the definition, that the probability generating function of \(X\) is \(( q + p t ) ^ { n }\), where \(q = 1 - p\).
  2. The independent random variable \(Y\) has the distribution \(\mathrm { B } ( 2 n , p )\) and \(T = X + Y\). Use probability generating functions to determine the distribution of \(T\), giving its parameters.
OCR S4 2011 June Q2
2 A botanist believes that some species of plants produce more flowers at high altitudes than at low altitudes. In order to investigate this belief the botanist randomly samples 11 species of plants each of which occurs at both altitudes. The numbers of flowers on the plants are shown in the table.
Species1234567891011
Number of flowers at low altitude534729654112
Number of flowers at high altitude161081416202115212
  1. Use the Wilcoxon signed rank test at the 5\% significance level to test the botanist's belief.
  2. Explain why the Wilcoxon rank sum test should not be used for this test.
OCR S4 2011 June Q3
3 For the events \(A\) and \(B , \mathrm { P } ( A ) = \mathrm { P } ( B ) = \frac { 3 } { 4 }\) and \(\mathrm { P } \left( A \mid B ^ { \prime } \right) = \frac { 1 } { 2 }\).
  1. Find \(\mathrm { P } ( A \cap B )\). For a third event \(C , \mathrm { P } ( C ) = \frac { 1 } { 4 }\) and \(C\) is independent of the event \(A \cap B\).
  2. Find \(\mathrm { P } ( A \cap B \cap C )\).
  3. Given that \(\mathrm { P } ( C \mid A ) = \lambda\) and \(\mathrm { P } ( B \mid C ) = 3 \lambda\), and that no event occurs outside \(A \cup B \cup C\), find the value of \(\lambda\).
OCR S4 2011 June Q4
4 The discrete random variable \(X\) has moment generating function \(\left( \frac { 1 } { 4 } + \frac { 3 } { 4 } \mathrm { e } ^ { t } \right) ^ { 3 }\).
  1. Find \(\mathrm { E } ( X )\).
  2. Find \(\mathrm { P } ( X = 2 )\).
  3. Show that \(X\) can be expressed as a sum of 3 independent observations of a random variable \(Y\). Obtain the probability distribution of \(Y\), and the variance of \(Y\).
OCR S4 2011 June Q5
5 A test was carried out to compare the breaking strengths of two brands of elastic band, \(A\) and \(B\), of the same size. Random samples of 6 were selected from each brand and the breaking strengths were measured. The results, in suitable units and arranged in ascending order for each brand, are as follows.
Brand \(A :\)5.68.79.210.711.212.6
Brand \(B :\)10.111.612.012.212.913.5
  1. Give one advantage that a non-parametric test might have over a parametric test in this context.
  2. Carry out a suitable Wilcoxon test at the \(5 \%\) significance level of whether there is a difference between the average breaking strengths of the two brands.
  3. An extra elastic band of brand \(B\) was tested and found to have a breaking strength exceeding all of the other 12 bands. Determine whether this information alters the conclusion of your test.
OCR S4 2011 June Q6
6 A City Council comprises 16 Labour members, 14 Conservative members and 6 members of Other parties. A sample of two members was chosen at random to represent the Council at an event. The number of Labour members and the number of Conservative members in this sample are denoted by \(L\) and \(C\) respectively. The joint probability distribution of \(L\) and \(C\) is given in the following table. \(C\)
\(L\)
012
0\(\frac { 1 } { 42 }\)\(\frac { 16 } { 105 }\)\(\frac { 4 } { 21 }\)
1\(\frac { 2 } { 15 }\)\(\frac { 16 } { 45 }\)0
2\(\frac { 13 } { 90 }\)00
  1. Verify the two non-zero probabilities in the table for which \(C = 1\).
  2. Find the expected number of Conservatives in the sample.
  3. Find the expected number of Other members in the sample.
  4. Explain why \(L\) and \(C\) are not independent, and state what can be deduced about \(\operatorname { Cov } ( L , C )\).