Questions — OCR MEI S2 (80 questions)

Browse by board
AQA AS Paper 1 AS Paper 2 C1 C2 C3 C4 D1 D2 FP1 FP2 FP3 Further AS Paper 1 Further AS Paper 2 Discrete Further AS Paper 2 Mechanics Further AS Paper 2 Statistics Further Paper 1 Further Paper 2 Further Paper 3 Discrete Further Paper 3 Mechanics Further Paper 3 Statistics M1 M2 M3 Paper 1 Paper 2 Paper 3 S1 S2 S3 CAIE FP1 FP2 Further Paper 1 Further Paper 2 Further Paper 3 Further Paper 4 M1 M2 P1 P2 P3 S1 S2 Edexcel AEA AS Paper 1 AS Paper 2 C1 C12 C2 C3 C34 C4 CP AS CP1 CP2 D1 D2 F1 F2 F3 FD1 FD1 AS FD2 FD2 AS FM1 FM1 AS FM2 FM2 AS FP1 FP1 AS FP2 FP2 AS FP3 FS1 FS1 AS FS2 FS2 AS M1 M2 M3 M4 M5 P1 P2 P3 P4 PMT Mocks Paper 1 Paper 2 Paper 3 S1 S2 S3 S4 OCR AS Pure C1 C2 C3 C4 D1 D2 FD1 AS FM1 AS FP1 FP1 AS FP2 FP3 FS1 AS Further Additional Pure Further Additional Pure AS Further Discrete Further Discrete AS Further Mechanics Further Mechanics AS Further Pure Core 1 Further Pure Core 2 Further Pure Core AS Further Statistics Further Statistics AS H240/01 H240/02 H240/03 M1 M2 M3 M4 Mechanics 1 PURE Pure 1 S1 S2 S3 S4 Stats 1 OCR MEI AS Paper 1 AS Paper 2 C1 C2 C3 C4 D1 D2 FP1 FP2 FP3 Further Extra Pure Further Mechanics A AS Further Mechanics B AS Further Mechanics Major Further Mechanics Minor Further Numerical Methods Further Pure Core Further Pure Core AS Further Pure with Technology Further Statistics A AS Further Statistics B AS Further Statistics Major Further Statistics Minor M1 M2 M3 M4 Paper 1 Paper 2 Paper 3 S1 S2 S3 S4 SPS SPS ASFM SPS ASFM Mechanics SPS ASFM Pure SPS ASFM Statistics SPS FM SPS FM Mechanics SPS FM Pure SPS FM Statistics SPS SM SPS SM Mechanics SPS SM Pure SPS SM Statistics WJEC Further Unit 1 Further Unit 2 Further Unit 3 Further Unit 4 Further Unit 5 Further Unit 6 Unit 1 Unit 2 Unit 3 Unit 4
OCR MEI S2 2009 June Q4
4 In a traffic survey a random sample of 400 cars passing a particular location during the rush hour is selected. The type of car and the sex of the driver are classified as follows.
\multirow{2}{*}{}Sex\multirow{2}{*}{Row totals}
MaleFemale
\multirow{5}{*}{Type of car}Hatchback9636132
Saloon7735112
People carrier384482
4WD19827
Sports car222547
Column totals252148400
  1. Carry out a test at the \(5 \%\) significance level to examine whether there is any association between type of car and sex of driver. State carefully your null and alternative hypotheses. Your working should include a table showing the contributions of each cell to the test statistic.
  2. For each type of car, comment briefly on how the number of drivers of each sex compares with what would be expected if there were no association. OCR is committed to seeking permission to reproduce all third-party content that it uses in its assessment materials. OCR has attempted to identify and contact all copyright holders whose work is used in this paper. To avoid the issue of disclosure of answer-related information to candidates, all copyright acknowledgements are reproduced in the OCR Copyright Acknowledgements Booklet. This is produced for each series of examinations, is given to all schools that receive assessment material and is freely available to download from our public website (\href{http://www.ocr.org.uk}{www.ocr.org.uk}) after the live examination series.
    If OCR has unwittingly failed to correctly acknowledge or clear any third-party content in this assessment material, OCR will be happy to correct its mistake at the earliest possible opportunity. For queries or further information please contact the Copyright Team, First Floor, 9 Hills Road, Cambridge CB2 1PB.
    OCR is part of the Cambridge Assessment Group; Cambridge Assessment is the brand name of University of Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
OCR MEI S2 2010 June Q1
1 Two celebrities judge a talent contest. Each celebrity gives a score out of 20 to each of a random sample of 8 contestants. The scores, \(x\) and \(y\), given by the celebrities to each contestant are shown below.
ContestantABCDEFGH
\(x\)61792013151114
\(y\)6131011971215
  1. Calculate the value of Spearman's rank correlation coefficient.
  2. Carry out a hypothesis test at the \(5 \%\) significance level to determine whether there is positive association between the scores allocated by the two celebrities.
  3. State the distributional assumption required for a test based on the product moment correlation coefficient. Sketch a scatter diagram of the scores above, and discuss whether it appears that the assumption is likely to be valid.
OCR MEI S2 2010 June Q2
2 A radioactive source is decaying at a mean rate of 3.4 counts per 5 seconds.
  1. State conditions for a Poisson distribution to be a suitable model for the rate of decay of the source. You may assume that a Poisson distribution with a mean rate of 3.4 counts per 5 seconds is a suitable model.
  2. State the variance of this Poisson distribution.
  3. Find the probability of
    (A) exactly 3 counts in a 5 -second period,
    (B) at least 3 counts in a 5 -second period.
  4. Find the probability of exactly 40 counts in a period of 60 seconds.
  5. Use a suitable approximating distribution to find the probability of at least 40 counts in a period of 60 seconds.
  6. The background radiation rate also, independently, follows a Poisson distribution and produces a mean count of 1.4 per 5 seconds. Find the probability that the radiation source together with the background radiation give a total count of at least 8 in a 5 -second period.
OCR MEI S2 2010 June Q3
3 In a men's cycling time trial, the times are modelled by the random variable \(X\) minutes which is Normally distributed with mean 63 and standard deviation 5.2.
  1. Find $$\begin{aligned} & \text { (A) } \mathrm { P } ( X < 65 ) \text {, }
    & \text { (B) } \mathrm { P } ( 60 < X < 65 ) \text {. } \end{aligned}$$
  2. Find the probability that 5 riders selected at random all record times between 60 and 65 minutes.
  3. A competitor aims to be in the fastest \(5 \%\) of entrants (i.e. those with the lowest times). Find the maximum time that he can take. It is suggested that holding the time trial on a new course may result in lower times. To investigate this, a random sample of 15 competitors is selected. These 15 competitors do the time trial on the new course. The mean time taken by these riders is 61.7 minutes. You may assume that times are Normally distributed and the standard deviation is still 5.2 minutes. A hypothesis test is carried out to investigate whether times on the new course are lower.
  4. Write down suitable null and alternative hypotheses for the test. Carry out the test at the 5\% significance level.
OCR MEI S2 2010 June Q4
4 In a survey a random sample of 63 runners is selected. The category of runner and the type of running are classified as follows.
\multirow{2}{*}{}Category of runner\multirow{2}{*}{Row totals}
JuniorSeniorVeteran
\multirow{3}{*}{Type of running}Track98219
Road481224
Both410620
Column totals17262063
  1. Carry out a test at the \(5 \%\) significance level to examine whether there is any association between category of runner and the type of running. State carefully your null and alternative hypotheses. Your working should include a table showing the contributions of each cell to the test statistic.
  2. For each category of runner, comment briefly on how the type of running compares with what would be expected if there were no association.
OCR MEI S2 2011 June Q1
1 An experiment is performed to determine the response of maize to nitrogen fertilizer. Data for the amount of nitrogen fertilizer applied, \(x \mathrm {~kg} / \mathrm { hectare }\), and the average yield of maize, \(y\) tonnes/hectare, in 5 experimental plots are given in the table below.
\(x\)0306090120
\(y\)0.52.54.76.27.4
  1. Draw a scatter diagram to illustrate these data.
  2. Calculate the equation of the regression line of \(y\) on \(x\).
  3. Draw your regression line on your scatter diagram and comment briefly on its fit.
  4. Calculate the value of the residual for the data point where \(x = 30\) and \(y = 2.5\).
  5. Use the equation of the regression line to calculate estimates of average yield with nitrogen fertilizer applications of
    (A) \(45 \mathrm {~kg} / \mathrm { hectare }\),
    (B) \(150 \mathrm {~kg} /\) hectare.
  6. In a plot where \(150 \mathrm {~kg} /\) hectare of nitrogen fertilizer is applied, the average yield of maize is 8.7 tonnes/hectare. Comment on this result.
OCR MEI S2 2011 June Q2
2 At a drive-through fast food takeaway, cars arrive independently, randomly and at a uniform average rate. The numbers of cars arriving per minute may be modelled by a Poisson distribution with mean 0.62.
  1. Briefly explain the meaning of each of the three terms 'independently', 'randomly' and 'at a uniform average rate', in the context of cars arriving at a fast food takeaway.
  2. Find the probability of at most 1 car arriving in a period of 1 minute.
  3. Find the probability of more than 5 cars arriving in a period of 10 minutes.
  4. State the exact distribution of the number of cars arriving in a period of 1 hour.
  5. Use a suitable approximating distribution to find the probability that at least 40 cars arrive in a period of 1 hour.
OCR MEI S2 2011 June Q3
3 The weights of Braeburn apples on display in a supermarket, measured in grams, are Normally distributed with mean 210.5 and standard deviation 15.2.
  1. Find the probability that a randomly selected apple weighs at least 220 grams.
  2. These apples are sold in packs of 3. You may assume that the weights of apples in each pack are independent. Find the probability that all 3 of the apples in a randomly selected pack weigh at least 220 grams.
  3. 100 packs are selected at random.
    (A) State the exact distribution of the number of these 100 packs in which all 3 apples weigh at least 220 grams.
    (B) Use a suitable approximating distribution to find the probability that in at most one of these packs all 3 apples weigh at least 220 grams.
    (C) Explain why this approximating distribution is suitable.
  4. The supermarket also sells Cox's Orange Pippin apples. The weights of these apples, measured in grams, are Normally distributed with mean 185 and standard deviation \(\sigma\).
    (A) Given that \(10 \%\) of randomly selected Cox's Orange Pippin apples weigh less than 170 grams, calculate the value of \(\sigma\).
    (B) Sketch the distributions of the weights of both types of apple on a single diagram.
OCR MEI S2 2011 June Q4
4
  1. In a survey on internet usage, a random sample of 200 people is selected. The people are asked how much they have spent on internet shopping during the last three months. The results, classified by amount spent and sex, are shown in the table.
    \multirow{2}{*}{}Sex\multirow{2}{*}{Row totals}
    MaleFemale
    \multirow{5}{*}{Amount spent}Nothing283462
    Less than £50172138
    £50 up to £200222648
    £200 up to £1000231639
    £1000 or more8513
    Column totals98102200
    1. Write down null and alternative hypotheses for a test to examine whether there is any association between amount spent and sex of person. The contributions to the test statistic for the usual \(\chi ^ { 2 }\) test are shown in the table below.
      \multirow{2}{*}{}Sex
      MaleFemale
      \multirow{5}{*}{Amount spent}Nothing0.18650.1791
      Less than £500.14090.1354
      £50 up to £2000.09820.0944
      £200 up to £10000.79180.7608
      £1000 or more0.41710.4007
      The sum of these contributions, correct to 3 decimal places, is 3.205.
    2. Calculate the expected frequency for females spending nothing. Verify the corresponding contribution, 0.1791 , to the test statistic.
    3. Carry out the test at the \(5 \%\) level of significance, stating your conclusion clearly.
  2. A bakery sells loaves specified as having a mean weight of 400 grams. It is known that the weights of these loaves are Normally distributed and that the standard deviation is 5.7 grams. An inspector suspects that the true mean weight may be less than 400 grams. In order to test this, the inspector takes a random sample of 6 loaves. Carry out a suitable test at the \(5 \%\) level, given that the weights, in grams, of the 6 loaves are as follows.
    \(\begin{array} { l l l l l l } 392.1 & 405.8 & 401.3 & 387.4 & 391.8 & 400.6 \end{array}\) RECOGNISING ACHIEVEMENT
OCR MEI S2 2012 June Q1
1 The times, in seconds, taken by ten randomly selected competitors for the first and last sections of an Olympic bobsleigh run are denoted by \(x\) and \(y\) respectively. Summary statistics for these data are as follows. $$\Sigma x = 113.69 \quad \Sigma y = 52.81 \quad \Sigma x ^ { 2 } = 1292.56 \quad \Sigma y ^ { 2 } = 278.91 \quad \Sigma x y = 600.41 \quad n = 10$$
  1. Calculate the sample product moment correlation coefficient.
  2. Carry out a hypothesis test at the \(10 \%\) significance level to investigate whether there is any correlation between times taken for the first and last sections of the bobsleigh run.
  3. State the distributional assumption which is necessary for this test to be valid. Explain briefly how a scatter diagram may be used to check whether this assumption is likely to be valid.
  4. A commentator says that in order to have a fast time on the last section, you must have a fast time on the first section. Comment briefly on this suggestion.
  5. (A) Would your conclusion in part (ii) have been different if you had carried out the hypothesis test at the \(1 \%\) level rather than the \(10 \%\) level? Explain your answer.
    (B) State one advantage and one disadvantage of using a \(1 \%\) significance level rather than a \(10 \%\) significance level in a hypothesis test.
OCR MEI S2 2012 June Q2
2 A particular genetic mutation occurs in one in every 300 births on average. A random sample of 1200 births is selected.
  1. State the exact distribution of \(X\), the number of births in the sample which have the mutation.
  2. Explain why \(X\) has, approximately, a Poisson distribution.
  3. Use a Poisson approximating distribution to find
    (A) \(\mathrm { P } ( X = 1 )\),
    (B) \(\mathrm { P } ( X > 4 )\).
  4. Twenty independent samples, each of 1200 births, are selected. State the mean and variance of a Normal approximating distribution suitable for modelling the total number of births with the mutation in the twenty samples.
  5. Use this Normal approximating distribution to
    (A) find the probability that there are at least 90 births which have the mutation,
    ( \(B\) ) find the least value of \(k\) such that the probability that there are at most \(k\) births with this mutation is greater than 5\%.
OCR MEI S2 2012 June Q3
3 At a vineyard, the process used to fill bottles with wine is subject to variation. The contents of bottles are independently Normally distributed with mean \(\mu = 751.4 \mathrm { ml }\) and standard deviation \(\sigma = 2.5 \mathrm { ml }\).
  1. Find the probability that a randomly selected bottle contains at least 750 ml .
  2. A case of wine consists of 6 bottles. Find the probability that all 6 bottles in a case contain at least 750 ml .
  3. Find the probability that, in a random sample of 25 cases, there are at least 2 cases in which all 6 bottles contain at least 750 ml . It is decided to increase the proportion of bottles which contain at least 750 ml to \(98 \%\).
  4. This can be done by changing the value of \(\mu\), but retaining the original value of \(\sigma\). Find the required value of \(\mu\).
  5. An alternative is to change the value of \(\sigma\), but retain the original value of \(\mu\). Find the required value of \(\sigma\).
  6. Comment briefly on which method might be easier to implement and which might be preferable to the vineyard owners.
OCR MEI S2 2012 June Q4
9 marks
4
  1. Mary is opening a cake shop. As part of her market research, she carries out a survey into which type of cake people like best. She offers people 4 types of cake to taste: chocolate, carrot, lemon and ginger. She selects a random sample of 150 people and she classifies the people as children and adults. The results are as follows.
    \multirow{2}{*}{}Classification of person\multirow{2}{*}{Row totals}
    ChildAdult
    \multirow{4}{*}{Type of cake}Chocolate342357
    Carrot161834
    Lemon41822
    Ginger132437
    Column totals6783150
    The contributions to the test statistic for the usual \(\chi ^ { 2 }\) test are shown in the table below.
    Classification of person
    \cline { 3 - 4 } \multicolumn{2}{|c|}{}ChildAdult
    \multirow{3}{*}{
    Type
    of
    cake
    }
    Chocolate2.86462.3124
    \cline { 2 - 4 }Carrot0.04360.0352
    \cline { 2 - 4 }Lemon3.45492.7889
    \cline { 2 - 4 }Ginger0.75260.6075
    The sum of these contributions, correct to 2 decimal places, is 12.86 .
    1. Calculate the expected frequency for children preferring chocolate cake. Verify the corresponding contribution, 2.8646, to the test statistic.
    2. Carry out the test at the \(1 \%\) level of significance.
  2. Mary buys flour in bags which are labelled as containing 5 kg . She suspects that the average contents of these bags may be less than 5 kg . In order to test this, she selects a random sample of 8 bags and weighs their contents. Assuming that weights are Normally distributed with standard deviation 0.0072 kg , carry out a test at the \(5 \%\) level, given that the weights of the 8 bags in kg are as follows.
    4.992
    4.981
    4.982
    4.996
    4.991
    5.006
    5.009
    5.003
    [0pt] [9] OCR is committed to seeking permission to reproduce all third-party content that it uses in its assessment materials. OCR has attempted to identify and contact all copyright holders whose work is used in this paper. To avoid the issue of disclosure of answer-related information to candidates, all copyright acknowledgements are reproduced in the OCR Copyright Acknowledgements Booklet. This is produced for each series of examinations and is freely available to download from our public website (\href{http://www.ocr.org.uk}{www.ocr.org.uk}) after the live examination series.
    If OCR has unwittingly failed to correctly acknowledge or clear any third-party content in this assessment material, OCR will be happy to correct its mistake at the earliest possible opportunity.
    For queries or further information please contact the Copyright Team, First Floor, 9 Hills Road, Cambridge CB2 1GE.
    OCR is part of the Cambridge Assessment Group; Cambridge Assessment is the brand name of University of Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
OCR MEI S2 2013 June Q1
1 Salbutamol is a drug used to improve lung function. In a medical trial, a random sample of 60 people with impaired lung function was selected. The forced expiratory volume in one second (FEV1) was measured for each person, both before being given salbutamol and again after a two-week course of the drug. The variables \(x\) and \(y\), measured in suitable units, represent FEV1 before and after the two-week course respectively. The data are illustrated in the scatter diagram below, together with the summary statistics for these data.
\includegraphics[max width=\textwidth, alt={}, center]{f3690bc0-3392-4f29-86f7-797d33fab4f1-2_682_1024_502_516} Summary statistics: $$n = 60 , \quad \sum x = 43.62 , \quad \sum y = 55.15 , \quad \sum x ^ { 2 } = 32.68 , \quad \sum y ^ { 2 } = 51.44 , \quad \sum x y = 40.66$$
  1. Calculate the sample product moment correlation coefficient.
  2. Carry out a hypothesis test at the \(5 \%\) significance level to investigate whether there is positive correlation between FEV1 before and after the course.
  3. State the distributional assumption which is necessary for this test to be valid. State, with a reason, whether the assumption appears to be valid.
  4. Explain the meaning of the term 'significance level'.
  5. Calculate the values of the summary statistics if the data point \(x = 0.55 , y = 1.00\) had been incorrectly recorded as \(x = 1.00 , y = 0.55\).
OCR MEI S2 2013 June Q2
2 Suppose that 3\% of the population of a large city have red hair.
  1. A random sample of 10 people from the city is selected. Find the probability that there is at least one person with red hair in this sample. A random sample of 60 people from the city is selected. The random variable \(X\) represents the number of people in this sample who have red hair.
  2. Explain why the distribution of \(X\) may be approximated by a Poisson distribution. Write down the mean of this Poisson distribution.
  3. Hence find
    (A) \(\mathrm { P } ( X = 2 )\),
    (B) \(\mathrm { P } ( X > 2 )\).
  4. Discuss whether or not it would be appropriate to model \(X\) using a Normal approximating distribution. A random sample of 5000 people from the city is selected.
  5. State the exact distribution of the number of people with red hair in the sample.
  6. Use a suitable Normal approximating distribution to find the probability that there are at least 160 people with red hair in the sample.
OCR MEI S2 2013 June Q3
3 The scores, \(X\), in Paper 1 of an English examination have an underlying Normal distribution with mean 76 and standard deviation 12. The scores are reported as integer marks. So, for example, a score for which \(75.5 \leqslant X < 76.5\) is reported as 76 marks.
  1. Find the probability that a candidate's reported mark is 76 .
  2. Find the probability that a candidate's reported mark is at least 80 .
  3. Three candidates are chosen at random. Find the probability that exactly one of these three candidates' reported marks is at least 80 . The proportion of candidates who receive an A* grade (the highest grade) must not exceed \(10 \%\) but should be as close as possible to \(10 \%\).
  4. Find the lowest reported mark that should be awarded an A* grade. The scores in Paper 2 of the examination have an underlying Normal distribution with mean \(\mu\) and standard deviation 12.
  5. Given that \(20 \%\) of candidates receive a reported mark of 50 or less, find the value of \(\mu\).
OCR MEI S2 2013 June Q4
4 An art gallery is holding an exhibition. A random sample of 150 visitors to the exhibition is selected. The visitors are asked which of four artists they prefer. Their preferences, classified according to whether the visitor is female or male, are given in the table.
Artist preferred
\cline { 3 - 6 } \multicolumn{2}{|c|}{}MonetRenoirDegasCézanne
\multirow{2}{*}{Sex}Male8251819
\cline { 2 - 6 }Female18351017
  1. Carry out a test at the \(10 \%\) significance level to examine whether there is any association between artist preferred and sex of visitor. Your working should include a table showing the contributions of each cell to the test statistic.
  2. For each artist, comment briefly on how the preferences of each sex compare with what would be expected if there were no association.
OCR MEI S2 2014 June Q1
1 A medical student is investigating the claim that young adults with high diastolic blood pressure tend to have high systolic blood pressure. The student measures the diastolic and systolic blood pressures of a random sample of ten young adults. The data are shown in the table and illustrated in the scatter diagram.
Diastolic blood pressure60616263737684879095
Systolic blood pressure98121118114108112132130134139
\includegraphics[max width=\textwidth, alt={}, center]{17e474c4-f5be-4ca1-b7c3-e444b46c3bec-2_865_809_593_628}
  1. Calculate the value of Spearman's rank correlation coefficient for these data.
  2. Carry out a hypothesis test at the \(5 \%\) significance level to examine whether there is positive association between diastolic blood pressure and systolic blood pressure in the population of young adults.
  3. Explain why, in the light of the scatter diagram, it might not be valid to carry out a test based on the product moment correlation coefficient. The product moment correlation coefficient between the diastolic and systolic blood pressures of a random sample of 10 athletes is 0.707 .
  4. Carry out a hypothesis test at the \(1 \%\) significance level to investigate whether there appears to be positive correlation between these two variables in the population of athletes. You may assume that in this case such a test is valid.
OCR MEI S2 2014 June Q2
2 Manufacturing defects occur in a particular type of aluminium sheeting randomly, independently and at a constant average rate of 1.7 defects per square metre.
  1. Explain the meaning of the term 'independently' and name the distribution that models this situation.
  2. Find the probability that there are exactly 2 defects in a sheet of area 1 square metre.
  3. Find the probability that there are exactly 12 defects in a sheet of area 7 square metres. In another type of aluminium sheet, defects occur randomly, independently and at a constant average rate of 0.8 defects per square metre.
  4. A large box is made from 2 square metres of the first type of sheet and 2 square metres of the second type of sheet, chosen independently. Show that the probability that there are at least 8 defects altogether in the box is 0.1334 . A random sample of 100 of these boxes is selected.
  5. State the exact distribution of the number of boxes which have at least 8 defects.
  6. Use a suitable approximating distribution to find the probability that there are at least 20 boxes in the sample which have at least 8 defects.
OCR MEI S2 2014 June Q3
3 The wing lengths of native English male blackbirds, measured in mm , are Normally distributed with mean 130.5 and variance 11.84.
  1. Find the probability that a randomly selected native English male blackbird has a wing length greater than 135 mm .
  2. Given that \(1 \%\) of native English male blackbirds have wing length more than \(k \mathrm {~mm}\), find the value of \(k\).
  3. Find the probability that a randomly selected native English male blackbird has a wing length which is 131 mm correct to the nearest millimetre. It is suspected that Scandinavian male blackbirds have, on average, longer wings than native English male blackbirds. A random sample of 20 Scandinavian male blackbirds has mean wing length 132.4 mm . You may assume that wing lengths in this population are Normally distributed with variance \(11.84 \mathrm {~mm} ^ { 2 }\).
  4. Carry out an appropriate hypothesis test, at the \(5 \%\) significance level.
  5. Discuss briefly one advantage and one disadvantage of using a \(10 \%\) significance level rather than a \(5 \%\) significance level in hypothesis testing in general.
OCR MEI S2 2014 June Q4
4 A researcher at a large company thinks that there may be some relationship between the numbers of working days lost due to illness per year and the ages of the workers in the company. The researcher selects a random sample of 190 workers. The ages of the workers and numbers of days lost for a period of 1 year are summarised below.
\cline { 3 - 5 } \multicolumn{2}{c|}{}Working days lost
\cline { 3 - 5 } \multicolumn{2}{c|}{}0 to 45 to 910 or more
\multirow{3}{*}{Age}Under 3531274
\cline { 2 - 5 }35 to 5028328
\cline { 2 - 5 }Over 50162816
  1. Carry out a test at the \(1 \%\) significance level to investigate whether the researcher's belief appears to be true. Your working should include a table showing the contributions of each cell to the test statistic.
  2. For the 'Over 50' age group, comment briefly on how the working days lost compare with what would be expected if there were no association.
  3. A student decides to reclassify the 'working days lost' into two groups, ' 0 to 4 ' and ' 5 or more', but leave the age groups as before. The test statistic with this classification is 7.08 . Carry out the test at the \(1 \%\) level with this new classification, using the same hypotheses as for the original test.
  4. Comment on the results of the two tests. \section*{END OF QUESTION PAPER}
OCR MEI S2 2015 June Q1
1 A random sample of wheat seedlings is planted and their growth is measured. The table shows their average growth, \(y \mathrm {~mm}\), at half-day intervals.
Time \(t\) days00.511.522.53
Average growth \(y \mathrm {~mm}\)072133455662
  1. Draw a scatter diagram to illustrate these data.
  2. Calculate the equation of the regression line of \(y\) on \(t\).
  3. Calculate the value of the residual for the data point at which \(t = 2\).
  4. Use the equation of the regression line to calculate an estimate of the average growth after 5 days for wheat seedlings. Comment on the reliability of this estimate. It is suggested that it would be better to replace the regression line by a line which passes through the origin. You are given that the equation of such a line is \(y = a t\), where \(a = \frac { \sum y t } { \sum t ^ { 2 } }\).
  5. Find the equation of this line and plot the line on your scatter diagram.
OCR MEI S2 2015 June Q2
2 It was stated in 2012 that \(3 \%\) of \(\pounds 1\) coins were fakes. Throughout this question, you should assume that this is still the case.
  1. Find the probability that, in a random selection of \(25 \pounds 1\) coins, there is exactly one fake coin. A random sample of \(250 \pounds 1\) coins is selected.
  2. Explain why a Poisson distribution is an appropriate approximating distribution for the number of fake coins in the sample.
  3. Use a Poisson distribution to find the probability that, in this sample, there are
    (A) exactly 10 fake coins,
    (B) at least 10 fake coins.
  4. Use a suitable approximating distribution to find the probability that there are at least 50 fake coins in a sample of 2000 coins. It is known that \(0.2 \%\) of another type of coin are fakes.
  5. A random sample of size \(n\) of these coins is taken. Using a Poisson approximating distribution, show that the probability of at most one fake coin in the sample is equal to \(\mathrm { e } ^ { - \lambda } + \lambda \mathrm { e } ^ { - \lambda }\), where \(\lambda = 0.002 n\).
  6. Use the approximation \(\mathrm { e } ^ { - \lambda } + \lambda \mathrm { e } ^ { - \lambda } \approx 1 - \frac { \lambda ^ { 2 } } { 2 }\) for small values of \(\lambda\) to estimate the value of \(n\) for which the probability in part ( \(\mathbf { v }\) ) is equal to 0.995 .
OCR MEI S2 2015 June Q3
3 The random variable \(X\) represents the weight in kg of a randomly selected male dog of a particular breed. \(X\) is Normally distributed with mean 30.7 and standard deviation 3.5.
  1. Find
    (A) \(\mathrm { P } ( X < 30 )\),
    (B) \(P ( 25 < X < 35 )\).
  2. Five of these dogs are chosen at random. Find the probability that each of them weighs at least 30 kg .
  3. The weights of females of the same breed of dog are Normally distributed with mean 26.8 kg . Given that \(5 \%\) of female dogs of this breed weigh more than 30 kg , find the standard deviation of their weights.
  4. Sketch the distributions of the weights of male and female dogs of this breed on a single diagram.
OCR MEI S2 2015 June Q4
4
  1. As part of an investigation into smoking, a random sample of 120 students was selected. The students were asked whether they were smokers, and also whether either of their parents were smokers. The results are summarised in the table below. Test, at the \(5 \%\) significance level, whether there is any association between the smoking habits of the students and their parents.
    At least one
    parent smokes
    Neither parent
    smokes
    Student smokes2127
    Student does not smoke1755
  2. The manufacturer of a particular brand of cigarette claims that the nicotine content of these cigarettes is Normally distributed with mean 0.87 mg . A researcher suspects that the mean nicotine content of this brand is higher than the value claimed by the manufacturer. The nicotine content, \(x \mathrm { mg }\), is measured for a random sample of 100 cigarettes. The data are summarised as follows. $$\sum x = 88.20 \quad \sum x ^ { 2 } = 78.68$$ Carry out a test at the \(1 \%\) significance level to investigate the researcher's belief. \section*{END OF QUESTION PAPER}