Expected frequencies partially provided

A question is this type if and only if some expected frequencies or contributions to the test statistic are given in a table and the student must complete the table and/or verify specific values before carrying out the test.

22 questions · Standard +0.2

5.06a Chi-squared: contingency tables
Sort by: Default | Easiest first | Hardest first
OCR MEI S2 2008 June Q4
18 marks Standard +0.3
4 A student is investigating whether there is any association between the species of shellfish that occur on a rocky shore and where they are located. A random sample of 160 shellfish is selected and the numbers of shellfish in each category are summarised in the table below.
Location
\cline { 3 - 5 } \multicolumn{2}{|c|}{}ExposedShelteredPool
\multirow{3}{*}{Species}Limpet243216
\cline { 2 - 5 }Mussel24113
\cline { 2 - 5 }Other52223
  1. Write down null and alternative hypotheses for a test to examine whether there is any association between species and location. The contributions to the test statistic for the usual \(\chi ^ { 2 }\) test are shown in the table below.
    ContributionLocation
    \cline { 3 - 5 }ExposedShelteredPool
    \multirow{3}{*}{Species}Limpet0.00090.25850.4450
    \cline { 2 - 5 }Mussel10.34721.27564.8773
    \cline { 2 - 5 }Other8.07190.14027.4298
    The sum of these contributions is 32.85 .
  2. Calculate the expected frequency for mussels in pools. Verify the corresponding contribution 4.8773 to the test statistic.
  3. Carry out the test at the \(5 \%\) level of significance, stating your conclusion clearly.
  4. For each species, comment briefly on how its distribution compares with what would be expected if there were no association.
  5. If 3 of the 160 shellfish are selected at random, one from each of the 3 types of location, find the probability that all 3 of them are limpets.
OCR S3 2007 January Q7
15 marks Standard +0.3
7 It is thought that a person's eye colour is related to the reaction of the person's skin to ultra-violet light. As part of a study, a random sample of 140 people were treated with a standard dose of ultra-violet light. The degree of reaction was classified as None, Mild or Strong. The results are given in Table 1. The corresponding expected frequencies for a \(\chi ^ { 2 }\) test of association between eye colour and reaction are shown in Table 2. \begin{table}[h]
\captionsetup{labelformat=empty} \caption{Table 1
Observed frequencies}
Eye colour
BlueBrownOtherTotal
None12171039
ReactionMild31211163
Strong2241238
Total654233140
\end{table} \begin{table}[h]
\captionsetup{labelformat=empty} \caption{Table 2
Expected frequencies}
Eye colour
BlueBrownOther
None18.1111.709.19
ReactionMild29.2518.9014.85
Strong17.6411.408.96
\end{table}
  1. (a) State suitable hypotheses for the test.
    (b) Show how the expected frequency of 18.11 in Table 2 is obtained.
    (c) Show that the three cells in the top row together contribute 4.53 to the calculated value of \(\chi ^ { 2 }\), correct to 2 decimal places.
    (d) You are given that the total calculated value of \(\chi ^ { 2 }\) is 12.78 , correct to 2 decimal places. Give the smallest value of \(\alpha\) obtained from the tables for which the null hypothesis would be rejected at the \(\alpha \%\) significance level.
  2. Test, at the \(5 \%\) significance level, whether the proportions of people in the whole population with blue eyes, brown eyes and other colours are in the ratios \(2 : 2 : 1\).
OCR S3 2008 January Q6
15 marks Standard +0.3
6 The Research and Development department of a paint manufacturer has produced paint of three different shades of grey, \(G _ { 1 } , G _ { 2 }\) and \(G _ { 3 }\). In order to find the reaction of the public to these shades, each of a random sample of 120 people was asked to state which shade they preferred. The results, classified by gender, are shown in Table 1. \begin{table}[h]
Shade
\cline { 2 - 5 }\(G _ { 1 }\)\(G _ { 2 }\)\(G _ { 3 }\)
\cline { 2 - 5 } GenderMale112423
Female181331
\cline { 2 - 5 }
\cline { 2 - 5 }
\captionsetup{labelformat=empty} \caption{Table 1}
\end{table} Table 2 shows the corresponding expected values, correct to 2 decimal places, for a test of independence. \begin{table}[h]
Shade
\cline { 2 - 5 }\(G _ { 1 }\)\(G _ { 2 }\)\(G _ { 3 }\)
\cline { 2 - 5 } GenderMale14.0217.8826.10
Female14.9819.1227.90
\cline { 2 - 5 }
\cline { 2 - 5 }
\captionsetup{labelformat=empty} \caption{Table 2}
\end{table}
  1. Show how the value 17.88 for Male, \(G _ { 2 }\) was obtained.
  2. Test, at the \(5 \%\) significance level, whether gender and preferred shade are independent.
  3. Determine the smallest significance level obtained from tables or calculator for which there is evidence that not all shades are equally preferred by people in general, irrespective of gender.
OCR S3 2006 June Q5
9 marks Moderate -0.3
5 Gloria is a market trader who sells jeans. She trades on Mondays, Wednesdays and Fridays. Wishing to investigate whether the volume of trade depends on the day of the week, Gloria analysed a random sample of 150 days' sales and classified them by day and volume (low, medium and high). The results are given in the table below.
Day
MondayWednesdayFriday
\multirow{3}{*}{Volume}Low15132
Medium232623
High12927
Gloria asked a statistician to perform a suitable test of independence and, as part of this test, expected frequencies were calculated. These are shown in the table below.
Day
MondayWednesdayFriday
Low10.009.6010.40
VolumeMedium24.0023.0424.96
High16.0015.3616.64
  1. Show how the value 23.04 for medium volume on Wednesday has been obtained.
  2. State, giving a reason, if it is necessary to combine any rows or columns in order to carry out the test. The value of the test statistic is found to be 21.15, correct to 2 decimal places.
  3. Stating suitable hypotheses for the test, give its conclusion using a \(1 \%\) significance level. Gloria wishes to hold a sale and asks the statistician to advise her on which day to hold it in order to sell as much as possible.
  4. State the day that the statistician should advise and give a reason for the choice.
OCR MEI S2 2010 January Q4
18 marks Moderate -0.3
4 A council provides waste paper recycling services for local businesses. Some businesses use the standard service for recycling paper, others use a special service for dealing with confidential documents, and others use both. Businesses are classified as small or large. A survey of a random sample of 285 businesses gives the following data for size of business and recycling service.
Recycling Service
\cline { 3 - 5 } \multicolumn{2}{|c|}{}StandardSpecialBoth
Size of
business
Small352644
Large555273
  1. Write down null and alternative hypotheses for a test to examine whether there is any association between size of business and recycling service used. The contributions to the test statistic for the usual \(\chi ^ { 2 }\) test are shown in the table below.
    Recycling Service
    \cline { 3 - 5 } \multicolumn{2}{|c|}{}StandardSpecialBoth
    Size of
    business
    Small0.10230.26070.0186
    Large0.05970.15200.0108
    The sum of these contributions is 0.6041 .
  2. Calculate the expected frequency for large businesses using the special service. Verify the corresponding contribution 0.1520 to the test statistic.
  3. Carry out the test at the \(5 \%\) level of significance, stating your conclusion clearly. The council is also investigating the weight of rubbish in domestic dustbins. In 2008 the average weight of rubbish in bins was 32.8 kg . The council has now started a recycling initiative and wishes to determine whether there has been a reduction in the weight of rubbish in bins. A random sample of 50 domestic dustbins is selected and it is found that the mean weight of rubbish per bin is now 30.9 kg , and the standard deviation is 3.4 kg .
  4. Carry out a test at the \(5 \%\) level to investigate whether the mean weight of rubbish has been reduced in comparison with 2008 . State carefully your null and alternative hypotheses. www.ocr.org.uk after the live examination series.
    If OCR has unwittingly failed to correctly acknowledge or clear any third-party content in this assessment material, OCR will be happy to correct its mistake at the earliest possible opportunity.
    For queries or further information please contact the Copyright Team, First Floor, 9 Hills Road, Cambridge CB2 1 GE.
    OCR is part of the
OCR MEI S2 2012 January Q4
17 marks Moderate -0.3
4 Birds are observed at feeding stations in three different places - woodland, farm and garden. The numbers of finches, thrushes and tits observed at each site are summarised in the table. The birds observed are regarded as a random sample from the population of birds of these species that use these feeding stations.
\multirow{2}{*}{Observed Frequency}Place
FarmGardenWoodlandTotals
\multirow{4}{*}{Species}Thrushes1174792
Tits702688184
Finches1721029
Totals98102105305
The expected frequencies under the null hypothesis for the usual \(\chi ^ { 2 }\) test are shown in the table below.
\multirow{2}{*}{Expected Frequency}Place
FarmGardenWoodland
\multirow{3}{*}{Species}Thrushes29.560730.767231.6721
Tits59.121361.534463.3443
Finches9.31809.69849.9836
  1. Verify that the entry 9.3180 is correct. The corresponding contributions to the test statistic are shown in the table below.
    \multirow{2}{*}{Contribution}Place
    FarmGardenWoodland
    \multirow{3}{*}{Species}Thrushes11.653960.748919.2192
    Tits2.001720.52019.5969
    Finches6.33326.11080.0000
  2. Verify that the entry 6.3332 is correct.
  3. Carry out the test at the \(1 \%\) level of significance.
  4. For each place, use the table of contributions to comment briefly on the differences between the observed and expected distributions of species.
OCR MEI S2 2011 June Q4
18 marks Standard +0.3
4
  1. In a survey on internet usage, a random sample of 200 people is selected. The people are asked how much they have spent on internet shopping during the last three months. The results, classified by amount spent and sex, are shown in the table.
    \multirow{2}{*}{}Sex\multirow{2}{*}{Row totals}
    MaleFemale
    \multirow{5}{*}{Amount spent}Nothing283462
    Less than £50172138
    £50 up to £200222648
    £200 up to £1000231639
    £1000 or more8513
    Column totals98102200
    1. Write down null and alternative hypotheses for a test to examine whether there is any association between amount spent and sex of person. The contributions to the test statistic for the usual \(\chi ^ { 2 }\) test are shown in the table below.
      \multirow{2}{*}{}Sex
      MaleFemale
      \multirow{5}{*}{Amount spent}Nothing0.18650.1791
      Less than £500.14090.1354
      £50 up to £2000.09820.0944
      £200 up to £10000.79180.7608
      £1000 or more0.41710.4007
      The sum of these contributions, correct to 3 decimal places, is 3.205.
    2. Calculate the expected frequency for females spending nothing. Verify the corresponding contribution, 0.1791 , to the test statistic.
    3. Carry out the test at the \(5 \%\) level of significance, stating your conclusion clearly.
  2. A bakery sells loaves specified as having a mean weight of 400 grams. It is known that the weights of these loaves are Normally distributed and that the standard deviation is 5.7 grams. An inspector suspects that the true mean weight may be less than 400 grams. In order to test this, the inspector takes a random sample of 6 loaves. Carry out a suitable test at the \(5 \%\) level, given that the weights, in grams, of the 6 loaves are as follows. \(\begin{array} { l l l l l l } 392.1 & 405.8 & 401.3 & 387.4 & 391.8 & 400.6 \end{array}\) RECOGNISING ACHIEVEMENT
OCR MEI S2 2012 June Q4
17 marks Standard +0.3
4
  1. Mary is opening a cake shop. As part of her market research, she carries out a survey into which type of cake people like best. She offers people 4 types of cake to taste: chocolate, carrot, lemon and ginger. She selects a random sample of 150 people and she classifies the people as children and adults. The results are as follows.
    \multirow{2}{*}{}Classification of person\multirow{2}{*}{Row totals}
    ChildAdult
    \multirow{4}{*}{Type of cake}Chocolate342357
    Carrot161834
    Lemon41822
    Ginger132437
    Column totals6783150
    The contributions to the test statistic for the usual \(\chi ^ { 2 }\) test are shown in the table below.
    Classification of person
    \cline { 3 - 4 } \multicolumn{2}{|c|}{}ChildAdult
    \multirow{3}{*}{
    Type
    of
    cake
    }
    Chocolate2.86462.3124
    \cline { 2 - 4 }Carrot0.04360.0352
    \cline { 2 - 4 }Lemon3.45492.7889
    \cline { 2 - 4 }Ginger0.75260.6075
    The sum of these contributions, correct to 2 decimal places, is 12.86 .
    1. Calculate the expected frequency for children preferring chocolate cake. Verify the corresponding contribution, 2.8646, to the test statistic.
    2. Carry out the test at the \(1 \%\) level of significance.
  2. Mary buys flour in bags which are labelled as containing 5 kg . She suspects that the average contents of these bags may be less than 5 kg . In order to test this, she selects a random sample of 8 bags and weighs their contents. Assuming that weights are Normally distributed with standard deviation 0.0072 kg , carry out a test at the \(5 \%\) level, given that the weights of the 8 bags in kg are as follows.
    4.992
    4.981
    4.982
    4.996
    4.991
    5.006
    5.009
    5.003
    [0pt] [9] }{www.ocr.org.uk}) after the live examination series.
    If OCR has unwittingly failed to correctly acknowledge or clear any third-party content in this assessment material, OCR will be happy to correct its mistake at the earliest possible opportunity.
    For queries or further information please contact the Copyright Team, First Floor, 9 Hills Road, Cambridge CB2 1GE.
    OCR is part of the
OCR MEI S2 2016 June Q4
20 marks Moderate -0.3
4
  1. A random sample of 80 GCSE students was selected to take part in an investigation into whether attitudes to mathematics differ between girls and boys. The students were asked if they agreed with the statement 'Mathematics is one of my favourite subjects'. They were given three options 'Agree', 'Disagree', 'Neither agree nor disagree'. The results, classified according to sex, are summarised in the table below.
    AgreeDisagreeNeither
    Male17138
    Female121119
    The contributions to the test statistic for the usual \(\chi ^ { 2 }\) test are shown in the table below.
    AgreeDisagreeNeither
    Male0.75500.22461.8153
    Female0.68310.20321.6424
    1. Calculate the expected frequency for females who agree. Verify the corresponding contribution, 0.6831 , to the test statistic.
    2. Carry out the test at the \(5 \%\) level of significance.
  2. The level of radioactivity in limpets (a type of shellfish) in the sea near to a nuclear power station is regularly monitored. Over a period of years it has been found that the level (measured in suitable units) is Normally distributed with mean 5.64. Following an incident at the power station, a researcher suspects that the mean level of radioactivity in limpets may have increased. The researcher selects a random sample of 60 limpets. Their levels of radioactivity, \(x\) (measured in the same units), are summarised as follows. $$\sum x = 373 \quad \sum x ^ { 2 } = 2498$$ Carry out a test at the \(5 \%\) significance level to investigate the researcher's belief.
AQA Further AS Paper 2 Statistics Specimen Q7
9 marks Standard +0.3
7 A dairy industry researcher, Robyn, decided to investigate the milk yield, classified as low, medium or high, obtained from four different breeds of cow, \(\mathrm { A } , \mathrm { B } , \mathrm { C }\) and D . The milk yield of a sample of 105 cows was monitored and the results are summarised in contingency Table 1.
\multirow{2}{*}{Table 1}Yield
LowMediumHighTotal
\multirow{4}{*}{Breed}A451221
B106420
C817732
D520732
Total274830105
The sample of cows may be regarded as random.
Robyn decides to carry out a \(\chi ^ { 2 }\)-test for association between milk yield and breed using the information given in Table 1. 7
  1. Contingency Table 2 gives some of the expected frequencies for this test.
    Complete Table 2 with the missing expected values.
    \multirow[t]{2}{*}{Table 2}Yield
    LowMediumHigh
    \multirow{4}{*}{Breed}A6
    B5.149.145.71
    C
    D8.2314.639.14
    7
  2. (i) For Robyn's test, the test statistic \(\sum \frac { ( O - E ) ^ { 2 } } { E } = 19.4\) correct to three significant figures.
    Use this information to carry out Robyn's test, using the \(1 \%\) level of significance.
    7 (b) (ii) By considering the observed frequencies given in Table 1 with the expected frequencies in Table 2, interpret, in context, the association, if any, between milk yield and breed.
OCR Further Statistics AS 2024 June Q2
8 marks Standard +0.3
2 For a random sample of 160 employees of a large company, the principal method of transport for getting to work, arranged according to grade of employee, is shown in the table.
GradeWalk or cyclePrivate motorised transportPublic transport
A9136
B164341
C11813
A test is carried out at the \(5 \%\) significance level of whether there is association between grade of employee and method of transport.
  1. State appropriate hypotheses for the test. The contributions to the test statistic are shown in the following table, correct to 3 decimal places.
    GradeWalk or cyclePrivate motorised transportPublic transport
    A1.1570.2891.929
    B1.8780.2250.327
    C2.0061.8000.083
  2. Show how the value 0.225 is obtained.
  3. Complete the test, stating the conclusion.
  4. Which combination of grade of employee and method of transport most strongly suggests association? Justify your answer.
Edexcel S3 2013 June Q4
12 marks Standard +0.3
  1. John thinks that a person's eye colour is related to their hair colour. He takes a random sample of 600 people and records their eye and hair colours. The results are shown in Table 1.
\begin{table}[h]
\multirow{2}{*}{}Hair colour
BlackBrownRedBlondeTotal
\multirow{5}{*}{Eye colour}Brown451251558243
Blue34901058192
Hazel20381626100
Green62972365
Total10528248165600
\captionsetup{labelformat=empty} \caption{Table 1}
\end{table} John carries out a \(\chi ^ { 2 }\) test in order to test whether eye colour and hair colour are related. He calculates the expected frequencies shown in Table 2. \begin{table}[h]
\multirow{2}{*}{}Hair colour
BlackBrownRedBlonde
\multirow{4}{*}{Eye colour}Brown42.5114.219.466.8
Blue33.690.215.452.8
Hazel17.547827.5
Green11.430.65.217.9
\captionsetup{labelformat=empty} \caption{Table 2}
\end{table}
  1. Show how the value 47 in Table 2 has been calculated.
  2. Write down the number of degrees of freedom John should use in this \(\chi ^ { 2 }\) test. Given that the value of the \(\chi ^ { 2 }\) statistic is 20.6 , to 3 significant figures,
  3. find the smallest value of \(\alpha\) for which the null hypothesis will be rejected at the \(\alpha \%\) level of significance.
  4. Use the data from Table 1 to test at the \(5 \%\) level of significance whether or not the proportions of people in the population with black, brown, red and blonde hair are in the ratio 2:6:1:3 State your hypotheses clearly.
Edexcel S3 2014 June Q2
7 marks Standard +0.3
  1. A survey asked a random sample of 200 people their age and the main use of their mobile phone.
The results are shown in Table 1 below. \begin{table}[h]
\multirow{2}{*}{}Main use of their mobile phone
InternetTextsPhone calls
\multirow{3}{*}{Age}Under 2027149
From 20 to 40323429
Over 40151921
\captionsetup{labelformat=empty} \caption{Table 1}
\end{table} The data are to be used to test whether or not age and main use of their mobile phone are independent. Table 2 shows the expected frequencies for each group, assuming people's age and main use of their mobile phone are independent. \begin{table}[h]
\multirow{2}{*}{}Main use of their mobile phone
InternetTextsPhone calls
\multirow{3}{*}{Age}Under 2018.516.7514.75
From 20 to 4035.1531.82528.025
Over 4020.3518.42516.225
\captionsetup{labelformat=empty} \caption{Table 2}
\end{table}
  1. For users under 20 choosing the Internet as the main use of their mobile phone,
    1. verify that the expected frequency is 18.5
    2. show that the contribution to the \(\chi ^ { 2 }\) test statistic is 3.91 to 3 significant figures.
  2. Given that the \(\chi ^ { 2 }\) test statistic for the data is 9.893 to 3 decimal places, test at the \(5 \%\) level of significance whether or not age and main use of their mobile phone are independent. State your hypotheses clearly.
Edexcel S3 2017 June Q4
14 marks Standard +0.3
4. A psychologist carries out a survey of the perceived body weight of 150 randomly chosen people. He asks them if they think they are underweight, about right or overweight. His results are summarised in the table below.
\cline { 2 - 4 } \multicolumn{1}{c|}{}UnderweightAbout rightOverweight
Male202230
Female162834
The psychologist calculates two of the expected frequencies, to 2 decimal places, for a test of independence between perceived body weight and gender. These results are shown in the table below.
\cline { 2 - 4 } \multicolumn{1}{c|}{}UnderweightAbout rightOverweight
Male17.28
Female18.72
  1. Complete the table of expected frequencies shown above.
  2. Test, at the \(10 \%\) level of significance, whether or not perceived body weight is independent of gender. State your hypotheses clearly. The psychologist now combines the male and female data to test whether or not body weight types are chosen equally.
  3. Find the smallest significance level, from the tables in the formula booklet, for which there is evidence of a preference.
OCR MEI Further Statistics Major 2022 June Q10
13 marks Standard +0.3
10 A scientist is researching dietary fat intake and cholesterol level. A random sample of 60 people is selected and their dietary fat intakes and cholesterol levels are measured. Dietary fat intakes are classified as low, medium and high, and cholesterol levels are classified as normal and high. The scientist decides to carry out a chi-squared test to investigate whether there is any association between dietary fat intake and cholesterol level. Tables \(\mathbf { 1 0 . 1 }\) and \(\mathbf { 1 0 . 2 }\) show the data and some of the expected frequencies for the test. \begin{table}[h]
\multirow{2}{*}{}Dietary fat intake
LowMediumHighTotal
\multirow{2}{*}{Cholesterol level}Normal918532
High3131228
Total12311760
\captionsetup{labelformat=empty} \caption{Table 10.1}
\end{table} \begin{table}[h]
Expected frequencyDietary fat intake
\cline { 3 - 5 }LowMediumHigh
\multirow{2}{*}{
Cholesterol
level
}
Normal9.0667
\cline { 2 - 5 }High7.9333
\captionsetup{labelformat=empty} \caption{Table 10.2}
\end{table}
  1. Complete the table of expected frequencies in the Printed Answer Booklet.
  2. Determine the contribution to the chi-squared test statistic for people with normal cholesterol level and high dietary fat intake, giving your answer to \(\mathbf { 4 }\) decimal places. The contributions to the chi-squared test statistic for the remaining categories are shown in Table 10.3. \begin{table}[h]
    Dietary fat intake
    \cline { 2 - 5 }LowMediumHigh
    \multirow{2}{*}{
    Cholesterol
    level
    }
    Normal1.05630.1301
    \cline { 2 - 5 }High1.20710.14872.0846
    \captionsetup{labelformat=empty} \caption{Table 10.3} \end{table}
  3. In this question you must show detailed reasoning. Carry out the test at the 5\% significance level.
  4. For each level of dietary fat intake, give a brief interpretation of what the data suggest about the level of cholesterol.
  5. OCR MEI Further Statistics Major 2024 June Q9
    13 marks Standard +0.3
    9 A cyclist has 3 bicycles, a road bike, a gravel bike and an electric bike. She wishes to know if the bicycle which she is riding makes any difference to whether she reaches a speed of 25 mph or greater on a journey. She selects a random sample of 120 journeys and notes the bicycle and whether or not her maximum speed was 25 mph or greater. She decides to carry out a chisquared test to investigate whether there is any association between bicycle type and whether her maximum speed is 25 mph or greater. Tables 9.1 and 9.2 show the data and some of the expected frequencies for the test. \begin{table}[h]
    \captionsetup{labelformat=empty} \caption{Table 9.1}
    \multirow{2}{*}{}Bicycle
    RoadGravelElectricTotal
    \multirow{2}{*}{Maximum speed}Less than 25 mph2211942
    25 mph or greater13471878
    Total156837120
    \end{table} \begin{table}[h]
    \captionsetup{labelformat=empty} \caption{Table 9.2}
    \multirow{2}{*}{Expected frequency}Bicycle
    RoadGravelElectric
    \multirow{2}{*}{Maximum speed}Less than 25 mph12.95
    25 mph or greater24.05
    \end{table}
    1. Complete the table of expected frequencies in the Printed Answer Booklet.
    2. Determine the contribution to the chi-squared test statistic for the Electric bicycle and maximum speed 25 mph or greater. Give your answer correct to 4 decimal places. The contributions to the chi-squared test statistic for the remaining categories are shown in Table 9.3. \begin{table}[h]
      \captionsetup{labelformat=empty} \caption{Table 9.3}
      \multirow{2}{*}{Contribution to the test statistic}Bicycle
      RoadGravelElectric
      \multirow{2}{*}{Maximum speed}Less than 25 mph2.01190.32942.8264
      25 mph or greater1.08330.1774
      \end{table}
    3. In this question you must show detailed reasoning. Carry out the test at the 5\% significance level.
    4. For each type of bicycle, give a brief interpretation of what the data suggest about maximum speed.
    WJEC Further Unit 2 2022 June Q6
    11 marks Standard +0.3
    6. An online survey on the use of social media asked the following question: \begin{displayquote} "Do you use any form of social media?" \end{displayquote} The results for a total of 1953 respondents are shown in the table below.
    Age in years
    Use social media18-2930-4950-6465 or olderTotal
    Yes3104123481961266
    No42116196333687
    Total3525285445291953
    To test whether there is a relationship between social media use and age, a significance test is carried out at the \(5 \%\) level.
    1. State the null and alternative hypotheses.
    2. Show how the expected frequency \(228 \cdot 18\) is calculated in the table below.
      Expected valuesAge in years
      Use social media18-2930-4950-6465 or older
      Yes\(228 \cdot 18\)\(342 \cdot 27\)352.64342.92
      No123.82185.73191.36186.08
    3. Determine the value of \(s\) in the table below.
      Chi-squared contributionsAge in years
      Use social media18-2930-4950-6465 or older
      Yes29.34\(s\)0.0662.94
      No54.0726-180.11115.99
    4. Complete the significance test, showing all your working.
    5. A student, analysing these data on a spreadsheet, obtains the following output. \includegraphics[max width=\textwidth, alt={}, center]{77fd7ad7-f5a3-4947-afc6-e5ef45bef7a8-5_202_1271_445_415} Explain why the student must have made an error in calculating the \(p\)-value.
    Edexcel FS1 AS 2018 June Q4
    7 marks Standard +0.3
    1. Abram carried out a survey of two treatments for a plant fungus. The contingency table below shows the results of a survey of a random sample of 125 plants with the fungus.
    \multirow{2}{*}{}Treatment
    No actionPlant sprayed oncePlant sprayed every day
    \multirow{3}{*}{Outcome}Plant died within a month151625
    Plant survived for 1-6 months82510
    Plant survived beyond 6 months7145
    Abram calculates expected frequencies to carry out a suitable test. Seven of these are given in the partly-completed table below.
    \multirow{2}{*}{}Treatment
    No actionPlant sprayed oncePlant sprayed every day
    \multirow{3}{*}{Outcome}Plant died within a month17.92
    Plant survived for 1-6 months10.3218.9213.76
    Plant survived beyond 6 months6.2411.448.32
    The value of \(\sum \frac { ( O - E ) ^ { 2 } } { E }\) for the 7 given values is 8.29
    Test at the \(2.5 \%\) level of significance, whether or not there is an association between the treatment of the plants and their survival. State your hypotheses and conclusion clearly.
    Edexcel FS1 2024 June Q3
    6 marks Standard +0.3
    1. Tisam took a survey of students' favourite colours. The results are summarised in the table below.
    \multirow{2}{*}{}Colour
    RedBlueGreenYellowBlackTotal
    \multirow{3}{*}{Year group}1-534151422388
    6-92332129884
    10-12528198868
    Total6275453919240
    Tisam carries out a suitable test to see if there is any association between favourite colour and year group.
    1. Write down the hypotheses for a suitable test. For her table, Tisam only needs to check one cell to show that none of the expected frequencies are less than 5
      1. Identify this cell, giving your reason.
      2. Calculate the expected frequency for this cell. The test statistic for Tisam's test is 38.449
    2. Using a \(1 \%\) level of significance, complete the test. You should state your critical value and conclusion clearly.
    AQA Further AS Paper 2 Statistics 2018 June Q8
    10 marks Standard +0.3
    8 An insurance company groups its vehicle insurance policies into two categories, car insurance and motorbike insurance. The number of claims in a random sample of 80 policies was monitored and the results summarised in contingency Table 1. \begin{table}[h]
    \captionsetup{labelformat=empty} \caption{Table 1}
    \multirow{2}{*}{}Number of claims
    0123 or moreTotal
    \multirow[b]{3}{*}{Type of insurance policy}Car91011535
    Motorbike19138545
    Total2823191080
    \end{table} The insurance company decides to carry out a \(\chi ^ { 2 }\)-test for association between number of claims and type of insurance policy using the information given in Table 1. 8
    1. The contingency table shown in Table 2 gives some of the exact expected frequencies for this test. Complete Table 2 with the missing exact expected values. \begin{table}[h]
      \captionsetup{labelformat=empty} \caption{Table 2}
      \multirow{2}{*}{}Number of claims
      0123 or more
      \multirow{2}{*}{Type of insurance policy}Car10.06254.375
      Motorbike10.6875
      \end{table} 8
    2. Carry out the insurance company's test, using the \(10 \%\) level of significance. \includegraphics[max width=\textwidth, alt={}, center]{313cd5ce-07ff-4781-a134-565b8b221145-12_2488_1719_219_150} Additional page, if required.
      Write the question numbers in the left-hand margin. Additional page, if required.
      Write the question numbers in the left-hand margin. Additional page, if required.
      Write the question numbers in the left-hand margin.
    AQA Further AS Paper 2 Statistics 2022 June Q7
    8 marks Standard +0.3
    7 Wade and Odelia are investigating whether there is an association between the region where a person lives and the brand of washing powder they use. They decide to conduct a \(\chi ^ { 2 }\)-test for association and survey a random sample of 200 people. The expected frequencies for the test have been calculated and are shown in the contingency table below.
    WJEC Further Unit 2 2018 June Q6
    10 marks Moderate -0.3
    A student, considering options for the future, collects data on education and salary. The table below shows the highest level of education attained and the salary bracket of a random sample of 664 people.
    Fewer than 5 GCSE5 or more GCSE3 A LevelsUniversity degreePost graduate qualificationTotal
    Less than £20 0001832202810108
    £20 000 to £60 000509511215550462
    More than £60 0003222935594
    Total7114916121865664
    By conducting a chi-squared test for independence, the student investigates the relationship between the highest level of education attained and the salary earned.
    1. State the null and alternative hypotheses. [1]
    2. The table below shows the expected values. Calculate the value of \(k\). [2]
      Expected valuesFewer than 5 GCSE5 or more GCSE3 A LevelsUniversity degreePost graduate qualification
      Less than £20 000\(k\)24·2326·1935·4610·57
      £20 000 to £60 00049·40103·67112·02151·6845·23
      More than £60 00010·0521·0922·7930·869·20
    3. The following computer output is obtained. Calculate the values of \(m\) and \(n\). [2]
      Chi Squared ContributionsFewer than 5 GCSE5 or more GCSE3 A LevelsUniversity degreePost graduate qualification
      Less than £20 0003·604530799\(m\)1·461651·56860·03098
      £20 000 to £60 0000·0072727350·725354E-060·072640·50396
      More than £60 0004·9466198630·038971·690810·55498\(n\)
      X-squared = 19·61301, df = 8, p-value = 0·0119
      1. Without carrying out any further calculations, explain how X-squared = 19·61301 (the \(\chi^2\) test statistic) was calculated. [2]
      2. Comment on the values in the "Fewer than 5 GCSE" column of the table in part (c). [2]
    4. The student says that the highest levels of education lead to the highest paying jobs. Comment on the accuracy of the student's statement. [1]