Edexcel S1 (Statistics 1)

Question 1
View details
  1. Briefly describe what is meant by
    1. a statistical model,
    2. a refinement of a model.
    3. The random variable \(X\) has the discrete uniform distribution and takes the values \(\{ 1 , \ldots , n \}\). The standard deviation of of \(X\) is \(2 \sqrt { 6 }\). Find
    4. the mean of \(X\),
    5. \(\mathrm { P } \left( 3 \leq X < \frac { 1 } { 2 } n \right)\).
    6. The rainfall at a weather station was recorded every day of the twentieth century. One year is selected at random from the records and the total rainfall, in cm , in January of that year is denoted by \(R\) Assuming that \(R\) can be modelled by a normal distribution with standard deviation \(12 \cdot 6\), and given that \(\mathrm { P } ( R > 100 ) = 0 \cdot 0764\),
    7. find the mean of \(R\),
    8. calculate \(\mathrm { P } ( 75 < R < 80 )\).
    9. The length of time, in minutes, that visitors queued for a tourist attraction is given by the following table, where, for example, ' 20 - ' means from 20 up to but not including 30 minutes.
    Queuing time (mins)\(0 -\)\(10 -\)\(15 -\)\(20 -\)\(30 -\)\(40 - 60\)
    Number of visitors1524\(x\)1310\(y\)
  2. State the upper class boundary of the first class. A histogram is drawn to represent this data. The total area under the histogram is \(36 \mathrm {~cm} ^ { 2 }\). The ' 10 - ' bar has width 1 cm and height 9.6 cm . The ' 15 - ' bar is ten times as high as the '40-60' bar.
  3. Find the values of \(x\) and \(y\).
  4. On graph paper, construct the histogram accurately.
Question 5
View details
5. The discrete random variable \(X\) takes only the values 4, 5, 6, 7, 8 and 9. The probabilities of these values are given in the table:
\(x\)456789
\(\mathrm { P } ( X = x )\)\(p\)0.1\(q\)\(q\)0.30.2
It is known that \(\mathrm { E } ( X ) = 6 \cdot 7\). Find
  1. the values of \(p\) and \(q\),
  2. the value of \(a\) for which \(\mathrm { E } ( 2 X + a ) = 0\),
  3. \(\operatorname { Var } ( X )\). \section*{STATISTICS 1 (A) TEST PAPER 9 Page 2}
Question 6
View details
  1. The marks out of 75 obtained by a group of ten students in their first and second Statistics modules were as follows:
Student\(A\)\(B\)\(C\)\(D\)\(E\)\(F\)\(G\)\(H\)\(I\)\(J\)
Module 1 \(( x )\)54334271602739465964
Module 2 \(( y )\)50224458421935465560
  1. Find \(\sum x\) and \(\sum y\). Given that \(\sum x ^ { 2 } = 26353\) and \(\sum x y = 22991\),
  2. obtain the equation of the regression line of \(y\) on \(x\).
  3. Estimate the Module 2 result of a student whose mark in Module 1 was Explain why one of these estimates is less reliable than the other. The equation of the regression line of \(x\) on \(y\) is \(x = 0.921 y + 9.81\).
  4. Deduce the product moment correlation coefficient between \(x\) and \(y\), and briefly interpret its value.
Question 7
View details
7. Among the families with two children in a large city, the probability that the elder child is a boy is \(\frac { 5 } { 12 }\) and the probability that the younger child is a boy is \(\frac { 9 } { 16 }\). The probability that the younger child is a girl, given that the elder child is a girl, is \(\frac { 1 } { 4 }\).
One of the families is chosen at random. Using a tree diagram, or otherwise,
  1. show that the probability that both children are boys is \(\frac { 1 } { 8 }\). Find the probability that
  2. one child is a boy and the other is a girl,
  3. one child is a boy given that the other is a girl. If three of the families are chosen at random,
  4. find the probability that exactly two of the families have two boys.
  5. State an assumption that you have made in answering part (d).