OCR S1 (Statistics 1) 2013 January

Mark scheme PDF ↗

Question 1 7 marks
View details
When a four-sided spinner is spun, the number on which it lands is denoted by \(X\), where \(X\) is a random variable taking values 2, 4, 6 and 8. The spinner is biased so that P(\(X = x\)) = \(kx\), where \(k\) is a constant.
  1. Show that P(\(X = 6\)) = \(\frac{3}{10}\). [2]
  2. Find E(\(X\)) and Var(\(X\)). [5]
Question 2 6 marks
View details
  1. Kathryn is allowed three attempts at a high jump. If she succeeds on any attempt, she does not jump again. The probability that she succeeds on her first attempt is \(\frac{1}{4}\). If she fails on her first attempt, the probability that she succeeds on her second attempt is \(\frac{1}{3}\). If she fails on her first two attempts, the probability that she succeeds on her third attempt is \(\frac{1}{2}\). Find the probability that she succeeds. [3]
  2. Khaled is allowed two attempts to pass an examination. If he succeeds on his first attempt, he does not make a second attempt. The probability that he passes at the first attempt is 0.4 and the probability that he passes on either the first or second attempt is 0.58. Find the probability that he passes on the second attempt, given that he failed on the first attempt. [3]
Question 3 12 marks
View details
The Gross Domestic Product per Capita (GDP), \(x\) dollars, and the Infant Mortality Rate per thousand (IMR), \(y\), of 6 African countries were recorded and summarised as follows. \(n = 6\) \quad \(\sum x = 7000\) \quad \(\sum x^2 = 8700000\) \quad \(\sum y = 456\) \quad \(\sum y^2 = 36262\) \quad \(\sum xy = 509900\)
  1. Calculate the equation of the regression line of \(y\) on \(x\) for these 6 countries. [4]
The original data were plotted on a scatter diagram and the regression line of \(y\) on \(x\) was drawn, as shown below. \includegraphics{figure_3}
  1. The GDP for another country, Tanzania, is 1300 dollars. Use the regression line in the diagram to estimate the IMR of Tanzania. [1]
  2. The GDP for Nigeria is 2400 dollars. Give two reasons why the regression line is unlikely to give a reliable estimate for the IMR for Nigeria. [2]
  3. The actual value of the IMR for Tanzania is 96. The data for Tanzania (\(x = 1300, y = 96\)) is now included with the original 6 countries. Calculate the value of the product moment correlation coefficient, \(r\), for all 7 countries. [4]
  4. The IMR is now redefined as the infant mortality rate per hundred instead of per thousand, and the value of \(r\) is recalculated for all 7 countries. Without calculation state what effect, if any, this would have on the value of \(r\) found in part (iv). [1]
Question 4 10 marks
View details
  1. How many different 3-digit numbers can be formed using the digits 1, 2 and 3 when
    1. no repetitions are allowed, [1]
    2. any repetitions are allowed, [2]
    3. each digit may be included at most twice? [2]
  2. How many different 4-digit numbers can be formed using the digits 1, 2 and 3 when each digit may be included at most twice? [5]
Question 5 10 marks
View details
A random variable \(X\) has the distribution B\((5, \frac{1}{4})\).
  1. Find
    1. E(\(X\)), [1]
    2. P(\(X = 2\)). [2]
  2. Two values of \(X\) are chosen at random. Find the probability that their sum is less than 2. [4]
  3. 10 values of \(X\) are chosen at random. Use an appropriate formula to find the probability that exactly 3 of these values are 2s. [3]
Question 6 7 marks
View details
The masses, \(x\) grams, of 800 apples are summarised in the histogram. \includegraphics{figure_6}
  1. On the frequency density axis, 1 cm represents \(a\) units. Find the value of \(a\). [3]
  2. Find an estimate of the median mass of the apples. [4]
Question 7 7 marks
View details
  1. Two judges rank \(n\) competitors, where \(n\) is an even number. Judge 2 reverses each consecutive pair of ranks given by Judge 1, as shown.
    Competitor\(C_1\)\(C_2\)\(C_3\)\(C_4\)\(C_5\)\(C_6\)\(\ldots\)\(C_{n-1}\)\(C_n\)
    Judge 1 rank123456\(\ldots\)\(n-1\)\(n\)
    Judge 2 rank214365\(\ldots\)\(n\)\(n-1\)
    Given that the value of Spearman's coefficient of rank correlation is \(\frac{63}{65}\), find \(n\). [4]
  2. An experiment produced some data from a bivariate distribution. The product moment correlation coefficient is denoted by \(r\), and Spearman's rank correlation coefficient is denoted by \(r_s\).
    1. Explain whether the statement $$r = 1 \Rightarrow r_s = 1$$ is true or false. [1]
    2. Use a diagram to explain whether the statement $$r \neq 1 \Rightarrow r_s \neq 1$$ is true or false. [2]
Question 8 13 marks
View details
Sandra makes repeated, independent attempts to hit a target. On each attempt, the probability that she succeeds is 0.1.
  1. Find the probability that
    1. the first time she succeeds is on her 5th attempt, [2]
    2. the first time she succeeds is after her 5th attempt, [2]
    3. the second time she succeeds is before her 4th attempt. [4]
    Jill also makes repeated attempts to hit the target. Each attempt of either Jill or Sandra is independent. Each time that Jill attempts to hit the target, the probability that she succeeds is 0.2. Sandra and Jill take turns attempting to hit the target, with Sandra going first.
  2. Find the probability that the first person to hit the target is Sandra, on her
    1. 2nd attempt, [2]
    2. 10th attempt. [3]