Edexcel S1 (Statistics 1)

Question 1
View details
  1. (a) Explain briefly what you understand by a statistical model.
    (2 marks)
    A zoologist is analysing data on the weights of adult female otters.
    (b) Name a distribution that you think might be suitable for modelling such data.
    (1 mark)
    (c) Describe two features that you would expect to find in the distribution of the weights of adult female otters and that led to your choice in part (b).
    (2 marks)
    (d) Why might your choice in part (b) not be suitable for modelling the weights of all adult otters?
    (1 mark)
  2. For a geography project a student studied weather records kept by her school since 1993. To see if there was any evidence of global warming she worked out the mean temperature in degrees Celsius at noon for the month of June in each year.
Her results are shown in the table below.
Year19931994199519961997199819992000
Mean temperature
\(\left( { } ^ { \circ } \mathrm { C } \right)\)
21.924.120.723.024.222.122.623.9
Question 2
View details
  1. Plot a scatter diagram showing these data. The student wanted to investigate further whether or not her data provided evidence of an increase in temperature in June each year. Using \(Y\) for the number of years since 1993 and \(T\) for the mean temperature, she calculated the following summary statistics. $$\Sigma Y = 28 , \quad \Sigma T = 182.5 , \quad \Sigma Y ^ { 2 } = 140 , \quad \Sigma T ^ { 2 } = 4173.93 , \quad \Sigma Y T = 644.7 .$$
  2. Calculate the product moment correlation coefficient for these data.
  3. Comment on your result in relation to the student's enquiry.
Question 3
View details
3. In a study of 120 pet-owners it was found that 57 owned at least one dog and of these 16 also owned at least one cat. There were 35 people in the group who didn't own any cats or dogs. As an incentive to take part in the study, one participant is chosen at random to win a year's free supply of pet food. Find the probability that the winner of this prize
  1. owns a dog but does not own a cat,
  2. owns a cat,
  3. does not own a cat given that they do not own a dog.
Question 4
View details
4. An internet service provider runs a series of television adverts at weekly intervals. To investigate the effectiveness of the adverts the company recorded the viewing figures in millions, \(v\), for the programme in which the advert was shown, and the number of new customers, \(c\), who signed up for their service the next day. The results are summarised as follows. $$\bar { v } = 4.92 , \quad \bar { c } = 104.4 , \quad S _ { v c } = 594.05 , \quad S _ { v v } = 85.44 .$$
  1. Calculate the equation of the regression line of \(c\) on \(v\) in the form \(c = a + b v\).
  2. Give an interpretation of the constants \(a\) and \(b\) in this context.
  3. Estimate the number of customers that will sign up with the company the day after an advert is shown during a programme watched by 3.7 million viewers.
  4. State two other factors besides viewing figures that will affect the success of an advert in gaining new customers for the company.
Question 5
View details
5. The time taken in minutes, \(T\), for a mechanic to service a bicycle follows a normal distribution with a mean of 25 minutes and a variance of 16 minutes \(^ { 2 }\). Find
  1. \(\mathrm { P } ( T < 28 )\),
  2. \(\quad \mathrm { P } ( | T - 25 | < 5 )\). One afternoon the mechanic has 3 bicycles to service.
  3. Find the probability that he will take less than 23 minutes on each of the three bicycles.
    (4 marks)
Question 6
View details
6. The number of people visiting a new art gallery each day is recorded over a three-month period and the results are summarised in the table below.
Number of visitorsNumber of days
400-4593
460-4798
480-49913
500-51912
520-53918
540-55911
560-5999
600-6995
  1. Draw a histogram on graph paper to illustrate these data. In order to calculate summary statistics for the data it is coded using \(y = \frac { x - 509.5 } { 10 }\), where \(x\) is the mid-point of each class.
  2. Find \(\sum\) fy. You may assume that \(\sum f y ^ { 2 } = 2041\).
  3. Using these values for \(\sum f y\) and \(\sum f y ^ { 2 }\), calculate estimates of the mean and standard deviation of the number of visitors per day.
    (6 marks)
Question 7
View details
7. A bag contains 4 red and 2 blue balls, all of the same size. A ball is selected at random and removed from the bag. This is repeated until a blue ball is pulled out of the bag. The random variable \(B\) is the number of balls that have been removed from the bag.
  1. Show that \(\mathrm { P } ( B = 2 ) = \frac { 4 } { 15 }\).
  2. Find the probability distribution of \(B\).
  3. Find \(\mathrm { E } ( B )\). The bag and the same 6 balls are used in a game at a funfair. One ball is removed from the bag at a time and a contestant wins 50 pence if one of the first two balls picked out is blue.
  4. What are the expected winnings from playing this game once? For \(\pounds 1\), a contestant gets to play the game three times.
  5. What is the expected profit or loss from the three games?