Find cumulative distribution F(x)

A question is this type if and only if it asks to find or use the cumulative distribution function F(x) = P(X ≤ x).

6 questions

Edexcel S1 2016 January Q1
  1. The discrete random variable \(X\) has the probability distribution given in the table below.
\(x\)- 21346
\(\mathrm { P } ( X = x )\)\(\frac { 1 } { 4 }\)\(\frac { 1 } { 6 }\)\(\frac { 1 } { 3 }\)\(\frac { 1 } { 12 }\)\(\frac { 1 } { 6 }\)
  1. Write down the value of \(\mathrm { F } ( 5 )\)
  2. Find \(\mathrm { E } ( X )\)
  3. Find \(\operatorname { Var } ( X )\) The random variable \(Y = 7 - 2 X\)
  4. Find
    1. \(\mathrm { E } ( Y )\)
    2. \(\operatorname { Var } ( Y )\)
    3. \(\mathrm { P } ( Y > X )\) \includegraphics[max width=\textwidth, alt={}, center]{70137e9a-0a6b-48b5-8dd4-c436cb063351-03_2261_47_313_37}
Edexcel S1 2015 June Q1
  1. The discrete random variable \(X\) can only take the values \(1,2,3\) and 4 For these values the cumulative distribution function is defined by
$$\mathrm { F } ( x ) = k x ^ { 2 } \text { for } x = 1,2,3,4$$ where \(k\) is a constant.
  1. Find the value of \(k\).
  2. Find the probability distribution of \(X\).
Edexcel S1 2018 October Q5
  1. The discrete random variable \(X\) is defined by the cumulative distribution function
\(x\)12345
\(\mathrm {~F} ( x )\)\(\frac { 3 k } { 2 }\)\(4 k\)\(\frac { 15 k } { 2 }\)\(12 k\)\(\frac { 35 k } { 2 }\)
where \(k\) is a constant.
  1. Find the probability distribution of \(X\).
  2. Find \(\mathrm { P } ( 1.5 < X \leqslant 3.5 )\) The random variable \(Y = 12 - 7 X\)
  3. Calculate Var(Y)
  4. Calculate \(\mathrm { P } ( 4 X \leqslant | Y | )\)
Edexcel S1 2009 January Q3
3. When Rohit plays a game, the number of points he receives is given by the discrete random variable \(X\) with the following probability distribution.
\(x\)0123
\(\mathrm { P } ( X = x )\)0.40.30.20.1
  1. Find \(\mathrm { E } ( X )\).
  2. Find \(\mathrm { F } ( 1.5 )\).
  3. Show that \(\operatorname { Var } ( X ) = 1\)
  4. Find \(\operatorname { Var } ( 5 - 3 X )\). Rohit can win a prize if the total number of points he has scored after 5 games is at least 10. After 3 games he has a total of 6 points. You may assume that games are independent.
  5. Find the probability that Rohit wins the prize.
Edexcel S1 2021 October Q5
  1. The discrete random variable \(Y\) has the following probability distribution
\(y\)- 9- 5059
\(\mathrm { P } ( Y = y )\)\(q\)\(r\)\(u\)\(r\)\(q\)
where \(q , r\) and \(u\) are probabilities.
  1. Write down the value of \(\mathrm { E } ( Y )\) The cumulative distribution function of \(Y\) is \(\mathrm { F } ( y )\)
    Given that \(F ( 0 ) = \frac { 19 } { 30 }\)
  2. show that the value of \(u\) is \(\frac { 4 } { 15 }\) Given also that \(\operatorname { Var } ( Y ) = 37\)
  3. find the value of \(q\) and the value of \(r\) The coordinates of a point \(P\) are \(( 12 , Y )\) The random variable \(D\) represents the length of \(O P\)
  4. Find the probability distribution of \(D\)
Edexcel S1 Q7
7. A music teacher monitored the sight-reading ability of one of her pupils over a 10 week period. At the end of each week, the pupil was given a new piece to sight-read and the teacher noted the number of errors \(y\). She also recorded the number of hours \(x\) that the pupil had practised each week. The data are shown in the table below.
  1. Given that \(\mathrm { E } ( X ) = - 0.2\), find the value of \(\alpha\) and the value of \(\beta\).
  2. Write down \(\mathrm { F } ( 0.8 )\).
  3. Evaluate \(\operatorname { Var } ( X )\). Find the value of
  4. \(\mathrm { E } ( 3 X - 2 )\),
  5. \(\operatorname { Var } ( 2 X + 6 )\).
    7. The following stem and leaf diagram shows the aptitude scores \(x\) obtained by all the applicants for a particular job. 3| 1 means 31 \begin{table}[h]
    \captionsetup{labelformat=empty} \caption{Aptitude score}
    3129
    424689
    51335679
    60133356889
    71222455568888
    801235889
    9012
    9
    \end{table}
  6. Write down the modal aptitude score.
  7. Find the three quartiles for these data. Outliers can be defined to be outside the limits \(\mathrm { Q } _ { 1 } - 1.0 \left( \mathrm { Q } _ { 3 } - \mathrm { Q } _ { 1 } \right)\) and \(\mathrm { Q } _ { 3 } + 1.0 \left( \mathrm { Q } _ { 3 } - \mathrm { Q } _ { 1 } \right)\).
  8. On a graph paper, draw a box plot to represent these data. For these data, \(\Sigma x = 3363\) and \(\Sigma x ^ { 2 } = 238305\).
  9. Calculate, to 2 decimal places, the mean and the standard deviation for these data.
  10. Use two different methods to show that these data are negatively skewed. END \section*{Advanced/Advanced Subsidiary} \section*{Wednesday 15 January 2003 - Morning} Answer Book (AB16)
    Graph Paper (ASG2)
    Mathematical Formulae (Lilac) Candidates may use any calculator EXCEPT those with the facility for symbolic Nil algebra, differentiation and/or integration. Thus candidates may NOT use calculators such as the Texas Instruments TI 89, TI 92, Casio CFX 9970G, Hewlett Packard HP 48G. In the boxes on the answer book, write the name of the examining body (Edexcel), your centre number, candidate number, the unit title (Statistics S1), the paper reference (6683), your surname, other name and signature.
    Values from the statistical tables should be quoted in full. When a calculator is used, the answer should be given to an appropriate degree of accuracy. A booklet 'Mathematical Formulae and Statistical Tables' is provided.
    Full marks may be obtained for answers to ALL questions.
    This paper has seven questions. Pages 6, 7 and 8 are blank. You must ensure that your answers to parts of questions are clearly labelled.
    You must show sufficient working to make your methods clear to the Examiner. Answers without working may gain no credit.
    1. The total amount of time a secretary spent on the telephone in a working day was recorded to the nearest minute. The data collected over 40 days are summarised in the table below.
    Draw a histogram to illustrate these data.
    2. The lifetimes of batteries used for a computer game have a mean of 12 hours and a standard deviation of 3 hours. Battery lifetimes may be assumed to be normally distributed. Find the lifetime, \(t\) hours, of a battery such that 1 battery in 5 will have a lifetime longer than \(t\).
    3. A company owns two petrol stations \(P\) and \(Q\) along a main road. Total daily sales in the same week for \(P ( \pounds p )\) and for \(Q ( \pounds q )\) are summarised in the table below. A science teacher believes that students' marks in physics depend upon their mathematical ability. The teacher decides to investigate this relationship using the test marks.
  11. Write down which is the explanatory variable in this investigation.
  12. Draw a scatter diagram to illustrate these data.
  13. Showing your working, find the equation of the regression line of \(p\) on \(m\).
  14. Draw the regression line on your scatter diagram. A ninth student was absent for the physics test, but she sat the mathematics test and scored 15 .
  15. Using this model, estimate the mark she would have scored in the physics test. \section*{END} Materials required for examination
    Answer Book (AB16)
    Graph Paper (ASG2)
    Mathematical Formulae (Lilac)
    Candidates may use any calculator EXCEPT those with the facility for symbolic algebra, differentiation and/or integration. Thus candidates may NOT use calculators such as the Texas Instruments TI 89, TI 92, Casio CFX 9970G, Hewlett Packard HP 48G. Paper Reference(s)
    6683 \section*{Advanced/Advanced Subsidiary} \section*{Tuesday 4 November 2003 - Morning} In the boxes on the answer book, write the name of the examining body (Edexcel), your centre number, candidate number, the unit title (Statistics S1), the paper reference (6683), your surname, other name and signature.
    Values from the statistical tables should be quoted in full. When a calculator is used, the answer should be given to an appropriate degree of accuracy. A booklet 'Mathematical Formulae and Statistical Tables' is provided.
    Full marks may be obtained for answers to ALL questions.
    This paper has six questions. You must ensure that your answers to parts of questions are clearly labelled.
    You must show sufficient working to make your methods clear to the Examiner. Answers without working may gain no credit.
    1. A company wants to pay its employees according to their performance at work. The performance score \(x\) and the annual salary, \(y\) in \(\pounds 100\) s, for a random sample of 10 of its employees for last year were recorded. The results are shown in the table below.
    [You may use \(\left. \Sigma h ^ { 2 } = 272094 , \Sigma c ^ { 2 } = 2878966 , \Sigma h c = 884484 \right]\)
  16. Draw a scatter diagram to illustrate these data.
  17. Find exact values of \(S _ { h c } S _ { h h }\) and \(S _ { c c }\).
  18. Calculate the value of the product moment correlation coefficient for these data.
  19. Give an interpretation of your correlation coefficient.
  20. Calculate the equation of the regression line of \(c\) on \(h\) in the form \(c = a + b h\).
  21. Estimate the level of confidence of a person of height 180 cm .
  22. State the range of values of \(h\) for which estimates of \(c\) are reliable. A booklet 'Mathematical Formulae and Statistical Tables' is provided.
    Full marks may be obtained for answers to ALL questions.
    This paper has six questions. You must ensure that your answers to parts of questions are clearly labelled.
    You must show sufficient working to make your methods clear to the Examiner. Answers without working may gain no credit.
    3. A discrete random variable \(X\) has a probability function as shown in the table below, where \(a\) and \(b\) are constants. Key: \(0 \quad | 18 | 4\) means 180 for Keith and 184 for Asif
    The quartiles for these two distributions are summarised in the table below. \(\quad 00\) means 10 Totals
  23. Find the three quartiles of these data. During the same month, the least number of caravans on Northcliffe caravan site was 31. The maximum number of caravans on this site on any night that month was 72 . The three quartiles for this site were 38, 45 and 52 respectively.
  24. On graph paper and using the same scale, draw box plots to represent the data for both caravan sites. You may assume that there are no outliers.
  25. Compare and contrast these two box plots.
  26. Give an interpretation to the upper quartiles of these two distributions.
    3. The following table shows the height \(x\), to the nearest cm , and the weight \(y\), to the nearest kg , of a random sample of 12 students.
  27. Give a reason to justify the use of a histogram to represent these data.
  28. Calculate the frequency densities needed to draw a histogram for these data.
    (DO NOT DRAW THE HISTOGRAM)
  29. Use interpolation to estimate the median \(Q _ { 2 }\), the lower quartile \(Q _ { 1 }\), and the upper quartile \(Q _ { 3 }\) of these data. The mid-point of each class is represented by \(x\) and the corresponding frequency by \(f\). Calculations then give the following values $$\sum f x = 8379.5 \text { and } \sum f x ^ { 2 } = 557489.75$$
  30. Calculate an estimate of the mean and an estimate of the standard deviation for these data. One coefficient of skewness is given by $$\frac { Q _ { 3 } - 2 Q _ { 2 } + Q _ { 1 } } { Q _ { 3 } - Q _ { 1 } } .$$
  31. Evaluate this coefficient and comment on the skewness of these data.
  32. Give another justification of your comment in part (e).
    3. A long distance lorry driver recorded the distance travelled, \(m\) miles, and the amount of fuel used, \(f\) litres, each day. Summarised below are data from the driver's records for a random sample of 8 days. The data are coded such that \(x = m - 250\) and \(y = f - 100\). $$\sum x = 130 \quad \sum y = 48 \quad \sum x y = 8880 \quad S _ { x x } = 20487.5$$
  33. Find the equation of the regression line of \(y\) on \(x\) in the form \(y = a + b x\).
  34. Hence find the equation of the regression line of \(f\) on \(m\).
  35. Predict the amount of fuel used on a journey of 235 miles.
    4. Aeroplanes fly from City \(A\) to City \(B\). Over a long period of time the number of minutes delay in take-off from City \(A\) was recorded. The minimum delay was 5 minutes and the maximum delay was 63 minutes. A quarter of all delays were at most 12 minutes, half were at most 17 minutes and \(75 \%\) were at most 28 minutes. Only one of the delays was longer than 45 minutes. An outlier is an observation that falls either \(1.5 \times\) (interquartile range) above the upper quartile or \(1.5 \times\) (interquartile range) below the lower quartile.
  36. On graph paper, draw a box plot to represent these data.
  37. Comment on the distribution of delays. Justify your answer.
  38. Suggest how the distribution might be interpreted by a passenger who frequently flies from City \(A\) to City \(B\).
    5. The random variable \(X\) has probability function $$\mathrm { P } ( X = x ) = \begin{cases} k x , & x = 1,2,3
    k ( x + 1 ) , & x = 4,5 \end{cases}$$ where \(k\) is a constant.
  39. Find the value of \(k\).
  40. Find the exact value of \(\mathrm { E } ( X )\).
  41. Show that, to 3 significant figures, \(\operatorname { Var } ( X ) = 1.47\).
  42. Find, to 1 decimal place, \(\operatorname { Var } ( 4 - 3 X )\).
    6. A scientist found that the time taken, \(M\) minutes, to carry out an experiment can be modelled by a normal random variable with mean 155 minutes and standard deviation 3.5 minutes. Find
  43. \(\mathrm { P } ( M > 160 )\),
  44. \(\mathrm { P } ( 150 \leq M \leq 157 )\),
  45. the value of \(m\), to 1 decimal place, such that \(\mathrm { P } ( M \leq m ) = 0.30\).
    7. In a school there are 148 students in Years 12 and 13 studying Science, Humanities or Arts subjects. Of these students, 89 wear glasses and the others do not. There are 30 Science students of whom 18 wear glasses. The corresponding figures for the Humanities students are 68 and 44 respectively. A student is chosen at random.
    Find the probability that this student
  46. is studying Arts subjects,
  47. does not wear glasses, given that the student is studying Arts subjects. Amongst the Science students, \(80 \%\) are right-handed. Corresponding percentages for Humanities and Arts students are \(75 \%\) and \(70 \%\) respectively. A student is again chosen at random.
  48. Find the probability that this student is right-handed. \section*{TOTAL FOR PAPER:75 MARKS}
  49. Given that this student is right-handed, find the probability that the student is studying Science subjects. Materials required for examination
    Mathematical Formulae (Green or Lilac) Items included with question papers
    Nil Paper Reference(s)
    6683/01 \section*{Advanced/Advanced Subsidiary} Monday 16 January 2006 - Morning
    Time: 1 hour 30 minutes Candidates may use any calculator EXCEPT those with the facility for symbolic algebra, differentiation and/or integration. Thus candidates may NOT use calculators such as the Texas Instruments TI 89, TI 92, Casio CFX 9970G, Hewlett Packard HP 48G. In the boxes on the answer book, write the name of the examining body (Edexcel), your centre number, candidate number, the unit title (Statistics S1), the paper reference (6683), your surname, other name and signature.
    Values from the statistical tables should be quoted in full. When a calculator is used, the answer should be given to an appropriate degree of accuracy. A booklet 'Mathematical Formulae and Statistical Tables' is provided.
    Full marks may be obtained for answers to ALL questions.
    The marks for individual questions and the parts of questions are shown in round brackets: e.g. (2). There are 7 questions on this paper. The total mark for this paper is 75 . You must ensure that your answers to parts of questions are clearly labelled.
    You must show sufficient working to make your methods clear to the Examiner. Answers without working may gain no credit.
    1. Over a period of time, the number of people \(x\) leaving a hotel each morning was recorded. These data are summarised in the stem and leaf diagram below.
    Two of the conversations were chosen at random.
  50. Find the probability that both of them were longer than 24.5 minutes. The mid-point of each class was represented by \(x\) and its corresponding frequency by \(f\), giving \(\sum f x = 1060\).
  51. Calculate an estimate of the mean time spent on their conversations. During the following 25 weeks they monitored their weekly conversation and found that at the end of the 80 weeks their overall mean length of conversation was 21 minutes.
  52. Find the mean time spent in conversation during these 25 weeks.
  53. Comment on these two mean values.
    3. A metallurgist measured the length, \(l \mathrm {~mm}\), of a copper rod at various temperatures, \(t ^ { \circ } \mathrm { C }\), and recorded the following results. (You may use: \(\sum x = 315 , \sum x ^ { 2 } = 15225 , \sum y = 620 , \sum y ^ { 2 } = 56550 , \sum x y = 28750\) )
  54. Draw a scatter diagram to represent these data.
  55. Show that \(S _ { x y } = 4337.5\) and find \(S _ { x x }\). The student believes that a linear relationship of the form \(y = a + b x\) could be used to describe these data.
  56. Use linear regression to find the value of \(a\) and the value of \(b\), giving your answers to 1 decimal place.
  57. Draw the regression line on your diagram. The student believes that one brand of chocolate is overpriced.
  58. Use the scatter diagram to
    1. state which brand is overpriced,
    2. suggest a fair price for this brand. Give reasons for both your answers.
      4. A survey of the reading habits of some students revealed that, on a regular basis, \(25 \%\) read quality newspapers, \(45 \%\) read tabloid newspapers and \(40 \%\) do not read newspapers at all.
  59. Find the proportion of students who read both quality and tabloid newspapers.
  60. Draw a Venn diagram to represent this information. A student is selected at random. Given that this student reads newspapers on a regular basis,
  61. find the probability that this student only reads quality newspapers.
    5. \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{e8afd947-55ac-424b-8db5-d5aa856ef4d7-046_494_926_258_1706} \captionsetup{labelformat=empty} \caption{Figure 2}
    \end{figure} Figure 2 shows a histogram for the variable \(t\) which represents the time taken, in minutes, by a group of people to swim 500 m .
  62. Copy and complete the frequency table for \(t\). $$\text { [You may use } \sum x ^ { 2 } = 60475 , \sum y ^ { 2 } = 53122 , \sum x y = 56076 \text { ] }$$
  63. Showing your working clearly, calculate the product moment correlation coefficient between the interview test and the performance after one year. The product moment correlation coefficient between the skills assessment and the performance after one year is - 0.156 to 3 significant figures.
  64. Use your answer to part (a) to comment on whether or not the interview test and skills assessment are a guide to the performance after one year. Give clear reasons for your answers.
    2. Cotinine is a chemical that is made by the body from nicotine which is found in cigarette smoke. A doctor tested the blood of 12 patients, who claimed to smoke a packet of cigarettes a day, for cotinine. The results, in appropriate units, are shown below. For the Balmoral Hotel,
  65. write down the mode of the age of the residents,
  66. find the values of the lower quartile, the median and the upper quartile.
    1. Find the mean, \(\bar { x }\), of the age of the residents.
    2. Given that \(\sum _ { x } x ^ { 2 } = 81213\), find the standard deviation of the age of the residents. One measure of skewness is found using $$\frac { \text { mean - mode } } { \text { standard deviation } }$$
  67. Evaluate this measure for the Balmoral Hotel. For the Abbey Hotel, the mode is 39 , the mean is 33.2 , the standard deviation is 12.7 and the measure of skewness is - 0.454 .
  68. Compare the two age distributions of the residents of each hotel.
    3. The random variable \(X\) has probability distribution given in the table below. A histogram was drawn and the bar representing the \(10 - 15\) class has a width of 2 cm and a height of 5 cm . For the 16-18 class find
  69. the width,
  70. the height
    of the bar representing this class.
    4. A researcher measured the foot lengths of a random sample of 120 ten-year-old children. The lengths are summarised in the table below.
    Foot length, \(\boldsymbol { l } , ( \mathbf { c m } )\)Number of children
    \(10 \leq l < 12\)5
    \(12 \leq l < 17\)53
    \(17 \leq l < 19\)29
    \(19 \leq l < 21\)15
    \(21 \leq l < 23\)11
    \(23 \leq l < 25\)7
  71. Use interpolation to estimate the median of this distribution.
  72. Calculate estimates for the mean and the standard deviation of these data. One measure of skewness is given by $$\text { Coefficient of skewness } = \frac { 3 ( \text { mean } - \text { median } ) } { \text { standard deviation } }$$
  73. Evaluate this coefficient and comment on the skewness of these data. Greg suggests that a normal distribution is a suitable model for the foot lengths of ten-year-old children.
  74. Using the value found in part (c), comment on Greg's suggestion, giving a reason for your answer.
    5. The weight, \(w\) grams, and the length, \(l \mathrm {~mm}\), of 10 randomly selected newborn turtles are given in the table below.
    \(l\)49.052.053.054.554.153.450.051.649.551.2
    \(w\)29323439383530312930
    $$\text { (You may use } S _ { l l } = 33.381 \quad S _ { w l } = 59.99 \quad S _ { w w } = 120.1 \text { ) }$$
  75. Find the equation of the regression line of \(w\) on \(l\) in the form \(w = a + b l\).
  76. Use your regression line to estimate the weight of a newborn turtle of length 60 mm .
  77. Comment on the reliability of your estimate giving a reason for your answer.
    6. The discrete random variable \(X\) has probability function $$\mathrm { P } ( X = x ) = \left\{ \begin{array} { c l } a ( 3 - x ) & x = 0,1,2
    b & x = 3 \end{array} \right.$$
  78. Find \(\mathrm { P } ( X = 2 )\) and copy and complete the table below.
    \(x\)0123
    \(\mathrm { P } ( X = x )\)\(3 a\)\(2 a\)\(b\)
    Given that \(\mathrm { E } ( X ) = 1.6\),
  79. find the value of \(a\) and the value of \(b\). Find
  80. \(\mathrm { P } ( 0.5 < X < 3 )\),
  81. \(\mathrm { E } ( 3 X - 2 )\).
  82. Show that the \(\operatorname { Var } ( X ) = 1.64\)
  83. Calculate \(\operatorname { Var } ( 3 X - 2 )\).
    7. (a) Given that \(\mathrm { P } ( A ) = a\) and \(\mathrm { P } ( B ) = b\) express \(\mathrm { P } ( A \cup B )\) in terms of \(a\) and \(b\) when
    1. \(A\) and \(B\) are mutually exclusive,
    2. \(A\) and \(B\) are independent. Two events \(R\) and \(Q\) are such that $$\mathrm { P } ( R \cap Q \square ) = 0.15 , \quad \mathrm { P } ( Q ) = 0.35 \quad \text { and } \quad \mathrm { P } ( R \mid Q ) = 0.1$$ Find the value of
  84. \(\mathrm { P } ( R \cup Q )\),
  85. \(\mathrm { P } ( R \cap Q )\),
  86. \(\mathrm { P } ( R )\).