Hypothesis test for correlation

A question is this type if and only if it asks to perform a formal hypothesis test to determine if correlation is significant (positive, negative, or non-zero).

6 questions

CAIE Further Paper 4 2020 Specimen Q4
4 Th m b r, \(x , 6\) a certain ty sea sh ll was co ed at \(\theta\) rach ly cb en sites, each \(\mathbf { a }\) metre seq re, alg th co stlie in co ry \(A\). Th \(m \quad \mathbf { b } , y , \boldsymbol { 6 }\) th same \(\quad \mathbf { 6 }\) sea sh ll was co ed at \(\theta\) rach ly cb en sites, each \(\mathbf {} { } _ { \text {t } }\) metre sq re, alog th co stlie in co ry \(B\). Tb results are sm marised as fb lw s,w b re \(\bar { x }\) ad \(\bar { y } \mathbf { d } \mathbf { h }\) e th samp e meas \(\mathbf { 6 } x\) ad \(y\) resp ctiv ly. $$\bar { x } = 9 \quad \Sigma ( x - \bar { x } ) ^ { 2 } = \mathbf { 3 } \quad \bar { y } = \mathbf { 4 } \quad \Sigma ( y - \bar { y } ) ^ { 2 } = \text { 日 }$$ \includegraphics[max width=\textwidth, alt={}]{0df58f9d-6700-46cc-bcf0-903e94cccc02-06_58_1667_539_239} metre,it b co stlin sirc \(\mathbf { b }\) ry \(A\) ad inc \(\mathbf { b }\) ry \(B\).
CAIE FP2 2013 November Q10
10 The lengths, \(x \mathrm {~m}\), and masses, \(y \mathrm {~kg}\), of 12 randomly chosen babies born at a particular hospital last year are summarised as follows. $$\Sigma x = 7.50 \quad \Sigma x ^ { 2 } = 4.73 \quad \Sigma y = 38.6 \quad \Sigma y ^ { 2 } = 124.84 \quad \Sigma x y = 24.25$$ Find the value of the product moment correlation coefficient for this sample. Obtain an estimate for the mass of a baby, born last year at the hospital, whose length is 0.64 m . Test, at the \(2 \%\) significance level, whether there is non-zero correlation between the two variables.
OCR MEI AS Paper 2 2021 November Q9
9 Arun, Beth and Charlie are investigating whether there is any association between death rate per 1000 and physician density per 1000. They each collect a random sample of size 10. Arun’s sample is shown in Fig.9.1. \begin{table}[h]
death rate per 1000physician density per 1000
Canberra7.23.62
Dhaka5.30.49
Brasilia6.82.23
Yaounde9.30.08
Zagreb12.53.08
Tehran5.41.16
Rome10.74.14
Tripoli3.82.09
Oslo7.94.51
Abuja9.70.35
\captionsetup{labelformat=empty} \caption{Fig. 9.1}
\end{table}
  1. Explain whether or not Arun collected his data from the pre-release material, or whether it is not possible to say. Beth and Charlie collected their samples from the pre-release material. Each of them drew a scatter diagram for their samples. The samples and scatter diagrams are shown in Figs. 9.2 and 9.3.
    Beth's sampledeath rate per 1000physician density per 1000
    Sudan6.70.41
    Cambodia7.40.17
    Gabon6.20.36
    Seychelles70.95
    Mexico5.42.25
    Kuwait2.32.58
    Haiti7.50.23
    Maldives41.04
    Nauru5.91.24
    Jordan3.42.34
    \includegraphics[max width=\textwidth, alt={}]{2b9ce212-84e2-4817-be94-98e2adff12a3-08_545_1024_340_918}
    \begin{table}[h]
    Charlie's sampledeath rate per 1000physician density per 1000
    Vanuata40.17
    Solomon Islands3.80.2
    N. Mariana Islands4.90.36
    Nauru5.91.24
    United Kingdom9.42.81
    Portugal10.63.34
    North Macedonia9.62.87
    Faroe Islands8.82.62
    Bulgaria14.53.99
    St. Kitts and Nevis7.22.52
    \captionsetup{labelformat=empty} \caption{Fig. 9.3}
    \end{table} \begin{figure}[h]
    \captionsetup{labelformat=empty} \caption{Fig. 9.2} \includegraphics[alt={},max width=\textwidth]{2b9ce212-84e2-4817-be94-98e2adff12a3-08_572_899_1400_1041}
    \end{figure} Arun states that Charlie's sample and Beth's sample cannot both be random for the following reasons.
    • Both samples include Nauru - there should not be any common values.
    • Beth's diagram suggests a negative association between death rate and physician density, whereas Charlie's diagram suggests a positive association. If both samples are random the same relationship would be suggested.
    • - Explain whether Arun’s reasons are valid.
    • State whether or not Arun is correct, or whether it is not possible to say.
    Kofi collects a sample of 10 African countries and 10 European countries. The scatter diagram for his results is shown in Fig. 9.4. \begin{figure}[h]
    \includegraphics[alt={},max width=\textwidth]{2b9ce212-84e2-4817-be94-98e2adff12a3-09_485_903_902_260} \captionsetup{labelformat=empty} \caption{Fig. 9.4}
    \end{figure}
  2. On the copy of Fig. 9.4 in the Printed Answer Booklet, use your knowledge of the pre-release material to identify the points representing the 10 European countries, justifying your choice.
Edexcel S1 Q6
6. The Principal of a school believes that more students are absent on days when the temperature is lower. Over a two-week period in December she records the percentage of students who are absent, \(A \%\), and the temperature, \(T ^ { \circ } \mathrm { C }\), at 9 am each morning giving these results.
\(T \left( { } ^ { \circ } \mathrm { C } \right)\)4\({ } ^ { - } 3\)\({ } ^ { - } 2\)\({ } ^ { - } 6\)037\({ } ^ { - } 1\)32
\(A ( \% )\)8.514.117.020.317.915.512.412.813.711.6
  1. Represent these data on a scatter diagram. You may use $$\Sigma T = 7 , \quad \Sigma A = 143.8 , \quad \Sigma T ^ { 2 } = 137 , \quad \Sigma A ^ { 2 } = 2172.66 , \quad \Sigma T A = 20.7$$
  2. Calculate the product moment correlation coefficient for these data and comment on the Principal’s hypothesis.
  3. Find an equation of the regression line of \(A\) on \(T\) in the form \(A = p + q T\).
  4. Draw the regression line on your scatter diagram.
SPS SPS FM Statistics 2022 February Q2
2. A shopper estimates the cost, \(\pounds X\) per item, of each of 12 items in a supermarket. The shopper's estimates are compared with the actual cost, \(\pounds Y\) per item, of each item. The results are summarised as follows.
\(n = 12\)
\(\sum x ^ { 2 } = 28127\)
\(\sum x = 399\)
\(\Sigma y ^ { 2 } = 116509.0212\)
\(\Sigma y = 623.88\)
\(\sum x y = 45006.01\) Test at the \(1 \%\) significance level whether the shopper's estimates are positively correlated with the actual cost of the items.
[0pt] [BLANK PAGE]
Edexcel Paper 3 2018 June Q2
  1. Tessa owns a small clothes shop in a seaside town. She records the weekly sales figures, \(\pounds w\), and the average weekly temperature, \(t ^ { \circ } \mathrm { C }\), for 8 weeks during the summer.
    The product moment correlation coefficient for these data is - 0.915
    1. Stating your hypotheses clearly and using a \(5 \%\) level of significance, test whether or not the correlation between sales figures and average weekly temperature is negative.
    2. Suggest a possible reason for this correlation.
    Tessa suggests that a linear regression model could be used to model these data.
  2. State, giving a reason, whether or not the correlation coefficient is consistent with Tessa’s suggestion.
  3. State, giving a reason, which variable would be the explanatory variable. Tessa calculated the linear regression equation as \(w = 10755 - 171 t\)
  4. Give an interpretation of the gradient of this regression equation.