219 questions · 30 question types identified
Questions that provide raw bivariate data in a table and ask to find the regression line of y on x.
| \(x\) | 1 | 2 | 4 | 5 | 8 |
| \(y\) | 7 | 5 | 8 | 6 | 4 |
| 2450 | 2480 | 2540 | 2420 | 2350 | 2290 | 2400 | 2460 | ||
| 1370 | 1350 | 1400 | 1330 | 1270 | 1210 | 1330 | 1350 |
| \(x\) | 4 | 6 | 6 | 8 | 2 | 7 | 12 | 14 | 9 | 5 |
| \(y\) | 2 | 4 | 6 | 8 | 6 | 10 | 9 | 8 | 6 | 5 |
Questions that provide summary statistics (sums, means, variances, Sxx, Sxy, etc.) and ask to find the regression line of y on x.
| Mean | Variance | |
| \(x\) | 3.3125 | 3.3086 |
| \(y\) | 6.7375 | 7.9473 |
| \(P\) (£) | 75 | 65 | 55 | 45 | 35 |
| \multirow[t]{5}{*}{\(H\) (hundred)} | 27 | 27 | 27 | 26 | 15 |
| 27 | 27 | 20 | 21 | 12 | |
| 22 | 18 | 16 | 9 | ||
| 19 | 18 | 13 | |||
| 12 | 16 | 9 |
Questions that require finding the regression equation in coded variables and then converting it to original variables, or vice versa, using the coding transformations.
A question is this type if and only if it asks to interpret the meaning of the gradient, intercept, or other feature of a regression line in context.
A question is this type if and only if it asks to identify which variable is the independent/explanatory/controlled variable and which is the dependent/response variable.
| Units of alcohol consumed | 2 | 3 | 3 | 4 | 4.5 | 5.5 | 6 | 6 | 7 | 8 | 8 | 9 |
| Reaction time (seconds) | 1 | 2 | 5 | 5 | 3.8 | 5.5 | 4.8 | 8.5 | 7.2 | 6.8 | 9 | 8 |
| \(t\) | 20 | 35 | 40 | 25 | 45 | 70 | 75 | 90 |
| \(h\) | 88 | 85 | 77 | 75 | 71 | 66 | 60 | 54 |
A question is this sub-type if and only if the student must first calculate the regression line equation from summary statistics (using formulas for gradient and intercept) before making a prediction.
Questions that provide summary statistics (such as Sxx, Syy, Sxy, sums of x, y, x², y², xy) and require calculating the product moment correlation coefficient using these given values.
| Sign of residual | + | + | + | + | - | - | + | - | - | - | - | - | - | - | - | + | + | + | + | + |
A question is this type if and only if it requires finding unknown data values given the regression line equation and some of the data points.
| \(x\) | 3 | 4 | 4 | 6 | 8 |
| \(y\) | 5 | 7 | \(q\) | 6 | 7 |
| \(x\) | 3 | 4 | 4 | 6 | 8 |
| \(y\) | 5 | 7 | \(q\) | 6 | 7 |
A question is this sub-type if and only if it provides a scatter diagram and requires interpretation of its features such as correlation strength, outliers, or relationship patterns without requiring drawing.
| Country | Life expectancy at birth in 2014 |
| Ethiopia | 60.8 |
| Sweden | 81.9 |
A question is this type if and only if it asks to find the correlation coefficient or other relationship given both regression line equations (y on x and x on y).
A question is this sub-type if and only if it provides summary statistics (such as Σx, Σy, Σx², Σy², Σxy, n) and asks to calculate Sxx, Syy, or Sxy using the standard formulas.
A question is this type if and only if it involves transforming a non-linear relationship (e.g., y = Ca^x) into linear form by taking logarithms or other transformations to enable linear regression.
A question is this type if and only if it asks to find the mean values of x and y given the equations of both regression lines (using the fact that both pass through the mean point).
Questions that provide raw bivariate data in a table and require calculating the product moment correlation coefficient directly from the individual data values.
| Account | A | B | C | D | E | F | G | H |
| \(p\) | 1.6 | 2.1 | 2.4 | 2.7 | 2.8 | 3.3 | 5.2 | 8.4 |
| \(q\) | 1.6 | 2.3 | 2.2 | 2.2 | 3.1 | 2.9 | 7.6 | 4.8 |
Questions that require testing whether the population correlation coefficient is zero (or equivalently, whether there is significant correlation) using the product moment correlation coefficient and t-distribution or critical value tables.
Questions that ask whether a regression line provides reliable estimates for a given value, whether extrapolation is appropriate, or to comment on the validity/reliability of using the model for a specific prediction.
A question is this type if and only if it asks to explain what is meant by 'least squares' in the context of regression, typically requiring reference to minimizing sum of squared residuals.
Questions that ask to find the regression line of x on y (the reverse regression), either from summary statistics or raw data.
A question is this type if and only if it asks to describe, interpret, or comment on the type, strength, or direction of correlation from a given correlation coefficient or scatter diagram.
| Year | 2007 | 2008 | 2009 | 2010 | 2011 |
| \(x\) | 250 | 270 | 264 | 290 | 292 |
| \(y\) | 4.2 | 3.7 | 3.2 | 3.5 | 3.0 |
A question is this sub-type if and only if it explicitly requires the student to draw or plot a scatter diagram from given data values.
| \(x\) | 3 | 5 | 6 | 8 | 10 | 12 | 13 | 15 | 16 | 18 |
| \(y\) | 36 | 50 | 53 | 61 | 69 | 79 | 82 | 90 | 88 | 96 |
A question is this sub-type if and only if it provides raw data values and asks to calculate Sxx, Syy, or Sxy directly from those values.
Questions that ask students to assess whether a linear regression model is appropriate based on contextual factors, scatter diagrams, or theoretical considerations (not residual plots).
A question is this type if and only if it involves algebraically minimizing an expression for the sum of squared residuals to derive regression line parameters.
A question is this sub-type if and only if the regression line equation is already provided in the question and the task is simply to substitute a value to make a prediction.
A question is this type if and only if it asks to calculate the variance of x or y from given summary statistics like Σx, Σx², and n.
A question is this sub-type if and only if it requires constructing a confidence interval or prediction interval around the predicted value, involving variance and distributional assumptions.
Questions that require testing whether the regression slope coefficient is significantly different from zero using regression output, standard errors, and t-tests to assess the significance of the relationship.
Questions that provide residual plots and ask students to interpret them or assess model appropriateness based on residual patterns.
Questions that require converting summary statistics (like Sxx, Sxy, Syy, or correlation coefficient) between coded and original variables using properties of linear transformations.
A question is this sub-type if and only if it provides summary statistics where one of Sxx, Syy, or Sxy is already given and asks to calculate one or both of the remaining S-values.