212 questions · 28 question types identified
Questions that provide raw bivariate data in a table and ask to find the regression line of y on x.
| \(l\) | 49.0 | 52.0 | 53.0 | 54.5 | 54.1 | 53.4 | 50.0 | 51.6 | 49.5 | 51.2 |
| \(w\) | 29 | 32 | 34 | 39 | 38 | 35 | 30 | 31 | 29 | 30 |
Questions that require finding the regression equation in coded variables and then converting it to original variables, or vice versa, using the coding transformations.
A question is this type if and only if it asks whether a regression line provides reliable estimates, whether extrapolation is appropriate, or to comment on the validity of using the model for prediction.
Questions that provide summary statistics (sums, means, variances, Sxx, Sxy, etc.) and ask to find the regression line of y on x.
| Mean | Variance | |
| \(x\) | 3.3125 | 3.3086 |
| \(y\) | 6.7375 | 7.9473 |
A question is this type if and only if it asks to interpret the meaning of the gradient, intercept, or other feature of a regression line in context.
A question is this type if and only if it asks to find the correlation coefficient or other relationship given both regression line equations (y on x and x on y).
A question is this type if and only if it asks to identify which variable is the independent/explanatory/controlled variable and which is the dependent/response variable.
Questions that provide summary statistics (such as Sxx, Syy, Sxy, sums of x, y, x², y², xy) and require calculating the product moment correlation coefficient using these given values.
A question is this sub-type if and only if the student must first calculate the regression line equation from summary statistics (using formulas for gradient and intercept) before making a prediction.
A question is this type if and only if it requires finding unknown data values given the regression line equation and some of the data points.
| \(x\) | 1 | 2 | 3 | 4 | 5 |
| \(y\) | 4 | \(p\) | \(q\) | 2 | 1 |
A question is this sub-type if and only if it provides a scatter diagram and requires interpretation of its features such as correlation strength, outliers, or relationship patterns without requiring drawing.
A question is this type if and only if it involves transforming a non-linear relationship (e.g., y = Ca^x) into linear form by taking logarithms or other transformations to enable linear regression.
| \(x\) | 0.9 | 1.6 | 2.4 | 3.2 |
| \(\ln y\) | 1.7 | 1.9 | 2.3 | 2.6 |
Questions that provide raw bivariate data in a table and require calculating the product moment correlation coefficient directly from the individual data values.
| Account | A | B | C | D | E | F | G | H |
| \(p\) | 1.6 | 2.1 | 2.4 | 2.7 | 2.8 | 3.3 | 5.2 | 8.4 |
| \(q\) | 1.6 | 2.3 | 2.2 | 2.2 | 3.1 | 2.9 | 7.6 | 4.8 |
A question is this sub-type if and only if it provides summary statistics (such as Σx, Σy, Σx², Σy², Σxy, n) and asks to calculate Sxx, Syy, or Sxy using the standard formulas.
A question is this type if and only if it asks to explain what is meant by 'least squares' in the context of regression, typically requiring reference to minimizing sum of squared residuals.
Questions that require testing whether the population correlation coefficient is zero (or equivalently, whether there is significant correlation) using the product moment correlation coefficient and t-distribution or critical value tables.
| \(x\) | 1 | 2 | 4 | 5 | 8 |
| \(y\) | 7 | 5 | 8 | 6 | 4 |
A question is this sub-type if and only if it provides raw data values and asks to calculate Sxx, Syy, or Sxy directly from those values.
A question is this type if and only if it asks to find the mean values of x and y given the equations of both regression lines (using the fact that both pass through the mean point).
A question is this type if and only if it asks to describe, interpret, or comment on the type, strength, or direction of correlation from a given correlation coefficient or scatter diagram.
| Year | 2007 | 2008 | 2009 | 2010 | 2011 |
| \(x\) | 250 | 270 | 264 | 290 | 292 |
| \(y\) | 4.2 | 3.7 | 3.2 | 3.5 | 3.0 |
Questions that ask to find the regression line of x on y (the reverse regression), either from summary statistics or raw data.
| 5.1 | 5.8 | 6.5 | 7.1 | 7.6 | 8.4 | 9.5 | 10.5 | ||
| 6.2 | 6.1 | 5.9 | 5.6 | 5.3 | 5.4 | 5.3 | 5.1 |
A question is this sub-type if and only if the regression line equation is already provided in the question and the task is simply to substitute a value to make a prediction.
A question is this sub-type if and only if it explicitly requires the student to draw or plot a scatter diagram from given data values.
| \(x\) | 3 | 5 | 6 | 8 | 10 | 12 | 13 | 15 | 16 | 18 |
| \(y\) | 36 | 50 | 53 | 61 | 69 | 79 | 82 | 90 | 88 | 96 |
A question is this type if and only if it asks to calculate the variance of x or y from given summary statistics like Σx, Σx², and n.
A question is this type if and only if it involves algebraically minimizing an expression for the sum of squared residuals to derive regression line parameters.
A question is this sub-type if and only if it requires constructing a confidence interval or prediction interval around the predicted value, involving variance and distributional assumptions.
Questions that require testing whether the regression slope coefficient is significantly different from zero using regression output, standard errors, and t-tests to assess the significance of the relationship.
Questions that require converting summary statistics (like Sxx, Sxy, Syy, or correlation coefficient) between coded and original variables using properties of linear transformations.
A question is this sub-type if and only if it provides summary statistics where one of Sxx, Syy, or Sxy is already given and asks to calculate one or both of the remaining S-values.