Interpret scatter diagrams and regression lines for bivariate data, including recognition of scatter diagrams which include distinct sections of the population (calculations involving regression lines are excluded).
Students should be familiar with the terms explanatory (independent) and response (dependent) variables.
Use of interpolation and the dangers of extrapolation. Variables other than x and y may be used.
Use to make predictions within the range of values of the explanatory variable.
Change of variable may be required, e.g. using knowledge of logarithms to reduce a relationship of the form y=axn or y=kbx into linear form to estimate a and n or k and b.
Understand informal interpretation of correlation.
Use of terms such as positive, negative, zero, strong and weak are expected.
Understand that correlation does not imply causation.
Recognise and interpret possible outliers in data sets and statistical diagrams.
Any rule needed to identify outliers will be specified in the question. For example, use of Q1−1.5×IQR and Q3+1.5×IQR or mean ±3×SD.
Select or critique data presentation techniques in the context of a statistical problem.
Students will be expected to draw simple inferences and give interpretations to measures of central tendency and variation. Significance tests, other than those mentioned in Section 5, will not be expected.
Be able to clean data, including dealing with missing data, errors and outliers.
For example, students may be asked to identify possible outliers on a box plot or scatter diagram.