12.10 Outliers and cleaning data

AQA Edexcel OCR A OCR B (MEI)
An outlier can be defined in terms of quartiles, as any value which is:

either \(> Q_3 + 1.5×IQR \)

or \(< Q_1 - 1.5×IQR \)

Alternatively, an outlier can be defined in terms of the mean and standard deviation, as any value which is:

either \(> \bar{x} + 2\sigma \)

or \(< \bar{x} - 2\sigma \)

Questions may specify different parameters, such as \(\pm 3\sigma\).

Cleaning data is the process of removing outliers from a data set.
3