The Kuiper metric for calibration

Mark Tygert's homepage

Kuiper's metric of calibration

We want a single scalar number to summarize the differences between n pairs of numbers (R₁, S₁), (R₂, S₂), …, (R_n, S_n), where R_k is either 0 or 1 and S_k can take any real value from 0 to 1 (inclusive), for k = 1, 2, …, n. Perfect calibration is when the expected value of R_k is equal to S_k, for k = 1, 2, …, n.

The Kuiper metric is the absolute value of the sum of (R_k – S_k) / n, summing over only those k for which S_k falls in an interval. The interval is chosen such that the absolute value of the sum is greatest.

The reason for restricting to this worst-case interval is to minimize possible cancellation between positive and negative differences. The Kuiper statistic can take values ranging from 0 to 1 (inclusive).