Precision: Calculations for Qualitative

## Definitions of Performance Characteristics

##

## Calculating Repeatability

## Calculating Intermediate Precision and Reproducibility

## Reporting and Interpreting Results

The following calculations are used to communicate to end users the precision of a qualitative test method. Qualitative test methods are those that give a positive or negative diagnosis, regardless of whether the assay produces results with numerical values. The methodology used to calculate precision differs depending on whether the test method produces binary data (presence/absence) or numerical results (Ct values, OD values, etc). In either case, the precision performance characteristic can be expressed in terms of a standard deviation.

**KEY**

CV = Coefficient of Variation (Also known as Relative Standard Deviation)

s = standard deviation

*x̄* = average of
*x _{i}*'s

x_{i} = ith numeric test result

n_{i} = number of binary results for replicate i

*TP _{i}* = # of True Positives among n

*TP _{i}* = # of True Negatives among n

p_{i} = proportion of correct results for the ith replicate

N = total number of measurements

CI = Confidence Interval

**Repeatability**

Consistency in results between technical replicates from a single experiment

**Intermediate Precision**

Consistency in results between experiments when more than one variable is added. Variables could include multiple operators, instruments, days, and/or any one other factor.

**Reproducibility**

Consistency in results across laboratories where multiple variables can be assessed.

The calculations used for all three precision characteristics are the same (see below). The difference lies in the experimental design and resulting data used as input. A measurement, such as the average, percent coefficient of variation, or percent of correct determinations (binary), is evaluated by its range of error.

**Data Input - Numerical Data**

Total number of positive samples tested = ____ (used to generate table)

**Data Input - Binary Data**

Total number of samples tested = ____ (used to generate table)

Total number of replicates per sample = ___ (used to generate table)

**Data Input - Numerical Data**

Total number of positive samples tested = ____ (used to generate table)

Total number of conditions (analysts and instruments) = ___(used to generate table)

**Data Input – Binary Data**

Total number of samples tested = ___ (used to generate table)

Total number of replicates per sample = __ (used to generate table)

Total number of operators or instruments = __ (used to generate table)

Total number of labs = __ (used to generate table)

For tests that produce a numerical value, such as a Ct value or OD value, the precision may be reported as both a CV percentage and the associated error (µ +/- CI at α = 0.05)

**Example:**

*The test had a repeatability CV of 2.3% (21.5 +/- 0.31 at α = 0.05, n=5). Intermediate precision with 3 operators was determined to be 2.8% (21.5 +/- 0.30 at α = 0.05, n=10). Reproducibility CV between the 5 laboratories was 4.5% (21.7 +/- 0.27 at α = 0.05, n=20).*

For tests that produce a binary result, such as a positive or negative diagnosis, the precision is reported as the precision estimate and the associated error (p̂ +/- CI)

**Example: **

*The test had a repeatability estimate of 80% +/- 9%, n=5. Intermediate precision with 2 operators was 87% +/- 11%, n=10. Reproducibility p̂ between the 3 laboratories was 86% +/- 7%, n=20.*

Where possible, it is recommended to design a validation experiment to include the candidate test method and a “gold" standard or reference method. Some validation studies are structured to compare tests to observe improvements. Others compare tests to confirm equivalence or comparability. Comparisons inform analysts on the suitability of a test for broader use.

Example, improved measurements:

- Test method A has a repeatability mean of 0.06 +/- 0.01 OD, n=20, and reproducibility mean of 0.09 +/- 0.02 OD, n=50.
- Test method B has a repeatability mean of 0.17 +/- 0.02 OD, n=20, and reproducibility mean of 0.18 +/- 0.04 OD, n=20.

Interpretation: Test method A is statistically different to test method B. This is desirable if test A is an improvement from test B.

Example, comparable measurements:

- Test method A has a repeatability standard deviation of 0.13 +/- 0.05 OD, n=20, and reproducibility standard deviation of 0.14 +/- 0.06 OD, n=50.
- Test method B has a repeatability standard deviation of 0.12 +/- 0.02 OD, n=20, and reproducibility standard deviation of 0.18 +/- 0.04 OD, n=20.

Interpretation: Test method A is statistically comparable to test method B. This is desirable when:

- Introducing a process improvement (e.g., faster or less expensive) that does not change the outcome of a test measurement.
- There is demand for different test format options, and the options must produce similar measurement outcomes.
- Test A and test B are the same test, but test A was designed in a different lab and test B is being verified to produce similar measurements between labs.