Relative Potency and Parallelism In Potency Bioassays

Relative Potency Determination

The relative potency of a test sample is the amount of biological activity it produces compared to an equal amount of a reference standard under the same conditions. Relative potency is measured between a dilution series of doses from both materials. The responses from both dilution curves are constrained to fit regression curves having an identical curve shape that provides the best fit for both curves. The distance between the two curves on the log dose axis is the log of the relative potency, and its antilog is the relative potency. This relative potency is equivalent to the ratio of the IC50s between the two curves. Since the two curves are constrained to the same shape, all ICnn ratios will be the same as the IC50 ratio (e.g. black bars between the constrained curves in graph at right). Determining the relative potency from unconstrained curves is not as accurate since the two curves are never precisely the same shape and ICnn ratios will vary throughout the length of the curves.

Relative potency (black bars) of test sample (blue curve) compared to reference standard (red curve).

Confidence Limits of Relative Potency

An estimation of the confidence limits of the relative potency determination between the two dilution curves is required to measure its reliability and for averaging multiple independent determinations. The confidence limits of the relative potency from nonlinear curves can be determined using the Profile method, which estimates the asymptotic limits numerically, or the Monte Carlo method, which estimate the probability density limits from simulation estimates. The confidence limits of the relative potency from parallel lines can be determined using linear approximation.

Parallelism (Similarity) between Test Sample and Reference Standard

Testing for the similarity (parallelism) of the two regression curves obtained from each dilution series is a prerequisite for determining the relative potency of two bioactive substances in biological systems. When the two substances are not parallel (not similar), there is no meaningful relative potency between the reference standard and the test sample. Parallelism testing is also used for matrix effects (linearity), cross-reactivity, interfering substances, concentration estimation, and inhibition studies.

The dilution series can span a full dose response nonlinear curve or a more limited linear set of doses. The advantage of a full dose response curve is that some differences between substances only appear at high or low doses, and relative potency is more accurately determined. But the more limited number of doses needed for a linear comparison is an advantage for animal studies and requires simpler math computations.

The three main approaches to assessing the parallelism between substances are the RSSE (Chi-Square) method (a direct measure of parallelism), the F Test method (a hypothesis test), and Equivalence method (an empirical test). The first two methods utilize the residual (Residual Sum of Squares Error, RSSE) method from regression statistics (also called the Extra Sum of Squares method), and the latter method compares the confidence intervals of the regression coefficients. All of these methods are cited in the USP 1032, 1033, 1034 guidelines.

3 Methods for Determining Parallelism

RSSE (Chi-Square) Method for Parallelism

In regression statistics, the similarity of regressions (parallelism) is often determined with the Extra Sum of Squares method used in the RSSE (Chi-Square) and F Test methods. The RSSE (Chi-Square) method is a direct measure of the similarity between the weighted residuals² of the individual dilutions of two regression curves (see manuscript: Determining Parallelism and Relative Potency in Immunoassay and Bioassay Data). This residual method, like the F Test method, can use any appropriately weighted least squares regression model (e.g. linear, 3PL, 4PL, 5PL) that provide an adequate fit to the dilution series data. One or both of the dilution series can be a partial dose-response curve. Residual methods are also effective with ill-behaved test methods because the weighting estimates match the test behavior. The RSSE method is sometimes called the Chi-Square method because the RSSEnonparallel result, like all appropriately weighted RSSEs, is chi-square distributed, and a chi-square probability of the RSSEnonparallel result can be used.

The RSSE (Chi-Square) method measures the difference in the residual sum of squares error (RSSE) of unconstrained (independent) curves (RSSEunconstrained) and curves constrained to the same shape (RSSEconstrained) to determine parallelism. When appropriately weighted for that test, the difference between the RSSEconstrained and the RSSEunconstrained is a direct measure (RSSEnonparallel) of the amount of non-parallelism between the assayed data points of the two curves. This RSSEnonparallel result becomes progressively larger the more nonparallel the two curves are.

Unconstrained Curves

Relative Potency Curves in STATLIA MATRIX

Constrained Curves

Standard and Test Sample

Residuals² (RSSEunconstrained)

Standard and Test Sample

Residuals² (RSSEconstrained)

A RSSE threshold can be established empirically that includes an acceptable amount of nonsimilarity for that test. Since the differences between the residuals of each assayed data point are measured individually, the RSSEnonpar allows individual dose regions to be examined. This can be an important benefit because some differences between the test sample and the reference standard only appear at high or low doses.

Importance of Weighting for Parallelism Tests

Statistical curve fitting uses a method called least squares regression fitting, which derives the one set of coefficients that has the smallest sum of squared residuals (RSSE) for that curve model. A squared residual is the vertical distance between the observed point and the curve, squared, divided by the estimated variance at that point. Weighting the squared residual errors with their estimated variances normalizes each point and allows all the points to contribute equally to the regression curve. Accurately estimated variances of the data points are necessary to obtain the Maximum Likelihood Estimate (MLE) of the true underlying regression curve. Adequate weighting can be determined from a single assay, but more accurate weighting estimates are obtained from the responses of pooled assays (see the Tech Note: Curve Weighting). Accurately weighted residuals are necessary for the RSSE (Chi-Square) method but not for the F Test method. However, accurate weighting of the F Test regressions partially offset the issues observed with very good and very poor curve fits.

Unconstrained Curves and Constrained Curves

In the unconstrained curves on the left in the graphs above, the data from both curves are computed independently using separate (5PL) curve fits. The individual weighted residuals²between each observed point and its respective curve are plotted on the weighted residuals² graphs next to the curves. The sum of these weighted residuals² is the RSSEunconstrained. Since the unconstrained sample responses fit their curves independently, the RSSE’s are not affected by any nonsimilarity (nonparallelism) between the curves.

With the constrained curves on the right in the graphs above, the responses from both curves are forced to fit one identical curve shape that provides the best fit for both curves. Since the constrained curves both use the same shape, the RSSEconstrained is affected by the amount of nonsimilarity between the two curves, and consequently have a higher RSSE than the unconstrained curves.

RSSE(Chi-Square) Method Provides Direct Measure of Similarity Between Curves

The difference between the RSSEconstrained and the RSSEunconstrained shown above is a direct measure of the amount of nonsimilarity (RSSEnonparallel) between the two curves. In this example, the two materials tested were the same material so the RSSEnonparallel result is minimal. The parallelism threshold for the RSSEnonparallel result can be set to include any amount of nonsimilarity appropriate for your test.

Parallel and Nonparallel Test Samples

In the examples below, two test samples from the same assay were each compared to the same reference standard from that assay.

Example 1

In the first example, the test sample was a control having the same material as the reference standard. A 5PL curve was fit to both curves in the unconstrained and constrained fits. The RSSEunconstrained shows good fits for each independently fit curve. Because the two materials were the same, their constrained shapes were very similar to their unconstrained shapes and the RSSEnonparallel result was minimal.