ADMET and DMPK, Vol. 9 No. 1, 2021.
Reminescences
https://doi.org/10.5599/admet.888
Do you know your r2?
Alex Avdeef
; in-ADME Research
Abstract
The prediction of solubility of drugs usually calls on the use of several open-source/commercially-available computer programs in the various calculation steps. Popular statistics to indicate the strength of the prediction model include the coefficient of determination (r2), Pearson’s linear correlation coefficient (rPearson), and the root-mean-square error (RMSE), among many others. When a program calculates these statistics, slightly different definitions may be used. This commentary briefly reviews the definitions of three types of r2 and RMSE statistics (model validation, bias compensation, and Pearson) and how systematic errors due to shortcomings in solubility prediction models can be differently indicated by the choice of statistical indices. The indices we have employed in recently published papers on the prediction of solubility of druglike molecules were unclear, especially in cases of drugs from ‘beyond the Rule of 5’ chemical space, as simple prediction models showed distinctive ‘bias-tilt’ systematic type scatter.
Keywords
coefficient of determination; linear correction coefficient; root-mean-square error; linear regression
Hrčak ID:
247491
URI
Publication date:
8.12.2020.
Visits: 1.835 *