Skip to the main content

Reminescences

https://doi.org/10.5599/admet.888

Do you know your r2?

Alex Avdeef ; in-ADME Research


Full text: english PDF 911 Kb

page 69-74

downloads: 767

cite


Abstract

The prediction of solubility of drugs usually calls on the use of several open-source/commercially-available computer programs in the various calculation steps. Popular statistics to indicate the strength of the prediction model include the coefficient of determination (r2), Pearson’s linear correlation coefficient (rPearson), and the root-mean-square error (RMSE), among many others. When a program calculates these statistics, slightly different definitions may be used. This commentary briefly reviews the definitions of three types of r2 and RMSE statistics (model validation, bias compensation, and Pearson) and how systematic errors due to shortcomings in solubility prediction models can be differently indicated by the choice of statistical indices. The indices we have employed in recently published papers on the prediction of solubility of druglike molecules were unclear, especially in cases of drugs from ‘beyond the Rule of 5’ chemical space, as simple prediction models showed distinctive ‘bias-tilt’ systematic type scatter.

Keywords

coefficient of determination; linear correction coefficient; root-mean-square error; linear regression

Hrčak ID:

247491

URI

https://hrcak.srce.hr/247491

Publication date:

8.12.2020.

Visits: 1.885 *