Using a training set of 191 drug-like compounds extracted from the AQUASOL database a quantitative structure-property relationship (QSPR) study was conducted employing a set of simple structural and physicochemical properties to predict aqueous solubility. The resultant regression model comprised five parameters (ClogP, molecular weight, indicator variable for aliphatic amine groups, number of rotatable bonds and number of aromatic rings) and demonstrated acceptable statistics (r(2) = 0.87, s = 0.51, F = 243.6, n = 191). The model was applied to two test sets consisting of a drug-like set of compounds (r2 = 0.80, s = 0.68, n = 174) and a set of agrochemicals (r(2) = 0.88, s = 0.65, n = 200). Using the established general solubility equation (GSE) on the training and drug-like test set gave poorer results than the current study. The agrochemical test set was predicted with equal accuracy using the GSE and the QSPR equation. The results of this study suggest that increasing molecular size, rigidity and lipophilicity decrease solubility whereas increasing conformational flexibility and the presence of a non-conjugated amine group increase the solubility of drug-like compounds. Indeed, the proposed structural parameters make physical sense and provide simple guidelines for modifying solubility during lead optimisation.
|Pages (from-to)||191 - 212|
|Number of pages||22|
|Journal||SAR and QSAR in Environmental Research|
|Publication status||Published - 2008|