Brereton, Andrew E and MacKinnon, Stephen and Safikhani, Zhaleh and Reeves, Shawn and Alwash, Sana and Shahani, Vijay and Windemuth, Andreas (2020) Predicting drug properties with parameter-free machine learning: pareto-optimal embedded modeling (POEM). Machine Learning: Science and Technology, 1 (2). 025008. ISSN 2632-2153
Brereton_2020_Mach._Learn.__Sci._Technol._1_025008.pdf - Published Version
Download (1MB)
Abstract
The prediction of absorption, distribution, metabolism, excretion, and toxicity (ADMET) of small molecules from their molecular structure is a central problem in medicinal chemistry with great practical importance in drug discovery. Creating predictive models conventionally requires substantial trial-and-error for the selection of molecular representations, machine learning (ML) algorithms, and hyperparameter tuning. A generally applicable method that performs well on all datasets without tuning would be of great value but is currently lacking. Here, we describe pareto-optimal embedded modeling (POEM), a similarity-based method for predicting molecular properties. POEM is a non-parametric, supervised ML algorithm developed to generate reliable predictive models without need for optimization. POEM's predictive strength is obtained by combining multiple different representations of molecular structures in a context-specific manner, while maintaining low dimensionality. We benchmark POEM relative to industry-standard ML algorithms and published results across 17 classifications tasks. POEM performs well in all cases and reduces the risk of overfitting.
Item Type: | Article |
---|---|
Subjects: | Article Archives > Multidisciplinary |
Depositing User: | Unnamed user with email support@articlearchives.org |
Date Deposited: | 29 Jun 2023 04:21 |
Last Modified: | 11 May 2024 09:35 |
URI: | http://archive.paparesearch.co.in/id/eprint/1745 |