Loading…
Optimization and validation of drug solubility by development of advanced artificial intelligence models
•Solubility enhancement through advanced artificial intelligence model.•Applying different modeless of machine learning such as Random Forest, Extremely Random Trees, and Gradient Boosting Trees.•The GBRT has shown the less error rate and can be used as the best model. Over the last ten years, the a...
Saved in:
Published in: | Journal of molecular liquids 2023-02, Vol.372, p.121113, Article 121113 |
---|---|
Main Authors: | , , , , , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •Solubility enhancement through advanced artificial intelligence model.•Applying different modeless of machine learning such as Random Forest, Extremely Random Trees, and Gradient Boosting Trees.•The GBRT has shown the less error rate and can be used as the best model.
Over the last ten years, the application of novel mathematical models of Machine Learning employed to model the solubility of drugs especially anticancer drugs, in supercritical carbon dioxide (ScCO2) system has gained remarkable popularity. In this research, three distinct ensemble models have been employed on the data as a novel method for busulfan as anticancer drug for the first time, based on decision trees, including Random Forest (RF), Gradient Boosting Trees (GBRT), and Extremely Randomized Tree (ERT) to predict the solubility of busulfan as an anticancer drug. The dataset has two input parameters, T = Temperature and P = Pressure, and Y = Solubility is the single output. After implementing and tuning these ensemble models' hyper parameters, the performance has been assessed through several metrics. All three models show R-squared score of more than 0.9, but in terms of RMSE, the error rates are 1.80E-04, 1.72E-04, and 1.03E-04 for RF, ERT, and GBRT models, respectively. Also, MAPE metrics 4.51E-01, 4.87E-01, and 3.62E-01 errors had found for RF, ERT, and GBRT models, respectively. GBRT has been selected as the best model due to the less rate of RMSE and MAPE. An analysis has also been performed to find the optimal amount of solubility, which can be considered the (x1 = 38.3, x2 = 333.1, Y = 1.36E-03) vector. |
---|---|
ISSN: | 0167-7322 1873-3166 |
DOI: | 10.1016/j.molliq.2022.121113 |