Loading…

Optimization and validation of drug solubility by development of advanced artificial intelligence models

•Solubility enhancement through advanced artificial intelligence model.•Applying different modeless of machine learning such as Random Forest, Extremely Random Trees, and Gradient Boosting Trees.•The GBRT has shown the less error rate and can be used as the best model. Over the last ten years, the a...

Full description

Saved in:
Bibliographic Details
Published in:Journal of molecular liquids 2023-02, Vol.372, p.121113, Article 121113
Main Authors: Liu, Yaoyang, Ahmed Smait, Drai, Yaseen Naser, Abbas, M. A. Altalbawy, Farag, Bahri, Hala, Abdul Kadhim Ruhaima, Ali, Zayad Fathallah, Thura, Hadrawi, Salema K., Alsaddon, Refad E., Alshetaili, Abdullah, Alsubaiyel, Amal M.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•Solubility enhancement through advanced artificial intelligence model.•Applying different modeless of machine learning such as Random Forest, Extremely Random Trees, and Gradient Boosting Trees.•The GBRT has shown the less error rate and can be used as the best model. Over the last ten years, the application of novel mathematical models of Machine Learning employed to model the solubility of drugs especially anticancer drugs, in supercritical carbon dioxide (ScCO2) system has gained remarkable popularity. In this research, three distinct ensemble models have been employed on the data as a novel method for busulfan as anticancer drug for the first time, based on decision trees, including Random Forest (RF), Gradient Boosting Trees (GBRT), and Extremely Randomized Tree (ERT) to predict the solubility of busulfan as an anticancer drug. The dataset has two input parameters, T = Temperature and P = Pressure, and Y = Solubility is the single output. After implementing and tuning these ensemble models' hyper parameters, the performance has been assessed through several metrics. All three models show R-squared score of more than 0.9, but in terms of RMSE, the error rates are 1.80E-04, 1.72E-04, and 1.03E-04 for RF, ERT, and GBRT models, respectively. Also, MAPE metrics 4.51E-01, 4.87E-01, and 3.62E-01 errors had found for RF, ERT, and GBRT models, respectively. GBRT has been selected as the best model due to the less rate of RMSE and MAPE. An analysis has also been performed to find the optimal amount of solubility, which can be considered the (x1 = 38.3, x2 = 333.1, Y = 1.36E-03) vector.
ISSN:0167-7322
1873-3166
DOI:10.1016/j.molliq.2022.121113