Loading…

Comparison of machine learning methods for predicting the methane production from anaerobic digestion of lignocellulosic biomass

Biogas derived from the anaerobic digestion of biomass can provide a carbon-neutral resource for green energy supply in the future. The biochemical methane potential (BMP) test has been widely applied to assess the characteristics of methane production from anaerobic digestion in batch mode. However...

Full description

Saved in:

Bibliographic Details
Published in:	Energy (Oxford) 2023-01, Vol.263, p.125883, Article 125883
Main Authors:	Wang, Zhengxin, Peng, Xinggan, Xia, Ao, Shah, Akeel A., Yan, Huchao, Huang, Yun, Zhu, Xianqing, Zhu, Xun, Liao, Qiang
Format:	Article
Language:	English
Subjects:	Anaerobic digestion Lignocellulosic biomass Machine learning Prediction Specific methane yield
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Biogas derived from the anaerobic digestion of biomass can provide a carbon-neutral resource for green energy supply in the future. The biochemical methane potential (BMP) test has been widely applied to assess the characteristics of methane production from anaerobic digestion in batch mode. However, the determination of key parameters in the BMP test, such as specific methane yield (SMY), usually requires long-term experiments, especially for lignocellulosic feedstocks with slow degradation rates. This study aims to propose an appropriate data-driven model for the efficient prediction of the SMY using data from 277 samples of various lignocellulosic biomass materials by evaluating ten different machine learning (ML) methods. The Pearson coefficient matrix indicates that the chemical components are more relevant as attributes for the ML models, compared to element compositions, and the content of lignin has a strong linear correlation with SMY. Classic nonlinear ML methods (R2 ≥ 0.61) perform better than linear methods (R2 ≤ 0.56), and an ensemble learning model (R2 = 0.71) is better than a single learner (R2 ≤ 0.67). A K-nearest neighbor (KNN) model using leave-one-out cross-validation (LOOCV) obtains the best performance (R2 = 0.75, MAE = 30.2 mL/gVS). The generalization performance of the best model is found to have an average relative error of 10.05%. [Display omitted] •Ten machine learning (ML) algorithms were used for biomethane yield prediction.•Chemical components of 277 lignocellulosic biomass samples were collected.•Nonlinear ML methods can better capture complex data patterns than linear methods.•The mean absolute error of optimal K-nearest neighbor model was 30.2 mL/gVS.•Generalization performance of the best model had average relative error of 10.05%.
ISSN:	0360-5442
DOI:	10.1016/j.energy.2022.125883