Loading…
Comparison of machine learning methods for predicting the methane production from anaerobic digestion of lignocellulosic biomass
Biogas derived from the anaerobic digestion of biomass can provide a carbon-neutral resource for green energy supply in the future. The biochemical methane potential (BMP) test has been widely applied to assess the characteristics of methane production from anaerobic digestion in batch mode. However...
Saved in:
Published in: | Energy (Oxford) 2023-01, Vol.263, p.125883, Article 125883 |
---|---|
Main Authors: | , , , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Biogas derived from the anaerobic digestion of biomass can provide a carbon-neutral resource for green energy supply in the future. The biochemical methane potential (BMP) test has been widely applied to assess the characteristics of methane production from anaerobic digestion in batch mode. However, the determination of key parameters in the BMP test, such as specific methane yield (SMY), usually requires long-term experiments, especially for lignocellulosic feedstocks with slow degradation rates. This study aims to propose an appropriate data-driven model for the efficient prediction of the SMY using data from 277 samples of various lignocellulosic biomass materials by evaluating ten different machine learning (ML) methods. The Pearson coefficient matrix indicates that the chemical components are more relevant as attributes for the ML models, compared to element compositions, and the content of lignin has a strong linear correlation with SMY. Classic nonlinear ML methods (R2 ≥ 0.61) perform better than linear methods (R2 ≤ 0.56), and an ensemble learning model (R2 = 0.71) is better than a single learner (R2 ≤ 0.67). A K-nearest neighbor (KNN) model using leave-one-out cross-validation (LOOCV) obtains the best performance (R2 = 0.75, MAE = 30.2 mL/gVS). The generalization performance of the best model is found to have an average relative error of 10.05%.
[Display omitted]
•Ten machine learning (ML) algorithms were used for biomethane yield prediction.•Chemical components of 277 lignocellulosic biomass samples were collected.•Nonlinear ML methods can better capture complex data patterns than linear methods.•The mean absolute error of optimal K-nearest neighbor model was 30.2 mL/gVS.•Generalization performance of the best model had average relative error of 10.05%. |
---|---|
ISSN: | 0360-5442 |
DOI: | 10.1016/j.energy.2022.125883 |