Loading…

The accuracy of machine learning models relies on hyperparameter tuning: student result classification using random forest, randomized search, grid search, bayesian, genetic, and optuna algorithms

Hyperparameters play a critical role in analyzing predictive performance in machine learning models. They serve to strike a balance between overfitting and underfitting of research-independent features to prevent extremes. Manual tuning and automated techniques are employed to identify the optimal c...

Full description

Saved in:
Bibliographic Details
Published in:Multimedia tools and applications 2024-02, Vol.83 (30), p.74349-74364
Main Authors: Rimal, Yagyanath, Sharma, Navneet, Alsadoon, Abeer
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c319t-61bb23b5febe80cefa6abe901d60a8c5775860f20baba75fe3da65ddb749f6fa3
cites cdi_FETCH-LOGICAL-c319t-61bb23b5febe80cefa6abe901d60a8c5775860f20baba75fe3da65ddb749f6fa3
container_end_page 74364
container_issue 30
container_start_page 74349
container_title Multimedia tools and applications
container_volume 83
creator Rimal, Yagyanath
Sharma, Navneet
Alsadoon, Abeer
description Hyperparameters play a critical role in analyzing predictive performance in machine learning models. They serve to strike a balance between overfitting and underfitting of research-independent features to prevent extremes. Manual tuning and automated techniques are employed to identify the optimal combination and permutation to achieve the best model performance. This study explores the pursuit of the best fit through various hyperparameters. Following Logistic Regression analysis, this research compared Random Forest, Randomized search, Grid search, Genetic, Bayesian, and Optuna machine learning model tuning for the best accuracy of prediction the student The model accuracy was further assessed using confusion matrices and Receiver Operating Characteristic—Area Under the Curve (ROC-AUC) curves for student grade classification. The genetic algorithm's recommended hyperparameter tuning yielded the highest accuracy (82.5%) and AUC-ROC score (90%) for student result classification. Manual tuning with an estimator of 300, criterion entropy, max features of sqrt, and a minimum sample leaf of 10 achieved an accuracy of 81.1%, which closely resembled the performance randomized search cross-validation algorithm. The default random forest model scored the least accuracy (78%). However, this manual tuning process took a lesser time (3.66 s) to fit the model while grid search CV tuned 941.5 s. Hence, this research made significant contributions to optimizing various machine learning models using a range of hyperparameters for grade classification.
doi_str_mv 10.1007/s11042-024-18426-2
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3099943833</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3099943833</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-61bb23b5febe80cefa6abe901d60a8c5775860f20baba75fe3da65ddb749f6fa3</originalsourceid><addsrcrecordid>eNp9kc2KFTEQhRtRcBx9AVcFbm9rfvrXnQyOCgNuxnWopCu3M3QnbZJeXJ_PBzPXOzCuXFVV-M45hFNVbzl7zxnrPyTOWSNqJpqaD43oavGsuuJtL-u-F_z5P_vL6lVKD4zxrhXNVfX7fiZAY_aI5gTBwopmdp5gIYze-SOsYaIlQaTFUYLgYT5tFDeMuFKmCHk_Yx8h5X0inwuY9iWDWTAlZ53B7IpoT2eviH4KK9hQoHx4PN0vmiCVODMf4Bjd06HxRMmhL8_kKTtzgKKAsJVMBFyOIbo8r-l19cLikujN47yuftx-vr_5Wt99__Lt5tNdbSQfc91xrYXUrSVNAzNksUNNI-NTx3Awbd-3Q8esYBo19gWTE3btNOm-GW1nUV5X7y6-Www_9_IF9RD26Eukkmwcx0YOUhZKXCgTQ0qRrNqiWzGeFGfq3Ja6tKVKW-pvW0oUkbyIUoH9keKT9X9UfwBK3p5H</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3099943833</pqid></control><display><type>article</type><title>The accuracy of machine learning models relies on hyperparameter tuning: student result classification using random forest, randomized search, grid search, bayesian, genetic, and optuna algorithms</title><source>Springer Nature</source><creator>Rimal, Yagyanath ; Sharma, Navneet ; Alsadoon, Abeer</creator><creatorcontrib>Rimal, Yagyanath ; Sharma, Navneet ; Alsadoon, Abeer</creatorcontrib><description>Hyperparameters play a critical role in analyzing predictive performance in machine learning models. They serve to strike a balance between overfitting and underfitting of research-independent features to prevent extremes. Manual tuning and automated techniques are employed to identify the optimal combination and permutation to achieve the best model performance. This study explores the pursuit of the best fit through various hyperparameters. Following Logistic Regression analysis, this research compared Random Forest, Randomized search, Grid search, Genetic, Bayesian, and Optuna machine learning model tuning for the best accuracy of prediction the student The model accuracy was further assessed using confusion matrices and Receiver Operating Characteristic—Area Under the Curve (ROC-AUC) curves for student grade classification. The genetic algorithm's recommended hyperparameter tuning yielded the highest accuracy (82.5%) and AUC-ROC score (90%) for student result classification. Manual tuning with an estimator of 300, criterion entropy, max features of sqrt, and a minimum sample leaf of 10 achieved an accuracy of 81.1%, which closely resembled the performance randomized search cross-validation algorithm. The default random forest model scored the least accuracy (78%). However, this manual tuning process took a lesser time (3.66 s) to fit the model while grid search CV tuned 941.5 s. Hence, this research made significant contributions to optimizing various machine learning models using a range of hyperparameters for grade classification.</description><identifier>ISSN: 1573-7721</identifier><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-024-18426-2</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Accuracy ; Bayesian analysis ; Classification ; Computer Communication Networks ; Computer Science ; Data Structures and Information Theory ; Genetic algorithms ; Machine learning ; Multimedia Information Systems ; Optimization ; Performance prediction ; Permutations ; Regression analysis ; Regression models ; Searching ; Special Purpose and Application-Based Systems ; Tuning</subject><ispartof>Multimedia tools and applications, 2024-02, Vol.83 (30), p.74349-74364</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-61bb23b5febe80cefa6abe901d60a8c5775860f20baba75fe3da65ddb749f6fa3</citedby><cites>FETCH-LOGICAL-c319t-61bb23b5febe80cefa6abe901d60a8c5775860f20baba75fe3da65ddb749f6fa3</cites><orcidid>0000-0003-1045-7728</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Rimal, Yagyanath</creatorcontrib><creatorcontrib>Sharma, Navneet</creatorcontrib><creatorcontrib>Alsadoon, Abeer</creatorcontrib><title>The accuracy of machine learning models relies on hyperparameter tuning: student result classification using random forest, randomized search, grid search, bayesian, genetic, and optuna algorithms</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>Hyperparameters play a critical role in analyzing predictive performance in machine learning models. They serve to strike a balance between overfitting and underfitting of research-independent features to prevent extremes. Manual tuning and automated techniques are employed to identify the optimal combination and permutation to achieve the best model performance. This study explores the pursuit of the best fit through various hyperparameters. Following Logistic Regression analysis, this research compared Random Forest, Randomized search, Grid search, Genetic, Bayesian, and Optuna machine learning model tuning for the best accuracy of prediction the student The model accuracy was further assessed using confusion matrices and Receiver Operating Characteristic—Area Under the Curve (ROC-AUC) curves for student grade classification. The genetic algorithm's recommended hyperparameter tuning yielded the highest accuracy (82.5%) and AUC-ROC score (90%) for student result classification. Manual tuning with an estimator of 300, criterion entropy, max features of sqrt, and a minimum sample leaf of 10 achieved an accuracy of 81.1%, which closely resembled the performance randomized search cross-validation algorithm. The default random forest model scored the least accuracy (78%). However, this manual tuning process took a lesser time (3.66 s) to fit the model while grid search CV tuned 941.5 s. Hence, this research made significant contributions to optimizing various machine learning models using a range of hyperparameters for grade classification.</description><subject>Accuracy</subject><subject>Bayesian analysis</subject><subject>Classification</subject><subject>Computer Communication Networks</subject><subject>Computer Science</subject><subject>Data Structures and Information Theory</subject><subject>Genetic algorithms</subject><subject>Machine learning</subject><subject>Multimedia Information Systems</subject><subject>Optimization</subject><subject>Performance prediction</subject><subject>Permutations</subject><subject>Regression analysis</subject><subject>Regression models</subject><subject>Searching</subject><subject>Special Purpose and Application-Based Systems</subject><subject>Tuning</subject><issn>1573-7721</issn><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kc2KFTEQhRtRcBx9AVcFbm9rfvrXnQyOCgNuxnWopCu3M3QnbZJeXJ_PBzPXOzCuXFVV-M45hFNVbzl7zxnrPyTOWSNqJpqaD43oavGsuuJtL-u-F_z5P_vL6lVKD4zxrhXNVfX7fiZAY_aI5gTBwopmdp5gIYze-SOsYaIlQaTFUYLgYT5tFDeMuFKmCHk_Yx8h5X0inwuY9iWDWTAlZ53B7IpoT2eviH4KK9hQoHx4PN0vmiCVODMf4Bjd06HxRMmhL8_kKTtzgKKAsJVMBFyOIbo8r-l19cLikujN47yuftx-vr_5Wt99__Lt5tNdbSQfc91xrYXUrSVNAzNksUNNI-NTx3Awbd-3Q8esYBo19gWTE3btNOm-GW1nUV5X7y6-Www_9_IF9RD26Eukkmwcx0YOUhZKXCgTQ0qRrNqiWzGeFGfq3Ja6tKVKW-pvW0oUkbyIUoH9keKT9X9UfwBK3p5H</recordid><startdate>20240215</startdate><enddate>20240215</enddate><creator>Rimal, Yagyanath</creator><creator>Sharma, Navneet</creator><creator>Alsadoon, Abeer</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0003-1045-7728</orcidid></search><sort><creationdate>20240215</creationdate><title>The accuracy of machine learning models relies on hyperparameter tuning: student result classification using random forest, randomized search, grid search, bayesian, genetic, and optuna algorithms</title><author>Rimal, Yagyanath ; Sharma, Navneet ; Alsadoon, Abeer</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-61bb23b5febe80cefa6abe901d60a8c5775860f20baba75fe3da65ddb749f6fa3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Bayesian analysis</topic><topic>Classification</topic><topic>Computer Communication Networks</topic><topic>Computer Science</topic><topic>Data Structures and Information Theory</topic><topic>Genetic algorithms</topic><topic>Machine learning</topic><topic>Multimedia Information Systems</topic><topic>Optimization</topic><topic>Performance prediction</topic><topic>Permutations</topic><topic>Regression analysis</topic><topic>Regression models</topic><topic>Searching</topic><topic>Special Purpose and Application-Based Systems</topic><topic>Tuning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Rimal, Yagyanath</creatorcontrib><creatorcontrib>Sharma, Navneet</creatorcontrib><creatorcontrib>Alsadoon, Abeer</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rimal, Yagyanath</au><au>Sharma, Navneet</au><au>Alsadoon, Abeer</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The accuracy of machine learning models relies on hyperparameter tuning: student result classification using random forest, randomized search, grid search, bayesian, genetic, and optuna algorithms</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2024-02-15</date><risdate>2024</risdate><volume>83</volume><issue>30</issue><spage>74349</spage><epage>74364</epage><pages>74349-74364</pages><issn>1573-7721</issn><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>Hyperparameters play a critical role in analyzing predictive performance in machine learning models. They serve to strike a balance between overfitting and underfitting of research-independent features to prevent extremes. Manual tuning and automated techniques are employed to identify the optimal combination and permutation to achieve the best model performance. This study explores the pursuit of the best fit through various hyperparameters. Following Logistic Regression analysis, this research compared Random Forest, Randomized search, Grid search, Genetic, Bayesian, and Optuna machine learning model tuning for the best accuracy of prediction the student The model accuracy was further assessed using confusion matrices and Receiver Operating Characteristic—Area Under the Curve (ROC-AUC) curves for student grade classification. The genetic algorithm's recommended hyperparameter tuning yielded the highest accuracy (82.5%) and AUC-ROC score (90%) for student result classification. Manual tuning with an estimator of 300, criterion entropy, max features of sqrt, and a minimum sample leaf of 10 achieved an accuracy of 81.1%, which closely resembled the performance randomized search cross-validation algorithm. The default random forest model scored the least accuracy (78%). However, this manual tuning process took a lesser time (3.66 s) to fit the model while grid search CV tuned 941.5 s. Hence, this research made significant contributions to optimizing various machine learning models using a range of hyperparameters for grade classification.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11042-024-18426-2</doi><tpages>16</tpages><orcidid>https://orcid.org/0000-0003-1045-7728</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1573-7721
ispartof Multimedia tools and applications, 2024-02, Vol.83 (30), p.74349-74364
issn 1573-7721
1380-7501
1573-7721
language eng
recordid cdi_proquest_journals_3099943833
source Springer Nature
subjects Accuracy
Bayesian analysis
Classification
Computer Communication Networks
Computer Science
Data Structures and Information Theory
Genetic algorithms
Machine learning
Multimedia Information Systems
Optimization
Performance prediction
Permutations
Regression analysis
Regression models
Searching
Special Purpose and Application-Based Systems
Tuning
title The accuracy of machine learning models relies on hyperparameter tuning: student result classification using random forest, randomized search, grid search, bayesian, genetic, and optuna algorithms
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T23%3A48%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20accuracy%20of%20machine%20learning%20models%20relies%20on%20hyperparameter%20tuning:%20student%20result%20classification%20using%20random%20forest,%20randomized%20search,%20grid%20search,%20bayesian,%20genetic,%20and%20optuna%20algorithms&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Rimal,%20Yagyanath&rft.date=2024-02-15&rft.volume=83&rft.issue=30&rft.spage=74349&rft.epage=74364&rft.pages=74349-74364&rft.issn=1573-7721&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-024-18426-2&rft_dat=%3Cproquest_cross%3E3099943833%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c319t-61bb23b5febe80cefa6abe901d60a8c5775860f20baba75fe3da65ddb749f6fa3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3099943833&rft_id=info:pmid/&rfr_iscdi=true