Loading…

Comparison of four machine learning methods for occupational accidents based on national data on metal sector in Turkey

•Machine learning models (RF, KNN, GBM, RPART) are used to predict occupational accident causes and consequence.•10-fold cross validation method was used for model validationwhich increased the accuracy of the models.•RF showed best performance on accuracy with respect to GBM, RPART and KNN. Occupat...

Full description

Saved in:
Bibliographic Details
Published in:Safety science 2024-06, Vol.174, p.106468, Article 106468
Main Authors: Özkan, Ekin Karakaya, Ulaş, Hasan Basri
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c251t-abbe0951d42a3e03796af5a79ce444988f1ec56ec852353b0d84a88fc9bbd96e3
container_end_page
container_issue
container_start_page 106468
container_title Safety science
container_volume 174
creator Özkan, Ekin Karakaya
Ulaş, Hasan Basri
description •Machine learning models (RF, KNN, GBM, RPART) are used to predict occupational accident causes and consequence.•10-fold cross validation method was used for model validationwhich increased the accuracy of the models.•RF showed best performance on accuracy with respect to GBM, RPART and KNN. Occupational accidents are one of the main problems in production system especially in metal sector. The aim of this study is to develop a predictive framework using machine learning (ML) to identify the causes of fatalities and amputations in the metal sector based on occupational accident data collected by the Turkish Ministry of Labor and Social Security (MLSS) from 2013 to 2018. Researchers have used a variety of strategies to investigate factors and create effective prediction frameworks for lowering occupational accidents. We used random forest (RF), k-nearest neighbour (KNN), gradient boosting method (GBM) and recursive partitioning and regression trees (RPART) to predict accident causes and consequence. Accuracy, precision, recall and f-score is used to measure the performance of ML frameworks. For model validation 10-fold cross validation method was used which increased the accuracy of the frameworks considerably. We extracted important factors which affected the causes of accident at metal sector using feature importance. Analysis proved RF as the best performing framework with highest classification results with 0.9172 accuracy, 0.9618 precision, 0.9518 recall and 0.9568 f-score using all features as compared to other techniques classification of occupational accident severity. To implement preventive controls and interventions in a more targeted way, it is recommended to use the predictive RF algorithm in the analysis of occupational accidents. With these studies, preventive measures can be taken by predicting occupational accidents that may occur in the future.
doi_str_mv 10.1016/j.ssci.2024.106468
format article
fullrecord <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_ssci_2024_106468</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0925753524000584</els_id><sourcerecordid>S0925753524000584</sourcerecordid><originalsourceid>FETCH-LOGICAL-c251t-abbe0951d42a3e03796af5a79ce444988f1ec56ec852353b0d84a88fc9bbd96e3</originalsourceid><addsrcrecordid>eNp9kN1KAzEQhYMoWKsv4FVeYGuSTfYHvJHiHxS8qddhNpm1qd2kJFulb2-W6q1Xw5wz3zBzCLnlbMEZr-62i5SMWwgmZBYqWTVnZMabui04k-KczFgrVFGrUl2Sq5S2jDFeVnxGvpdh2EN0KXgaetqHQ6QDmI3zSHcI0Tv_QQccN8Gm7EYajDnsYXTBw46CMc6iHxPtIKGleYn_8yyMMAkZzl1CM2baebo-xE88XpOLHnYJb37rnLw_Pa6XL8Xq7fl1-bAqjFB8LKDrkLWKWymgRFbWbQW9gro1KKVsm6bnaFSFplGiVGXHbCMhq6btOttWWM6JOO01MaQUsdf76AaIR82ZnqLTWz1Fp6fo9Cm6DN2fIMyXfTmMOk-gN2hdzH9oG9x_-A_eK3px</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Comparison of four machine learning methods for occupational accidents based on national data on metal sector in Turkey</title><source>ScienceDirect Freedom Collection</source><creator>Özkan, Ekin Karakaya ; Ulaş, Hasan Basri</creator><creatorcontrib>Özkan, Ekin Karakaya ; Ulaş, Hasan Basri</creatorcontrib><description>•Machine learning models (RF, KNN, GBM, RPART) are used to predict occupational accident causes and consequence.•10-fold cross validation method was used for model validationwhich increased the accuracy of the models.•RF showed best performance on accuracy with respect to GBM, RPART and KNN. Occupational accidents are one of the main problems in production system especially in metal sector. The aim of this study is to develop a predictive framework using machine learning (ML) to identify the causes of fatalities and amputations in the metal sector based on occupational accident data collected by the Turkish Ministry of Labor and Social Security (MLSS) from 2013 to 2018. Researchers have used a variety of strategies to investigate factors and create effective prediction frameworks for lowering occupational accidents. We used random forest (RF), k-nearest neighbour (KNN), gradient boosting method (GBM) and recursive partitioning and regression trees (RPART) to predict accident causes and consequence. Accuracy, precision, recall and f-score is used to measure the performance of ML frameworks. For model validation 10-fold cross validation method was used which increased the accuracy of the frameworks considerably. We extracted important factors which affected the causes of accident at metal sector using feature importance. Analysis proved RF as the best performing framework with highest classification results with 0.9172 accuracy, 0.9618 precision, 0.9518 recall and 0.9568 f-score using all features as compared to other techniques classification of occupational accident severity. To implement preventive controls and interventions in a more targeted way, it is recommended to use the predictive RF algorithm in the analysis of occupational accidents. With these studies, preventive measures can be taken by predicting occupational accidents that may occur in the future.</description><identifier>ISSN: 0925-7535</identifier><identifier>EISSN: 1879-1042</identifier><identifier>DOI: 10.1016/j.ssci.2024.106468</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Gradient Boosting Method ; K-nearest Neighbour ; Occupational Accident ; Random forest ; Recursive Partitioning and Regression Trees</subject><ispartof>Safety science, 2024-06, Vol.174, p.106468, Article 106468</ispartof><rights>2024 Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c251t-abbe0951d42a3e03796af5a79ce444988f1ec56ec852353b0d84a88fc9bbd96e3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Özkan, Ekin Karakaya</creatorcontrib><creatorcontrib>Ulaş, Hasan Basri</creatorcontrib><title>Comparison of four machine learning methods for occupational accidents based on national data on metal sector in Turkey</title><title>Safety science</title><description>•Machine learning models (RF, KNN, GBM, RPART) are used to predict occupational accident causes and consequence.•10-fold cross validation method was used for model validationwhich increased the accuracy of the models.•RF showed best performance on accuracy with respect to GBM, RPART and KNN. Occupational accidents are one of the main problems in production system especially in metal sector. The aim of this study is to develop a predictive framework using machine learning (ML) to identify the causes of fatalities and amputations in the metal sector based on occupational accident data collected by the Turkish Ministry of Labor and Social Security (MLSS) from 2013 to 2018. Researchers have used a variety of strategies to investigate factors and create effective prediction frameworks for lowering occupational accidents. We used random forest (RF), k-nearest neighbour (KNN), gradient boosting method (GBM) and recursive partitioning and regression trees (RPART) to predict accident causes and consequence. Accuracy, precision, recall and f-score is used to measure the performance of ML frameworks. For model validation 10-fold cross validation method was used which increased the accuracy of the frameworks considerably. We extracted important factors which affected the causes of accident at metal sector using feature importance. Analysis proved RF as the best performing framework with highest classification results with 0.9172 accuracy, 0.9618 precision, 0.9518 recall and 0.9568 f-score using all features as compared to other techniques classification of occupational accident severity. To implement preventive controls and interventions in a more targeted way, it is recommended to use the predictive RF algorithm in the analysis of occupational accidents. With these studies, preventive measures can be taken by predicting occupational accidents that may occur in the future.</description><subject>Gradient Boosting Method</subject><subject>K-nearest Neighbour</subject><subject>Occupational Accident</subject><subject>Random forest</subject><subject>Recursive Partitioning and Regression Trees</subject><issn>0925-7535</issn><issn>1879-1042</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kN1KAzEQhYMoWKsv4FVeYGuSTfYHvJHiHxS8qddhNpm1qd2kJFulb2-W6q1Xw5wz3zBzCLnlbMEZr-62i5SMWwgmZBYqWTVnZMabui04k-KczFgrVFGrUl2Sq5S2jDFeVnxGvpdh2EN0KXgaetqHQ6QDmI3zSHcI0Tv_QQccN8Gm7EYajDnsYXTBw46CMc6iHxPtIKGleYn_8yyMMAkZzl1CM2baebo-xE88XpOLHnYJb37rnLw_Pa6XL8Xq7fl1-bAqjFB8LKDrkLWKWymgRFbWbQW9gro1KKVsm6bnaFSFplGiVGXHbCMhq6btOttWWM6JOO01MaQUsdf76AaIR82ZnqLTWz1Fp6fo9Cm6DN2fIMyXfTmMOk-gN2hdzH9oG9x_-A_eK3px</recordid><startdate>202406</startdate><enddate>202406</enddate><creator>Özkan, Ekin Karakaya</creator><creator>Ulaş, Hasan Basri</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>202406</creationdate><title>Comparison of four machine learning methods for occupational accidents based on national data on metal sector in Turkey</title><author>Özkan, Ekin Karakaya ; Ulaş, Hasan Basri</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c251t-abbe0951d42a3e03796af5a79ce444988f1ec56ec852353b0d84a88fc9bbd96e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Gradient Boosting Method</topic><topic>K-nearest Neighbour</topic><topic>Occupational Accident</topic><topic>Random forest</topic><topic>Recursive Partitioning and Regression Trees</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Özkan, Ekin Karakaya</creatorcontrib><creatorcontrib>Ulaş, Hasan Basri</creatorcontrib><collection>CrossRef</collection><jtitle>Safety science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Özkan, Ekin Karakaya</au><au>Ulaş, Hasan Basri</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comparison of four machine learning methods for occupational accidents based on national data on metal sector in Turkey</atitle><jtitle>Safety science</jtitle><date>2024-06</date><risdate>2024</risdate><volume>174</volume><spage>106468</spage><pages>106468-</pages><artnum>106468</artnum><issn>0925-7535</issn><eissn>1879-1042</eissn><abstract>•Machine learning models (RF, KNN, GBM, RPART) are used to predict occupational accident causes and consequence.•10-fold cross validation method was used for model validationwhich increased the accuracy of the models.•RF showed best performance on accuracy with respect to GBM, RPART and KNN. Occupational accidents are one of the main problems in production system especially in metal sector. The aim of this study is to develop a predictive framework using machine learning (ML) to identify the causes of fatalities and amputations in the metal sector based on occupational accident data collected by the Turkish Ministry of Labor and Social Security (MLSS) from 2013 to 2018. Researchers have used a variety of strategies to investigate factors and create effective prediction frameworks for lowering occupational accidents. We used random forest (RF), k-nearest neighbour (KNN), gradient boosting method (GBM) and recursive partitioning and regression trees (RPART) to predict accident causes and consequence. Accuracy, precision, recall and f-score is used to measure the performance of ML frameworks. For model validation 10-fold cross validation method was used which increased the accuracy of the frameworks considerably. We extracted important factors which affected the causes of accident at metal sector using feature importance. Analysis proved RF as the best performing framework with highest classification results with 0.9172 accuracy, 0.9618 precision, 0.9518 recall and 0.9568 f-score using all features as compared to other techniques classification of occupational accident severity. To implement preventive controls and interventions in a more targeted way, it is recommended to use the predictive RF algorithm in the analysis of occupational accidents. With these studies, preventive measures can be taken by predicting occupational accidents that may occur in the future.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.ssci.2024.106468</doi></addata></record>
fulltext fulltext
identifier ISSN: 0925-7535
ispartof Safety science, 2024-06, Vol.174, p.106468, Article 106468
issn 0925-7535
1879-1042
language eng
recordid cdi_crossref_primary_10_1016_j_ssci_2024_106468
source ScienceDirect Freedom Collection
subjects Gradient Boosting Method
K-nearest Neighbour
Occupational Accident
Random forest
Recursive Partitioning and Regression Trees
title Comparison of four machine learning methods for occupational accidents based on national data on metal sector in Turkey
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T21%3A09%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comparison%20of%20four%20machine%20learning%20methods%20for%20occupational%20accidents%20based%20on%20national%20data%20on%20metal%20sector%20in%20Turkey&rft.jtitle=Safety%20science&rft.au=%C3%96zkan,%20Ekin%20Karakaya&rft.date=2024-06&rft.volume=174&rft.spage=106468&rft.pages=106468-&rft.artnum=106468&rft.issn=0925-7535&rft.eissn=1879-1042&rft_id=info:doi/10.1016/j.ssci.2024.106468&rft_dat=%3Celsevier_cross%3ES0925753524000584%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c251t-abbe0951d42a3e03796af5a79ce444988f1ec56ec852353b0d84a88fc9bbd96e3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true