Loading…

Expectation–Maximization (EM) Clustering as a Preprocessing Method for Clinical Pathway Mining

Hospital information systems (HIS) are service-oriented systems that focus on payment for medical services. Because all HIS coding for diseases and clinical processes are payment-oriented, they may differ from clinicians’ concepts of diseases and processes. HIS in large-scale hospitals in Japan util...

Full description

Saved in:
Bibliographic Details
Published in:The review of socionetwork strategies 2022, Vol.16 (1), p.25-52
Main Authors: Tsumoto, Shusaku, Kimura, Tomohiro, Hirano, Shoji
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c355t-cd04a9f2a8463af00bb193efa44925544b9f4af05eceeb73d83ce0c88d0ed5373
cites cdi_FETCH-LOGICAL-c355t-cd04a9f2a8463af00bb193efa44925544b9f4af05eceeb73d83ce0c88d0ed5373
container_end_page 52
container_issue 1
container_start_page 25
container_title The review of socionetwork strategies
container_volume 16
creator Tsumoto, Shusaku
Kimura, Tomohiro
Hirano, Shoji
description Hospital information systems (HIS) are service-oriented systems that focus on payment for medical services. Because all HIS coding for diseases and clinical processes are payment-oriented, they may differ from clinicians’ concepts of diseases and processes. HIS in large-scale hospitals in Japan utilize Diagnostic Procedure Combination (DPC) codes, a disease-coding system that focuses on the use of medical resources. Although DPC codes are very precise for diseases requiring surgery, such as cataracts and lung cancer, classification codes for diseases that do not require surgery, such as cerebral infarction, are less precise, with a single category often covering many subtypes with different clinical courses. This paper proposes a preprocessing method that splits DPC codes into subgroups prior to the application of dual clustering-based clinical pathway mining. This method applies expectation–maximization (EM) clustering to the length of patient stay in the hospital using Akaike Information Criteria (AIC) to select the number of clusters. A dual mining method is subsequently applied to the datasets of subgroups and the meanings of subtype clusters are explored using a text mining method. The proposed method was evaluated using datasets from an HIS at Shimane University hospital as preprocessing for clinical pathway mining. The experimental results showed that the proposed method correctly generated subgroups from the more generalized DPC codes and that the clinical pathways identified after this preprocessing capture the characteristics of processes in real clinical settings.
doi_str_mv 10.1007/s12626-021-00100-w
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2654373667</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2654373667</sourcerecordid><originalsourceid>FETCH-LOGICAL-c355t-cd04a9f2a8463af00bb193efa44925544b9f4af05eceeb73d83ce0c88d0ed5373</originalsourceid><addsrcrecordid>eNp9kE1OwzAQhS0EElXpBVhZYgMLg_-TLFFVClIjuoC1cRynTdUmwU7VlhV34IacBKdBYsdsRvP03szoA-CS4FuCcXTnCZVUIkwJwjgoaHcCBiSWEWKUyVMwoIIyxEjEzsHI-xUOxWgUSzIAb5N9Y02r27Kuvj-_Ur0vN-XHcYTXk_QGjtdb31pXVguoPdRw7mzjamO976TUtss6h0XtgrGsSqPXcK7b5U4fYBrmanEBzgq99nb024fg9WHyMn5Es-fp0_h-hgwTokUmx1wnBdUxl0wXGGcZSZgtNOcJFYLzLCl40IU11mYRy2NmLDZxnGObCxaxIbjq94bv3rfWt2pVb10VTioqBQ8OKTsX7V3G1d47W6jGlRvtDopg1cFUPUwVYKojTLULIdaHfNNxsO5v9T-pHzF3eT8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2654373667</pqid></control><display><type>article</type><title>Expectation–Maximization (EM) Clustering as a Preprocessing Method for Clinical Pathway Mining</title><source>Springer Nature</source><creator>Tsumoto, Shusaku ; Kimura, Tomohiro ; Hirano, Shoji</creator><creatorcontrib>Tsumoto, Shusaku ; Kimura, Tomohiro ; Hirano, Shoji</creatorcontrib><description>Hospital information systems (HIS) are service-oriented systems that focus on payment for medical services. Because all HIS coding for diseases and clinical processes are payment-oriented, they may differ from clinicians’ concepts of diseases and processes. HIS in large-scale hospitals in Japan utilize Diagnostic Procedure Combination (DPC) codes, a disease-coding system that focuses on the use of medical resources. Although DPC codes are very precise for diseases requiring surgery, such as cataracts and lung cancer, classification codes for diseases that do not require surgery, such as cerebral infarction, are less precise, with a single category often covering many subtypes with different clinical courses. This paper proposes a preprocessing method that splits DPC codes into subgroups prior to the application of dual clustering-based clinical pathway mining. This method applies expectation–maximization (EM) clustering to the length of patient stay in the hospital using Akaike Information Criteria (AIC) to select the number of clusters. A dual mining method is subsequently applied to the datasets of subgroups and the meanings of subtype clusters are explored using a text mining method. The proposed method was evaluated using datasets from an HIS at Shimane University hospital as preprocessing for clinical pathway mining. The experimental results showed that the proposed method correctly generated subgroups from the more generalized DPC codes and that the clinical pathways identified after this preprocessing capture the characteristics of processes in real clinical settings.</description><identifier>ISSN: 2523-3173</identifier><identifier>EISSN: 1867-3236</identifier><identifier>DOI: 10.1007/s12626-021-00100-w</identifier><language>eng</language><publisher>Singapore: Springer Nature Singapore</publisher><subject>Business and Management ; Cataracts ; Clustering ; Coding ; Data mining ; Datasets ; Health services ; Hospitals ; Information systems ; Information Systems Applications (incl.Internet) ; IT in Business ; Maximization ; Optimization ; Patient care planning ; Payment systems ; Preprocessing ; Simulation and Modeling ; Subgroups ; Surgery</subject><ispartof>The review of socionetwork strategies, 2022, Vol.16 (1), p.25-52</ispartof><rights>Springer Japan KK, part of Springer Nature 2022</rights><rights>Springer Japan KK, part of Springer Nature 2022.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c355t-cd04a9f2a8463af00bb193efa44925544b9f4af05eceeb73d83ce0c88d0ed5373</citedby><cites>FETCH-LOGICAL-c355t-cd04a9f2a8463af00bb193efa44925544b9f4af05eceeb73d83ce0c88d0ed5373</cites><orcidid>0000-0001-6651-976X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Tsumoto, Shusaku</creatorcontrib><creatorcontrib>Kimura, Tomohiro</creatorcontrib><creatorcontrib>Hirano, Shoji</creatorcontrib><title>Expectation–Maximization (EM) Clustering as a Preprocessing Method for Clinical Pathway Mining</title><title>The review of socionetwork strategies</title><addtitle>Rev Socionetwork Strat</addtitle><description>Hospital information systems (HIS) are service-oriented systems that focus on payment for medical services. Because all HIS coding for diseases and clinical processes are payment-oriented, they may differ from clinicians’ concepts of diseases and processes. HIS in large-scale hospitals in Japan utilize Diagnostic Procedure Combination (DPC) codes, a disease-coding system that focuses on the use of medical resources. Although DPC codes are very precise for diseases requiring surgery, such as cataracts and lung cancer, classification codes for diseases that do not require surgery, such as cerebral infarction, are less precise, with a single category often covering many subtypes with different clinical courses. This paper proposes a preprocessing method that splits DPC codes into subgroups prior to the application of dual clustering-based clinical pathway mining. This method applies expectation–maximization (EM) clustering to the length of patient stay in the hospital using Akaike Information Criteria (AIC) to select the number of clusters. A dual mining method is subsequently applied to the datasets of subgroups and the meanings of subtype clusters are explored using a text mining method. The proposed method was evaluated using datasets from an HIS at Shimane University hospital as preprocessing for clinical pathway mining. The experimental results showed that the proposed method correctly generated subgroups from the more generalized DPC codes and that the clinical pathways identified after this preprocessing capture the characteristics of processes in real clinical settings.</description><subject>Business and Management</subject><subject>Cataracts</subject><subject>Clustering</subject><subject>Coding</subject><subject>Data mining</subject><subject>Datasets</subject><subject>Health services</subject><subject>Hospitals</subject><subject>Information systems</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>IT in Business</subject><subject>Maximization</subject><subject>Optimization</subject><subject>Patient care planning</subject><subject>Payment systems</subject><subject>Preprocessing</subject><subject>Simulation and Modeling</subject><subject>Subgroups</subject><subject>Surgery</subject><issn>2523-3173</issn><issn>1867-3236</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kE1OwzAQhS0EElXpBVhZYgMLg_-TLFFVClIjuoC1cRynTdUmwU7VlhV34IacBKdBYsdsRvP03szoA-CS4FuCcXTnCZVUIkwJwjgoaHcCBiSWEWKUyVMwoIIyxEjEzsHI-xUOxWgUSzIAb5N9Y02r27Kuvj-_Ur0vN-XHcYTXk_QGjtdb31pXVguoPdRw7mzjamO976TUtss6h0XtgrGsSqPXcK7b5U4fYBrmanEBzgq99nb024fg9WHyMn5Es-fp0_h-hgwTokUmx1wnBdUxl0wXGGcZSZgtNOcJFYLzLCl40IU11mYRy2NmLDZxnGObCxaxIbjq94bv3rfWt2pVb10VTioqBQ8OKTsX7V3G1d47W6jGlRvtDopg1cFUPUwVYKojTLULIdaHfNNxsO5v9T-pHzF3eT8</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Tsumoto, Shusaku</creator><creator>Kimura, Tomohiro</creator><creator>Hirano, Shoji</creator><general>Springer Nature Singapore</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-6651-976X</orcidid></search><sort><creationdate>2022</creationdate><title>Expectation–Maximization (EM) Clustering as a Preprocessing Method for Clinical Pathway Mining</title><author>Tsumoto, Shusaku ; Kimura, Tomohiro ; Hirano, Shoji</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c355t-cd04a9f2a8463af00bb193efa44925544b9f4af05eceeb73d83ce0c88d0ed5373</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Business and Management</topic><topic>Cataracts</topic><topic>Clustering</topic><topic>Coding</topic><topic>Data mining</topic><topic>Datasets</topic><topic>Health services</topic><topic>Hospitals</topic><topic>Information systems</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>IT in Business</topic><topic>Maximization</topic><topic>Optimization</topic><topic>Patient care planning</topic><topic>Payment systems</topic><topic>Preprocessing</topic><topic>Simulation and Modeling</topic><topic>Subgroups</topic><topic>Surgery</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tsumoto, Shusaku</creatorcontrib><creatorcontrib>Kimura, Tomohiro</creatorcontrib><creatorcontrib>Hirano, Shoji</creatorcontrib><collection>CrossRef</collection><jtitle>The review of socionetwork strategies</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tsumoto, Shusaku</au><au>Kimura, Tomohiro</au><au>Hirano, Shoji</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Expectation–Maximization (EM) Clustering as a Preprocessing Method for Clinical Pathway Mining</atitle><jtitle>The review of socionetwork strategies</jtitle><stitle>Rev Socionetwork Strat</stitle><date>2022</date><risdate>2022</risdate><volume>16</volume><issue>1</issue><spage>25</spage><epage>52</epage><pages>25-52</pages><issn>2523-3173</issn><eissn>1867-3236</eissn><abstract>Hospital information systems (HIS) are service-oriented systems that focus on payment for medical services. Because all HIS coding for diseases and clinical processes are payment-oriented, they may differ from clinicians’ concepts of diseases and processes. HIS in large-scale hospitals in Japan utilize Diagnostic Procedure Combination (DPC) codes, a disease-coding system that focuses on the use of medical resources. Although DPC codes are very precise for diseases requiring surgery, such as cataracts and lung cancer, classification codes for diseases that do not require surgery, such as cerebral infarction, are less precise, with a single category often covering many subtypes with different clinical courses. This paper proposes a preprocessing method that splits DPC codes into subgroups prior to the application of dual clustering-based clinical pathway mining. This method applies expectation–maximization (EM) clustering to the length of patient stay in the hospital using Akaike Information Criteria (AIC) to select the number of clusters. A dual mining method is subsequently applied to the datasets of subgroups and the meanings of subtype clusters are explored using a text mining method. The proposed method was evaluated using datasets from an HIS at Shimane University hospital as preprocessing for clinical pathway mining. The experimental results showed that the proposed method correctly generated subgroups from the more generalized DPC codes and that the clinical pathways identified after this preprocessing capture the characteristics of processes in real clinical settings.</abstract><cop>Singapore</cop><pub>Springer Nature Singapore</pub><doi>10.1007/s12626-021-00100-w</doi><tpages>28</tpages><orcidid>https://orcid.org/0000-0001-6651-976X</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 2523-3173
ispartof The review of socionetwork strategies, 2022, Vol.16 (1), p.25-52
issn 2523-3173
1867-3236
language eng
recordid cdi_proquest_journals_2654373667
source Springer Nature
subjects Business and Management
Cataracts
Clustering
Coding
Data mining
Datasets
Health services
Hospitals
Information systems
Information Systems Applications (incl.Internet)
IT in Business
Maximization
Optimization
Patient care planning
Payment systems
Preprocessing
Simulation and Modeling
Subgroups
Surgery
title Expectation–Maximization (EM) Clustering as a Preprocessing Method for Clinical Pathway Mining
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T21%3A12%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Expectation%E2%80%93Maximization%20(EM)%20Clustering%20as%20a%20Preprocessing%20Method%20for%20Clinical%20Pathway%20Mining&rft.jtitle=The%20review%20of%20socionetwork%20strategies&rft.au=Tsumoto,%20Shusaku&rft.date=2022&rft.volume=16&rft.issue=1&rft.spage=25&rft.epage=52&rft.pages=25-52&rft.issn=2523-3173&rft.eissn=1867-3236&rft_id=info:doi/10.1007/s12626-021-00100-w&rft_dat=%3Cproquest_cross%3E2654373667%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c355t-cd04a9f2a8463af00bb193efa44925544b9f4af05eceeb73d83ce0c88d0ed5373%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2654373667&rft_id=info:pmid/&rfr_iscdi=true