Loading…

Low-Rank Sparse Feature Selection for Patient Similarity Learning

Comparing and identifying similar patients is a fundamental task in medical domains - an efficient technique can, for example, help doctors to track patient cohorts, compare the effectiveness of treatments, or predict medical outcomes. The goal of patient similarity learning is to derive a clinicall...

Full description

Saved in:
Bibliographic Details
Main Authors: Mengting Zhan, Shilei Cao, Buyue Qian, Shiyu Chang, Jishang Wei
Format: Conference Proceeding
Language:English
Subjects:
Citations: Items that cite this one
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c655-81602b5bcb67f29f3fcc6b93b6fed15ab3465f30f281dc7034ba72f8e1aed4323
cites
container_end_page 1340
container_issue
container_start_page 1335
container_title
container_volume
creator Mengting Zhan
Shilei Cao
Buyue Qian
Shiyu Chang
Jishang Wei
description Comparing and identifying similar patients is a fundamental task in medical domains - an efficient technique can, for example, help doctors to track patient cohorts, compare the effectiveness of treatments, or predict medical outcomes. The goal of patient similarity learning is to derive a clinically meaningful measure to evaluate the similarity amongst patients represented by their key clinical indicators. However, it is challenging to learn such similarity, as medical data are usually high dimensional, heterogeneous, and complex. In addition, a desirable patient similarity is dependent on particular clinical settings, which implies supervised learning scheme is more useful in medical domains. To address these, in this paper we present a novel similarity learning approach formulated as the generalized Mahalanobis similarity function with pairwise constraints. Considering there always exists some features non-discriminative and contains redundant information, we encode a low-rank structure to our similarity function to perform feature selection. We evaluate the proposed model on both UCI benchmarks and a real clinical dataset for several medical tasks, including patient retrieval, classification, and cohort discovery. The results show that our similarity model significantly outperforms many state-of-the-art baselines, and is effective at removing noisy or redundant features.
doi_str_mv 10.1109/ICDM.2016.0182
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_7837995</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7837995</ieee_id><sourcerecordid>7837995</sourcerecordid><originalsourceid>FETCH-LOGICAL-c655-81602b5bcb67f29f3fcc6b93b6fed15ab3465f30f281dc7034ba72f8e1aed4323</originalsourceid><addsrcrecordid>eNotzLtOwzAUAFCDhEQpXVlY_AMJtq-fYxVoqRQEIt0rO7lGhjSpnCDUv2eA6WyHkDvOSs6Ze9hVjy-lYFyXjFtxQVbOWK6YY0oaEJdkIcDIwkqrr8nNNH0yBloDW5B1Pf4U7374os3J5wnpBv38nZE22GM7p3Ggccz0zc8Jh5k26Zh6n9N8pjX6PKTh45ZcRd9PuPp3Sfabp331XNSv2121rotWK1VYrpkIKrRBmyhchNi2OjgIOmLHlQ8gtYrAorC8aw0DGbwR0SL32EkQsCT3f21CxMMpp6PP54OxYJxT8AsKWkkD</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Low-Rank Sparse Feature Selection for Patient Similarity Learning</title><source>IEEE Xplore All Conference Series</source><creator>Mengting Zhan ; Shilei Cao ; Buyue Qian ; Shiyu Chang ; Jishang Wei</creator><creatorcontrib>Mengting Zhan ; Shilei Cao ; Buyue Qian ; Shiyu Chang ; Jishang Wei</creatorcontrib><description>Comparing and identifying similar patients is a fundamental task in medical domains - an efficient technique can, for example, help doctors to track patient cohorts, compare the effectiveness of treatments, or predict medical outcomes. The goal of patient similarity learning is to derive a clinically meaningful measure to evaluate the similarity amongst patients represented by their key clinical indicators. However, it is challenging to learn such similarity, as medical data are usually high dimensional, heterogeneous, and complex. In addition, a desirable patient similarity is dependent on particular clinical settings, which implies supervised learning scheme is more useful in medical domains. To address these, in this paper we present a novel similarity learning approach formulated as the generalized Mahalanobis similarity function with pairwise constraints. Considering there always exists some features non-discriminative and contains redundant information, we encode a low-rank structure to our similarity function to perform feature selection. We evaluate the proposed model on both UCI benchmarks and a real clinical dataset for several medical tasks, including patient retrieval, classification, and cohort discovery. The results show that our similarity model significantly outperforms many state-of-the-art baselines, and is effective at removing noisy or redundant features.</description><identifier>EISSN: 2374-8486</identifier><identifier>EISBN: 9781509054732</identifier><identifier>EISBN: 1509054731</identifier><identifier>DOI: 10.1109/ICDM.2016.0182</identifier><identifier>CODEN: IEEPAD</identifier><language>eng</language><publisher>IEEE</publisher><subject>Algorithm design and analysis ; Electronic mail ; Linear programming ; Measurement ; Medical diagnostic imaging ; Medical services ; Sparse matrices</subject><ispartof>2016 IEEE 16th International Conference on Data Mining (ICDM), 2016, p.1335-1340</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c655-81602b5bcb67f29f3fcc6b93b6fed15ab3465f30f281dc7034ba72f8e1aed4323</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7837995$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,27902,54530,54907</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/7837995$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Mengting Zhan</creatorcontrib><creatorcontrib>Shilei Cao</creatorcontrib><creatorcontrib>Buyue Qian</creatorcontrib><creatorcontrib>Shiyu Chang</creatorcontrib><creatorcontrib>Jishang Wei</creatorcontrib><title>Low-Rank Sparse Feature Selection for Patient Similarity Learning</title><title>2016 IEEE 16th International Conference on Data Mining (ICDM)</title><addtitle>ICDM</addtitle><description>Comparing and identifying similar patients is a fundamental task in medical domains - an efficient technique can, for example, help doctors to track patient cohorts, compare the effectiveness of treatments, or predict medical outcomes. The goal of patient similarity learning is to derive a clinically meaningful measure to evaluate the similarity amongst patients represented by their key clinical indicators. However, it is challenging to learn such similarity, as medical data are usually high dimensional, heterogeneous, and complex. In addition, a desirable patient similarity is dependent on particular clinical settings, which implies supervised learning scheme is more useful in medical domains. To address these, in this paper we present a novel similarity learning approach formulated as the generalized Mahalanobis similarity function with pairwise constraints. Considering there always exists some features non-discriminative and contains redundant information, we encode a low-rank structure to our similarity function to perform feature selection. We evaluate the proposed model on both UCI benchmarks and a real clinical dataset for several medical tasks, including patient retrieval, classification, and cohort discovery. The results show that our similarity model significantly outperforms many state-of-the-art baselines, and is effective at removing noisy or redundant features.</description><subject>Algorithm design and analysis</subject><subject>Electronic mail</subject><subject>Linear programming</subject><subject>Measurement</subject><subject>Medical diagnostic imaging</subject><subject>Medical services</subject><subject>Sparse matrices</subject><issn>2374-8486</issn><isbn>9781509054732</isbn><isbn>1509054731</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2016</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotzLtOwzAUAFCDhEQpXVlY_AMJtq-fYxVoqRQEIt0rO7lGhjSpnCDUv2eA6WyHkDvOSs6Ze9hVjy-lYFyXjFtxQVbOWK6YY0oaEJdkIcDIwkqrr8nNNH0yBloDW5B1Pf4U7374os3J5wnpBv38nZE22GM7p3Ggccz0zc8Jh5k26Zh6n9N8pjX6PKTh45ZcRd9PuPp3Sfabp331XNSv2121rotWK1VYrpkIKrRBmyhchNi2OjgIOmLHlQ8gtYrAorC8aw0DGbwR0SL32EkQsCT3f21CxMMpp6PP54OxYJxT8AsKWkkD</recordid><startdate>201612</startdate><enddate>201612</enddate><creator>Mengting Zhan</creator><creator>Shilei Cao</creator><creator>Buyue Qian</creator><creator>Shiyu Chang</creator><creator>Jishang Wei</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201612</creationdate><title>Low-Rank Sparse Feature Selection for Patient Similarity Learning</title><author>Mengting Zhan ; Shilei Cao ; Buyue Qian ; Shiyu Chang ; Jishang Wei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c655-81602b5bcb67f29f3fcc6b93b6fed15ab3465f30f281dc7034ba72f8e1aed4323</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Algorithm design and analysis</topic><topic>Electronic mail</topic><topic>Linear programming</topic><topic>Measurement</topic><topic>Medical diagnostic imaging</topic><topic>Medical services</topic><topic>Sparse matrices</topic><toplevel>online_resources</toplevel><creatorcontrib>Mengting Zhan</creatorcontrib><creatorcontrib>Shilei Cao</creatorcontrib><creatorcontrib>Buyue Qian</creatorcontrib><creatorcontrib>Shiyu Chang</creatorcontrib><creatorcontrib>Jishang Wei</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Mengting Zhan</au><au>Shilei Cao</au><au>Buyue Qian</au><au>Shiyu Chang</au><au>Jishang Wei</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Low-Rank Sparse Feature Selection for Patient Similarity Learning</atitle><btitle>2016 IEEE 16th International Conference on Data Mining (ICDM)</btitle><stitle>ICDM</stitle><date>2016-12</date><risdate>2016</risdate><spage>1335</spage><epage>1340</epage><pages>1335-1340</pages><eissn>2374-8486</eissn><eisbn>9781509054732</eisbn><eisbn>1509054731</eisbn><coden>IEEPAD</coden><abstract>Comparing and identifying similar patients is a fundamental task in medical domains - an efficient technique can, for example, help doctors to track patient cohorts, compare the effectiveness of treatments, or predict medical outcomes. The goal of patient similarity learning is to derive a clinically meaningful measure to evaluate the similarity amongst patients represented by their key clinical indicators. However, it is challenging to learn such similarity, as medical data are usually high dimensional, heterogeneous, and complex. In addition, a desirable patient similarity is dependent on particular clinical settings, which implies supervised learning scheme is more useful in medical domains. To address these, in this paper we present a novel similarity learning approach formulated as the generalized Mahalanobis similarity function with pairwise constraints. Considering there always exists some features non-discriminative and contains redundant information, we encode a low-rank structure to our similarity function to perform feature selection. We evaluate the proposed model on both UCI benchmarks and a real clinical dataset for several medical tasks, including patient retrieval, classification, and cohort discovery. The results show that our similarity model significantly outperforms many state-of-the-art baselines, and is effective at removing noisy or redundant features.</abstract><pub>IEEE</pub><doi>10.1109/ICDM.2016.0182</doi><tpages>6</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier EISSN: 2374-8486
ispartof 2016 IEEE 16th International Conference on Data Mining (ICDM), 2016, p.1335-1340
issn 2374-8486
language eng
recordid cdi_ieee_primary_7837995
source IEEE Xplore All Conference Series
subjects Algorithm design and analysis
Electronic mail
Linear programming
Measurement
Medical diagnostic imaging
Medical services
Sparse matrices
title Low-Rank Sparse Feature Selection for Patient Similarity Learning
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T10%3A06%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Low-Rank%20Sparse%20Feature%20Selection%20for%20Patient%20Similarity%20Learning&rft.btitle=2016%20IEEE%2016th%20International%20Conference%20on%20Data%20Mining%20(ICDM)&rft.au=Mengting%20Zhan&rft.date=2016-12&rft.spage=1335&rft.epage=1340&rft.pages=1335-1340&rft.eissn=2374-8486&rft.coden=IEEPAD&rft_id=info:doi/10.1109/ICDM.2016.0182&rft.eisbn=9781509054732&rft.eisbn_list=1509054731&rft_dat=%3Cieee_CHZPO%3E7837995%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c655-81602b5bcb67f29f3fcc6b93b6fed15ab3465f30f281dc7034ba72f8e1aed4323%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=7837995&rfr_iscdi=true