Loading…
Learning to rank for censored survival data
Survival analysis is a type of semi-supervised ranking task where the target output (the survival time) is often right-censored. Utilizing this information is a challenge because it is not obvious how to correctly incorporate these censored examples into a model. We study how three categories of los...
Saved in:
Published in: | arXiv.org 2018-06 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | |
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Luck, Margaux Sylvain, Tristan Cohen, Joseph Paul Cardinal, Heloise Lodi, Andrea Bengio, Yoshua |
description | Survival analysis is a type of semi-supervised ranking task where the target output (the survival time) is often right-censored. Utilizing this information is a challenge because it is not obvious how to correctly incorporate these censored examples into a model. We study how three categories of loss functions, namely partial likelihood methods, rank methods, and our classification method based on a Wasserstein metric (WM) and the non-parametric Kaplan Meier estimate of the probability density to impute the labels of censored examples, can take advantage of this information. The proposed method allows us to have a model that predict the probability distribution of an event. If a clinician had access to the detailed probability of an event over time this would help in treatment planning. For example, determining if the risk of kidney graft rejection is constant or peaked after some time. Also, we demonstrate that this approach directly optimizes the expected C-index which is the most common evaluation metric for ranking survival models. |
format | article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2073368080</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2073368080</sourcerecordid><originalsourceid>FETCH-proquest_journals_20733680803</originalsourceid><addsrcrecordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mTQ9klNLMrLzEtXKMlXKErMy1ZIyy9SSE7NK84vSk1RKC4tKsssS8xRSEksSeRhYE1LzClO5YXS3AzKbq4hzh66BUX5haWpxSXxWfmlRXlAqXgjA3NjYzMLAwsDY-JUAQBFSzEp</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2073368080</pqid></control><display><type>article</type><title>Learning to rank for censored survival data</title><source>ProQuest - Publicly Available Content Database</source><creator>Luck, Margaux ; Sylvain, Tristan ; Cohen, Joseph Paul ; Cardinal, Heloise ; Lodi, Andrea ; Bengio, Yoshua</creator><creatorcontrib>Luck, Margaux ; Sylvain, Tristan ; Cohen, Joseph Paul ; Cardinal, Heloise ; Lodi, Andrea ; Bengio, Yoshua</creatorcontrib><description>Survival analysis is a type of semi-supervised ranking task where the target output (the survival time) is often right-censored. Utilizing this information is a challenge because it is not obvious how to correctly incorporate these censored examples into a model. We study how three categories of loss functions, namely partial likelihood methods, rank methods, and our classification method based on a Wasserstein metric (WM) and the non-parametric Kaplan Meier estimate of the probability density to impute the labels of censored examples, can take advantage of this information. The proposed method allows us to have a model that predict the probability distribution of an event. If a clinician had access to the detailed probability of an event over time this would help in treatment planning. For example, determining if the risk of kidney graft rejection is constant or peaked after some time. Also, we demonstrate that this approach directly optimizes the expected C-index which is the most common evaluation metric for ranking survival models.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Ranking ; Survival ; Survival analysis</subject><ispartof>arXiv.org, 2018-06</ispartof><rights>2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2073368080?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,37012,44590</link.rule.ids></links><search><creatorcontrib>Luck, Margaux</creatorcontrib><creatorcontrib>Sylvain, Tristan</creatorcontrib><creatorcontrib>Cohen, Joseph Paul</creatorcontrib><creatorcontrib>Cardinal, Heloise</creatorcontrib><creatorcontrib>Lodi, Andrea</creatorcontrib><creatorcontrib>Bengio, Yoshua</creatorcontrib><title>Learning to rank for censored survival data</title><title>arXiv.org</title><description>Survival analysis is a type of semi-supervised ranking task where the target output (the survival time) is often right-censored. Utilizing this information is a challenge because it is not obvious how to correctly incorporate these censored examples into a model. We study how three categories of loss functions, namely partial likelihood methods, rank methods, and our classification method based on a Wasserstein metric (WM) and the non-parametric Kaplan Meier estimate of the probability density to impute the labels of censored examples, can take advantage of this information. The proposed method allows us to have a model that predict the probability distribution of an event. If a clinician had access to the detailed probability of an event over time this would help in treatment planning. For example, determining if the risk of kidney graft rejection is constant or peaked after some time. Also, we demonstrate that this approach directly optimizes the expected C-index which is the most common evaluation metric for ranking survival models.</description><subject>Ranking</subject><subject>Survival</subject><subject>Survival analysis</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mTQ9klNLMrLzEtXKMlXKErMy1ZIyy9SSE7NK84vSk1RKC4tKsssS8xRSEksSeRhYE1LzClO5YXS3AzKbq4hzh66BUX5haWpxSXxWfmlRXlAqXgjA3NjYzMLAwsDY-JUAQBFSzEp</recordid><startdate>20180608</startdate><enddate>20180608</enddate><creator>Luck, Margaux</creator><creator>Sylvain, Tristan</creator><creator>Cohen, Joseph Paul</creator><creator>Cardinal, Heloise</creator><creator>Lodi, Andrea</creator><creator>Bengio, Yoshua</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20180608</creationdate><title>Learning to rank for censored survival data</title><author>Luck, Margaux ; Sylvain, Tristan ; Cohen, Joseph Paul ; Cardinal, Heloise ; Lodi, Andrea ; Bengio, Yoshua</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_20733680803</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Ranking</topic><topic>Survival</topic><topic>Survival analysis</topic><toplevel>online_resources</toplevel><creatorcontrib>Luck, Margaux</creatorcontrib><creatorcontrib>Sylvain, Tristan</creatorcontrib><creatorcontrib>Cohen, Joseph Paul</creatorcontrib><creatorcontrib>Cardinal, Heloise</creatorcontrib><creatorcontrib>Lodi, Andrea</creatorcontrib><creatorcontrib>Bengio, Yoshua</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>ProQuest - Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Luck, Margaux</au><au>Sylvain, Tristan</au><au>Cohen, Joseph Paul</au><au>Cardinal, Heloise</au><au>Lodi, Andrea</au><au>Bengio, Yoshua</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Learning to rank for censored survival data</atitle><jtitle>arXiv.org</jtitle><date>2018-06-08</date><risdate>2018</risdate><eissn>2331-8422</eissn><abstract>Survival analysis is a type of semi-supervised ranking task where the target output (the survival time) is often right-censored. Utilizing this information is a challenge because it is not obvious how to correctly incorporate these censored examples into a model. We study how three categories of loss functions, namely partial likelihood methods, rank methods, and our classification method based on a Wasserstein metric (WM) and the non-parametric Kaplan Meier estimate of the probability density to impute the labels of censored examples, can take advantage of this information. The proposed method allows us to have a model that predict the probability distribution of an event. If a clinician had access to the detailed probability of an event over time this would help in treatment planning. For example, determining if the risk of kidney graft rejection is constant or peaked after some time. Also, we demonstrate that this approach directly optimizes the expected C-index which is the most common evaluation metric for ranking survival models.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2018-06 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2073368080 |
source | ProQuest - Publicly Available Content Database |
subjects | Ranking Survival Survival analysis |
title | Learning to rank for censored survival data |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T09%3A05%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Learning%20to%20rank%20for%20censored%20survival%20data&rft.jtitle=arXiv.org&rft.au=Luck,%20Margaux&rft.date=2018-06-08&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2073368080%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_20733680803%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2073368080&rft_id=info:pmid/&rfr_iscdi=true |