Loading…

Alternative Analysis Methods for Time to Event Endpoints under Non-proportional Hazards: A Comparative Analysis

The log-rank test is most powerful under proportional hazards (PH). In practice, non-PH patterns are often observed in clinical trials, such as in immuno-oncology; therefore, alternative methods are needed to restore the efficiency of statistical testing. Three categories of testing methods were eva...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2019-09
Main Authors: Lin, Ray S, Lin, Ji, Roychoudhury, Satrajit, Anderson, Keaven M, Hu, Tianle, Huang, Bo, Leon, Larry F, Liao, Jason JZ, Liu, Rong, Luo, Xiaodong, Mukhopadhyay, Pralay, Qin, Rui, Tatsuoka, Kay, Wang, Xuejing, Wang, Yang, Zhu, Jian, Tai-Tsang, Chen, Iacona, Renee, Cross-Pharma Non-proportional Hazards Working Group
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Lin, Ray S
Lin, Ji
Roychoudhury, Satrajit
Anderson, Keaven M
Hu, Tianle
Huang, Bo
Leon, Larry F
Liao, Jason JZ
Liu, Rong
Luo, Xiaodong
Mukhopadhyay, Pralay
Qin, Rui
Tatsuoka, Kay
Wang, Xuejing
Wang, Yang
Zhu, Jian
Tai-Tsang, Chen
Iacona, Renee
Cross-Pharma Non-proportional Hazards Working Group
description The log-rank test is most powerful under proportional hazards (PH). In practice, non-PH patterns are often observed in clinical trials, such as in immuno-oncology; therefore, alternative methods are needed to restore the efficiency of statistical testing. Three categories of testing methods were evaluated, including weighted log-rank tests, Kaplan-Meier curve-based tests (including weighted Kaplan-Meier and Restricted Mean Survival Time, RMST), and combination tests (including Breslow test, Lee's combo test, and MaxCombo test). Nine scenarios representing the PH and various non-PH patterns were simulated. The power, type I error, and effect estimates of each method were compared. In general, all tests control type I error well. There is not a single most powerful test across all scenarios. In the absence of prior knowledge regarding the PH or non-PH patterns, the MaxCombo test is relatively robust across patterns. Since the treatment effect changes overtime under non-PH, the overall profile of the treatment effect may not be represented comprehensively based on a single measure. Thus, multiple measures of the treatment effect should be pre-specified as sensitivity analyses to evaluate the totality of the data.
doi_str_mv 10.48550/arxiv.1909.09467
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2295838481</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2295838481</sourcerecordid><originalsourceid>FETCH-LOGICAL-a521-ef781281fc4cc2a85303d06d43d7c67171242cc26cb186926c6bd0830e46e77a3</originalsourceid><addsrcrecordid>eNpVj89LwzAAhYMgOOb-AG8Bz635ndRbKdMNpl56H1mTYkaX1CQt6l9vQS-evsPjfbwHwB1GJVOcowcdP91c4gpVJaqYkFdgRSjFhWKE3IBNSmeEEBGScE5XINRDttHr7GYLa6-Hr-QSfLH5PZgE-xBh6y4W5gC3s_UZbr0Zg_M5wckbG-Fr8MUYwxhidmGpw53-1tGkR1jDJlxGHf-rb8F1r4dkN39cg_Zp2za74vD2vG_qQ6E5wYXtpcJE4b5jXUe04hRRg4Rh1MhOSCwxYWRJRHfCSlQLxckgRZFlwkqp6Rrc_2qXbR-TTfl4DtNyc0hHQiquqGIK0x8ijluS</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2295838481</pqid></control><display><type>article</type><title>Alternative Analysis Methods for Time to Event Endpoints under Non-proportional Hazards: A Comparative Analysis</title><source>Publicly Available Content Database</source><creator>Lin, Ray S ; Lin, Ji ; Roychoudhury, Satrajit ; Anderson, Keaven M ; Hu, Tianle ; Huang, Bo ; Leon, Larry F ; Liao, Jason JZ ; Liu, Rong ; Luo, Xiaodong ; Mukhopadhyay, Pralay ; Qin, Rui ; Tatsuoka, Kay ; Wang, Xuejing ; Wang, Yang ; Zhu, Jian ; Tai-Tsang, Chen ; Iacona, Renee ; Cross-Pharma Non-proportional Hazards Working Group</creator><creatorcontrib>Lin, Ray S ; Lin, Ji ; Roychoudhury, Satrajit ; Anderson, Keaven M ; Hu, Tianle ; Huang, Bo ; Leon, Larry F ; Liao, Jason JZ ; Liu, Rong ; Luo, Xiaodong ; Mukhopadhyay, Pralay ; Qin, Rui ; Tatsuoka, Kay ; Wang, Xuejing ; Wang, Yang ; Zhu, Jian ; Tai-Tsang, Chen ; Iacona, Renee ; Cross-Pharma Non-proportional Hazards Working Group</creatorcontrib><description>The log-rank test is most powerful under proportional hazards (PH). In practice, non-PH patterns are often observed in clinical trials, such as in immuno-oncology; therefore, alternative methods are needed to restore the efficiency of statistical testing. Three categories of testing methods were evaluated, including weighted log-rank tests, Kaplan-Meier curve-based tests (including weighted Kaplan-Meier and Restricted Mean Survival Time, RMST), and combination tests (including Breslow test, Lee's combo test, and MaxCombo test). Nine scenarios representing the PH and various non-PH patterns were simulated. The power, type I error, and effect estimates of each method were compared. In general, all tests control type I error well. There is not a single most powerful test across all scenarios. In the absence of prior knowledge regarding the PH or non-PH patterns, the MaxCombo test is relatively robust across patterns. Since the treatment effect changes overtime under non-PH, the overall profile of the treatment effect may not be represented comprehensively based on a single measure. Thus, multiple measures of the treatment effect should be pre-specified as sensitivity analyses to evaluate the totality of the data.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.1909.09467</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Hazards ; Rank tests ; Sensitivity analysis</subject><ispartof>arXiv.org, 2019-09</ispartof><rights>2019. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2295838481?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>776,780,25731,27902,36989,44566</link.rule.ids></links><search><creatorcontrib>Lin, Ray S</creatorcontrib><creatorcontrib>Lin, Ji</creatorcontrib><creatorcontrib>Roychoudhury, Satrajit</creatorcontrib><creatorcontrib>Anderson, Keaven M</creatorcontrib><creatorcontrib>Hu, Tianle</creatorcontrib><creatorcontrib>Huang, Bo</creatorcontrib><creatorcontrib>Leon, Larry F</creatorcontrib><creatorcontrib>Liao, Jason JZ</creatorcontrib><creatorcontrib>Liu, Rong</creatorcontrib><creatorcontrib>Luo, Xiaodong</creatorcontrib><creatorcontrib>Mukhopadhyay, Pralay</creatorcontrib><creatorcontrib>Qin, Rui</creatorcontrib><creatorcontrib>Tatsuoka, Kay</creatorcontrib><creatorcontrib>Wang, Xuejing</creatorcontrib><creatorcontrib>Wang, Yang</creatorcontrib><creatorcontrib>Zhu, Jian</creatorcontrib><creatorcontrib>Tai-Tsang, Chen</creatorcontrib><creatorcontrib>Iacona, Renee</creatorcontrib><creatorcontrib>Cross-Pharma Non-proportional Hazards Working Group</creatorcontrib><title>Alternative Analysis Methods for Time to Event Endpoints under Non-proportional Hazards: A Comparative Analysis</title><title>arXiv.org</title><description>The log-rank test is most powerful under proportional hazards (PH). In practice, non-PH patterns are often observed in clinical trials, such as in immuno-oncology; therefore, alternative methods are needed to restore the efficiency of statistical testing. Three categories of testing methods were evaluated, including weighted log-rank tests, Kaplan-Meier curve-based tests (including weighted Kaplan-Meier and Restricted Mean Survival Time, RMST), and combination tests (including Breslow test, Lee's combo test, and MaxCombo test). Nine scenarios representing the PH and various non-PH patterns were simulated. The power, type I error, and effect estimates of each method were compared. In general, all tests control type I error well. There is not a single most powerful test across all scenarios. In the absence of prior knowledge regarding the PH or non-PH patterns, the MaxCombo test is relatively robust across patterns. Since the treatment effect changes overtime under non-PH, the overall profile of the treatment effect may not be represented comprehensively based on a single measure. Thus, multiple measures of the treatment effect should be pre-specified as sensitivity analyses to evaluate the totality of the data.</description><subject>Hazards</subject><subject>Rank tests</subject><subject>Sensitivity analysis</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNpVj89LwzAAhYMgOOb-AG8Bz635ndRbKdMNpl56H1mTYkaX1CQt6l9vQS-evsPjfbwHwB1GJVOcowcdP91c4gpVJaqYkFdgRSjFhWKE3IBNSmeEEBGScE5XINRDttHr7GYLa6-Hr-QSfLH5PZgE-xBh6y4W5gC3s_UZbr0Zg_M5wckbG-Fr8MUYwxhidmGpw53-1tGkR1jDJlxGHf-rb8F1r4dkN39cg_Zp2za74vD2vG_qQ6E5wYXtpcJE4b5jXUe04hRRg4Rh1MhOSCwxYWRJRHfCSlQLxckgRZFlwkqp6Rrc_2qXbR-TTfl4DtNyc0hHQiquqGIK0x8ijluS</recordid><startdate>20190920</startdate><enddate>20190920</enddate><creator>Lin, Ray S</creator><creator>Lin, Ji</creator><creator>Roychoudhury, Satrajit</creator><creator>Anderson, Keaven M</creator><creator>Hu, Tianle</creator><creator>Huang, Bo</creator><creator>Leon, Larry F</creator><creator>Liao, Jason JZ</creator><creator>Liu, Rong</creator><creator>Luo, Xiaodong</creator><creator>Mukhopadhyay, Pralay</creator><creator>Qin, Rui</creator><creator>Tatsuoka, Kay</creator><creator>Wang, Xuejing</creator><creator>Wang, Yang</creator><creator>Zhu, Jian</creator><creator>Tai-Tsang, Chen</creator><creator>Iacona, Renee</creator><creator>Cross-Pharma Non-proportional Hazards Working Group</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20190920</creationdate><title>Alternative Analysis Methods for Time to Event Endpoints under Non-proportional Hazards: A Comparative Analysis</title><author>Lin, Ray S ; Lin, Ji ; Roychoudhury, Satrajit ; Anderson, Keaven M ; Hu, Tianle ; Huang, Bo ; Leon, Larry F ; Liao, Jason JZ ; Liu, Rong ; Luo, Xiaodong ; Mukhopadhyay, Pralay ; Qin, Rui ; Tatsuoka, Kay ; Wang, Xuejing ; Wang, Yang ; Zhu, Jian ; Tai-Tsang, Chen ; Iacona, Renee ; Cross-Pharma Non-proportional Hazards Working Group</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a521-ef781281fc4cc2a85303d06d43d7c67171242cc26cb186926c6bd0830e46e77a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Hazards</topic><topic>Rank tests</topic><topic>Sensitivity analysis</topic><toplevel>online_resources</toplevel><creatorcontrib>Lin, Ray S</creatorcontrib><creatorcontrib>Lin, Ji</creatorcontrib><creatorcontrib>Roychoudhury, Satrajit</creatorcontrib><creatorcontrib>Anderson, Keaven M</creatorcontrib><creatorcontrib>Hu, Tianle</creatorcontrib><creatorcontrib>Huang, Bo</creatorcontrib><creatorcontrib>Leon, Larry F</creatorcontrib><creatorcontrib>Liao, Jason JZ</creatorcontrib><creatorcontrib>Liu, Rong</creatorcontrib><creatorcontrib>Luo, Xiaodong</creatorcontrib><creatorcontrib>Mukhopadhyay, Pralay</creatorcontrib><creatorcontrib>Qin, Rui</creatorcontrib><creatorcontrib>Tatsuoka, Kay</creatorcontrib><creatorcontrib>Wang, Xuejing</creatorcontrib><creatorcontrib>Wang, Yang</creatorcontrib><creatorcontrib>Zhu, Jian</creatorcontrib><creatorcontrib>Tai-Tsang, Chen</creatorcontrib><creatorcontrib>Iacona, Renee</creatorcontrib><creatorcontrib>Cross-Pharma Non-proportional Hazards Working Group</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lin, Ray S</au><au>Lin, Ji</au><au>Roychoudhury, Satrajit</au><au>Anderson, Keaven M</au><au>Hu, Tianle</au><au>Huang, Bo</au><au>Leon, Larry F</au><au>Liao, Jason JZ</au><au>Liu, Rong</au><au>Luo, Xiaodong</au><au>Mukhopadhyay, Pralay</au><au>Qin, Rui</au><au>Tatsuoka, Kay</au><au>Wang, Xuejing</au><au>Wang, Yang</au><au>Zhu, Jian</au><au>Tai-Tsang, Chen</au><au>Iacona, Renee</au><au>Cross-Pharma Non-proportional Hazards Working Group</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Alternative Analysis Methods for Time to Event Endpoints under Non-proportional Hazards: A Comparative Analysis</atitle><jtitle>arXiv.org</jtitle><date>2019-09-20</date><risdate>2019</risdate><eissn>2331-8422</eissn><abstract>The log-rank test is most powerful under proportional hazards (PH). In practice, non-PH patterns are often observed in clinical trials, such as in immuno-oncology; therefore, alternative methods are needed to restore the efficiency of statistical testing. Three categories of testing methods were evaluated, including weighted log-rank tests, Kaplan-Meier curve-based tests (including weighted Kaplan-Meier and Restricted Mean Survival Time, RMST), and combination tests (including Breslow test, Lee's combo test, and MaxCombo test). Nine scenarios representing the PH and various non-PH patterns were simulated. The power, type I error, and effect estimates of each method were compared. In general, all tests control type I error well. There is not a single most powerful test across all scenarios. In the absence of prior knowledge regarding the PH or non-PH patterns, the MaxCombo test is relatively robust across patterns. Since the treatment effect changes overtime under non-PH, the overall profile of the treatment effect may not be represented comprehensively based on a single measure. Thus, multiple measures of the treatment effect should be pre-specified as sensitivity analyses to evaluate the totality of the data.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.1909.09467</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2019-09
issn 2331-8422
language eng
recordid cdi_proquest_journals_2295838481
source Publicly Available Content Database
subjects Hazards
Rank tests
Sensitivity analysis
title Alternative Analysis Methods for Time to Event Endpoints under Non-proportional Hazards: A Comparative Analysis
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T12%3A37%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Alternative%20Analysis%20Methods%20for%20Time%20to%20Event%20Endpoints%20under%20Non-proportional%20Hazards:%20A%20Comparative%20Analysis&rft.jtitle=arXiv.org&rft.au=Lin,%20Ray%20S&rft.date=2019-09-20&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.1909.09467&rft_dat=%3Cproquest%3E2295838481%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-a521-ef781281fc4cc2a85303d06d43d7c67171242cc26cb186926c6bd0830e46e77a3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2295838481&rft_id=info:pmid/&rfr_iscdi=true