Loading…

Pooling Biospecimens for Efficient Exposure Assessment When Using Case-Cohort Analysis in Cohort Studies

Large prospective cohort studies have been fruitful for identifying exposure-disease associations. In a cohort where biospecimens (e.g., blood, urine) were collected at enrollment, analysts can exploit a case-cohort approach: Biospecimens from a random sample of cohort participants, called the "...

Full description

Saved in:
Bibliographic Details
Published in:Environmental health perspectives 2024-12, Vol.132 (12), p.127004
Main Authors: Shi, Min, Umbach, David M, Weinberg, Clarice R
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c1450-d18d308b2bff0d1e966818b632bcce97e6a0e407c4c9a4316e8d0af4355e789d3
container_end_page
container_issue 12
container_start_page 127004
container_title Environmental health perspectives
container_volume 132
creator Shi, Min
Umbach, David M
Weinberg, Clarice R
description Large prospective cohort studies have been fruitful for identifying exposure-disease associations. In a cohort where biospecimens (e.g., blood, urine) were collected at enrollment, analysts can exploit a case-cohort approach: Biospecimens from a random sample of cohort participants, called the "subcohort," plus a sample of incident cases that were not part of the subcohort are assayed. Reusing subcohort data for multiple disease outcomes can reduce costs and conserve specimen archives. Pooling biospecimen samples before assay could both save money and reduce depletion of the archive but has not been studied for cohort studies. We develop and evaluate a biospecimen pooling strategy for case-cohort analyses that relate an exposure to risk of a rare disease. Our approach involves constructing pooling sets for cases not in the subcohort after grouping them according to time of diagnosis (e.g., age). In contrast, members of the subcohort are grouped by age at entry before constructing pooling sets. The analyst then fits a logistic regression model that jointly stratifies by age at risk and pooling set size and adjusts for confounders. We used simulations (288 sampling scenarios with 1,000 simulated datasets each) to evaluate the performance of this approach for several sizes of pooling sets and illustrated its application to environmental epidemiologic studies by reanalyzing Sister Study data. Parameter estimates were nearly unbiased, and 95% confidence intervals constructed using a bootstrap estimate of the standard error performed well. In statistical tests also based on the bootstrap standard error, pooling up to 8 specimens per pool caused only modest loss of power. Assigning more cohort members to the subcohort and commensurately increasing the number of specimens per pool improved power and precision substantially while reducing the number of assays. When using case-cohort analysis to study disease outcomes in relation to exposures assessed using biospecimens in a cohort study, epidemiologists should consider biospecimen pooling as a way to improve statistical power, conserve irreplaceable archives, and save money. https://doi.org/10.1289/EHP14476.
doi_str_mv 10.1289/EHP14476
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_11668240</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3148840284</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1450-d18d308b2bff0d1e966818b632bcce97e6a0e407c4c9a4316e8d0af4355e789d3</originalsourceid><addsrcrecordid>eNpVUV1LwzAUDaLo_AB_geTRl2rSpmnyJHNMJwwUVHwMaXrrIm0zc1fRf--G29SnC-eeLziEnHJ2wVOlL8eTBy5EIXfIgOd5mmidil0yYEzzRBYyPyCHiG-MMa6k3CcHmS64yoUckNlDCI3vXum1DzgH51vokNYh0nFde-ehW9Dx5zxgH4EOEQGxXWEvM-joM66UI4uQjMIsxAUddrb5Qo_Ud3QNPS76ygMek73aNggn63tEnm_GT6NJMr2_vRsNp4njImdJxVWVMVWmZV2zioOWUnFVyiwtnQNdgLQMBCuccNqKjEtQFbO1yPIcCqWr7Ihc_fjO-7KFyi3LRtuYefStjV8mWG_-fzo_M6_hw3C-jEoFWzqcrx1ieO8BF6b16KBpbAehR5NxoZRgqRK_VBcDYoR6m8OZWS1jNsssqWd_e22Jmymyb0KQipI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3148840284</pqid></control><display><type>article</type><title>Pooling Biospecimens for Efficient Exposure Assessment When Using Case-Cohort Analysis in Cohort Studies</title><source>GreenFILE</source><source>Publicly Available Content Database</source><source>ABI/INFORM Global</source><source>PubMed Central</source><creator>Shi, Min ; Umbach, David M ; Weinberg, Clarice R</creator><creatorcontrib>Shi, Min ; Umbach, David M ; Weinberg, Clarice R</creatorcontrib><description>Large prospective cohort studies have been fruitful for identifying exposure-disease associations. In a cohort where biospecimens (e.g., blood, urine) were collected at enrollment, analysts can exploit a case-cohort approach: Biospecimens from a random sample of cohort participants, called the "subcohort," plus a sample of incident cases that were not part of the subcohort are assayed. Reusing subcohort data for multiple disease outcomes can reduce costs and conserve specimen archives. Pooling biospecimen samples before assay could both save money and reduce depletion of the archive but has not been studied for cohort studies. We develop and evaluate a biospecimen pooling strategy for case-cohort analyses that relate an exposure to risk of a rare disease. Our approach involves constructing pooling sets for cases not in the subcohort after grouping them according to time of diagnosis (e.g., age). In contrast, members of the subcohort are grouped by age at entry before constructing pooling sets. The analyst then fits a logistic regression model that jointly stratifies by age at risk and pooling set size and adjusts for confounders. We used simulations (288 sampling scenarios with 1,000 simulated datasets each) to evaluate the performance of this approach for several sizes of pooling sets and illustrated its application to environmental epidemiologic studies by reanalyzing Sister Study data. Parameter estimates were nearly unbiased, and 95% confidence intervals constructed using a bootstrap estimate of the standard error performed well. In statistical tests also based on the bootstrap standard error, pooling up to 8 specimens per pool caused only modest loss of power. Assigning more cohort members to the subcohort and commensurately increasing the number of specimens per pool improved power and precision substantially while reducing the number of assays. When using case-cohort analysis to study disease outcomes in relation to exposures assessed using biospecimens in a cohort study, epidemiologists should consider biospecimen pooling as a way to improve statistical power, conserve irreplaceable archives, and save money. https://doi.org/10.1289/EHP14476.</description><identifier>ISSN: 0091-6765</identifier><identifier>ISSN: 1552-9924</identifier><identifier>EISSN: 1552-9924</identifier><identifier>DOI: 10.1289/EHP14476</identifier><identifier>PMID: 39718546</identifier><language>eng</language><publisher>United States: Environmental Health Perspectives</publisher><subject>Cohort Studies ; Environmental Exposure - statistics &amp; numerical data ; Humans ; Specimen Handling - methods</subject><ispartof>Environmental health perspectives, 2024-12, Vol.132 (12), p.127004</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1450-d18d308b2bff0d1e966818b632bcce97e6a0e407c4c9a4316e8d0af4355e789d3</cites><orcidid>0000-0002-7713-8556 ; 0000-0002-7142-2812 ; 0000-0001-8265-5307</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC11668240/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC11668240/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,27923,27924,36060,37012,53790,53792</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/39718546$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Shi, Min</creatorcontrib><creatorcontrib>Umbach, David M</creatorcontrib><creatorcontrib>Weinberg, Clarice R</creatorcontrib><title>Pooling Biospecimens for Efficient Exposure Assessment When Using Case-Cohort Analysis in Cohort Studies</title><title>Environmental health perspectives</title><addtitle>Environ Health Perspect</addtitle><description>Large prospective cohort studies have been fruitful for identifying exposure-disease associations. In a cohort where biospecimens (e.g., blood, urine) were collected at enrollment, analysts can exploit a case-cohort approach: Biospecimens from a random sample of cohort participants, called the "subcohort," plus a sample of incident cases that were not part of the subcohort are assayed. Reusing subcohort data for multiple disease outcomes can reduce costs and conserve specimen archives. Pooling biospecimen samples before assay could both save money and reduce depletion of the archive but has not been studied for cohort studies. We develop and evaluate a biospecimen pooling strategy for case-cohort analyses that relate an exposure to risk of a rare disease. Our approach involves constructing pooling sets for cases not in the subcohort after grouping them according to time of diagnosis (e.g., age). In contrast, members of the subcohort are grouped by age at entry before constructing pooling sets. The analyst then fits a logistic regression model that jointly stratifies by age at risk and pooling set size and adjusts for confounders. We used simulations (288 sampling scenarios with 1,000 simulated datasets each) to evaluate the performance of this approach for several sizes of pooling sets and illustrated its application to environmental epidemiologic studies by reanalyzing Sister Study data. Parameter estimates were nearly unbiased, and 95% confidence intervals constructed using a bootstrap estimate of the standard error performed well. In statistical tests also based on the bootstrap standard error, pooling up to 8 specimens per pool caused only modest loss of power. Assigning more cohort members to the subcohort and commensurately increasing the number of specimens per pool improved power and precision substantially while reducing the number of assays. When using case-cohort analysis to study disease outcomes in relation to exposures assessed using biospecimens in a cohort study, epidemiologists should consider biospecimen pooling as a way to improve statistical power, conserve irreplaceable archives, and save money. https://doi.org/10.1289/EHP14476.</description><subject>Cohort Studies</subject><subject>Environmental Exposure - statistics &amp; numerical data</subject><subject>Humans</subject><subject>Specimen Handling - methods</subject><issn>0091-6765</issn><issn>1552-9924</issn><issn>1552-9924</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpVUV1LwzAUDaLo_AB_geTRl2rSpmnyJHNMJwwUVHwMaXrrIm0zc1fRf--G29SnC-eeLziEnHJ2wVOlL8eTBy5EIXfIgOd5mmidil0yYEzzRBYyPyCHiG-MMa6k3CcHmS64yoUckNlDCI3vXum1DzgH51vokNYh0nFde-ehW9Dx5zxgH4EOEQGxXWEvM-joM66UI4uQjMIsxAUddrb5Qo_Ud3QNPS76ygMek73aNggn63tEnm_GT6NJMr2_vRsNp4njImdJxVWVMVWmZV2zioOWUnFVyiwtnQNdgLQMBCuccNqKjEtQFbO1yPIcCqWr7Ihc_fjO-7KFyi3LRtuYefStjV8mWG_-fzo_M6_hw3C-jEoFWzqcrx1ieO8BF6b16KBpbAehR5NxoZRgqRK_VBcDYoR6m8OZWS1jNsssqWd_e22Jmymyb0KQipI</recordid><startdate>202412</startdate><enddate>202412</enddate><creator>Shi, Min</creator><creator>Umbach, David M</creator><creator>Weinberg, Clarice R</creator><general>Environmental Health Perspectives</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-7713-8556</orcidid><orcidid>https://orcid.org/0000-0002-7142-2812</orcidid><orcidid>https://orcid.org/0000-0001-8265-5307</orcidid></search><sort><creationdate>202412</creationdate><title>Pooling Biospecimens for Efficient Exposure Assessment When Using Case-Cohort Analysis in Cohort Studies</title><author>Shi, Min ; Umbach, David M ; Weinberg, Clarice R</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1450-d18d308b2bff0d1e966818b632bcce97e6a0e407c4c9a4316e8d0af4355e789d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Cohort Studies</topic><topic>Environmental Exposure - statistics &amp; numerical data</topic><topic>Humans</topic><topic>Specimen Handling - methods</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shi, Min</creatorcontrib><creatorcontrib>Umbach, David M</creatorcontrib><creatorcontrib>Weinberg, Clarice R</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Environmental health perspectives</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shi, Min</au><au>Umbach, David M</au><au>Weinberg, Clarice R</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Pooling Biospecimens for Efficient Exposure Assessment When Using Case-Cohort Analysis in Cohort Studies</atitle><jtitle>Environmental health perspectives</jtitle><addtitle>Environ Health Perspect</addtitle><date>2024-12</date><risdate>2024</risdate><volume>132</volume><issue>12</issue><spage>127004</spage><pages>127004-</pages><issn>0091-6765</issn><issn>1552-9924</issn><eissn>1552-9924</eissn><abstract>Large prospective cohort studies have been fruitful for identifying exposure-disease associations. In a cohort where biospecimens (e.g., blood, urine) were collected at enrollment, analysts can exploit a case-cohort approach: Biospecimens from a random sample of cohort participants, called the "subcohort," plus a sample of incident cases that were not part of the subcohort are assayed. Reusing subcohort data for multiple disease outcomes can reduce costs and conserve specimen archives. Pooling biospecimen samples before assay could both save money and reduce depletion of the archive but has not been studied for cohort studies. We develop and evaluate a biospecimen pooling strategy for case-cohort analyses that relate an exposure to risk of a rare disease. Our approach involves constructing pooling sets for cases not in the subcohort after grouping them according to time of diagnosis (e.g., age). In contrast, members of the subcohort are grouped by age at entry before constructing pooling sets. The analyst then fits a logistic regression model that jointly stratifies by age at risk and pooling set size and adjusts for confounders. We used simulations (288 sampling scenarios with 1,000 simulated datasets each) to evaluate the performance of this approach for several sizes of pooling sets and illustrated its application to environmental epidemiologic studies by reanalyzing Sister Study data. Parameter estimates were nearly unbiased, and 95% confidence intervals constructed using a bootstrap estimate of the standard error performed well. In statistical tests also based on the bootstrap standard error, pooling up to 8 specimens per pool caused only modest loss of power. Assigning more cohort members to the subcohort and commensurately increasing the number of specimens per pool improved power and precision substantially while reducing the number of assays. When using case-cohort analysis to study disease outcomes in relation to exposures assessed using biospecimens in a cohort study, epidemiologists should consider biospecimen pooling as a way to improve statistical power, conserve irreplaceable archives, and save money. https://doi.org/10.1289/EHP14476.</abstract><cop>United States</cop><pub>Environmental Health Perspectives</pub><pmid>39718546</pmid><doi>10.1289/EHP14476</doi><orcidid>https://orcid.org/0000-0002-7713-8556</orcidid><orcidid>https://orcid.org/0000-0002-7142-2812</orcidid><orcidid>https://orcid.org/0000-0001-8265-5307</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0091-6765
ispartof Environmental health perspectives, 2024-12, Vol.132 (12), p.127004
issn 0091-6765
1552-9924
1552-9924
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_11668240
source GreenFILE; Publicly Available Content Database; ABI/INFORM Global; PubMed Central
subjects Cohort Studies
Environmental Exposure - statistics & numerical data
Humans
Specimen Handling - methods
title Pooling Biospecimens for Efficient Exposure Assessment When Using Case-Cohort Analysis in Cohort Studies
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T20%3A32%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Pooling%20Biospecimens%20for%20Efficient%20Exposure%20Assessment%20When%20Using%20Case-Cohort%20Analysis%20in%20Cohort%20Studies&rft.jtitle=Environmental%20health%20perspectives&rft.au=Shi,%20Min&rft.date=2024-12&rft.volume=132&rft.issue=12&rft.spage=127004&rft.pages=127004-&rft.issn=0091-6765&rft.eissn=1552-9924&rft_id=info:doi/10.1289/EHP14476&rft_dat=%3Cproquest_pubme%3E3148840284%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c1450-d18d308b2bff0d1e966818b632bcce97e6a0e407c4c9a4316e8d0af4355e789d3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3148840284&rft_id=info:pmid/39718546&rfr_iscdi=true