Loading…

Infinite Hidden Markov Models for Multiple Multivariate Time Series with Missing Data

Exposure to air pollution is associated with increased morbidity and mortality. Recent technological advancements permit the collection of time-resolved personal exposure data. Such data are often incomplete with missing observations and exposures below the limit of detection, which limit their use...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2022-04
Main Authors: Hoskovec, Lauren, Koslovsky, Matthew D, Koehler, Kirsten, Good, Nicholas, Peel, Jennifer L, Volckens, John, Wilson, Ander
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Hoskovec, Lauren
Koslovsky, Matthew D
Koehler, Kirsten
Good, Nicholas
Peel, Jennifer L
Volckens, John
Wilson, Ander
description Exposure to air pollution is associated with increased morbidity and mortality. Recent technological advancements permit the collection of time-resolved personal exposure data. Such data are often incomplete with missing observations and exposures below the limit of detection, which limit their use in health effects studies. In this paper we develop an infinite hidden Markov model for multiple asynchronous multivariate time series with missing data. Our model is designed to include covariates that can inform transitions among hidden states. We implement beam sampling, a combination of slice sampling and dynamic programming, to sample the hidden states, and a Bayesian multiple imputation algorithm to impute missing data. In simulation studies, our model excels in estimating hidden states and state-specific means and imputing observations that are missing at random or below the limit of detection. We validate our imputation approach on data from the Fort Collins Commuter Study. We show that the estimated hidden states improve imputations for data that are missing at random compared to existing approaches. In a case study of the Fort Collins Commuter Study, we describe the inferential gains obtained from our model including improved imputation of missing data and the ability to identify shared patterns in activity and exposure among repeated sampling days for individuals and among distinct individuals.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2650321916</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2650321916</sourcerecordid><originalsourceid>FETCH-proquest_journals_26503219163</originalsourceid><addsrcrecordid>eNqNiksKwjAUAIMgWLR3eOC6kCa26toPddGVui6BvuqrMalJWq9vQQ_gagZmJiwSUqbJZiXEjMXet5xzka9FlsmIXU-mIUMBoaC6RgOlcg87QGlr1B4a66DsdaBO41cG5UiN-4WeCGd0hB7eFO5QkvdkbrBXQS3YtFHaY_zjnC2Ph8uuSDpnXz36ULW2d2ZMlcgzLkW6TXP53_UBlZxAag</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2650321916</pqid></control><display><type>article</type><title>Infinite Hidden Markov Models for Multiple Multivariate Time Series with Missing Data</title><source>Publicly Available Content Database</source><creator>Hoskovec, Lauren ; Koslovsky, Matthew D ; Koehler, Kirsten ; Good, Nicholas ; Peel, Jennifer L ; Volckens, John ; Wilson, Ander</creator><creatorcontrib>Hoskovec, Lauren ; Koslovsky, Matthew D ; Koehler, Kirsten ; Good, Nicholas ; Peel, Jennifer L ; Volckens, John ; Wilson, Ander</creatorcontrib><description>Exposure to air pollution is associated with increased morbidity and mortality. Recent technological advancements permit the collection of time-resolved personal exposure data. Such data are often incomplete with missing observations and exposures below the limit of detection, which limit their use in health effects studies. In this paper we develop an infinite hidden Markov model for multiple asynchronous multivariate time series with missing data. Our model is designed to include covariates that can inform transitions among hidden states. We implement beam sampling, a combination of slice sampling and dynamic programming, to sample the hidden states, and a Bayesian multiple imputation algorithm to impute missing data. In simulation studies, our model excels in estimating hidden states and state-specific means and imputing observations that are missing at random or below the limit of detection. We validate our imputation approach on data from the Fort Collins Commuter Study. We show that the estimated hidden states improve imputations for data that are missing at random compared to existing approaches. In a case study of the Fort Collins Commuter Study, we describe the inferential gains obtained from our model including improved imputation of missing data and the ability to identify shared patterns in activity and exposure among repeated sampling days for individuals and among distinct individuals.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Dynamic programming ; Exposure ; Markov chains ; Missing data ; Multivariate analysis ; Sampling ; Time series</subject><ispartof>arXiv.org, 2022-04</ispartof><rights>2022. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2650321916?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>777,781,25734,36993,44571</link.rule.ids></links><search><creatorcontrib>Hoskovec, Lauren</creatorcontrib><creatorcontrib>Koslovsky, Matthew D</creatorcontrib><creatorcontrib>Koehler, Kirsten</creatorcontrib><creatorcontrib>Good, Nicholas</creatorcontrib><creatorcontrib>Peel, Jennifer L</creatorcontrib><creatorcontrib>Volckens, John</creatorcontrib><creatorcontrib>Wilson, Ander</creatorcontrib><title>Infinite Hidden Markov Models for Multiple Multivariate Time Series with Missing Data</title><title>arXiv.org</title><description>Exposure to air pollution is associated with increased morbidity and mortality. Recent technological advancements permit the collection of time-resolved personal exposure data. Such data are often incomplete with missing observations and exposures below the limit of detection, which limit their use in health effects studies. In this paper we develop an infinite hidden Markov model for multiple asynchronous multivariate time series with missing data. Our model is designed to include covariates that can inform transitions among hidden states. We implement beam sampling, a combination of slice sampling and dynamic programming, to sample the hidden states, and a Bayesian multiple imputation algorithm to impute missing data. In simulation studies, our model excels in estimating hidden states and state-specific means and imputing observations that are missing at random or below the limit of detection. We validate our imputation approach on data from the Fort Collins Commuter Study. We show that the estimated hidden states improve imputations for data that are missing at random compared to existing approaches. In a case study of the Fort Collins Commuter Study, we describe the inferential gains obtained from our model including improved imputation of missing data and the ability to identify shared patterns in activity and exposure among repeated sampling days for individuals and among distinct individuals.</description><subject>Algorithms</subject><subject>Dynamic programming</subject><subject>Exposure</subject><subject>Markov chains</subject><subject>Missing data</subject><subject>Multivariate analysis</subject><subject>Sampling</subject><subject>Time series</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNiksKwjAUAIMgWLR3eOC6kCa26toPddGVui6BvuqrMalJWq9vQQ_gagZmJiwSUqbJZiXEjMXet5xzka9FlsmIXU-mIUMBoaC6RgOlcg87QGlr1B4a66DsdaBO41cG5UiN-4WeCGd0hB7eFO5QkvdkbrBXQS3YtFHaY_zjnC2Ph8uuSDpnXz36ULW2d2ZMlcgzLkW6TXP53_UBlZxAag</recordid><startdate>20220413</startdate><enddate>20220413</enddate><creator>Hoskovec, Lauren</creator><creator>Koslovsky, Matthew D</creator><creator>Koehler, Kirsten</creator><creator>Good, Nicholas</creator><creator>Peel, Jennifer L</creator><creator>Volckens, John</creator><creator>Wilson, Ander</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20220413</creationdate><title>Infinite Hidden Markov Models for Multiple Multivariate Time Series with Missing Data</title><author>Hoskovec, Lauren ; Koslovsky, Matthew D ; Koehler, Kirsten ; Good, Nicholas ; Peel, Jennifer L ; Volckens, John ; Wilson, Ander</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_26503219163</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Algorithms</topic><topic>Dynamic programming</topic><topic>Exposure</topic><topic>Markov chains</topic><topic>Missing data</topic><topic>Multivariate analysis</topic><topic>Sampling</topic><topic>Time series</topic><toplevel>online_resources</toplevel><creatorcontrib>Hoskovec, Lauren</creatorcontrib><creatorcontrib>Koslovsky, Matthew D</creatorcontrib><creatorcontrib>Koehler, Kirsten</creatorcontrib><creatorcontrib>Good, Nicholas</creatorcontrib><creatorcontrib>Peel, Jennifer L</creatorcontrib><creatorcontrib>Volckens, John</creatorcontrib><creatorcontrib>Wilson, Ander</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hoskovec, Lauren</au><au>Koslovsky, Matthew D</au><au>Koehler, Kirsten</au><au>Good, Nicholas</au><au>Peel, Jennifer L</au><au>Volckens, John</au><au>Wilson, Ander</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Infinite Hidden Markov Models for Multiple Multivariate Time Series with Missing Data</atitle><jtitle>arXiv.org</jtitle><date>2022-04-13</date><risdate>2022</risdate><eissn>2331-8422</eissn><abstract>Exposure to air pollution is associated with increased morbidity and mortality. Recent technological advancements permit the collection of time-resolved personal exposure data. Such data are often incomplete with missing observations and exposures below the limit of detection, which limit their use in health effects studies. In this paper we develop an infinite hidden Markov model for multiple asynchronous multivariate time series with missing data. Our model is designed to include covariates that can inform transitions among hidden states. We implement beam sampling, a combination of slice sampling and dynamic programming, to sample the hidden states, and a Bayesian multiple imputation algorithm to impute missing data. In simulation studies, our model excels in estimating hidden states and state-specific means and imputing observations that are missing at random or below the limit of detection. We validate our imputation approach on data from the Fort Collins Commuter Study. We show that the estimated hidden states improve imputations for data that are missing at random compared to existing approaches. In a case study of the Fort Collins Commuter Study, we describe the inferential gains obtained from our model including improved imputation of missing data and the ability to identify shared patterns in activity and exposure among repeated sampling days for individuals and among distinct individuals.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2022-04
issn 2331-8422
language eng
recordid cdi_proquest_journals_2650321916
source Publicly Available Content Database
subjects Algorithms
Dynamic programming
Exposure
Markov chains
Missing data
Multivariate analysis
Sampling
Time series
title Infinite Hidden Markov Models for Multiple Multivariate Time Series with Missing Data
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T12%3A23%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Infinite%20Hidden%20Markov%20Models%20for%20Multiple%20Multivariate%20Time%20Series%20with%20Missing%20Data&rft.jtitle=arXiv.org&rft.au=Hoskovec,%20Lauren&rft.date=2022-04-13&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2650321916%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_26503219163%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2650321916&rft_id=info:pmid/&rfr_iscdi=true