Loading…

Integrative analysis of multi-omics data for discovering low-frequency variants associated with low-density lipoprotein cholesterol levels

Abstract Motivation The abundance of omics data has facilitated integrative analyses of single and multiple molecular layers with genome-wide association studies focusing on common variants. Built on its successes, we propose a general analysis framework to leverage multi-omics data with sequencing...

Full description

Saved in:
Bibliographic Details
Published in:Bioinformatics 2021-01, Vol.36 (21), p.5223-5228
Main Authors: Yang, Tianzhong, Wei, Peng, Pan, Wei
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933
cites cdi_FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933
container_end_page 5228
container_issue 21
container_start_page 5223
container_title Bioinformatics
container_volume 36
creator Yang, Tianzhong
Wei, Peng
Pan, Wei
description Abstract Motivation The abundance of omics data has facilitated integrative analyses of single and multiple molecular layers with genome-wide association studies focusing on common variants. Built on its successes, we propose a general analysis framework to leverage multi-omics data with sequencing data to improve the statistical power of discovering new associations and understanding of the disease susceptibility due to low-frequency variants. The proposed test features its robustness to model misspecification, high power across a wide range of scenarios and the potential of offering insights into the underlying genetic architecture and disease mechanisms. Results Using the Framingham Heart Study data, we show that low-frequency variants are predictive of DNA methylation, even after conditioning on the nearby common variants. In addition, DNA methylation and gene expression provide complementary information to functional genomics. In the Avon Longitudinal Study of Parents and Children with a sample size of 1497, one gene CLPTM1 is identified to be associated with low-density lipoprotein cholesterol levels by the proposed powerful adaptive gene-based test integrating information from gene expression, methylation and enhancer–promoter interactions. It is further replicated in the TwinsUK study with 1706 samples. The signal is driven by both low-frequency and common variants. Availability and implementation Models are available at https://github.com/ytzhong/DNAm. Supplementary information Supplementary data are available at Bioinformatics online.
doi_str_mv 10.1093/bioinformatics/btaa898
format article
fullrecord <record><control><sourceid>proquest_TOX</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7850048</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/bioinformatics/btaa898</oup_id><sourcerecordid>2452099344</sourcerecordid><originalsourceid>FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933</originalsourceid><addsrcrecordid>eNqNkcFuFSEUhidGY2v1FRqWbsbCwMzAxsQ0aps0caNrcmY4cy-GgRGYae4r9KlF721jd64g4TvfD_xVdcnoB0YVvxpssH4KcYZsx3Q1ZACp5IvqnImO1g1t1cuy511fC0n5WfUmpZ-UtkwI8bo645z2lMnmvHq49Rl3sVg2JODBHZJNJExkXl22dZiLnRjIQEoYMTaNYcNo_Y64cF9PEX-t6McD2SBa8DkRSCmMFjIacm_z_i9m0CebD8TZJSwxZLSejPvgMGWMwRGHG7r0tno1gUv47rReVD--fP5-fVPffft6e_3prh5F2-W6aZWSZpqgYUOr-NDLzkw4mEnKkcrWUMG6HqCRYDrToJSKdYJLBQoAlOL8ovp49C7rMKMZ0ecITi_RzhAPOoDVz0-83etd2HQvW0qFLIL3J0EM5fkp67n8CzoHHsOadCPahpYkIQraHdExhpQiTk8xjOo_RernRepTkWXw8t9LPo09NlcAdgTCuvyv9Dfgj7fN</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2452099344</pqid></control><display><type>article</type><title>Integrative analysis of multi-omics data for discovering low-frequency variants associated with low-density lipoprotein cholesterol levels</title><source>Open Access: Oxford University Press Open Journals</source><creator>Yang, Tianzhong ; Wei, Peng ; Pan, Wei</creator><contributor>Schwartz, Russell</contributor><creatorcontrib>Yang, Tianzhong ; Wei, Peng ; Pan, Wei ; Schwartz, Russell</creatorcontrib><description>Abstract Motivation The abundance of omics data has facilitated integrative analyses of single and multiple molecular layers with genome-wide association studies focusing on common variants. Built on its successes, we propose a general analysis framework to leverage multi-omics data with sequencing data to improve the statistical power of discovering new associations and understanding of the disease susceptibility due to low-frequency variants. The proposed test features its robustness to model misspecification, high power across a wide range of scenarios and the potential of offering insights into the underlying genetic architecture and disease mechanisms. Results Using the Framingham Heart Study data, we show that low-frequency variants are predictive of DNA methylation, even after conditioning on the nearby common variants. In addition, DNA methylation and gene expression provide complementary information to functional genomics. In the Avon Longitudinal Study of Parents and Children with a sample size of 1497, one gene CLPTM1 is identified to be associated with low-density lipoprotein cholesterol levels by the proposed powerful adaptive gene-based test integrating information from gene expression, methylation and enhancer–promoter interactions. It is further replicated in the TwinsUK study with 1706 samples. The signal is driven by both low-frequency and common variants. Availability and implementation Models are available at https://github.com/ytzhong/DNAm. Supplementary information Supplementary data are available at Bioinformatics online.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1460-2059</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/btaa898</identifier><identifier>PMID: 33070182</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Original Papers</subject><ispartof>Bioinformatics, 2021-01, Vol.36 (21), p.5223-5228</ispartof><rights>The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2020</rights><rights>The Author(s) (2020). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933</citedby><cites>FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933</cites><orcidid>0000-0002-0162-7740 ; 0000-0001-7758-6116</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7850048/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7850048/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,727,780,784,885,1604,27924,27925,53791,53793</link.rule.ids><linktorsrc>$$Uhttps://dx.doi.org/10.1093/bioinformatics/btaa898$$EView_record_in_Oxford_University_Press$$FView_record_in_$$GOxford_University_Press</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33070182$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Schwartz, Russell</contributor><creatorcontrib>Yang, Tianzhong</creatorcontrib><creatorcontrib>Wei, Peng</creatorcontrib><creatorcontrib>Pan, Wei</creatorcontrib><title>Integrative analysis of multi-omics data for discovering low-frequency variants associated with low-density lipoprotein cholesterol levels</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>Abstract Motivation The abundance of omics data has facilitated integrative analyses of single and multiple molecular layers with genome-wide association studies focusing on common variants. Built on its successes, we propose a general analysis framework to leverage multi-omics data with sequencing data to improve the statistical power of discovering new associations and understanding of the disease susceptibility due to low-frequency variants. The proposed test features its robustness to model misspecification, high power across a wide range of scenarios and the potential of offering insights into the underlying genetic architecture and disease mechanisms. Results Using the Framingham Heart Study data, we show that low-frequency variants are predictive of DNA methylation, even after conditioning on the nearby common variants. In addition, DNA methylation and gene expression provide complementary information to functional genomics. In the Avon Longitudinal Study of Parents and Children with a sample size of 1497, one gene CLPTM1 is identified to be associated with low-density lipoprotein cholesterol levels by the proposed powerful adaptive gene-based test integrating information from gene expression, methylation and enhancer–promoter interactions. It is further replicated in the TwinsUK study with 1706 samples. The signal is driven by both low-frequency and common variants. Availability and implementation Models are available at https://github.com/ytzhong/DNAm. Supplementary information Supplementary data are available at Bioinformatics online.</description><subject>Original Papers</subject><issn>1367-4803</issn><issn>1460-2059</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNqNkcFuFSEUhidGY2v1FRqWbsbCwMzAxsQ0aps0caNrcmY4cy-GgRGYae4r9KlF721jd64g4TvfD_xVdcnoB0YVvxpssH4KcYZsx3Q1ZACp5IvqnImO1g1t1cuy511fC0n5WfUmpZ-UtkwI8bo645z2lMnmvHq49Rl3sVg2JODBHZJNJExkXl22dZiLnRjIQEoYMTaNYcNo_Y64cF9PEX-t6McD2SBa8DkRSCmMFjIacm_z_i9m0CebD8TZJSwxZLSejPvgMGWMwRGHG7r0tno1gUv47rReVD--fP5-fVPffft6e_3prh5F2-W6aZWSZpqgYUOr-NDLzkw4mEnKkcrWUMG6HqCRYDrToJSKdYJLBQoAlOL8ovp49C7rMKMZ0ecITi_RzhAPOoDVz0-83etd2HQvW0qFLIL3J0EM5fkp67n8CzoHHsOadCPahpYkIQraHdExhpQiTk8xjOo_RernRepTkWXw8t9LPo09NlcAdgTCuvyv9Dfgj7fN</recordid><startdate>20210129</startdate><enddate>20210129</enddate><creator>Yang, Tianzhong</creator><creator>Wei, Peng</creator><creator>Pan, Wei</creator><general>Oxford University Press</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-0162-7740</orcidid><orcidid>https://orcid.org/0000-0001-7758-6116</orcidid></search><sort><creationdate>20210129</creationdate><title>Integrative analysis of multi-omics data for discovering low-frequency variants associated with low-density lipoprotein cholesterol levels</title><author>Yang, Tianzhong ; Wei, Peng ; Pan, Wei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Original Papers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yang, Tianzhong</creatorcontrib><creatorcontrib>Wei, Peng</creatorcontrib><creatorcontrib>Pan, Wei</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Yang, Tianzhong</au><au>Wei, Peng</au><au>Pan, Wei</au><au>Schwartz, Russell</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Integrative analysis of multi-omics data for discovering low-frequency variants associated with low-density lipoprotein cholesterol levels</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2021-01-29</date><risdate>2021</risdate><volume>36</volume><issue>21</issue><spage>5223</spage><epage>5228</epage><pages>5223-5228</pages><issn>1367-4803</issn><eissn>1460-2059</eissn><eissn>1367-4811</eissn><abstract>Abstract Motivation The abundance of omics data has facilitated integrative analyses of single and multiple molecular layers with genome-wide association studies focusing on common variants. Built on its successes, we propose a general analysis framework to leverage multi-omics data with sequencing data to improve the statistical power of discovering new associations and understanding of the disease susceptibility due to low-frequency variants. The proposed test features its robustness to model misspecification, high power across a wide range of scenarios and the potential of offering insights into the underlying genetic architecture and disease mechanisms. Results Using the Framingham Heart Study data, we show that low-frequency variants are predictive of DNA methylation, even after conditioning on the nearby common variants. In addition, DNA methylation and gene expression provide complementary information to functional genomics. In the Avon Longitudinal Study of Parents and Children with a sample size of 1497, one gene CLPTM1 is identified to be associated with low-density lipoprotein cholesterol levels by the proposed powerful adaptive gene-based test integrating information from gene expression, methylation and enhancer–promoter interactions. It is further replicated in the TwinsUK study with 1706 samples. The signal is driven by both low-frequency and common variants. Availability and implementation Models are available at https://github.com/ytzhong/DNAm. Supplementary information Supplementary data are available at Bioinformatics online.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>33070182</pmid><doi>10.1093/bioinformatics/btaa898</doi><tpages>6</tpages><orcidid>https://orcid.org/0000-0002-0162-7740</orcidid><orcidid>https://orcid.org/0000-0001-7758-6116</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1367-4803
ispartof Bioinformatics, 2021-01, Vol.36 (21), p.5223-5228
issn 1367-4803
1460-2059
1367-4811
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7850048
source Open Access: Oxford University Press Open Journals
subjects Original Papers
title Integrative analysis of multi-omics data for discovering low-frequency variants associated with low-density lipoprotein cholesterol levels
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T05%3A50%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_TOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Integrative%20analysis%20of%20multi-omics%20data%20for%20discovering%20low-frequency%20variants%20associated%20with%20low-density%20lipoprotein%20cholesterol%20levels&rft.jtitle=Bioinformatics&rft.au=Yang,%20Tianzhong&rft.date=2021-01-29&rft.volume=36&rft.issue=21&rft.spage=5223&rft.epage=5228&rft.pages=5223-5228&rft.issn=1367-4803&rft.eissn=1460-2059&rft_id=info:doi/10.1093/bioinformatics/btaa898&rft_dat=%3Cproquest_TOX%3E2452099344%3C/proquest_TOX%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2452099344&rft_id=info:pmid/33070182&rft_oup_id=10.1093/bioinformatics/btaa898&rfr_iscdi=true