Loading…
Integrative analysis of multi-omics data for discovering low-frequency variants associated with low-density lipoprotein cholesterol levels
Abstract Motivation The abundance of omics data has facilitated integrative analyses of single and multiple molecular layers with genome-wide association studies focusing on common variants. Built on its successes, we propose a general analysis framework to leverage multi-omics data with sequencing...
Saved in:
Published in: | Bioinformatics 2021-01, Vol.36 (21), p.5223-5228 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933 |
---|---|
cites | cdi_FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933 |
container_end_page | 5228 |
container_issue | 21 |
container_start_page | 5223 |
container_title | Bioinformatics |
container_volume | 36 |
creator | Yang, Tianzhong Wei, Peng Pan, Wei |
description | Abstract
Motivation
The abundance of omics data has facilitated integrative analyses of single and multiple molecular layers with genome-wide association studies focusing on common variants. Built on its successes, we propose a general analysis framework to leverage multi-omics data with sequencing data to improve the statistical power of discovering new associations and understanding of the disease susceptibility due to low-frequency variants. The proposed test features its robustness to model misspecification, high power across a wide range of scenarios and the potential of offering insights into the underlying genetic architecture and disease mechanisms.
Results
Using the Framingham Heart Study data, we show that low-frequency variants are predictive of DNA methylation, even after conditioning on the nearby common variants. In addition, DNA methylation and gene expression provide complementary information to functional genomics. In the Avon Longitudinal Study of Parents and Children with a sample size of 1497, one gene CLPTM1 is identified to be associated with low-density lipoprotein cholesterol levels by the proposed powerful adaptive gene-based test integrating information from gene expression, methylation and enhancer–promoter interactions. It is further replicated in the TwinsUK study with 1706 samples. The signal is driven by both low-frequency and common variants.
Availability and implementation
Models are available at https://github.com/ytzhong/DNAm.
Supplementary information
Supplementary data are available at Bioinformatics online. |
doi_str_mv | 10.1093/bioinformatics/btaa898 |
format | article |
fullrecord | <record><control><sourceid>proquest_TOX</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7850048</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/bioinformatics/btaa898</oup_id><sourcerecordid>2452099344</sourcerecordid><originalsourceid>FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933</originalsourceid><addsrcrecordid>eNqNkcFuFSEUhidGY2v1FRqWbsbCwMzAxsQ0aps0caNrcmY4cy-GgRGYae4r9KlF721jd64g4TvfD_xVdcnoB0YVvxpssH4KcYZsx3Q1ZACp5IvqnImO1g1t1cuy511fC0n5WfUmpZ-UtkwI8bo645z2lMnmvHq49Rl3sVg2JODBHZJNJExkXl22dZiLnRjIQEoYMTaNYcNo_Y64cF9PEX-t6McD2SBa8DkRSCmMFjIacm_z_i9m0CebD8TZJSwxZLSejPvgMGWMwRGHG7r0tno1gUv47rReVD--fP5-fVPffft6e_3prh5F2-W6aZWSZpqgYUOr-NDLzkw4mEnKkcrWUMG6HqCRYDrToJSKdYJLBQoAlOL8ovp49C7rMKMZ0ecITi_RzhAPOoDVz0-83etd2HQvW0qFLIL3J0EM5fkp67n8CzoHHsOadCPahpYkIQraHdExhpQiTk8xjOo_RernRepTkWXw8t9LPo09NlcAdgTCuvyv9Dfgj7fN</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2452099344</pqid></control><display><type>article</type><title>Integrative analysis of multi-omics data for discovering low-frequency variants associated with low-density lipoprotein cholesterol levels</title><source>Open Access: Oxford University Press Open Journals</source><creator>Yang, Tianzhong ; Wei, Peng ; Pan, Wei</creator><contributor>Schwartz, Russell</contributor><creatorcontrib>Yang, Tianzhong ; Wei, Peng ; Pan, Wei ; Schwartz, Russell</creatorcontrib><description>Abstract
Motivation
The abundance of omics data has facilitated integrative analyses of single and multiple molecular layers with genome-wide association studies focusing on common variants. Built on its successes, we propose a general analysis framework to leverage multi-omics data with sequencing data to improve the statistical power of discovering new associations and understanding of the disease susceptibility due to low-frequency variants. The proposed test features its robustness to model misspecification, high power across a wide range of scenarios and the potential of offering insights into the underlying genetic architecture and disease mechanisms.
Results
Using the Framingham Heart Study data, we show that low-frequency variants are predictive of DNA methylation, even after conditioning on the nearby common variants. In addition, DNA methylation and gene expression provide complementary information to functional genomics. In the Avon Longitudinal Study of Parents and Children with a sample size of 1497, one gene CLPTM1 is identified to be associated with low-density lipoprotein cholesterol levels by the proposed powerful adaptive gene-based test integrating information from gene expression, methylation and enhancer–promoter interactions. It is further replicated in the TwinsUK study with 1706 samples. The signal is driven by both low-frequency and common variants.
Availability and implementation
Models are available at https://github.com/ytzhong/DNAm.
Supplementary information
Supplementary data are available at Bioinformatics online.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1460-2059</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/btaa898</identifier><identifier>PMID: 33070182</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Original Papers</subject><ispartof>Bioinformatics, 2021-01, Vol.36 (21), p.5223-5228</ispartof><rights>The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2020</rights><rights>The Author(s) (2020). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933</citedby><cites>FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933</cites><orcidid>0000-0002-0162-7740 ; 0000-0001-7758-6116</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7850048/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7850048/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,727,780,784,885,1604,27924,27925,53791,53793</link.rule.ids><linktorsrc>$$Uhttps://dx.doi.org/10.1093/bioinformatics/btaa898$$EView_record_in_Oxford_University_Press$$FView_record_in_$$GOxford_University_Press</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33070182$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Schwartz, Russell</contributor><creatorcontrib>Yang, Tianzhong</creatorcontrib><creatorcontrib>Wei, Peng</creatorcontrib><creatorcontrib>Pan, Wei</creatorcontrib><title>Integrative analysis of multi-omics data for discovering low-frequency variants associated with low-density lipoprotein cholesterol levels</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>Abstract
Motivation
The abundance of omics data has facilitated integrative analyses of single and multiple molecular layers with genome-wide association studies focusing on common variants. Built on its successes, we propose a general analysis framework to leverage multi-omics data with sequencing data to improve the statistical power of discovering new associations and understanding of the disease susceptibility due to low-frequency variants. The proposed test features its robustness to model misspecification, high power across a wide range of scenarios and the potential of offering insights into the underlying genetic architecture and disease mechanisms.
Results
Using the Framingham Heart Study data, we show that low-frequency variants are predictive of DNA methylation, even after conditioning on the nearby common variants. In addition, DNA methylation and gene expression provide complementary information to functional genomics. In the Avon Longitudinal Study of Parents and Children with a sample size of 1497, one gene CLPTM1 is identified to be associated with low-density lipoprotein cholesterol levels by the proposed powerful adaptive gene-based test integrating information from gene expression, methylation and enhancer–promoter interactions. It is further replicated in the TwinsUK study with 1706 samples. The signal is driven by both low-frequency and common variants.
Availability and implementation
Models are available at https://github.com/ytzhong/DNAm.
Supplementary information
Supplementary data are available at Bioinformatics online.</description><subject>Original Papers</subject><issn>1367-4803</issn><issn>1460-2059</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNqNkcFuFSEUhidGY2v1FRqWbsbCwMzAxsQ0aps0caNrcmY4cy-GgRGYae4r9KlF721jd64g4TvfD_xVdcnoB0YVvxpssH4KcYZsx3Q1ZACp5IvqnImO1g1t1cuy511fC0n5WfUmpZ-UtkwI8bo645z2lMnmvHq49Rl3sVg2JODBHZJNJExkXl22dZiLnRjIQEoYMTaNYcNo_Y64cF9PEX-t6McD2SBa8DkRSCmMFjIacm_z_i9m0CebD8TZJSwxZLSejPvgMGWMwRGHG7r0tno1gUv47rReVD--fP5-fVPffft6e_3prh5F2-W6aZWSZpqgYUOr-NDLzkw4mEnKkcrWUMG6HqCRYDrToJSKdYJLBQoAlOL8ovp49C7rMKMZ0ecITi_RzhAPOoDVz0-83etd2HQvW0qFLIL3J0EM5fkp67n8CzoHHsOadCPahpYkIQraHdExhpQiTk8xjOo_RernRepTkWXw8t9LPo09NlcAdgTCuvyv9Dfgj7fN</recordid><startdate>20210129</startdate><enddate>20210129</enddate><creator>Yang, Tianzhong</creator><creator>Wei, Peng</creator><creator>Pan, Wei</creator><general>Oxford University Press</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-0162-7740</orcidid><orcidid>https://orcid.org/0000-0001-7758-6116</orcidid></search><sort><creationdate>20210129</creationdate><title>Integrative analysis of multi-omics data for discovering low-frequency variants associated with low-density lipoprotein cholesterol levels</title><author>Yang, Tianzhong ; Wei, Peng ; Pan, Wei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Original Papers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yang, Tianzhong</creatorcontrib><creatorcontrib>Wei, Peng</creatorcontrib><creatorcontrib>Pan, Wei</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Yang, Tianzhong</au><au>Wei, Peng</au><au>Pan, Wei</au><au>Schwartz, Russell</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Integrative analysis of multi-omics data for discovering low-frequency variants associated with low-density lipoprotein cholesterol levels</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2021-01-29</date><risdate>2021</risdate><volume>36</volume><issue>21</issue><spage>5223</spage><epage>5228</epage><pages>5223-5228</pages><issn>1367-4803</issn><eissn>1460-2059</eissn><eissn>1367-4811</eissn><abstract>Abstract
Motivation
The abundance of omics data has facilitated integrative analyses of single and multiple molecular layers with genome-wide association studies focusing on common variants. Built on its successes, we propose a general analysis framework to leverage multi-omics data with sequencing data to improve the statistical power of discovering new associations and understanding of the disease susceptibility due to low-frequency variants. The proposed test features its robustness to model misspecification, high power across a wide range of scenarios and the potential of offering insights into the underlying genetic architecture and disease mechanisms.
Results
Using the Framingham Heart Study data, we show that low-frequency variants are predictive of DNA methylation, even after conditioning on the nearby common variants. In addition, DNA methylation and gene expression provide complementary information to functional genomics. In the Avon Longitudinal Study of Parents and Children with a sample size of 1497, one gene CLPTM1 is identified to be associated with low-density lipoprotein cholesterol levels by the proposed powerful adaptive gene-based test integrating information from gene expression, methylation and enhancer–promoter interactions. It is further replicated in the TwinsUK study with 1706 samples. The signal is driven by both low-frequency and common variants.
Availability and implementation
Models are available at https://github.com/ytzhong/DNAm.
Supplementary information
Supplementary data are available at Bioinformatics online.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>33070182</pmid><doi>10.1093/bioinformatics/btaa898</doi><tpages>6</tpages><orcidid>https://orcid.org/0000-0002-0162-7740</orcidid><orcidid>https://orcid.org/0000-0001-7758-6116</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1367-4803 |
ispartof | Bioinformatics, 2021-01, Vol.36 (21), p.5223-5228 |
issn | 1367-4803 1460-2059 1367-4811 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7850048 |
source | Open Access: Oxford University Press Open Journals |
subjects | Original Papers |
title | Integrative analysis of multi-omics data for discovering low-frequency variants associated with low-density lipoprotein cholesterol levels |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T05%3A50%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_TOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Integrative%20analysis%20of%20multi-omics%20data%20for%20discovering%20low-frequency%20variants%20associated%20with%20low-density%20lipoprotein%20cholesterol%20levels&rft.jtitle=Bioinformatics&rft.au=Yang,%20Tianzhong&rft.date=2021-01-29&rft.volume=36&rft.issue=21&rft.spage=5223&rft.epage=5228&rft.pages=5223-5228&rft.issn=1367-4803&rft.eissn=1460-2059&rft_id=info:doi/10.1093/bioinformatics/btaa898&rft_dat=%3Cproquest_TOX%3E2452099344%3C/proquest_TOX%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c456t-25998dffa21b593b786dfebdf88c085d04167aa28ad6d2e889164389a9aaa9933%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2452099344&rft_id=info:pmid/33070182&rft_oup_id=10.1093/bioinformatics/btaa898&rfr_iscdi=true |