Loading…

Harmonising and linking biomedical and clinical data across disparate data archives to enable integrative cross-biobank research

A wealth of biospecimen samples are stored in modern globally distributed biobanks. Biomedical researchers worldwide need to be able to combine the available resources to improve the power of large-scale studies. A prerequisite for this effort is to be able to search and access phenotypic, clinical...

Full description

Saved in:
Bibliographic Details
Main Authors: Spjuth, Ola, Krestyaninova, Maria, Hastings, Janna, Langhammer, Arnulf, Harris, Jennifer, Shen, Huei-Yi, Heikkinen, Jani, Waldenberger, Melanie, Ladenvall, Claes, Esko, Tõnu, Persson, Mats-Åke, Heggland, Jon, Dietrich, Joern, Ose, Sandra, Gieger, Christian, Ried, Janina S, Peters, Annette, Fortier, Isabel, de Geus, Eco JC, Klovins, Janis, Zaharenko, Linda, Willemsen, Gonneke, Hottenga, Jouke-Jan, Litton, Jan-Eric, Karvanen, Juha, Boomsma, Dorret I, Groop, Leif, Rung, Johan, Palmgren, Juni, Pedersen, Nancy L, McCarthy, Mark I, van Duijn, Cornelia M, Hveem, Kristian, Metspalu, Andres, Ripatti, Samuli, Prokopenko, Inga
Format: Article
Language:English
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Spjuth, Ola
Krestyaninova, Maria
Hastings, Janna
Langhammer, Arnulf
Harris, Jennifer
Shen, Huei-Yi
Heikkinen, Jani
Waldenberger, Melanie
Ladenvall, Claes
Esko, Tõnu
Persson, Mats-Åke
Heggland, Jon
Dietrich, Joern
Ose, Sandra
Gieger, Christian
Ried, Janina S
Peters, Annette
Fortier, Isabel
de Geus, Eco JC
Klovins, Janis
Zaharenko, Linda
Willemsen, Gonneke
Hottenga, Jouke-Jan
Litton, Jan-Eric
Karvanen, Juha
Boomsma, Dorret I
Groop, Leif
Rung, Johan
Palmgren, Juni
Pedersen, Nancy L
McCarthy, Mark I
van Duijn, Cornelia M
Hveem, Kristian
Metspalu, Andres
Ripatti, Samuli
Prokopenko, Inga
description A wealth of biospecimen samples are stored in modern globally distributed biobanks. Biomedical researchers worldwide need to be able to combine the available resources to improve the power of large-scale studies. A prerequisite for this effort is to be able to search and access phenotypic, clinical and other information about samples that are currently stored at biobanks in an integrated manner. However, privacy issues together with heterogeneous information systems and the lack of agreed-upon vocabularies have made specimen searching across multiple biobanks extremely challenging. We describe three case studies where we have linked samples and sample descriptions in order to facilitate global searching of available samples for research. The use cases include the ENGAGE (European Network for Genetic and Genomic Epidemiology) consortium comprising at least 39 cohorts, the SUMMIT (surrogate markers for micro- and macro-vascular hard endpoints for innovative diabetes tools) consortium and a pilot for data integration between a Swedish clinical health registry and a biobank. We used the Sample avAILability (SAIL) method for data linking: first, created harmonised variables and then annotated and made searchable information on the number of specimens available in individual biobanks for various phenotypic categories. By operating on this categorised availability data we sidestep many obstacles related to privacy that arise when handling real values and show that harmonised and annotated records about data availability across disparate biomedical archives provide a key methodological advance in pre-analysis exchange of information between biobanks, that is, during the project planning phase.
format article
fullrecord <record><control><sourceid>cristin_3HK</sourceid><recordid>TN_cdi_cristin_nora_11250_2388239</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>11250_2388239</sourcerecordid><originalsourceid>FETCH-cristin_nora_11250_23882393</originalsourceid><addsrcrecordid>eNqNjMEKwjAQRHvxIOo_rB9QsC1CPYvSD_Aetslal6YbyQbPfrpp6Qd4mpnHzGyLb4dxCsLKMgCKA88yzr7nMJFji37BNvMlOEwIaGNQBcf6xoiJVhrtiz-kkAKQYO8JWBINuZExLJsy__YoI0RSmgf7YvNEr3RYdVcc77fHtSttZE0sRkJEU1X1-WTqpm3r5tL80_kBVjZJSQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Harmonising and linking biomedical and clinical data across disparate data archives to enable integrative cross-biobank research</title><source>NORA - Norwegian Open Research Archives</source><creator>Spjuth, Ola ; Krestyaninova, Maria ; Hastings, Janna ; Langhammer, Arnulf ; Harris, Jennifer ; Shen, Huei-Yi ; Heikkinen, Jani ; Waldenberger, Melanie ; Ladenvall, Claes ; Esko, Tõnu ; Persson, Mats-Åke ; Heggland, Jon ; Dietrich, Joern ; Ose, Sandra ; Gieger, Christian ; Ried, Janina S ; Peters, Annette ; Fortier, Isabel ; de Geus, Eco JC ; Klovins, Janis ; Zaharenko, Linda ; Willemsen, Gonneke ; Hottenga, Jouke-Jan ; Litton, Jan-Eric ; Karvanen, Juha ; Boomsma, Dorret I ; Groop, Leif ; Rung, Johan ; Palmgren, Juni ; Pedersen, Nancy L ; McCarthy, Mark I ; van Duijn, Cornelia M ; Hveem, Kristian ; Metspalu, Andres ; Ripatti, Samuli ; Prokopenko, Inga</creator><creatorcontrib>Spjuth, Ola ; Krestyaninova, Maria ; Hastings, Janna ; Langhammer, Arnulf ; Harris, Jennifer ; Shen, Huei-Yi ; Heikkinen, Jani ; Waldenberger, Melanie ; Ladenvall, Claes ; Esko, Tõnu ; Persson, Mats-Åke ; Heggland, Jon ; Dietrich, Joern ; Ose, Sandra ; Gieger, Christian ; Ried, Janina S ; Peters, Annette ; Fortier, Isabel ; de Geus, Eco JC ; Klovins, Janis ; Zaharenko, Linda ; Willemsen, Gonneke ; Hottenga, Jouke-Jan ; Litton, Jan-Eric ; Karvanen, Juha ; Boomsma, Dorret I ; Groop, Leif ; Rung, Johan ; Palmgren, Juni ; Pedersen, Nancy L ; McCarthy, Mark I ; van Duijn, Cornelia M ; Hveem, Kristian ; Metspalu, Andres ; Ripatti, Samuli ; Prokopenko, Inga</creatorcontrib><description>A wealth of biospecimen samples are stored in modern globally distributed biobanks. Biomedical researchers worldwide need to be able to combine the available resources to improve the power of large-scale studies. A prerequisite for this effort is to be able to search and access phenotypic, clinical and other information about samples that are currently stored at biobanks in an integrated manner. However, privacy issues together with heterogeneous information systems and the lack of agreed-upon vocabularies have made specimen searching across multiple biobanks extremely challenging. We describe three case studies where we have linked samples and sample descriptions in order to facilitate global searching of available samples for research. The use cases include the ENGAGE (European Network for Genetic and Genomic Epidemiology) consortium comprising at least 39 cohorts, the SUMMIT (surrogate markers for micro- and macro-vascular hard endpoints for innovative diabetes tools) consortium and a pilot for data integration between a Swedish clinical health registry and a biobank. We used the Sample avAILability (SAIL) method for data linking: first, created harmonised variables and then annotated and made searchable information on the number of specimens available in individual biobanks for various phenotypic categories. By operating on this categorised availability data we sidestep many obstacles related to privacy that arise when handling real values and show that harmonised and annotated records about data availability across disparate biomedical archives provide a key methodological advance in pre-analysis exchange of information between biobanks, that is, during the project planning phase.</description><language>eng</language><publisher>Nature Publishing Group</publisher><creationdate>2015</creationdate><rights>info:eu-repo/semantics/openAccess</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,780,885,26567</link.rule.ids><linktorsrc>$$Uhttp://hdl.handle.net/11250/2388239$$EView_record_in_NORA$$FView_record_in_$$GNORA$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Spjuth, Ola</creatorcontrib><creatorcontrib>Krestyaninova, Maria</creatorcontrib><creatorcontrib>Hastings, Janna</creatorcontrib><creatorcontrib>Langhammer, Arnulf</creatorcontrib><creatorcontrib>Harris, Jennifer</creatorcontrib><creatorcontrib>Shen, Huei-Yi</creatorcontrib><creatorcontrib>Heikkinen, Jani</creatorcontrib><creatorcontrib>Waldenberger, Melanie</creatorcontrib><creatorcontrib>Ladenvall, Claes</creatorcontrib><creatorcontrib>Esko, Tõnu</creatorcontrib><creatorcontrib>Persson, Mats-Åke</creatorcontrib><creatorcontrib>Heggland, Jon</creatorcontrib><creatorcontrib>Dietrich, Joern</creatorcontrib><creatorcontrib>Ose, Sandra</creatorcontrib><creatorcontrib>Gieger, Christian</creatorcontrib><creatorcontrib>Ried, Janina S</creatorcontrib><creatorcontrib>Peters, Annette</creatorcontrib><creatorcontrib>Fortier, Isabel</creatorcontrib><creatorcontrib>de Geus, Eco JC</creatorcontrib><creatorcontrib>Klovins, Janis</creatorcontrib><creatorcontrib>Zaharenko, Linda</creatorcontrib><creatorcontrib>Willemsen, Gonneke</creatorcontrib><creatorcontrib>Hottenga, Jouke-Jan</creatorcontrib><creatorcontrib>Litton, Jan-Eric</creatorcontrib><creatorcontrib>Karvanen, Juha</creatorcontrib><creatorcontrib>Boomsma, Dorret I</creatorcontrib><creatorcontrib>Groop, Leif</creatorcontrib><creatorcontrib>Rung, Johan</creatorcontrib><creatorcontrib>Palmgren, Juni</creatorcontrib><creatorcontrib>Pedersen, Nancy L</creatorcontrib><creatorcontrib>McCarthy, Mark I</creatorcontrib><creatorcontrib>van Duijn, Cornelia M</creatorcontrib><creatorcontrib>Hveem, Kristian</creatorcontrib><creatorcontrib>Metspalu, Andres</creatorcontrib><creatorcontrib>Ripatti, Samuli</creatorcontrib><creatorcontrib>Prokopenko, Inga</creatorcontrib><title>Harmonising and linking biomedical and clinical data across disparate data archives to enable integrative cross-biobank research</title><description>A wealth of biospecimen samples are stored in modern globally distributed biobanks. Biomedical researchers worldwide need to be able to combine the available resources to improve the power of large-scale studies. A prerequisite for this effort is to be able to search and access phenotypic, clinical and other information about samples that are currently stored at biobanks in an integrated manner. However, privacy issues together with heterogeneous information systems and the lack of agreed-upon vocabularies have made specimen searching across multiple biobanks extremely challenging. We describe three case studies where we have linked samples and sample descriptions in order to facilitate global searching of available samples for research. The use cases include the ENGAGE (European Network for Genetic and Genomic Epidemiology) consortium comprising at least 39 cohorts, the SUMMIT (surrogate markers for micro- and macro-vascular hard endpoints for innovative diabetes tools) consortium and a pilot for data integration between a Swedish clinical health registry and a biobank. We used the Sample avAILability (SAIL) method for data linking: first, created harmonised variables and then annotated and made searchable information on the number of specimens available in individual biobanks for various phenotypic categories. By operating on this categorised availability data we sidestep many obstacles related to privacy that arise when handling real values and show that harmonised and annotated records about data availability across disparate biomedical archives provide a key methodological advance in pre-analysis exchange of information between biobanks, that is, during the project planning phase.</description><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>3HK</sourceid><recordid>eNqNjMEKwjAQRHvxIOo_rB9QsC1CPYvSD_Aetslal6YbyQbPfrpp6Qd4mpnHzGyLb4dxCsLKMgCKA88yzr7nMJFji37BNvMlOEwIaGNQBcf6xoiJVhrtiz-kkAKQYO8JWBINuZExLJsy__YoI0RSmgf7YvNEr3RYdVcc77fHtSttZE0sRkJEU1X1-WTqpm3r5tL80_kBVjZJSQ</recordid><startdate>2015</startdate><enddate>2015</enddate><creator>Spjuth, Ola</creator><creator>Krestyaninova, Maria</creator><creator>Hastings, Janna</creator><creator>Langhammer, Arnulf</creator><creator>Harris, Jennifer</creator><creator>Shen, Huei-Yi</creator><creator>Heikkinen, Jani</creator><creator>Waldenberger, Melanie</creator><creator>Ladenvall, Claes</creator><creator>Esko, Tõnu</creator><creator>Persson, Mats-Åke</creator><creator>Heggland, Jon</creator><creator>Dietrich, Joern</creator><creator>Ose, Sandra</creator><creator>Gieger, Christian</creator><creator>Ried, Janina S</creator><creator>Peters, Annette</creator><creator>Fortier, Isabel</creator><creator>de Geus, Eco JC</creator><creator>Klovins, Janis</creator><creator>Zaharenko, Linda</creator><creator>Willemsen, Gonneke</creator><creator>Hottenga, Jouke-Jan</creator><creator>Litton, Jan-Eric</creator><creator>Karvanen, Juha</creator><creator>Boomsma, Dorret I</creator><creator>Groop, Leif</creator><creator>Rung, Johan</creator><creator>Palmgren, Juni</creator><creator>Pedersen, Nancy L</creator><creator>McCarthy, Mark I</creator><creator>van Duijn, Cornelia M</creator><creator>Hveem, Kristian</creator><creator>Metspalu, Andres</creator><creator>Ripatti, Samuli</creator><creator>Prokopenko, Inga</creator><general>Nature Publishing Group</general><scope>3HK</scope></search><sort><creationdate>2015</creationdate><title>Harmonising and linking biomedical and clinical data across disparate data archives to enable integrative cross-biobank research</title><author>Spjuth, Ola ; Krestyaninova, Maria ; Hastings, Janna ; Langhammer, Arnulf ; Harris, Jennifer ; Shen, Huei-Yi ; Heikkinen, Jani ; Waldenberger, Melanie ; Ladenvall, Claes ; Esko, Tõnu ; Persson, Mats-Åke ; Heggland, Jon ; Dietrich, Joern ; Ose, Sandra ; Gieger, Christian ; Ried, Janina S ; Peters, Annette ; Fortier, Isabel ; de Geus, Eco JC ; Klovins, Janis ; Zaharenko, Linda ; Willemsen, Gonneke ; Hottenga, Jouke-Jan ; Litton, Jan-Eric ; Karvanen, Juha ; Boomsma, Dorret I ; Groop, Leif ; Rung, Johan ; Palmgren, Juni ; Pedersen, Nancy L ; McCarthy, Mark I ; van Duijn, Cornelia M ; Hveem, Kristian ; Metspalu, Andres ; Ripatti, Samuli ; Prokopenko, Inga</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-cristin_nora_11250_23882393</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Spjuth, Ola</creatorcontrib><creatorcontrib>Krestyaninova, Maria</creatorcontrib><creatorcontrib>Hastings, Janna</creatorcontrib><creatorcontrib>Langhammer, Arnulf</creatorcontrib><creatorcontrib>Harris, Jennifer</creatorcontrib><creatorcontrib>Shen, Huei-Yi</creatorcontrib><creatorcontrib>Heikkinen, Jani</creatorcontrib><creatorcontrib>Waldenberger, Melanie</creatorcontrib><creatorcontrib>Ladenvall, Claes</creatorcontrib><creatorcontrib>Esko, Tõnu</creatorcontrib><creatorcontrib>Persson, Mats-Åke</creatorcontrib><creatorcontrib>Heggland, Jon</creatorcontrib><creatorcontrib>Dietrich, Joern</creatorcontrib><creatorcontrib>Ose, Sandra</creatorcontrib><creatorcontrib>Gieger, Christian</creatorcontrib><creatorcontrib>Ried, Janina S</creatorcontrib><creatorcontrib>Peters, Annette</creatorcontrib><creatorcontrib>Fortier, Isabel</creatorcontrib><creatorcontrib>de Geus, Eco JC</creatorcontrib><creatorcontrib>Klovins, Janis</creatorcontrib><creatorcontrib>Zaharenko, Linda</creatorcontrib><creatorcontrib>Willemsen, Gonneke</creatorcontrib><creatorcontrib>Hottenga, Jouke-Jan</creatorcontrib><creatorcontrib>Litton, Jan-Eric</creatorcontrib><creatorcontrib>Karvanen, Juha</creatorcontrib><creatorcontrib>Boomsma, Dorret I</creatorcontrib><creatorcontrib>Groop, Leif</creatorcontrib><creatorcontrib>Rung, Johan</creatorcontrib><creatorcontrib>Palmgren, Juni</creatorcontrib><creatorcontrib>Pedersen, Nancy L</creatorcontrib><creatorcontrib>McCarthy, Mark I</creatorcontrib><creatorcontrib>van Duijn, Cornelia M</creatorcontrib><creatorcontrib>Hveem, Kristian</creatorcontrib><creatorcontrib>Metspalu, Andres</creatorcontrib><creatorcontrib>Ripatti, Samuli</creatorcontrib><creatorcontrib>Prokopenko, Inga</creatorcontrib><collection>NORA - Norwegian Open Research Archives</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Spjuth, Ola</au><au>Krestyaninova, Maria</au><au>Hastings, Janna</au><au>Langhammer, Arnulf</au><au>Harris, Jennifer</au><au>Shen, Huei-Yi</au><au>Heikkinen, Jani</au><au>Waldenberger, Melanie</au><au>Ladenvall, Claes</au><au>Esko, Tõnu</au><au>Persson, Mats-Åke</au><au>Heggland, Jon</au><au>Dietrich, Joern</au><au>Ose, Sandra</au><au>Gieger, Christian</au><au>Ried, Janina S</au><au>Peters, Annette</au><au>Fortier, Isabel</au><au>de Geus, Eco JC</au><au>Klovins, Janis</au><au>Zaharenko, Linda</au><au>Willemsen, Gonneke</au><au>Hottenga, Jouke-Jan</au><au>Litton, Jan-Eric</au><au>Karvanen, Juha</au><au>Boomsma, Dorret I</au><au>Groop, Leif</au><au>Rung, Johan</au><au>Palmgren, Juni</au><au>Pedersen, Nancy L</au><au>McCarthy, Mark I</au><au>van Duijn, Cornelia M</au><au>Hveem, Kristian</au><au>Metspalu, Andres</au><au>Ripatti, Samuli</au><au>Prokopenko, Inga</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Harmonising and linking biomedical and clinical data across disparate data archives to enable integrative cross-biobank research</atitle><date>2015</date><risdate>2015</risdate><abstract>A wealth of biospecimen samples are stored in modern globally distributed biobanks. Biomedical researchers worldwide need to be able to combine the available resources to improve the power of large-scale studies. A prerequisite for this effort is to be able to search and access phenotypic, clinical and other information about samples that are currently stored at biobanks in an integrated manner. However, privacy issues together with heterogeneous information systems and the lack of agreed-upon vocabularies have made specimen searching across multiple biobanks extremely challenging. We describe three case studies where we have linked samples and sample descriptions in order to facilitate global searching of available samples for research. The use cases include the ENGAGE (European Network for Genetic and Genomic Epidemiology) consortium comprising at least 39 cohorts, the SUMMIT (surrogate markers for micro- and macro-vascular hard endpoints for innovative diabetes tools) consortium and a pilot for data integration between a Swedish clinical health registry and a biobank. We used the Sample avAILability (SAIL) method for data linking: first, created harmonised variables and then annotated and made searchable information on the number of specimens available in individual biobanks for various phenotypic categories. By operating on this categorised availability data we sidestep many obstacles related to privacy that arise when handling real values and show that harmonised and annotated records about data availability across disparate biomedical archives provide a key methodological advance in pre-analysis exchange of information between biobanks, that is, during the project planning phase.</abstract><pub>Nature Publishing Group</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_cristin_nora_11250_2388239
source NORA - Norwegian Open Research Archives
title Harmonising and linking biomedical and clinical data across disparate data archives to enable integrative cross-biobank research
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T11%3A14%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-cristin_3HK&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Harmonising%20and%20linking%20biomedical%20and%20clinical%20data%20across%20disparate%20data%20archives%20to%20enable%20integrative%20cross-biobank%20research&rft.au=Spjuth,%20Ola&rft.date=2015&rft_id=info:doi/&rft_dat=%3Ccristin_3HK%3E11250_2388239%3C/cristin_3HK%3E%3Cgrp_id%3Ecdi_FETCH-cristin_nora_11250_23882393%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true