Loading…

re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data

Illumina BeadArrays are among the most popular and reliable platforms for gene expression profiling. However, little external scrutiny has been given to the design, selection and annotation of BeadArray probes, which is a fundamental issue in data quality and interpretation. Here we present a pipeli...

Full description

Saved in:
Bibliographic Details
Published in:Nucleic acids research 2010-01, Vol.38 (3), p.e17-e17
Main Authors: Barbosa-Morais, Nuno L, Dunning, Mark J, Samarajiwa, Shamith A, Darot, Jeremy F.J, Ritchie, Matthew E, Lynch, Andy G, Tavaré, Simon
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c499t-eb0e95f63f81bad0ef08c158c63383b85a438a0dde7fd07fbbbb21cf01e5bc253
cites cdi_FETCH-LOGICAL-c499t-eb0e95f63f81bad0ef08c158c63383b85a438a0dde7fd07fbbbb21cf01e5bc253
container_end_page e17
container_issue 3
container_start_page e17
container_title Nucleic acids research
container_volume 38
creator Barbosa-Morais, Nuno L
Dunning, Mark J
Samarajiwa, Shamith A
Darot, Jeremy F.J
Ritchie, Matthew E
Lynch, Andy G
Tavaré, Simon
description Illumina BeadArrays are among the most popular and reliable platforms for gene expression profiling. However, little external scrutiny has been given to the design, selection and annotation of BeadArray probes, which is a fundamental issue in data quality and interpretation. Here we present a pipeline for the complete genomic and transcriptomic re-annotation of Illumina probe sequences, also applicable to other platforms, with its output available through a Web interface and incorporated into Bioconductor packages. We have identified several problems with the design of individual probes and we show the benefits of probe re-annotation on the analysis of BeadArray gene expression data sets. We discuss the importance of aspects such as probe coverage of individual transcripts, alternative messenger RNA splicing, single-nucleotide polymorphisms, repeat sequences, RNA degradation biases and probes targeting genomic regions with no known transcription. We conclude that many of the Illumina probes have unreliable original annotation and that our re-annotation allows analyses to focus on the good quality probes, which form the majority, and also to expand the scope of biological information that can be extracted.
doi_str_mv 10.1093/nar/gkp942
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_2817484</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>746012255</sourcerecordid><originalsourceid>FETCH-LOGICAL-c499t-eb0e95f63f81bad0ef08c158c63383b85a438a0dde7fd07fbbbb21cf01e5bc253</originalsourceid><addsrcrecordid>eNqFkU1v1EAMhkcIRJfChR8AuSEhhXq-sjMcKpWKj0qVOEDPIyfxpAPJTJjJVvTfk9Wu-DhhHyzZj1_Zehl7zuENByvPIuaz4ftslXjANlw2ola2EQ_ZBiTomoMyJ-xJKd8AuOJaPWYn3Foh19ywPlONMaYFl5BiNYeZxhCp8ilXV-O4m0LE6h1hf5Ez3pe3VZjmnO5CHKrllqoQF8pzpuN68tVA6zb9XHul7Fs9LviUPfI4Fnp2rKfs5sP7r5ef6uvPH68uL67rTlm71NQCWe0b6Q1vsQfyYDquTddIaWRrNCppEPqetr6HrW_XELzzwEm3ndDylJ0fdOddO1HfUVwyjm7OYcJ87xIG9-8khls3pDsnDN8qo1aBV0eBnH7sqCxuCqWjccRIaVfcVjXAhdD6_6SUWmhQe83XB7LLqZRM_vc9HNzeP7f65w7-rfCLvz_4gx4NW4GXB8BjcjjkUNzNFwFcAjcgrNHyF_4QpAU</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>733525044</pqid></control><display><type>article</type><title>re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data</title><source>PubMed (Medline)</source><source>Oxford Open Access Journals</source><creator>Barbosa-Morais, Nuno L ; Dunning, Mark J ; Samarajiwa, Shamith A ; Darot, Jeremy F.J ; Ritchie, Matthew E ; Lynch, Andy G ; Tavaré, Simon</creator><creatorcontrib>Barbosa-Morais, Nuno L ; Dunning, Mark J ; Samarajiwa, Shamith A ; Darot, Jeremy F.J ; Ritchie, Matthew E ; Lynch, Andy G ; Tavaré, Simon</creatorcontrib><description>Illumina BeadArrays are among the most popular and reliable platforms for gene expression profiling. However, little external scrutiny has been given to the design, selection and annotation of BeadArray probes, which is a fundamental issue in data quality and interpretation. Here we present a pipeline for the complete genomic and transcriptomic re-annotation of Illumina probe sequences, also applicable to other platforms, with its output available through a Web interface and incorporated into Bioconductor packages. We have identified several problems with the design of individual probes and we show the benefits of probe re-annotation on the analysis of BeadArray gene expression data sets. We discuss the importance of aspects such as probe coverage of individual transcripts, alternative messenger RNA splicing, single-nucleotide polymorphisms, repeat sequences, RNA degradation biases and probes targeting genomic regions with no known transcription. We conclude that many of the Illumina probes have unreliable original annotation and that our re-annotation allows analyses to focus on the good quality probes, which form the majority, and also to expand the scope of biological information that can be extracted.</description><identifier>ISSN: 0305-1048</identifier><identifier>EISSN: 1362-4962</identifier><identifier>DOI: 10.1093/nar/gkp942</identifier><identifier>PMID: 19923232</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Alternative Splicing ; Base Pair Mismatch ; Gene Expression Profiling - methods ; Humans ; Methods Online ; Oligonucleotide Array Sequence Analysis - methods ; Oligonucleotide Probes - chemistry ; Polymorphism, Single Nucleotide ; Repetitive Sequences, Nucleic Acid ; Software</subject><ispartof>Nucleic acids research, 2010-01, Vol.38 (3), p.e17-e17</ispartof><rights>The Author(s) 2009. Published by Oxford University Press. 2009</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c499t-eb0e95f63f81bad0ef08c158c63383b85a438a0dde7fd07fbbbb21cf01e5bc253</citedby><cites>FETCH-LOGICAL-c499t-eb0e95f63f81bad0ef08c158c63383b85a438a0dde7fd07fbbbb21cf01e5bc253</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817484/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817484/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,724,777,781,882,27905,27906,53772,53774</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/19923232$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Barbosa-Morais, Nuno L</creatorcontrib><creatorcontrib>Dunning, Mark J</creatorcontrib><creatorcontrib>Samarajiwa, Shamith A</creatorcontrib><creatorcontrib>Darot, Jeremy F.J</creatorcontrib><creatorcontrib>Ritchie, Matthew E</creatorcontrib><creatorcontrib>Lynch, Andy G</creatorcontrib><creatorcontrib>Tavaré, Simon</creatorcontrib><title>re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data</title><title>Nucleic acids research</title><addtitle>Nucleic Acids Res</addtitle><description>Illumina BeadArrays are among the most popular and reliable platforms for gene expression profiling. However, little external scrutiny has been given to the design, selection and annotation of BeadArray probes, which is a fundamental issue in data quality and interpretation. Here we present a pipeline for the complete genomic and transcriptomic re-annotation of Illumina probe sequences, also applicable to other platforms, with its output available through a Web interface and incorporated into Bioconductor packages. We have identified several problems with the design of individual probes and we show the benefits of probe re-annotation on the analysis of BeadArray gene expression data sets. We discuss the importance of aspects such as probe coverage of individual transcripts, alternative messenger RNA splicing, single-nucleotide polymorphisms, repeat sequences, RNA degradation biases and probes targeting genomic regions with no known transcription. We conclude that many of the Illumina probes have unreliable original annotation and that our re-annotation allows analyses to focus on the good quality probes, which form the majority, and also to expand the scope of biological information that can be extracted.</description><subject>Alternative Splicing</subject><subject>Base Pair Mismatch</subject><subject>Gene Expression Profiling - methods</subject><subject>Humans</subject><subject>Methods Online</subject><subject>Oligonucleotide Array Sequence Analysis - methods</subject><subject>Oligonucleotide Probes - chemistry</subject><subject>Polymorphism, Single Nucleotide</subject><subject>Repetitive Sequences, Nucleic Acid</subject><subject>Software</subject><issn>0305-1048</issn><issn>1362-4962</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNqFkU1v1EAMhkcIRJfChR8AuSEhhXq-sjMcKpWKj0qVOEDPIyfxpAPJTJjJVvTfk9Wu-DhhHyzZj1_Zehl7zuENByvPIuaz4ftslXjANlw2ola2EQ_ZBiTomoMyJ-xJKd8AuOJaPWYn3Foh19ywPlONMaYFl5BiNYeZxhCp8ilXV-O4m0LE6h1hf5Ez3pe3VZjmnO5CHKrllqoQF8pzpuN68tVA6zb9XHul7Fs9LviUPfI4Fnp2rKfs5sP7r5ef6uvPH68uL67rTlm71NQCWe0b6Q1vsQfyYDquTddIaWRrNCppEPqetr6HrW_XELzzwEm3ndDylJ0fdOddO1HfUVwyjm7OYcJ87xIG9-8khls3pDsnDN8qo1aBV0eBnH7sqCxuCqWjccRIaVfcVjXAhdD6_6SUWmhQe83XB7LLqZRM_vc9HNzeP7f65w7-rfCLvz_4gx4NW4GXB8BjcjjkUNzNFwFcAjcgrNHyF_4QpAU</recordid><startdate>20100101</startdate><enddate>20100101</enddate><creator>Barbosa-Morais, Nuno L</creator><creator>Dunning, Mark J</creator><creator>Samarajiwa, Shamith A</creator><creator>Darot, Jeremy F.J</creator><creator>Ritchie, Matthew E</creator><creator>Lynch, Andy G</creator><creator>Tavaré, Simon</creator><general>Oxford University Press</general><scope>FBQ</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7TM</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>RC3</scope><scope>5PM</scope></search><sort><creationdate>20100101</creationdate><title>re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data</title><author>Barbosa-Morais, Nuno L ; Dunning, Mark J ; Samarajiwa, Shamith A ; Darot, Jeremy F.J ; Ritchie, Matthew E ; Lynch, Andy G ; Tavaré, Simon</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c499t-eb0e95f63f81bad0ef08c158c63383b85a438a0dde7fd07fbbbb21cf01e5bc253</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Alternative Splicing</topic><topic>Base Pair Mismatch</topic><topic>Gene Expression Profiling - methods</topic><topic>Humans</topic><topic>Methods Online</topic><topic>Oligonucleotide Array Sequence Analysis - methods</topic><topic>Oligonucleotide Probes - chemistry</topic><topic>Polymorphism, Single Nucleotide</topic><topic>Repetitive Sequences, Nucleic Acid</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Barbosa-Morais, Nuno L</creatorcontrib><creatorcontrib>Dunning, Mark J</creatorcontrib><creatorcontrib>Samarajiwa, Shamith A</creatorcontrib><creatorcontrib>Darot, Jeremy F.J</creatorcontrib><creatorcontrib>Ritchie, Matthew E</creatorcontrib><creatorcontrib>Lynch, Andy G</creatorcontrib><creatorcontrib>Tavaré, Simon</creatorcontrib><collection>AGRIS</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Nucleic Acids Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Nucleic acids research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Barbosa-Morais, Nuno L</au><au>Dunning, Mark J</au><au>Samarajiwa, Shamith A</au><au>Darot, Jeremy F.J</au><au>Ritchie, Matthew E</au><au>Lynch, Andy G</au><au>Tavaré, Simon</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data</atitle><jtitle>Nucleic acids research</jtitle><addtitle>Nucleic Acids Res</addtitle><date>2010-01-01</date><risdate>2010</risdate><volume>38</volume><issue>3</issue><spage>e17</spage><epage>e17</epage><pages>e17-e17</pages><issn>0305-1048</issn><eissn>1362-4962</eissn><abstract>Illumina BeadArrays are among the most popular and reliable platforms for gene expression profiling. However, little external scrutiny has been given to the design, selection and annotation of BeadArray probes, which is a fundamental issue in data quality and interpretation. Here we present a pipeline for the complete genomic and transcriptomic re-annotation of Illumina probe sequences, also applicable to other platforms, with its output available through a Web interface and incorporated into Bioconductor packages. We have identified several problems with the design of individual probes and we show the benefits of probe re-annotation on the analysis of BeadArray gene expression data sets. We discuss the importance of aspects such as probe coverage of individual transcripts, alternative messenger RNA splicing, single-nucleotide polymorphisms, repeat sequences, RNA degradation biases and probes targeting genomic regions with no known transcription. We conclude that many of the Illumina probes have unreliable original annotation and that our re-annotation allows analyses to focus on the good quality probes, which form the majority, and also to expand the scope of biological information that can be extracted.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>19923232</pmid><doi>10.1093/nar/gkp942</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0305-1048
ispartof Nucleic acids research, 2010-01, Vol.38 (3), p.e17-e17
issn 0305-1048
1362-4962
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_2817484
source PubMed (Medline); Oxford Open Access Journals
subjects Alternative Splicing
Base Pair Mismatch
Gene Expression Profiling - methods
Humans
Methods Online
Oligonucleotide Array Sequence Analysis - methods
Oligonucleotide Probes - chemistry
Polymorphism, Single Nucleotide
Repetitive Sequences, Nucleic Acid
Software
title re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T15%3A18%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=re-annotation%20pipeline%20for%20Illumina%20BeadArrays:%20improving%20the%20interpretation%20of%20gene%20expression%20data&rft.jtitle=Nucleic%20acids%20research&rft.au=Barbosa-Morais,%20Nuno%20L&rft.date=2010-01-01&rft.volume=38&rft.issue=3&rft.spage=e17&rft.epage=e17&rft.pages=e17-e17&rft.issn=0305-1048&rft.eissn=1362-4962&rft_id=info:doi/10.1093/nar/gkp942&rft_dat=%3Cproquest_pubme%3E746012255%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c499t-eb0e95f63f81bad0ef08c158c63383b85a438a0dde7fd07fbbbb21cf01e5bc253%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=733525044&rft_id=info:pmid/19923232&rfr_iscdi=true