Loading…

Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips

Affymetrix GeneChips are a popular platform for performing whole-genome experiments on the transcriptome. There are a range of different calibration steps, and users are presented with choices of different background subtractions, normalisations and expression measures. We wished to establish which...

Full description

Saved in:
Bibliographic Details
Published in:BMC bioinformatics 2007-06, Vol.8 (1), p.195-195, Article 195
Main Authors: Harrison, Andrew P, Johnston, Caroline E, Orengo, Christine A
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-b648t-d1deaf69d32934c577c065129506a4a2cf47cc0acd5c422c2e559c9dd5db42ec3
cites cdi_FETCH-LOGICAL-b648t-d1deaf69d32934c577c065129506a4a2cf47cc0acd5c422c2e559c9dd5db42ec3
container_end_page 195
container_issue 1
container_start_page 195
container_title BMC bioinformatics
container_volume 8
creator Harrison, Andrew P
Johnston, Caroline E
Orengo, Christine A
description Affymetrix GeneChips are a popular platform for performing whole-genome experiments on the transcriptome. There are a range of different calibration steps, and users are presented with choices of different background subtractions, normalisations and expression measures. We wished to establish which of the calibration steps resulted in the biggest uncertainty in the sets of genes reported to be differentially expressed. Our results indicate that the sets of genes identified as being most significantly differentially expressed, as estimated by the z-score of fold change, is relatively insensitive to the choice of background subtraction and normalisation. However, the contents of the gene list are most sensitive to the choice of expression measure. This is irrespective of whether the experiment uses a rat, mouse or human chip and whether the chip definition is made using probe mappings from Unigene, RefSeq, Entrez Gene or the original Affymetrix definitions. It is also irrespective of whether both Present and Absent, or just Present, Calls from the MAS5 algorithm are used to filter genelists, and this conclusion holds for genes of differing intensities. We also reach the same conclusion after assigning genes to be differentially expressed using t-statistics, although this approach results in a large amount of false positives in the sets of genes identified due to the small numbers of replicates typically used in microarray experiments. The major calibration uncertainty that biologists need to consider when analysing Affymetrix data is how their multiple probe values are condensed into one expression measure.
doi_str_mv 10.1186/1471-2105-8-195
format article
fullrecord <record><control><sourceid>gale_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_a77a43234d7d44b88554aaa591999936</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A165790131</galeid><doaj_id>oai_doaj_org_article_a77a43234d7d44b88554aaa591999936</doaj_id><sourcerecordid>A165790131</sourcerecordid><originalsourceid>FETCH-LOGICAL-b648t-d1deaf69d32934c577c065129506a4a2cf47cc0acd5c422c2e559c9dd5db42ec3</originalsourceid><addsrcrecordid>eNqFkktr3DAQgE1paR7tubdiKBR6cKKnZV0K2yVNFwKFNj2LsSTvarGtreQN2X9fOV7SGFIqHyRmPn0WM5Nl7zC6wLgqLzETuCAY8aIqsOQvstPHyMsn55PsLMYtQlhUiL_OTrDgJUGoOs1ur-IAdevixvXrHPIOtj7kGvbR5r7JjYs62B30-pC7Ph82NuVaVwcYnO9HYtE0h84Owd3n17a3y43bxTfZqwbaaN8e9_Ps19er2-W34ub79Wq5uCnqklVDYbCx0JTSUCIp01wIjUqOieSoBAZEN0xojUAbrhkhmljOpZbGcFMzYjU9z1aT13jYql1wHYSD8uDUQ8CHtYIwON1aBUIAo4QyIwxjdVVxzgCASyzTomVyfZ5cu33dWaNtPwRoZ9J5pncbtfZ3CkvECKuS4MskqJ3_h2Ce0b5TY4fU2CFVJRFPko_HVwT_e2_joLrUAdu20Fu_j0qgUhDG6X9BgiiVHI_P-jCBa0hlcH3j08_1CKsFLrmQCFOcqItnqPQZ2znte9u4FJ9d-DS7kJjB3g_rNDhRrX7-mLOXE6uDjzHY5rEoGKlxjJ8pw_unzfjLH-eW_gHcJOt8</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>20339518</pqid></control><display><type>article</type><title>Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips</title><source>PubMed Central</source><creator>Harrison, Andrew P ; Johnston, Caroline E ; Orengo, Christine A</creator><creatorcontrib>Harrison, Andrew P ; Johnston, Caroline E ; Orengo, Christine A</creatorcontrib><description>Affymetrix GeneChips are a popular platform for performing whole-genome experiments on the transcriptome. There are a range of different calibration steps, and users are presented with choices of different background subtractions, normalisations and expression measures. We wished to establish which of the calibration steps resulted in the biggest uncertainty in the sets of genes reported to be differentially expressed. Our results indicate that the sets of genes identified as being most significantly differentially expressed, as estimated by the z-score of fold change, is relatively insensitive to the choice of background subtraction and normalisation. However, the contents of the gene list are most sensitive to the choice of expression measure. This is irrespective of whether the experiment uses a rat, mouse or human chip and whether the chip definition is made using probe mappings from Unigene, RefSeq, Entrez Gene or the original Affymetrix definitions. It is also irrespective of whether both Present and Absent, or just Present, Calls from the MAS5 algorithm are used to filter genelists, and this conclusion holds for genes of differing intensities. We also reach the same conclusion after assigning genes to be differentially expressed using t-statistics, although this approach results in a large amount of false positives in the sets of genes identified due to the small numbers of replicates typically used in microarray experiments. The major calibration uncertainty that biologists need to consider when analysing Affymetrix data is how their multiple probe values are condensed into one expression measure.</description><identifier>ISSN: 1471-2105</identifier><identifier>EISSN: 1471-2105</identifier><identifier>DOI: 10.1186/1471-2105-8-195</identifier><identifier>PMID: 17562008</identifier><language>eng</language><publisher>England: BioMed Central Ltd</publisher><subject>Algorithms ; Calibration ; Data Interpretation, Statistical ; DNA microarrays ; Evaluation ; Gene Expression Profiling - instrumentation ; Gene Expression Profiling - standards ; Oligonucleotide Array Sequence Analysis - instrumentation ; Oligonucleotide Array Sequence Analysis - standards ; Quality Control ; Reproducibility of Results ; Sensitivity and Specificity ; United Kingdom</subject><ispartof>BMC bioinformatics, 2007-06, Vol.8 (1), p.195-195, Article 195</ispartof><rights>COPYRIGHT 2007 BioMed Central Ltd.</rights><rights>Copyright © 2007 Harrison et al; licensee BioMed Central Ltd. 2007 Harrison et al; licensee BioMed Central Ltd.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-b648t-d1deaf69d32934c577c065129506a4a2cf47cc0acd5c422c2e559c9dd5db42ec3</citedby><cites>FETCH-LOGICAL-b648t-d1deaf69d32934c577c065129506a4a2cf47cc0acd5c422c2e559c9dd5db42ec3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC1904248/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC1904248/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/17562008$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Harrison, Andrew P</creatorcontrib><creatorcontrib>Johnston, Caroline E</creatorcontrib><creatorcontrib>Orengo, Christine A</creatorcontrib><title>Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips</title><title>BMC bioinformatics</title><addtitle>BMC Bioinformatics</addtitle><description>Affymetrix GeneChips are a popular platform for performing whole-genome experiments on the transcriptome. There are a range of different calibration steps, and users are presented with choices of different background subtractions, normalisations and expression measures. We wished to establish which of the calibration steps resulted in the biggest uncertainty in the sets of genes reported to be differentially expressed. Our results indicate that the sets of genes identified as being most significantly differentially expressed, as estimated by the z-score of fold change, is relatively insensitive to the choice of background subtraction and normalisation. However, the contents of the gene list are most sensitive to the choice of expression measure. This is irrespective of whether the experiment uses a rat, mouse or human chip and whether the chip definition is made using probe mappings from Unigene, RefSeq, Entrez Gene or the original Affymetrix definitions. It is also irrespective of whether both Present and Absent, or just Present, Calls from the MAS5 algorithm are used to filter genelists, and this conclusion holds for genes of differing intensities. We also reach the same conclusion after assigning genes to be differentially expressed using t-statistics, although this approach results in a large amount of false positives in the sets of genes identified due to the small numbers of replicates typically used in microarray experiments. The major calibration uncertainty that biologists need to consider when analysing Affymetrix data is how their multiple probe values are condensed into one expression measure.</description><subject>Algorithms</subject><subject>Calibration</subject><subject>Data Interpretation, Statistical</subject><subject>DNA microarrays</subject><subject>Evaluation</subject><subject>Gene Expression Profiling - instrumentation</subject><subject>Gene Expression Profiling - standards</subject><subject>Oligonucleotide Array Sequence Analysis - instrumentation</subject><subject>Oligonucleotide Array Sequence Analysis - standards</subject><subject>Quality Control</subject><subject>Reproducibility of Results</subject><subject>Sensitivity and Specificity</subject><subject>United Kingdom</subject><issn>1471-2105</issn><issn>1471-2105</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2007</creationdate><recordtype>article</recordtype><sourceid>DOA</sourceid><recordid>eNqFkktr3DAQgE1paR7tubdiKBR6cKKnZV0K2yVNFwKFNj2LsSTvarGtreQN2X9fOV7SGFIqHyRmPn0WM5Nl7zC6wLgqLzETuCAY8aIqsOQvstPHyMsn55PsLMYtQlhUiL_OTrDgJUGoOs1ur-IAdevixvXrHPIOtj7kGvbR5r7JjYs62B30-pC7Ph82NuVaVwcYnO9HYtE0h84Owd3n17a3y43bxTfZqwbaaN8e9_Ps19er2-W34ub79Wq5uCnqklVDYbCx0JTSUCIp01wIjUqOieSoBAZEN0xojUAbrhkhmljOpZbGcFMzYjU9z1aT13jYql1wHYSD8uDUQ8CHtYIwON1aBUIAo4QyIwxjdVVxzgCASyzTomVyfZ5cu33dWaNtPwRoZ9J5pncbtfZ3CkvECKuS4MskqJ3_h2Ce0b5TY4fU2CFVJRFPko_HVwT_e2_joLrUAdu20Fu_j0qgUhDG6X9BgiiVHI_P-jCBa0hlcH3j08_1CKsFLrmQCFOcqItnqPQZ2znte9u4FJ9d-DS7kJjB3g_rNDhRrX7-mLOXE6uDjzHY5rEoGKlxjJ8pw_unzfjLH-eW_gHcJOt8</recordid><startdate>20070611</startdate><enddate>20070611</enddate><creator>Harrison, Andrew P</creator><creator>Johnston, Caroline E</creator><creator>Orengo, Christine A</creator><general>BioMed Central Ltd</general><general>BioMed Central</general><general>BMC</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISR</scope><scope>7QO</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20070611</creationdate><title>Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips</title><author>Harrison, Andrew P ; Johnston, Caroline E ; Orengo, Christine A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-b648t-d1deaf69d32934c577c065129506a4a2cf47cc0acd5c422c2e559c9dd5db42ec3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2007</creationdate><topic>Algorithms</topic><topic>Calibration</topic><topic>Data Interpretation, Statistical</topic><topic>DNA microarrays</topic><topic>Evaluation</topic><topic>Gene Expression Profiling - instrumentation</topic><topic>Gene Expression Profiling - standards</topic><topic>Oligonucleotide Array Sequence Analysis - instrumentation</topic><topic>Oligonucleotide Array Sequence Analysis - standards</topic><topic>Quality Control</topic><topic>Reproducibility of Results</topic><topic>Sensitivity and Specificity</topic><topic>United Kingdom</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Harrison, Andrew P</creatorcontrib><creatorcontrib>Johnston, Caroline E</creatorcontrib><creatorcontrib>Orengo, Christine A</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Science</collection><collection>Biotechnology Research Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>BMC bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Harrison, Andrew P</au><au>Johnston, Caroline E</au><au>Orengo, Christine A</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips</atitle><jtitle>BMC bioinformatics</jtitle><addtitle>BMC Bioinformatics</addtitle><date>2007-06-11</date><risdate>2007</risdate><volume>8</volume><issue>1</issue><spage>195</spage><epage>195</epage><pages>195-195</pages><artnum>195</artnum><issn>1471-2105</issn><eissn>1471-2105</eissn><abstract>Affymetrix GeneChips are a popular platform for performing whole-genome experiments on the transcriptome. There are a range of different calibration steps, and users are presented with choices of different background subtractions, normalisations and expression measures. We wished to establish which of the calibration steps resulted in the biggest uncertainty in the sets of genes reported to be differentially expressed. Our results indicate that the sets of genes identified as being most significantly differentially expressed, as estimated by the z-score of fold change, is relatively insensitive to the choice of background subtraction and normalisation. However, the contents of the gene list are most sensitive to the choice of expression measure. This is irrespective of whether the experiment uses a rat, mouse or human chip and whether the chip definition is made using probe mappings from Unigene, RefSeq, Entrez Gene or the original Affymetrix definitions. It is also irrespective of whether both Present and Absent, or just Present, Calls from the MAS5 algorithm are used to filter genelists, and this conclusion holds for genes of differing intensities. We also reach the same conclusion after assigning genes to be differentially expressed using t-statistics, although this approach results in a large amount of false positives in the sets of genes identified due to the small numbers of replicates typically used in microarray experiments. The major calibration uncertainty that biologists need to consider when analysing Affymetrix data is how their multiple probe values are condensed into one expression measure.</abstract><cop>England</cop><pub>BioMed Central Ltd</pub><pmid>17562008</pmid><doi>10.1186/1471-2105-8-195</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1471-2105
ispartof BMC bioinformatics, 2007-06, Vol.8 (1), p.195-195, Article 195
issn 1471-2105
1471-2105
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_a77a43234d7d44b88554aaa591999936
source PubMed Central
subjects Algorithms
Calibration
Data Interpretation, Statistical
DNA microarrays
Evaluation
Gene Expression Profiling - instrumentation
Gene Expression Profiling - standards
Oligonucleotide Array Sequence Analysis - instrumentation
Oligonucleotide Array Sequence Analysis - standards
Quality Control
Reproducibility of Results
Sensitivity and Specificity
United Kingdom
title Establishing a major cause of discrepancy in the calibration of Affymetrix GeneChips
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T11%3A42%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Establishing%20a%20major%20cause%20of%20discrepancy%20in%20the%20calibration%20of%20Affymetrix%20GeneChips&rft.jtitle=BMC%20bioinformatics&rft.au=Harrison,%20Andrew%20P&rft.date=2007-06-11&rft.volume=8&rft.issue=1&rft.spage=195&rft.epage=195&rft.pages=195-195&rft.artnum=195&rft.issn=1471-2105&rft.eissn=1471-2105&rft_id=info:doi/10.1186/1471-2105-8-195&rft_dat=%3Cgale_doaj_%3EA165790131%3C/gale_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-b648t-d1deaf69d32934c577c065129506a4a2cf47cc0acd5c422c2e559c9dd5db42ec3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=20339518&rft_id=info:pmid/17562008&rft_galeid=A165790131&rfr_iscdi=true