Loading…

Guide for protein fold change and p-value calculation for non-experts in proteomics

Proteomics studies generate tables with thousands of entries. A significant component of being a proteomics scientist is the ability to process these tables to identify regulated proteins. Many bioinformatics tools are freely available for the community, some of which within reach for scientists wit...

Full description

Saved in:
Bibliographic Details
Published in:Molecular omics 2020-12, Vol.16 (6), p.573-582
Main Authors: Aguilan, Jennifer T, Kulej, Katarzyna, Sidoli, Simone
Format: Article
Language:English
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c328t-ba315b524769c59b11119fafba463f6cfed5f6aad2828ef81faf872923472dcb3
cites cdi_FETCH-LOGICAL-c328t-ba315b524769c59b11119fafba463f6cfed5f6aad2828ef81faf872923472dcb3
container_end_page 582
container_issue 6
container_start_page 573
container_title Molecular omics
container_volume 16
creator Aguilan, Jennifer T
Kulej, Katarzyna
Sidoli, Simone
description Proteomics studies generate tables with thousands of entries. A significant component of being a proteomics scientist is the ability to process these tables to identify regulated proteins. Many bioinformatics tools are freely available for the community, some of which within reach for scientists with limited or no background in programming and statistics. However, proteomics has become popular in most other biological and biomedical disciplines, resulting in more and more studies where data processing is delegated to specialists that are not lead authors of the scientific project. This creates a risk or at least a limiting factor, as the biological interpretation of a dataset is contingent of a third-party specialist transforming data without the input of the project leader. We acknowledge in advance that dedicated scripts and software have a higher level of sophistication; but we hereby claim that the approach we describe makes proteomics data processing immediately accessible to every scientist. In this paper, we describe key steps of the typical data transformation, normalization and statistics in proteomics data analysis using a simple spreadsheet. This manuscript aims to demonstrate to those who are not familiar with the math and statistics behind these workflows that a proteomics dataset can be processed, simplified and interpreted in software like Microsoft Excel. With this, we aim to reach the community of non-specialists in proteomics to find a common language and illustrate the basic steps of -omics data processing.
doi_str_mv 10.1039/d0mo00087f
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2445967606</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2445967606</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-ba315b524769c59b11119fafba463f6cfed5f6aad2828ef81faf872923472dcb3</originalsourceid><addsrcrecordid>eNpNkMtOwzAQRS0EolXphg9AWSKkgF9x7CUqtCAVdQGsI8cPCHLsECcI_h7TFsRsZqQ5987oAnCK4CWCRFxp2AYIIS_tAZjiAhU5RZwe_psnYB7jW2KQwBxjfgwmBAvGS0qm4HE1NtpkNvRZ14fBND7NTmfqVfoXk0mvsy7_kG40mZJOjU4OTfBb3gefm8_O9EPMkmwrD22j4gk4stJFM9_3GXhe3j4t7vL1ZnW_uF7nimA-5LUkqKgLTEsmVCFqlEpYaWtJGbFMWaMLy6TU6WtuLEdpx0ssMKEl1qomM3C-802n30cTh6ptojLOSW_CGCtMaSFYySBL6MUOVX2IsTe26vqmlf1XhWD1k2N1Ax822xyXCT7b-451a_Qf-psa-QaprG3t</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2445967606</pqid></control><display><type>article</type><title>Guide for protein fold change and p-value calculation for non-experts in proteomics</title><source>Royal Society of Chemistry</source><creator>Aguilan, Jennifer T ; Kulej, Katarzyna ; Sidoli, Simone</creator><creatorcontrib>Aguilan, Jennifer T ; Kulej, Katarzyna ; Sidoli, Simone</creatorcontrib><description>Proteomics studies generate tables with thousands of entries. A significant component of being a proteomics scientist is the ability to process these tables to identify regulated proteins. Many bioinformatics tools are freely available for the community, some of which within reach for scientists with limited or no background in programming and statistics. However, proteomics has become popular in most other biological and biomedical disciplines, resulting in more and more studies where data processing is delegated to specialists that are not lead authors of the scientific project. This creates a risk or at least a limiting factor, as the biological interpretation of a dataset is contingent of a third-party specialist transforming data without the input of the project leader. We acknowledge in advance that dedicated scripts and software have a higher level of sophistication; but we hereby claim that the approach we describe makes proteomics data processing immediately accessible to every scientist. In this paper, we describe key steps of the typical data transformation, normalization and statistics in proteomics data analysis using a simple spreadsheet. This manuscript aims to demonstrate to those who are not familiar with the math and statistics behind these workflows that a proteomics dataset can be processed, simplified and interpreted in software like Microsoft Excel. With this, we aim to reach the community of non-specialists in proteomics to find a common language and illustrate the basic steps of -omics data processing.</description><identifier>ISSN: 2515-4184</identifier><identifier>EISSN: 2515-4184</identifier><identifier>DOI: 10.1039/d0mo00087f</identifier><identifier>PMID: 32968743</identifier><language>eng</language><publisher>England</publisher><ispartof>Molecular omics, 2020-12, Vol.16 (6), p.573-582</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-ba315b524769c59b11119fafba463f6cfed5f6aad2828ef81faf872923472dcb3</citedby><cites>FETCH-LOGICAL-c328t-ba315b524769c59b11119fafba463f6cfed5f6aad2828ef81faf872923472dcb3</cites><orcidid>0000-0001-9073-6641 ; 0000-0002-0221-354X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/32968743$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Aguilan, Jennifer T</creatorcontrib><creatorcontrib>Kulej, Katarzyna</creatorcontrib><creatorcontrib>Sidoli, Simone</creatorcontrib><title>Guide for protein fold change and p-value calculation for non-experts in proteomics</title><title>Molecular omics</title><addtitle>Mol Omics</addtitle><description>Proteomics studies generate tables with thousands of entries. A significant component of being a proteomics scientist is the ability to process these tables to identify regulated proteins. Many bioinformatics tools are freely available for the community, some of which within reach for scientists with limited or no background in programming and statistics. However, proteomics has become popular in most other biological and biomedical disciplines, resulting in more and more studies where data processing is delegated to specialists that are not lead authors of the scientific project. This creates a risk or at least a limiting factor, as the biological interpretation of a dataset is contingent of a third-party specialist transforming data without the input of the project leader. We acknowledge in advance that dedicated scripts and software have a higher level of sophistication; but we hereby claim that the approach we describe makes proteomics data processing immediately accessible to every scientist. In this paper, we describe key steps of the typical data transformation, normalization and statistics in proteomics data analysis using a simple spreadsheet. This manuscript aims to demonstrate to those who are not familiar with the math and statistics behind these workflows that a proteomics dataset can be processed, simplified and interpreted in software like Microsoft Excel. With this, we aim to reach the community of non-specialists in proteomics to find a common language and illustrate the basic steps of -omics data processing.</description><issn>2515-4184</issn><issn>2515-4184</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNpNkMtOwzAQRS0EolXphg9AWSKkgF9x7CUqtCAVdQGsI8cPCHLsECcI_h7TFsRsZqQ5987oAnCK4CWCRFxp2AYIIS_tAZjiAhU5RZwe_psnYB7jW2KQwBxjfgwmBAvGS0qm4HE1NtpkNvRZ14fBND7NTmfqVfoXk0mvsy7_kG40mZJOjU4OTfBb3gefm8_O9EPMkmwrD22j4gk4stJFM9_3GXhe3j4t7vL1ZnW_uF7nimA-5LUkqKgLTEsmVCFqlEpYaWtJGbFMWaMLy6TU6WtuLEdpx0ssMKEl1qomM3C-802n30cTh6ptojLOSW_CGCtMaSFYySBL6MUOVX2IsTe26vqmlf1XhWD1k2N1Ax822xyXCT7b-451a_Qf-psa-QaprG3t</recordid><startdate>20201201</startdate><enddate>20201201</enddate><creator>Aguilan, Jennifer T</creator><creator>Kulej, Katarzyna</creator><creator>Sidoli, Simone</creator><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-9073-6641</orcidid><orcidid>https://orcid.org/0000-0002-0221-354X</orcidid></search><sort><creationdate>20201201</creationdate><title>Guide for protein fold change and p-value calculation for non-experts in proteomics</title><author>Aguilan, Jennifer T ; Kulej, Katarzyna ; Sidoli, Simone</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-ba315b524769c59b11119fafba463f6cfed5f6aad2828ef81faf872923472dcb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Aguilan, Jennifer T</creatorcontrib><creatorcontrib>Kulej, Katarzyna</creatorcontrib><creatorcontrib>Sidoli, Simone</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Molecular omics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Aguilan, Jennifer T</au><au>Kulej, Katarzyna</au><au>Sidoli, Simone</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Guide for protein fold change and p-value calculation for non-experts in proteomics</atitle><jtitle>Molecular omics</jtitle><addtitle>Mol Omics</addtitle><date>2020-12-01</date><risdate>2020</risdate><volume>16</volume><issue>6</issue><spage>573</spage><epage>582</epage><pages>573-582</pages><issn>2515-4184</issn><eissn>2515-4184</eissn><abstract>Proteomics studies generate tables with thousands of entries. A significant component of being a proteomics scientist is the ability to process these tables to identify regulated proteins. Many bioinformatics tools are freely available for the community, some of which within reach for scientists with limited or no background in programming and statistics. However, proteomics has become popular in most other biological and biomedical disciplines, resulting in more and more studies where data processing is delegated to specialists that are not lead authors of the scientific project. This creates a risk or at least a limiting factor, as the biological interpretation of a dataset is contingent of a third-party specialist transforming data without the input of the project leader. We acknowledge in advance that dedicated scripts and software have a higher level of sophistication; but we hereby claim that the approach we describe makes proteomics data processing immediately accessible to every scientist. In this paper, we describe key steps of the typical data transformation, normalization and statistics in proteomics data analysis using a simple spreadsheet. This manuscript aims to demonstrate to those who are not familiar with the math and statistics behind these workflows that a proteomics dataset can be processed, simplified and interpreted in software like Microsoft Excel. With this, we aim to reach the community of non-specialists in proteomics to find a common language and illustrate the basic steps of -omics data processing.</abstract><cop>England</cop><pmid>32968743</pmid><doi>10.1039/d0mo00087f</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0001-9073-6641</orcidid><orcidid>https://orcid.org/0000-0002-0221-354X</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 2515-4184
ispartof Molecular omics, 2020-12, Vol.16 (6), p.573-582
issn 2515-4184
2515-4184
language eng
recordid cdi_proquest_miscellaneous_2445967606
source Royal Society of Chemistry
title Guide for protein fold change and p-value calculation for non-experts in proteomics
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T10%3A35%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Guide%20for%20protein%20fold%20change%20and%20p-value%20calculation%20for%20non-experts%20in%20proteomics&rft.jtitle=Molecular%20omics&rft.au=Aguilan,%20Jennifer%20T&rft.date=2020-12-01&rft.volume=16&rft.issue=6&rft.spage=573&rft.epage=582&rft.pages=573-582&rft.issn=2515-4184&rft.eissn=2515-4184&rft_id=info:doi/10.1039/d0mo00087f&rft_dat=%3Cproquest_cross%3E2445967606%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c328t-ba315b524769c59b11119fafba463f6cfed5f6aad2828ef81faf872923472dcb3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2445967606&rft_id=info:pmid/32968743&rfr_iscdi=true