Loading…
Construction of Human Proteoform Families from 21 Tesla FT-ICR Mass Spectrometry Top-Down Proteomic Data
Identification of proteoforms, the different forms of a protein, is important to understand biological processes. A proteoform family is the set of different proteoforms from the same gene. We previously developed the software program Proteoform Suite, which constructs proteoform families and identi...
Saved in:
Published in: | Journal of proteome research 2020-10, Vol.20 (1), p.317-325 |
---|---|
Main Authors: | , , , , , , |
Format: | Article |
Language: | English |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 325 |
container_issue | 1 |
container_start_page | 317 |
container_title | Journal of proteome research |
container_volume | 20 |
creator | Schaffer, Leah V. Anderson, Lissa C. Butcher, David S. Shortreed, Michael R. Miller, Rachel M. Pavelec, Caitlin Smith, Lloyd M. |
description | Identification of proteoforms, the different forms of a protein, is important to understand biological processes. A proteoform family is the set of different proteoforms from the same gene. We previously developed the software program Proteoform Suite, which constructs proteoform families and identifies proteoforms by intact-mass analysis. Here, we have applied this approach to top-down proteomic data acquired at the National High Magnetic Field Laboratory 21 tesla FT-ICR mass spectrometer (data available on the MassIVE platform with identifier MSV000085978). We explored the ability to construct proteoform families and identify proteoforms from the high mass accuracy data that this instrument provides for a complex cell lysate sample from the MCF-7 human breast cancer cell line. 2830 experimental proteoforms were observed, of which 932 were identified, 44 were ambiguous, and 1854 were unidentified. Of the 932 unique identified proteoforms, 766 were identified by top-down MS2 analysis at 1% FDR using TDPortal and 166 were additional intact-mass identifications (~4.7% calculated global FDR) made using Proteoform Suite. We recently published a proteoform level schema to represent ambiguity in proteoform identifications. We implemented this proteoform level classification in Proteoform Suite for intact-mass identifications, which enables users to determine the ambiguity levels and sources of ambiguity for each intact-mass proteoform identification. |
doi_str_mv | 10.1021/acs.jproteome.0c00403 |
format | article |
fullrecord | <record><control><sourceid>pubmedcentral</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7775878</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>pubmedcentral_primary_oai_pubmedcentral_nih_gov_7775878</sourcerecordid><originalsourceid>FETCH-pubmedcentral_primary_oai_pubmedcentral_nih_gov_77758783</originalsourceid><addsrcrecordid>eNqlzM1Kw0AUhuFBFFt_LkE4N5B4ptM4ycZNa6gLQTT7cBwndkomJ8xMlN69Iu3CtavvgxceIW4k5hIX8pZMzHdj4GTZ2xwN4hLViZjLQhWZqlCfHn9ZqZm4iHGHKAuN6lzMlEK9vNPVXGxXPMQUJpMcD8AdbCZPAzz_wh0HDzV51zsboQvsYSGhsbEnqJvscfUCTxQjvI7WpJ9qU9hDw2O25q-j4Z2BNSW6Emcd9dFeH_ZS3NcPzWqTjdObt-_GDilQ347BeQr7lsm1f8vgtu0Hf7Za66LUpfo38A0Gr2l-</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Construction of Human Proteoform Families from 21 Tesla FT-ICR Mass Spectrometry Top-Down Proteomic Data</title><source>American Chemical Society:Jisc Collections:American Chemical Society Read & Publish Agreement 2022-2024 (Reading list)</source><creator>Schaffer, Leah V. ; Anderson, Lissa C. ; Butcher, David S. ; Shortreed, Michael R. ; Miller, Rachel M. ; Pavelec, Caitlin ; Smith, Lloyd M.</creator><creatorcontrib>Schaffer, Leah V. ; Anderson, Lissa C. ; Butcher, David S. ; Shortreed, Michael R. ; Miller, Rachel M. ; Pavelec, Caitlin ; Smith, Lloyd M.</creatorcontrib><description>Identification of proteoforms, the different forms of a protein, is important to understand biological processes. A proteoform family is the set of different proteoforms from the same gene. We previously developed the software program Proteoform Suite, which constructs proteoform families and identifies proteoforms by intact-mass analysis. Here, we have applied this approach to top-down proteomic data acquired at the National High Magnetic Field Laboratory 21 tesla FT-ICR mass spectrometer (data available on the MassIVE platform with identifier MSV000085978). We explored the ability to construct proteoform families and identify proteoforms from the high mass accuracy data that this instrument provides for a complex cell lysate sample from the MCF-7 human breast cancer cell line. 2830 experimental proteoforms were observed, of which 932 were identified, 44 were ambiguous, and 1854 were unidentified. Of the 932 unique identified proteoforms, 766 were identified by top-down MS2 analysis at 1% FDR using TDPortal and 166 were additional intact-mass identifications (~4.7% calculated global FDR) made using Proteoform Suite. We recently published a proteoform level schema to represent ambiguity in proteoform identifications. We implemented this proteoform level classification in Proteoform Suite for intact-mass identifications, which enables users to determine the ambiguity levels and sources of ambiguity for each intact-mass proteoform identification.</description><identifier>ISSN: 1535-3893</identifier><identifier>EISSN: 1535-3907</identifier><identifier>DOI: 10.1021/acs.jproteome.0c00403</identifier><identifier>PMID: 33074679</identifier><language>eng</language><ispartof>Journal of proteome research, 2020-10, Vol.20 (1), p.317-325</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885,27924,27925</link.rule.ids></links><search><creatorcontrib>Schaffer, Leah V.</creatorcontrib><creatorcontrib>Anderson, Lissa C.</creatorcontrib><creatorcontrib>Butcher, David S.</creatorcontrib><creatorcontrib>Shortreed, Michael R.</creatorcontrib><creatorcontrib>Miller, Rachel M.</creatorcontrib><creatorcontrib>Pavelec, Caitlin</creatorcontrib><creatorcontrib>Smith, Lloyd M.</creatorcontrib><title>Construction of Human Proteoform Families from 21 Tesla FT-ICR Mass Spectrometry Top-Down Proteomic Data</title><title>Journal of proteome research</title><description>Identification of proteoforms, the different forms of a protein, is important to understand biological processes. A proteoform family is the set of different proteoforms from the same gene. We previously developed the software program Proteoform Suite, which constructs proteoform families and identifies proteoforms by intact-mass analysis. Here, we have applied this approach to top-down proteomic data acquired at the National High Magnetic Field Laboratory 21 tesla FT-ICR mass spectrometer (data available on the MassIVE platform with identifier MSV000085978). We explored the ability to construct proteoform families and identify proteoforms from the high mass accuracy data that this instrument provides for a complex cell lysate sample from the MCF-7 human breast cancer cell line. 2830 experimental proteoforms were observed, of which 932 were identified, 44 were ambiguous, and 1854 were unidentified. Of the 932 unique identified proteoforms, 766 were identified by top-down MS2 analysis at 1% FDR using TDPortal and 166 were additional intact-mass identifications (~4.7% calculated global FDR) made using Proteoform Suite. We recently published a proteoform level schema to represent ambiguity in proteoform identifications. We implemented this proteoform level classification in Proteoform Suite for intact-mass identifications, which enables users to determine the ambiguity levels and sources of ambiguity for each intact-mass proteoform identification.</description><issn>1535-3893</issn><issn>1535-3907</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNqlzM1Kw0AUhuFBFFt_LkE4N5B4ptM4ycZNa6gLQTT7cBwndkomJ8xMlN69Iu3CtavvgxceIW4k5hIX8pZMzHdj4GTZ2xwN4hLViZjLQhWZqlCfHn9ZqZm4iHGHKAuN6lzMlEK9vNPVXGxXPMQUJpMcD8AdbCZPAzz_wh0HDzV51zsboQvsYSGhsbEnqJvscfUCTxQjvI7WpJ9qU9hDw2O25q-j4Z2BNSW6Emcd9dFeH_ZS3NcPzWqTjdObt-_GDilQ347BeQr7lsm1f8vgtu0Hf7Za66LUpfo38A0Gr2l-</recordid><startdate>20201019</startdate><enddate>20201019</enddate><creator>Schaffer, Leah V.</creator><creator>Anderson, Lissa C.</creator><creator>Butcher, David S.</creator><creator>Shortreed, Michael R.</creator><creator>Miller, Rachel M.</creator><creator>Pavelec, Caitlin</creator><creator>Smith, Lloyd M.</creator><scope>5PM</scope></search><sort><creationdate>20201019</creationdate><title>Construction of Human Proteoform Families from 21 Tesla FT-ICR Mass Spectrometry Top-Down Proteomic Data</title><author>Schaffer, Leah V. ; Anderson, Lissa C. ; Butcher, David S. ; Shortreed, Michael R. ; Miller, Rachel M. ; Pavelec, Caitlin ; Smith, Lloyd M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-pubmedcentral_primary_oai_pubmedcentral_nih_gov_77758783</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Schaffer, Leah V.</creatorcontrib><creatorcontrib>Anderson, Lissa C.</creatorcontrib><creatorcontrib>Butcher, David S.</creatorcontrib><creatorcontrib>Shortreed, Michael R.</creatorcontrib><creatorcontrib>Miller, Rachel M.</creatorcontrib><creatorcontrib>Pavelec, Caitlin</creatorcontrib><creatorcontrib>Smith, Lloyd M.</creatorcontrib><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of proteome research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Schaffer, Leah V.</au><au>Anderson, Lissa C.</au><au>Butcher, David S.</au><au>Shortreed, Michael R.</au><au>Miller, Rachel M.</au><au>Pavelec, Caitlin</au><au>Smith, Lloyd M.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Construction of Human Proteoform Families from 21 Tesla FT-ICR Mass Spectrometry Top-Down Proteomic Data</atitle><jtitle>Journal of proteome research</jtitle><date>2020-10-19</date><risdate>2020</risdate><volume>20</volume><issue>1</issue><spage>317</spage><epage>325</epage><pages>317-325</pages><issn>1535-3893</issn><eissn>1535-3907</eissn><abstract>Identification of proteoforms, the different forms of a protein, is important to understand biological processes. A proteoform family is the set of different proteoforms from the same gene. We previously developed the software program Proteoform Suite, which constructs proteoform families and identifies proteoforms by intact-mass analysis. Here, we have applied this approach to top-down proteomic data acquired at the National High Magnetic Field Laboratory 21 tesla FT-ICR mass spectrometer (data available on the MassIVE platform with identifier MSV000085978). We explored the ability to construct proteoform families and identify proteoforms from the high mass accuracy data that this instrument provides for a complex cell lysate sample from the MCF-7 human breast cancer cell line. 2830 experimental proteoforms were observed, of which 932 were identified, 44 were ambiguous, and 1854 were unidentified. Of the 932 unique identified proteoforms, 766 were identified by top-down MS2 analysis at 1% FDR using TDPortal and 166 were additional intact-mass identifications (~4.7% calculated global FDR) made using Proteoform Suite. We recently published a proteoform level schema to represent ambiguity in proteoform identifications. We implemented this proteoform level classification in Proteoform Suite for intact-mass identifications, which enables users to determine the ambiguity levels and sources of ambiguity for each intact-mass proteoform identification.</abstract><pmid>33074679</pmid><doi>10.1021/acs.jproteome.0c00403</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1535-3893 |
ispartof | Journal of proteome research, 2020-10, Vol.20 (1), p.317-325 |
issn | 1535-3893 1535-3907 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7775878 |
source | American Chemical Society:Jisc Collections:American Chemical Society Read & Publish Agreement 2022-2024 (Reading list) |
title | Construction of Human Proteoform Families from 21 Tesla FT-ICR Mass Spectrometry Top-Down Proteomic Data |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T20%3A45%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pubmedcentral&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Construction%20of%20Human%20Proteoform%20Families%20from%2021%20Tesla%20FT-ICR%20Mass%20Spectrometry%20Top-Down%20Proteomic%20Data&rft.jtitle=Journal%20of%20proteome%20research&rft.au=Schaffer,%20Leah%20V.&rft.date=2020-10-19&rft.volume=20&rft.issue=1&rft.spage=317&rft.epage=325&rft.pages=317-325&rft.issn=1535-3893&rft.eissn=1535-3907&rft_id=info:doi/10.1021/acs.jproteome.0c00403&rft_dat=%3Cpubmedcentral%3Epubmedcentral_primary_oai_pubmedcentral_nih_gov_7775878%3C/pubmedcentral%3E%3Cgrp_id%3Ecdi_FETCH-pubmedcentral_primary_oai_pubmedcentral_nih_gov_77758783%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/33074679&rfr_iscdi=true |