Loading…

Widespread stable noncanonical peptides identified by integrated analyses of ribosome profiling and ORF features

Studies have revealed dozens of functional peptides in putative ‘noncoding’ regions and raised the question of how many proteins are encoded by noncanonical open reading frames (ORFs). Here, we comprehensively annotate genome-wide translated ORFs across five eukaryotes ( human , mouse , zebrafish ,...

Full description

Saved in:
Bibliographic Details
Published in:Nature communications 2024-03, Vol.15 (1), p.1932-1932, Article 1932
Main Authors: Yang, Haiwang, Li, Qianru, Stroup, Emily K., Wang, Sheng, Ji, Zhe
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Studies have revealed dozens of functional peptides in putative ‘noncoding’ regions and raised the question of how many proteins are encoded by noncanonical open reading frames (ORFs). Here, we comprehensively annotate genome-wide translated ORFs across five eukaryotes ( human , mouse , zebrafish , worm , and yeast ) by analyzing ribosome profiling data. We develop a logistic regression model named PepScore based on ORF features (expected length, encoded domain, and conservation) to calculate the probability that the encoded peptide is stable in humans . Systematic ectopic expression validates PepScore and shows that stable complex-associating microproteins can be encoded in 5’/3’ untranslated regions and overlapping coding regions of mRNAs besides annotated noncoding RNAs. Stable noncanonical proteins follow conventional rules and localize to different subcellular compartments. Inhibition of proteasomal/lysosomal degradation pathways can stabilize some peptides especially those with moderate PepScores, but cannot rescue the expression of short ones with low PepScores suggesting they are directly degraded by cellular proteases. The majority of human noncanonical peptides with high PepScores show longer lengths but low conservation across species/mammals, and hundreds contain trait-associated genetic variants. Our study presents a statistical framework to identify stable noncanonical peptides in the genome and provides a valuable resource for functional characterization of noncanonical translation during development and disease. By developing computational algorithms, the authors annotated translated open reading frames in five eukaryotes and found many stable peptides are encoded by putative ‘noncoding’ regions of genomes.
ISSN:2041-1723
2041-1723
DOI:10.1038/s41467-024-46240-9