Loading…

CALANGO: A phylogeny-aware comparative genomics tool for discovering quantitative genotype-phenotype associations across species

Living species vary significantly in phenotype and genomic content. Sophisticated statistical methods linking genes with phenotypes within a species have led to breakthroughs in complex genetic diseases and genetic breeding. Despite the abundance of genomic and phenotypic data available for thousand...

Full description

Saved in:
Bibliographic Details
Published in:Patterns (New York, N.Y.) N.Y.), 2023-06, Vol.4 (6), p.100728-100728, Article 100728
Main Authors: Hongo, Jorge Augusto, de Castro, Giovanni Marques, Albuquerque Menezes, Alison Pelri, Rios Picorelli, Agnello César, Martins da Silva, Thieres Tayroni, Imada, Eddie Luidy, Marchionni, Luigi, Del-Bem, Luiz-Eduardo, Vieira Chaves, Anderson, Almeida, Gabriel Magno de Freitas, Campelo, Felipe, Lobo, Francisco Pereira
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Living species vary significantly in phenotype and genomic content. Sophisticated statistical methods linking genes with phenotypes within a species have led to breakthroughs in complex genetic diseases and genetic breeding. Despite the abundance of genomic and phenotypic data available for thousands of species, finding genotype-phenotype associations across species is challenging due to the non-independence of species data resulting from common ancestry. To address this, we present CALANGO (comparative analysis with annotation-based genomic components), a phylogeny-aware comparative genomics tool to find homologous regions and biological roles associated with quantitative phenotypes across species. In two case studies, CALANGO identified both known and previously unidentified genotype-phenotype associations. The first study revealed unknown aspects of the ecological interaction between Escherichia coli, its integrated bacteriophages, and the pathogenicity phenotype. The second identified an association between maximum height in angiosperms and the expansion of a reproductive mechanism that prevents inbreeding and increases genetic diversity, with implications for conservation biology and agriculture. [Display omitted] •Searches for genome-wide quantitative genotype-phenotype associations•Uses phylogeny-aware models to account for non-independence of species data•Detects associations of homologous regions and molecular functional convergences•Tested and documented open-source package Life is a complex and varied phenomenon with a wide range of phenotypic and genotypic variations. The search for the putative genetic mechanisms associated with—and eventually playing causal roles in—the phenotypic differences between species remains a key question in biology. We introduce CALANGO, a comparative genomics tool to search for genome-wide genotype-phenotype associations across species, taking advantage of the large amounts of phenotypic data available for species with complete genomes. Our tool uses phylogeny-aware linear models to account for the non-independence of species data and can be used to detect both homologous regions and molecular functional convergences associated with phenotypes. Through two case studies, we show how CALANGO can be used to investigate the genomic and functional evolution of distinct complex phenotypes and to select targets for experimental characterization. CALANGO is a comparative genomics tool that identifies genotype-phenotype ass
ISSN:2666-3899
2666-3899
DOI:10.1016/j.patter.2023.100728