Loading…

Use of a draft genome of coffee (C offea arabica ) to identify SNP s associated with caffeine content

Arabica coffee ( Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high‐ or low‐caffeine co...

Full description

Saved in:
Bibliographic Details
Published in:Plant biotechnology journal 2018-10, Vol.16 (10), p.1756-1766
Main Authors: Tran, Hue T.M., Ramaraj, Thiruvarangan, Furtado, Agnelo, Lee, Leonard Slade, Henry, Robert J.
Format: Article
Language:English
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Arabica coffee ( Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high‐ or low‐caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long‐read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCO s were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms ( SNP s) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway‐based analysis, 65 caffeine‐associated SNP s were discovered, among which 11 SNP s were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee.
ISSN:1467-7644
1467-7652
DOI:10.1111/pbi.12912