Loading…

Use of a draft genome of coffee (C offea arabica ) to identify SNP s associated with caffeine content

Arabica coffee ( Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high‐ or low‐caffeine co...

Full description

Saved in:
Bibliographic Details
Published in:Plant biotechnology journal 2018-10, Vol.16 (10), p.1756-1766
Main Authors: Tran, Hue T.M., Ramaraj, Thiruvarangan, Furtado, Agnelo, Lee, Leonard Slade, Henry, Robert J.
Format: Article
Language:English
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c742-8e3bf1dba4a3d83c767352d6e32ccf59819951dfba0ada0ca158c934787f03c03
cites cdi_FETCH-LOGICAL-c742-8e3bf1dba4a3d83c767352d6e32ccf59819951dfba0ada0ca158c934787f03c03
container_end_page 1766
container_issue 10
container_start_page 1756
container_title Plant biotechnology journal
container_volume 16
creator Tran, Hue T.M.
Ramaraj, Thiruvarangan
Furtado, Agnelo
Lee, Leonard Slade
Henry, Robert J.
description Arabica coffee ( Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high‐ or low‐caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long‐read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCO s were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms ( SNP s) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway‐based analysis, 65 caffeine‐associated SNP s were discovered, among which 11 SNP s were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee.
doi_str_mv 10.1111/pbi.12912
format article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1111_pbi_12912</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1111_pbi_12912</sourcerecordid><originalsourceid>FETCH-LOGICAL-c742-8e3bf1dba4a3d83c767352d6e32ccf59819951dfba0ada0ca158c934787f03c03</originalsourceid><addsrcrecordid>eNo9kEtLA0EQhAdRMEYP_oM-msPGee7sHmXxBUEF43npnYeOmJ0wMyD5926i2Jcqiuo6fIRcMrpk011vh7BkvGX8iMyYrHWla8WP_72Up-Qs509KOatVPSPuLTuIHhBsQl_g3Y1xc0hM9N45uOpgbxAw4RAMwgJKhGDdWILfwevTC2TAnKMJWJyF71A-wOD0EkY3jYxlap6TE49f2V386Zys727X3UO1er5_7G5WldGSV40Tg2d2QInCNsLoWgvFbe0EN8artmFtq5j1A1K0SA0y1ZhWSN1oT4WhYk4Wv7MmxZyT8_02hQ2mXc9ov8fTT3j6Ax7xA40IV5k</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Use of a draft genome of coffee (C offea arabica ) to identify SNP s associated with caffeine content</title><source>PubMed Central (Open Access)</source><source>Wiley Online Library Open Access</source><source>Publicly Available Content Database</source><creator>Tran, Hue T.M. ; Ramaraj, Thiruvarangan ; Furtado, Agnelo ; Lee, Leonard Slade ; Henry, Robert J.</creator><creatorcontrib>Tran, Hue T.M. ; Ramaraj, Thiruvarangan ; Furtado, Agnelo ; Lee, Leonard Slade ; Henry, Robert J.</creatorcontrib><description>Arabica coffee ( Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high‐ or low‐caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long‐read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCO s were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms ( SNP s) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway‐based analysis, 65 caffeine‐associated SNP s were discovered, among which 11 SNP s were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee.</description><identifier>ISSN: 1467-7644</identifier><identifier>EISSN: 1467-7652</identifier><identifier>DOI: 10.1111/pbi.12912</identifier><language>eng</language><ispartof>Plant biotechnology journal, 2018-10, Vol.16 (10), p.1756-1766</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c742-8e3bf1dba4a3d83c767352d6e32ccf59819951dfba0ada0ca158c934787f03c03</citedby><cites>FETCH-LOGICAL-c742-8e3bf1dba4a3d83c767352d6e32ccf59819951dfba0ada0ca158c934787f03c03</cites><orcidid>0000-0002-4060-0292 ; 0000-0003-0606-0525</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Tran, Hue T.M.</creatorcontrib><creatorcontrib>Ramaraj, Thiruvarangan</creatorcontrib><creatorcontrib>Furtado, Agnelo</creatorcontrib><creatorcontrib>Lee, Leonard Slade</creatorcontrib><creatorcontrib>Henry, Robert J.</creatorcontrib><title>Use of a draft genome of coffee (C offea arabica ) to identify SNP s associated with caffeine content</title><title>Plant biotechnology journal</title><description>Arabica coffee ( Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high‐ or low‐caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long‐read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCO s were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms ( SNP s) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway‐based analysis, 65 caffeine‐associated SNP s were discovered, among which 11 SNP s were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee.</description><issn>1467-7644</issn><issn>1467-7652</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNo9kEtLA0EQhAdRMEYP_oM-msPGee7sHmXxBUEF43npnYeOmJ0wMyD5926i2Jcqiuo6fIRcMrpk011vh7BkvGX8iMyYrHWla8WP_72Up-Qs509KOatVPSPuLTuIHhBsQl_g3Y1xc0hM9N45uOpgbxAw4RAMwgJKhGDdWILfwevTC2TAnKMJWJyF71A-wOD0EkY3jYxlap6TE49f2V386Zys727X3UO1er5_7G5WldGSV40Tg2d2QInCNsLoWgvFbe0EN8artmFtq5j1A1K0SA0y1ZhWSN1oT4WhYk4Wv7MmxZyT8_02hQ2mXc9ov8fTT3j6Ax7xA40IV5k</recordid><startdate>201810</startdate><enddate>201810</enddate><creator>Tran, Hue T.M.</creator><creator>Ramaraj, Thiruvarangan</creator><creator>Furtado, Agnelo</creator><creator>Lee, Leonard Slade</creator><creator>Henry, Robert J.</creator><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-4060-0292</orcidid><orcidid>https://orcid.org/0000-0003-0606-0525</orcidid></search><sort><creationdate>201810</creationdate><title>Use of a draft genome of coffee (C offea arabica ) to identify SNP s associated with caffeine content</title><author>Tran, Hue T.M. ; Ramaraj, Thiruvarangan ; Furtado, Agnelo ; Lee, Leonard Slade ; Henry, Robert J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c742-8e3bf1dba4a3d83c767352d6e32ccf59819951dfba0ada0ca158c934787f03c03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tran, Hue T.M.</creatorcontrib><creatorcontrib>Ramaraj, Thiruvarangan</creatorcontrib><creatorcontrib>Furtado, Agnelo</creatorcontrib><creatorcontrib>Lee, Leonard Slade</creatorcontrib><creatorcontrib>Henry, Robert J.</creatorcontrib><collection>CrossRef</collection><jtitle>Plant biotechnology journal</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tran, Hue T.M.</au><au>Ramaraj, Thiruvarangan</au><au>Furtado, Agnelo</au><au>Lee, Leonard Slade</au><au>Henry, Robert J.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Use of a draft genome of coffee (C offea arabica ) to identify SNP s associated with caffeine content</atitle><jtitle>Plant biotechnology journal</jtitle><date>2018-10</date><risdate>2018</risdate><volume>16</volume><issue>10</issue><spage>1756</spage><epage>1766</epage><pages>1756-1766</pages><issn>1467-7644</issn><eissn>1467-7652</eissn><abstract>Arabica coffee ( Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high‐ or low‐caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long‐read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCO s were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms ( SNP s) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway‐based analysis, 65 caffeine‐associated SNP s were discovered, among which 11 SNP s were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee.</abstract><doi>10.1111/pbi.12912</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0002-4060-0292</orcidid><orcidid>https://orcid.org/0000-0003-0606-0525</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1467-7644
ispartof Plant biotechnology journal, 2018-10, Vol.16 (10), p.1756-1766
issn 1467-7644
1467-7652
language eng
recordid cdi_crossref_primary_10_1111_pbi_12912
source PubMed Central (Open Access); Wiley Online Library Open Access; Publicly Available Content Database
title Use of a draft genome of coffee (C offea arabica ) to identify SNP s associated with caffeine content
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T13%3A15%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Use%20of%20a%20draft%20genome%20of%20coffee%20(C%20offea%20arabica%20)%20to%20identify%20SNP%20s%20associated%20with%20caffeine%20content&rft.jtitle=Plant%20biotechnology%20journal&rft.au=Tran,%20Hue%20T.M.&rft.date=2018-10&rft.volume=16&rft.issue=10&rft.spage=1756&rft.epage=1766&rft.pages=1756-1766&rft.issn=1467-7644&rft.eissn=1467-7652&rft_id=info:doi/10.1111/pbi.12912&rft_dat=%3Ccrossref%3E10_1111_pbi_12912%3C/crossref%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c742-8e3bf1dba4a3d83c767352d6e32ccf59819951dfba0ada0ca158c934787f03c03%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true