Loading…

analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. I. Two-way quantitative data

Data associated with germplasm collections are typically large and multivariate with a considerable number of descriptors measured on each of many accessions. Pattern analysis methods of clustering and ordination have been identified as techniques for statistically evaluating the available diversity...

Full description

Saved in:
Bibliographic Details
Published in:Euphytica 1997, Vol.95 (1), p.27-38
Main Authors: Harch, B.D, Basford, K.E, DeLacy, I.H, Lawrence, P.K
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 38
container_issue 1
container_start_page 27
container_title Euphytica
container_volume 95
creator Harch, B.D
Basford, K.E
DeLacy, I.H
Lawrence, P.K
description Data associated with germplasm collections are typically large and multivariate with a considerable number of descriptors measured on each of many accessions. Pattern analysis methods of clustering and ordination have been identified as techniques for statistically evaluating the available diversity in germplasm data. While used in many studies, the approaches have not dealt explicitly with the computational consequences of large data sets (i.e. greater than 5000 accessions). To consider the application of these techniques to germplasm evaluation data, 11328 accessions of groundnut (Arachis hypogaea L) from the International Research Institute for the Semi-Arid Tropics, Andhra Pradesh, India were examined. Data for nine quantitative descriptors measured in the rainy and post-rainy growing seasons were used. The ordination technique of principal component analysis was used to reduce the dimensionality of the germplasm data. The identification of phenotypically similar groups of accessions within large scale data via the computationally intensive hierarchical clustering techniques was not feasible and non-hierarchical techniques had to be used. Finite mixture models that maximise the likelihood of an accession belonging to a cluster were used to cluster the accessions in this collection. The patterns of response for the different growing seasons were found to be highly correlated. However, in relating the results to passport and other characterisation and evaluation descriptors, the observed patterns did not appear to be related to taxonomy or any other well known characteristics of groundnut.[PUBLICATION ABSTRACT]
doi_str_mv 10.1023/A:1002971207770
format article
fullrecord <record><control><sourceid>proquest_pasca</sourceid><recordid>TN_cdi_proquest_miscellaneous_16063798</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2106245981</sourcerecordid><originalsourceid>FETCH-LOGICAL-f179t-e51ecf2237cd682e7d0bc021f4194efd9a2c5929921a498925bed624486a21613</originalsourceid><addsrcrecordid>eNpd0UFv1DAQBeAIgcRSOHPEQqiCQ5bxOLFjbquKlkorcWh7jqaOnU1x4q3tsNp7f3iDtidOc_nmjfSmKD5yWHNA8X3zgwOgVhxBKQWvihWvlShrkPC6WAHwqkQh5NviXUoPAKBVDaviiSbyxzQkFhzzFHvLkiFvWUeZWKY_dmIuhpHlnWWHEH3H-hjmqZvmzL5uIpndsrs77kNPlth2_Y31No57T2lkJnhvTR7CtGbXa3Z7COWBjuxxpikPmfLw93TnffHGkU_2w8s8K-4uf95e_Cq3v6-uLzbb0nGlc2lrbo1DFMp0skGrOrg3gNxVXFfWdZrQ1Bq1Rk6VbjTW97aTWFWNJOSSi7Pi_JS7j-Fxtim345CM9Z4mG-bUcglSKN0s8PN_8CHMcWkqtariWCstYUFfXhD9q8xFmsyQ2n0cRorHFhsQTaUW9unEHIWW-riQuxsELgCb5U-NEs9YYIVd</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>741257960</pqid></control><display><type>article</type><title>analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. I. Two-way quantitative data</title><source>Springer Nature</source><creator>Harch, B.D ; Basford, K.E ; DeLacy, I.H ; Lawrence, P.K</creator><creatorcontrib>Harch, B.D ; Basford, K.E ; DeLacy, I.H ; Lawrence, P.K</creatorcontrib><description>Data associated with germplasm collections are typically large and multivariate with a considerable number of descriptors measured on each of many accessions. Pattern analysis methods of clustering and ordination have been identified as techniques for statistically evaluating the available diversity in germplasm data. While used in many studies, the approaches have not dealt explicitly with the computational consequences of large data sets (i.e. greater than 5000 accessions). To consider the application of these techniques to germplasm evaluation data, 11328 accessions of groundnut (Arachis hypogaea L) from the International Research Institute for the Semi-Arid Tropics, Andhra Pradesh, India were examined. Data for nine quantitative descriptors measured in the rainy and post-rainy growing seasons were used. The ordination technique of principal component analysis was used to reduce the dimensionality of the germplasm data. The identification of phenotypically similar groups of accessions within large scale data via the computationally intensive hierarchical clustering techniques was not feasible and non-hierarchical techniques had to be used. Finite mixture models that maximise the likelihood of an accession belonging to a cluster were used to cluster the accessions in this collection. The patterns of response for the different growing seasons were found to be highly correlated. However, in relating the results to passport and other characterisation and evaluation descriptors, the observed patterns did not appear to be related to taxonomy or any other well known characteristics of groundnut.[PUBLICATION ABSTRACT]</description><identifier>ISSN: 0014-2336</identifier><identifier>EISSN: 1573-5060</identifier><identifier>DOI: 10.1023/A:1002971207770</identifier><identifier>CODEN: EUPHAA</identifier><language>eng</language><publisher>Dordrecht: Springer</publisher><subject>agronomic traits ; Agronomy. Soil science and plant productions ; Arachis hypogaea ; Biological and medical sciences ; descriptors ; emergence ; flowering date ; Fundamental and applied biological sciences. Psychology ; Generalities. Genetics. Plant material ; Genetic resources, diversity ; genetic variation ; Genetics ; Genetics and breeding of economic plants ; Germplasm ; Growing season ; leaves ; length ; Mathematical models ; Ordination ; Peanuts ; phenetics ; phenotype ; plant genetic resources ; Plant material ; pods ; post rainy season ; principal component analysis ; Principal components analysis ; quantitative traits ; seasonal variation ; seed weight ; seeds ; statistical analysis ; Studies ; taxonomy ; Tropical environments ; wet season ; width</subject><ispartof>Euphytica, 1997, Vol.95 (1), p.27-38</ispartof><rights>1997 INIST-CNRS</rights><rights>Kluwer Academic Publishers 1997</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,4024,27923,27924,27925</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=2803847$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Harch, B.D</creatorcontrib><creatorcontrib>Basford, K.E</creatorcontrib><creatorcontrib>DeLacy, I.H</creatorcontrib><creatorcontrib>Lawrence, P.K</creatorcontrib><title>analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. I. Two-way quantitative data</title><title>Euphytica</title><description>Data associated with germplasm collections are typically large and multivariate with a considerable number of descriptors measured on each of many accessions. Pattern analysis methods of clustering and ordination have been identified as techniques for statistically evaluating the available diversity in germplasm data. While used in many studies, the approaches have not dealt explicitly with the computational consequences of large data sets (i.e. greater than 5000 accessions). To consider the application of these techniques to germplasm evaluation data, 11328 accessions of groundnut (Arachis hypogaea L) from the International Research Institute for the Semi-Arid Tropics, Andhra Pradesh, India were examined. Data for nine quantitative descriptors measured in the rainy and post-rainy growing seasons were used. The ordination technique of principal component analysis was used to reduce the dimensionality of the germplasm data. The identification of phenotypically similar groups of accessions within large scale data via the computationally intensive hierarchical clustering techniques was not feasible and non-hierarchical techniques had to be used. Finite mixture models that maximise the likelihood of an accession belonging to a cluster were used to cluster the accessions in this collection. The patterns of response for the different growing seasons were found to be highly correlated. However, in relating the results to passport and other characterisation and evaluation descriptors, the observed patterns did not appear to be related to taxonomy or any other well known characteristics of groundnut.[PUBLICATION ABSTRACT]</description><subject>agronomic traits</subject><subject>Agronomy. Soil science and plant productions</subject><subject>Arachis hypogaea</subject><subject>Biological and medical sciences</subject><subject>descriptors</subject><subject>emergence</subject><subject>flowering date</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>Generalities. Genetics. Plant material</subject><subject>Genetic resources, diversity</subject><subject>genetic variation</subject><subject>Genetics</subject><subject>Genetics and breeding of economic plants</subject><subject>Germplasm</subject><subject>Growing season</subject><subject>leaves</subject><subject>length</subject><subject>Mathematical models</subject><subject>Ordination</subject><subject>Peanuts</subject><subject>phenetics</subject><subject>phenotype</subject><subject>plant genetic resources</subject><subject>Plant material</subject><subject>pods</subject><subject>post rainy season</subject><subject>principal component analysis</subject><subject>Principal components analysis</subject><subject>quantitative traits</subject><subject>seasonal variation</subject><subject>seed weight</subject><subject>seeds</subject><subject>statistical analysis</subject><subject>Studies</subject><subject>taxonomy</subject><subject>Tropical environments</subject><subject>wet season</subject><subject>width</subject><issn>0014-2336</issn><issn>1573-5060</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1997</creationdate><recordtype>article</recordtype><recordid>eNpd0UFv1DAQBeAIgcRSOHPEQqiCQ5bxOLFjbquKlkorcWh7jqaOnU1x4q3tsNp7f3iDtidOc_nmjfSmKD5yWHNA8X3zgwOgVhxBKQWvihWvlShrkPC6WAHwqkQh5NviXUoPAKBVDaviiSbyxzQkFhzzFHvLkiFvWUeZWKY_dmIuhpHlnWWHEH3H-hjmqZvmzL5uIpndsrs77kNPlth2_Y31No57T2lkJnhvTR7CtGbXa3Z7COWBjuxxpikPmfLw93TnffHGkU_2w8s8K-4uf95e_Cq3v6-uLzbb0nGlc2lrbo1DFMp0skGrOrg3gNxVXFfWdZrQ1Bq1Rk6VbjTW97aTWFWNJOSSi7Pi_JS7j-Fxtim345CM9Z4mG-bUcglSKN0s8PN_8CHMcWkqtariWCstYUFfXhD9q8xFmsyQ2n0cRorHFhsQTaUW9unEHIWW-riQuxsELgCb5U-NEs9YYIVd</recordid><startdate>1997</startdate><enddate>1997</enddate><creator>Harch, B.D</creator><creator>Basford, K.E</creator><creator>DeLacy, I.H</creator><creator>Lawrence, P.K</creator><general>Springer</general><general>Springer Nature B.V</general><scope>FBQ</scope><scope>IQODW</scope><scope>3V.</scope><scope>7SN</scope><scope>7SS</scope><scope>7T7</scope><scope>7TM</scope><scope>7X2</scope><scope>7XB</scope><scope>88I</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>M0K</scope><scope>M2P</scope><scope>M7N</scope><scope>P64</scope><scope>PATMY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PYCSY</scope><scope>Q9U</scope><scope>RC3</scope></search><sort><creationdate>1997</creationdate><title>analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. I. Two-way quantitative data</title><author>Harch, B.D ; Basford, K.E ; DeLacy, I.H ; Lawrence, P.K</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-f179t-e51ecf2237cd682e7d0bc021f4194efd9a2c5929921a498925bed624486a21613</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1997</creationdate><topic>agronomic traits</topic><topic>Agronomy. Soil science and plant productions</topic><topic>Arachis hypogaea</topic><topic>Biological and medical sciences</topic><topic>descriptors</topic><topic>emergence</topic><topic>flowering date</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>Generalities. Genetics. Plant material</topic><topic>Genetic resources, diversity</topic><topic>genetic variation</topic><topic>Genetics</topic><topic>Genetics and breeding of economic plants</topic><topic>Germplasm</topic><topic>Growing season</topic><topic>leaves</topic><topic>length</topic><topic>Mathematical models</topic><topic>Ordination</topic><topic>Peanuts</topic><topic>phenetics</topic><topic>phenotype</topic><topic>plant genetic resources</topic><topic>Plant material</topic><topic>pods</topic><topic>post rainy season</topic><topic>principal component analysis</topic><topic>Principal components analysis</topic><topic>quantitative traits</topic><topic>seasonal variation</topic><topic>seed weight</topic><topic>seeds</topic><topic>statistical analysis</topic><topic>Studies</topic><topic>taxonomy</topic><topic>Tropical environments</topic><topic>wet season</topic><topic>width</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Harch, B.D</creatorcontrib><creatorcontrib>Basford, K.E</creatorcontrib><creatorcontrib>DeLacy, I.H</creatorcontrib><creatorcontrib>Lawrence, P.K</creatorcontrib><collection>AGRIS</collection><collection>Pascal-Francis</collection><collection>ProQuest Central (Corporate)</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Nucleic Acids Abstracts</collection><collection>Agricultural Science Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Science Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Agricultural &amp; Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>ProQuest Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>Agriculture Science Database</collection><collection>ProQuest Science Journals</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Environmental Science Collection</collection><collection>ProQuest Central Basic</collection><collection>Genetics Abstracts</collection><jtitle>Euphytica</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Harch, B.D</au><au>Basford, K.E</au><au>DeLacy, I.H</au><au>Lawrence, P.K</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. I. Two-way quantitative data</atitle><jtitle>Euphytica</jtitle><date>1997</date><risdate>1997</risdate><volume>95</volume><issue>1</issue><spage>27</spage><epage>38</epage><pages>27-38</pages><issn>0014-2336</issn><eissn>1573-5060</eissn><coden>EUPHAA</coden><abstract>Data associated with germplasm collections are typically large and multivariate with a considerable number of descriptors measured on each of many accessions. Pattern analysis methods of clustering and ordination have been identified as techniques for statistically evaluating the available diversity in germplasm data. While used in many studies, the approaches have not dealt explicitly with the computational consequences of large data sets (i.e. greater than 5000 accessions). To consider the application of these techniques to germplasm evaluation data, 11328 accessions of groundnut (Arachis hypogaea L) from the International Research Institute for the Semi-Arid Tropics, Andhra Pradesh, India were examined. Data for nine quantitative descriptors measured in the rainy and post-rainy growing seasons were used. The ordination technique of principal component analysis was used to reduce the dimensionality of the germplasm data. The identification of phenotypically similar groups of accessions within large scale data via the computationally intensive hierarchical clustering techniques was not feasible and non-hierarchical techniques had to be used. Finite mixture models that maximise the likelihood of an accession belonging to a cluster were used to cluster the accessions in this collection. The patterns of response for the different growing seasons were found to be highly correlated. However, in relating the results to passport and other characterisation and evaluation descriptors, the observed patterns did not appear to be related to taxonomy or any other well known characteristics of groundnut.[PUBLICATION ABSTRACT]</abstract><cop>Dordrecht</cop><pub>Springer</pub><doi>10.1023/A:1002971207770</doi><tpages>12</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0014-2336
ispartof Euphytica, 1997, Vol.95 (1), p.27-38
issn 0014-2336
1573-5060
language eng
recordid cdi_proquest_miscellaneous_16063798
source Springer Nature
subjects agronomic traits
Agronomy. Soil science and plant productions
Arachis hypogaea
Biological and medical sciences
descriptors
emergence
flowering date
Fundamental and applied biological sciences. Psychology
Generalities. Genetics. Plant material
Genetic resources, diversity
genetic variation
Genetics
Genetics and breeding of economic plants
Germplasm
Growing season
leaves
length
Mathematical models
Ordination
Peanuts
phenetics
phenotype
plant genetic resources
Plant material
pods
post rainy season
principal component analysis
Principal components analysis
quantitative traits
seasonal variation
seed weight
seeds
statistical analysis
Studies
taxonomy
Tropical environments
wet season
width
title analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. I. Two-way quantitative data
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T03%3A05%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pasca&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=analysis%20of%20large%20scale%20data%20taken%20from%20the%20world%20groundnut%20(Arachis%20hypogaea%20L.)%20germplasm%20collection.%20I.%20Two-way%20quantitative%20data&rft.jtitle=Euphytica&rft.au=Harch,%20B.D&rft.date=1997&rft.volume=95&rft.issue=1&rft.spage=27&rft.epage=38&rft.pages=27-38&rft.issn=0014-2336&rft.eissn=1573-5060&rft.coden=EUPHAA&rft_id=info:doi/10.1023/A:1002971207770&rft_dat=%3Cproquest_pasca%3E2106245981%3C/proquest_pasca%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-f179t-e51ecf2237cd682e7d0bc021f4194efd9a2c5929921a498925bed624486a21613%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=741257960&rft_id=info:pmid/&rfr_iscdi=true