Loading…
analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. I. Two-way quantitative data
Data associated with germplasm collections are typically large and multivariate with a considerable number of descriptors measured on each of many accessions. Pattern analysis methods of clustering and ordination have been identified as techniques for statistically evaluating the available diversity...
Saved in:
Published in: | Euphytica 1997, Vol.95 (1), p.27-38 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 38 |
container_issue | 1 |
container_start_page | 27 |
container_title | Euphytica |
container_volume | 95 |
creator | Harch, B.D Basford, K.E DeLacy, I.H Lawrence, P.K |
description | Data associated with germplasm collections are typically large and multivariate with a considerable number of descriptors measured on each of many accessions. Pattern analysis methods of clustering and ordination have been identified as techniques for statistically evaluating the available diversity in germplasm data. While used in many studies, the approaches have not dealt explicitly with the computational consequences of large data sets (i.e. greater than 5000 accessions). To consider the application of these techniques to germplasm evaluation data, 11328 accessions of groundnut (Arachis hypogaea L) from the International Research Institute for the Semi-Arid Tropics, Andhra Pradesh, India were examined. Data for nine quantitative descriptors measured in the rainy and post-rainy growing seasons were used. The ordination technique of principal component analysis was used to reduce the dimensionality of the germplasm data. The identification of phenotypically similar groups of accessions within large scale data via the computationally intensive hierarchical clustering techniques was not feasible and non-hierarchical techniques had to be used. Finite mixture models that maximise the likelihood of an accession belonging to a cluster were used to cluster the accessions in this collection. The patterns of response for the different growing seasons were found to be highly correlated. However, in relating the results to passport and other characterisation and evaluation descriptors, the observed patterns did not appear to be related to taxonomy or any other well known characteristics of groundnut.[PUBLICATION ABSTRACT] |
doi_str_mv | 10.1023/A:1002971207770 |
format | article |
fullrecord | <record><control><sourceid>proquest_pasca</sourceid><recordid>TN_cdi_proquest_miscellaneous_16063798</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2106245981</sourcerecordid><originalsourceid>FETCH-LOGICAL-f179t-e51ecf2237cd682e7d0bc021f4194efd9a2c5929921a498925bed624486a21613</originalsourceid><addsrcrecordid>eNpd0UFv1DAQBeAIgcRSOHPEQqiCQ5bxOLFjbquKlkorcWh7jqaOnU1x4q3tsNp7f3iDtidOc_nmjfSmKD5yWHNA8X3zgwOgVhxBKQWvihWvlShrkPC6WAHwqkQh5NviXUoPAKBVDaviiSbyxzQkFhzzFHvLkiFvWUeZWKY_dmIuhpHlnWWHEH3H-hjmqZvmzL5uIpndsrs77kNPlth2_Y31No57T2lkJnhvTR7CtGbXa3Z7COWBjuxxpikPmfLw93TnffHGkU_2w8s8K-4uf95e_Cq3v6-uLzbb0nGlc2lrbo1DFMp0skGrOrg3gNxVXFfWdZrQ1Bq1Rk6VbjTW97aTWFWNJOSSi7Pi_JS7j-Fxtim345CM9Z4mG-bUcglSKN0s8PN_8CHMcWkqtariWCstYUFfXhD9q8xFmsyQ2n0cRorHFhsQTaUW9unEHIWW-riQuxsELgCb5U-NEs9YYIVd</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>741257960</pqid></control><display><type>article</type><title>analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. I. Two-way quantitative data</title><source>Springer Nature</source><creator>Harch, B.D ; Basford, K.E ; DeLacy, I.H ; Lawrence, P.K</creator><creatorcontrib>Harch, B.D ; Basford, K.E ; DeLacy, I.H ; Lawrence, P.K</creatorcontrib><description>Data associated with germplasm collections are typically large and multivariate with a considerable number of descriptors measured on each of many accessions. Pattern analysis methods of clustering and ordination have been identified as techniques for statistically evaluating the available diversity in germplasm data. While used in many studies, the approaches have not dealt explicitly with the computational consequences of large data sets (i.e. greater than 5000 accessions). To consider the application of these techniques to germplasm evaluation data, 11328 accessions of groundnut (Arachis hypogaea L) from the International Research Institute for the Semi-Arid Tropics, Andhra Pradesh, India were examined. Data for nine quantitative descriptors measured in the rainy and post-rainy growing seasons were used. The ordination technique of principal component analysis was used to reduce the dimensionality of the germplasm data. The identification of phenotypically similar groups of accessions within large scale data via the computationally intensive hierarchical clustering techniques was not feasible and non-hierarchical techniques had to be used. Finite mixture models that maximise the likelihood of an accession belonging to a cluster were used to cluster the accessions in this collection. The patterns of response for the different growing seasons were found to be highly correlated. However, in relating the results to passport and other characterisation and evaluation descriptors, the observed patterns did not appear to be related to taxonomy or any other well known characteristics of groundnut.[PUBLICATION ABSTRACT]</description><identifier>ISSN: 0014-2336</identifier><identifier>EISSN: 1573-5060</identifier><identifier>DOI: 10.1023/A:1002971207770</identifier><identifier>CODEN: EUPHAA</identifier><language>eng</language><publisher>Dordrecht: Springer</publisher><subject>agronomic traits ; Agronomy. Soil science and plant productions ; Arachis hypogaea ; Biological and medical sciences ; descriptors ; emergence ; flowering date ; Fundamental and applied biological sciences. Psychology ; Generalities. Genetics. Plant material ; Genetic resources, diversity ; genetic variation ; Genetics ; Genetics and breeding of economic plants ; Germplasm ; Growing season ; leaves ; length ; Mathematical models ; Ordination ; Peanuts ; phenetics ; phenotype ; plant genetic resources ; Plant material ; pods ; post rainy season ; principal component analysis ; Principal components analysis ; quantitative traits ; seasonal variation ; seed weight ; seeds ; statistical analysis ; Studies ; taxonomy ; Tropical environments ; wet season ; width</subject><ispartof>Euphytica, 1997, Vol.95 (1), p.27-38</ispartof><rights>1997 INIST-CNRS</rights><rights>Kluwer Academic Publishers 1997</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,4024,27923,27924,27925</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=2803847$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Harch, B.D</creatorcontrib><creatorcontrib>Basford, K.E</creatorcontrib><creatorcontrib>DeLacy, I.H</creatorcontrib><creatorcontrib>Lawrence, P.K</creatorcontrib><title>analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. I. Two-way quantitative data</title><title>Euphytica</title><description>Data associated with germplasm collections are typically large and multivariate with a considerable number of descriptors measured on each of many accessions. Pattern analysis methods of clustering and ordination have been identified as techniques for statistically evaluating the available diversity in germplasm data. While used in many studies, the approaches have not dealt explicitly with the computational consequences of large data sets (i.e. greater than 5000 accessions). To consider the application of these techniques to germplasm evaluation data, 11328 accessions of groundnut (Arachis hypogaea L) from the International Research Institute for the Semi-Arid Tropics, Andhra Pradesh, India were examined. Data for nine quantitative descriptors measured in the rainy and post-rainy growing seasons were used. The ordination technique of principal component analysis was used to reduce the dimensionality of the germplasm data. The identification of phenotypically similar groups of accessions within large scale data via the computationally intensive hierarchical clustering techniques was not feasible and non-hierarchical techniques had to be used. Finite mixture models that maximise the likelihood of an accession belonging to a cluster were used to cluster the accessions in this collection. The patterns of response for the different growing seasons were found to be highly correlated. However, in relating the results to passport and other characterisation and evaluation descriptors, the observed patterns did not appear to be related to taxonomy or any other well known characteristics of groundnut.[PUBLICATION ABSTRACT]</description><subject>agronomic traits</subject><subject>Agronomy. Soil science and plant productions</subject><subject>Arachis hypogaea</subject><subject>Biological and medical sciences</subject><subject>descriptors</subject><subject>emergence</subject><subject>flowering date</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>Generalities. Genetics. Plant material</subject><subject>Genetic resources, diversity</subject><subject>genetic variation</subject><subject>Genetics</subject><subject>Genetics and breeding of economic plants</subject><subject>Germplasm</subject><subject>Growing season</subject><subject>leaves</subject><subject>length</subject><subject>Mathematical models</subject><subject>Ordination</subject><subject>Peanuts</subject><subject>phenetics</subject><subject>phenotype</subject><subject>plant genetic resources</subject><subject>Plant material</subject><subject>pods</subject><subject>post rainy season</subject><subject>principal component analysis</subject><subject>Principal components analysis</subject><subject>quantitative traits</subject><subject>seasonal variation</subject><subject>seed weight</subject><subject>seeds</subject><subject>statistical analysis</subject><subject>Studies</subject><subject>taxonomy</subject><subject>Tropical environments</subject><subject>wet season</subject><subject>width</subject><issn>0014-2336</issn><issn>1573-5060</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1997</creationdate><recordtype>article</recordtype><recordid>eNpd0UFv1DAQBeAIgcRSOHPEQqiCQ5bxOLFjbquKlkorcWh7jqaOnU1x4q3tsNp7f3iDtidOc_nmjfSmKD5yWHNA8X3zgwOgVhxBKQWvihWvlShrkPC6WAHwqkQh5NviXUoPAKBVDaviiSbyxzQkFhzzFHvLkiFvWUeZWKY_dmIuhpHlnWWHEH3H-hjmqZvmzL5uIpndsrs77kNPlth2_Y31No57T2lkJnhvTR7CtGbXa3Z7COWBjuxxpikPmfLw93TnffHGkU_2w8s8K-4uf95e_Cq3v6-uLzbb0nGlc2lrbo1DFMp0skGrOrg3gNxVXFfWdZrQ1Bq1Rk6VbjTW97aTWFWNJOSSi7Pi_JS7j-Fxtim345CM9Z4mG-bUcglSKN0s8PN_8CHMcWkqtariWCstYUFfXhD9q8xFmsyQ2n0cRorHFhsQTaUW9unEHIWW-riQuxsELgCb5U-NEs9YYIVd</recordid><startdate>1997</startdate><enddate>1997</enddate><creator>Harch, B.D</creator><creator>Basford, K.E</creator><creator>DeLacy, I.H</creator><creator>Lawrence, P.K</creator><general>Springer</general><general>Springer Nature B.V</general><scope>FBQ</scope><scope>IQODW</scope><scope>3V.</scope><scope>7SN</scope><scope>7SS</scope><scope>7T7</scope><scope>7TM</scope><scope>7X2</scope><scope>7XB</scope><scope>88I</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>M0K</scope><scope>M2P</scope><scope>M7N</scope><scope>P64</scope><scope>PATMY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PYCSY</scope><scope>Q9U</scope><scope>RC3</scope></search><sort><creationdate>1997</creationdate><title>analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. I. Two-way quantitative data</title><author>Harch, B.D ; Basford, K.E ; DeLacy, I.H ; Lawrence, P.K</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-f179t-e51ecf2237cd682e7d0bc021f4194efd9a2c5929921a498925bed624486a21613</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1997</creationdate><topic>agronomic traits</topic><topic>Agronomy. Soil science and plant productions</topic><topic>Arachis hypogaea</topic><topic>Biological and medical sciences</topic><topic>descriptors</topic><topic>emergence</topic><topic>flowering date</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>Generalities. Genetics. Plant material</topic><topic>Genetic resources, diversity</topic><topic>genetic variation</topic><topic>Genetics</topic><topic>Genetics and breeding of economic plants</topic><topic>Germplasm</topic><topic>Growing season</topic><topic>leaves</topic><topic>length</topic><topic>Mathematical models</topic><topic>Ordination</topic><topic>Peanuts</topic><topic>phenetics</topic><topic>phenotype</topic><topic>plant genetic resources</topic><topic>Plant material</topic><topic>pods</topic><topic>post rainy season</topic><topic>principal component analysis</topic><topic>Principal components analysis</topic><topic>quantitative traits</topic><topic>seasonal variation</topic><topic>seed weight</topic><topic>seeds</topic><topic>statistical analysis</topic><topic>Studies</topic><topic>taxonomy</topic><topic>Tropical environments</topic><topic>wet season</topic><topic>width</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Harch, B.D</creatorcontrib><creatorcontrib>Basford, K.E</creatorcontrib><creatorcontrib>DeLacy, I.H</creatorcontrib><creatorcontrib>Lawrence, P.K</creatorcontrib><collection>AGRIS</collection><collection>Pascal-Francis</collection><collection>ProQuest Central (Corporate)</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Nucleic Acids Abstracts</collection><collection>Agricultural Science Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Science Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Agricultural & Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>ProQuest Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>Agriculture Science Database</collection><collection>ProQuest Science Journals</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Environmental Science Collection</collection><collection>ProQuest Central Basic</collection><collection>Genetics Abstracts</collection><jtitle>Euphytica</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Harch, B.D</au><au>Basford, K.E</au><au>DeLacy, I.H</au><au>Lawrence, P.K</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. I. Two-way quantitative data</atitle><jtitle>Euphytica</jtitle><date>1997</date><risdate>1997</risdate><volume>95</volume><issue>1</issue><spage>27</spage><epage>38</epage><pages>27-38</pages><issn>0014-2336</issn><eissn>1573-5060</eissn><coden>EUPHAA</coden><abstract>Data associated with germplasm collections are typically large and multivariate with a considerable number of descriptors measured on each of many accessions. Pattern analysis methods of clustering and ordination have been identified as techniques for statistically evaluating the available diversity in germplasm data. While used in many studies, the approaches have not dealt explicitly with the computational consequences of large data sets (i.e. greater than 5000 accessions). To consider the application of these techniques to germplasm evaluation data, 11328 accessions of groundnut (Arachis hypogaea L) from the International Research Institute for the Semi-Arid Tropics, Andhra Pradesh, India were examined. Data for nine quantitative descriptors measured in the rainy and post-rainy growing seasons were used. The ordination technique of principal component analysis was used to reduce the dimensionality of the germplasm data. The identification of phenotypically similar groups of accessions within large scale data via the computationally intensive hierarchical clustering techniques was not feasible and non-hierarchical techniques had to be used. Finite mixture models that maximise the likelihood of an accession belonging to a cluster were used to cluster the accessions in this collection. The patterns of response for the different growing seasons were found to be highly correlated. However, in relating the results to passport and other characterisation and evaluation descriptors, the observed patterns did not appear to be related to taxonomy or any other well known characteristics of groundnut.[PUBLICATION ABSTRACT]</abstract><cop>Dordrecht</cop><pub>Springer</pub><doi>10.1023/A:1002971207770</doi><tpages>12</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0014-2336 |
ispartof | Euphytica, 1997, Vol.95 (1), p.27-38 |
issn | 0014-2336 1573-5060 |
language | eng |
recordid | cdi_proquest_miscellaneous_16063798 |
source | Springer Nature |
subjects | agronomic traits Agronomy. Soil science and plant productions Arachis hypogaea Biological and medical sciences descriptors emergence flowering date Fundamental and applied biological sciences. Psychology Generalities. Genetics. Plant material Genetic resources, diversity genetic variation Genetics Genetics and breeding of economic plants Germplasm Growing season leaves length Mathematical models Ordination Peanuts phenetics phenotype plant genetic resources Plant material pods post rainy season principal component analysis Principal components analysis quantitative traits seasonal variation seed weight seeds statistical analysis Studies taxonomy Tropical environments wet season width |
title | analysis of large scale data taken from the world groundnut (Arachis hypogaea L.) germplasm collection. I. Two-way quantitative data |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T03%3A05%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pasca&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=analysis%20of%20large%20scale%20data%20taken%20from%20the%20world%20groundnut%20(Arachis%20hypogaea%20L.)%20germplasm%20collection.%20I.%20Two-way%20quantitative%20data&rft.jtitle=Euphytica&rft.au=Harch,%20B.D&rft.date=1997&rft.volume=95&rft.issue=1&rft.spage=27&rft.epage=38&rft.pages=27-38&rft.issn=0014-2336&rft.eissn=1573-5060&rft.coden=EUPHAA&rft_id=info:doi/10.1023/A:1002971207770&rft_dat=%3Cproquest_pasca%3E2106245981%3C/proquest_pasca%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-f179t-e51ecf2237cd682e7d0bc021f4194efd9a2c5929921a498925bed624486a21613%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=741257960&rft_id=info:pmid/&rfr_iscdi=true |