Loading…
Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data
A survey of presences and absences of specific species across multiple biogeographic units (or bioregions) are used in a broad area of biological studies from ecology to microbiology. Using binary presence-absence data, we evaluate species co-occurrences that help elucidate relationships among organ...
Saved in:
Published in: | BMC bioinformatics 2019-12, Vol.20 (Suppl 15), p.644-644, Article 644 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c559t-e1e27a281758b7a6b3820e34d3975d84e8064e5e08ff3c4913eef2a231f5daf13 |
---|---|
cites | cdi_FETCH-LOGICAL-c559t-e1e27a281758b7a6b3820e34d3975d84e8064e5e08ff3c4913eef2a231f5daf13 |
container_end_page | 644 |
container_issue | Suppl 15 |
container_start_page | 644 |
container_title | BMC bioinformatics |
container_volume | 20 |
creator | Chung, Neo Christopher Miasojedow, BłaŻej Startek, Michał Gambin, Anna |
description | A survey of presences and absences of specific species across multiple biogeographic units (or bioregions) are used in a broad area of biological studies from ecology to microbiology. Using binary presence-absence data, we evaluate species co-occurrences that help elucidate relationships among organisms and environments. To summarize similarity between occurrences of species, we routinely use the Jaccard/Tanimoto coefficient, which is the ratio of their intersection to their union. It is natural, then, to identify statistically significant Jaccard/Tanimoto coefficients, which suggest non-random co-occurrences of species. However, statistical hypothesis testing using this similarity coefficient has been seldom used or studied.
We introduce a hypothesis test for similarity for biological presence-absence data, using the Jaccard/Tanimoto coefficient. Several key improvements are presented including unbiased estimation of expectation and centered Jaccard/Tanimoto coefficients, that account for occurrence probabilities. The exact and asymptotic solutions are derived. To overcome a computational burden due to high-dimensionality, we propose the bootstrap and measurement concentration algorithms to efficiently estimate statistical significance of binary similarity. Comprehensive simulation studies demonstrate that our proposed methods produce accurate p-values and false discovery rates. The proposed estimation methods are orders of magnitude faster than the exact solution, particularly with an increasing dimensionality. We showcase their applications in evaluating co-occurrences of bird species in 28 islands of Vanuatu and fish species in 3347 freshwater habitats in France. The proposed methods are implemented in an open source R package called jaccard (https://cran.r-project.org/package=jaccard).
We introduce a suite of statistical methods for the Jaccard/Tanimoto similarity coefficient for binary data, that enable straightforward incorporation of probabilistic measures in analysis for species co-occurrences. Due to their generality, the proposed methods and implementations are applicable to a wide range of binary data arising from genomics, biochemistry, and other areas of science. |
doi_str_mv | 10.1186/s12859-019-3118-5 |
format | article |
fullrecord | <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_9cc3a82e01814c97a5157972e3e4e2cf</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_9cc3a82e01814c97a5157972e3e4e2cf</doaj_id><sourcerecordid>2340660144</sourcerecordid><originalsourceid>FETCH-LOGICAL-c559t-e1e27a281758b7a6b3820e34d3975d84e8064e5e08ff3c4913eef2a231f5daf13</originalsourceid><addsrcrecordid>eNpdkk9v1DAQxSMEoqXwAbigSFy4pPX4T2xfkFAFtKgSl3JDsmadydarJF5sb6V-e7zdUrWcxho___Rm_JrmPbBTANOfZeBG2Y6B7URtdOpFcwxSQ8eBqZdPzkfNm5w3jIE2TL1ujgQYLXtgx83vH-g9puHsGpcwxxLbHOYwYQrlri2US4vL0NYaZiwhLu1M5SYOuR1jalchTnEdPE7tNlGmxVOHq_vaDljwbfNqxCnTu4d60vz69vX6_KK7-vn98vzLVeeVsqUjIK6RG9DKrDT2K2E4IyEHYbUajCTDekmKmBlH4aUFQTRy5AJGNeAI4qS5PHCHiBu3TdVrunMRg7tvxLR2mErwEznrvUDDiYEB6a1GBUpbzUmQJO7Hyvp8YG13q5kGT0tJOD2DPr9Zwo1bx1vXW24FVxXw6QGQ4p9d3ZybQ_Y0TbhQ3GXHhWDKGtX3VfrxP-km7tJSV1VVkvU9AymrCg4qn2LOicZHM8DcPgfukANXc-D2OXB7Ex-eTvH44t_Hi780Fq60</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2340660144</pqid></control><display><type>article</type><title>Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data</title><source>Publicly Available Content Database</source><source>PubMed Central</source><creator>Chung, Neo Christopher ; Miasojedow, BłaŻej ; Startek, Michał ; Gambin, Anna</creator><creatorcontrib>Chung, Neo Christopher ; Miasojedow, BłaŻej ; Startek, Michał ; Gambin, Anna</creatorcontrib><description>A survey of presences and absences of specific species across multiple biogeographic units (or bioregions) are used in a broad area of biological studies from ecology to microbiology. Using binary presence-absence data, we evaluate species co-occurrences that help elucidate relationships among organisms and environments. To summarize similarity between occurrences of species, we routinely use the Jaccard/Tanimoto coefficient, which is the ratio of their intersection to their union. It is natural, then, to identify statistically significant Jaccard/Tanimoto coefficients, which suggest non-random co-occurrences of species. However, statistical hypothesis testing using this similarity coefficient has been seldom used or studied.
We introduce a hypothesis test for similarity for biological presence-absence data, using the Jaccard/Tanimoto coefficient. Several key improvements are presented including unbiased estimation of expectation and centered Jaccard/Tanimoto coefficients, that account for occurrence probabilities. The exact and asymptotic solutions are derived. To overcome a computational burden due to high-dimensionality, we propose the bootstrap and measurement concentration algorithms to efficiently estimate statistical significance of binary similarity. Comprehensive simulation studies demonstrate that our proposed methods produce accurate p-values and false discovery rates. The proposed estimation methods are orders of magnitude faster than the exact solution, particularly with an increasing dimensionality. We showcase their applications in evaluating co-occurrences of bird species in 28 islands of Vanuatu and fish species in 3347 freshwater habitats in France. The proposed methods are implemented in an open source R package called jaccard (https://cran.r-project.org/package=jaccard).
We introduce a suite of statistical methods for the Jaccard/Tanimoto similarity coefficient for binary data, that enable straightforward incorporation of probabilistic measures in analysis for species co-occurrences. Due to their generality, the proposed methods and implementations are applicable to a wide range of binary data arising from genomics, biochemistry, and other areas of science.</description><identifier>ISSN: 1471-2105</identifier><identifier>EISSN: 1471-2105</identifier><identifier>DOI: 10.1186/s12859-019-3118-5</identifier><identifier>PMID: 31874610</identifier><language>eng</language><publisher>England: BioMed Central</publisher><subject>Algorithms ; Animals ; Aquatic habitats ; Associations ; Asymptotic methods ; Binary data ; Binary similarity ; Biogeography ; Biometry ; Birds ; Co-occurrences ; Coefficients ; Computer applications ; Computer simulation ; Ecological monitoring ; Estimates ; Evaluation ; Exact solutions ; Expected values ; Fishes ; Freshwater Biology - methods ; Freshwater environments ; Freshwater fish ; Genomics ; Hypotheses ; Jaccard ; Microbiology ; P-value ; Presence-absence ; Probability ; Production methods ; Species ; Statistical analysis ; Statistical methods ; Statistical significance ; Statistics ; Tanimoto ; Test procedures</subject><ispartof>BMC bioinformatics, 2019-12, Vol.20 (Suppl 15), p.644-644, Article 644</ispartof><rights>2019. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>The Author(s) 2019</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c559t-e1e27a281758b7a6b3820e34d3975d84e8064e5e08ff3c4913eef2a231f5daf13</citedby><cites>FETCH-LOGICAL-c559t-e1e27a281758b7a6b3820e34d3975d84e8064e5e08ff3c4913eef2a231f5daf13</cites><orcidid>0000-0001-6798-8867 ; 0000-0003-3476-3017</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6929325/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2340660144?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,25733,27903,27904,36991,36992,44569,53769,53771</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/31874610$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Chung, Neo Christopher</creatorcontrib><creatorcontrib>Miasojedow, BłaŻej</creatorcontrib><creatorcontrib>Startek, Michał</creatorcontrib><creatorcontrib>Gambin, Anna</creatorcontrib><title>Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data</title><title>BMC bioinformatics</title><addtitle>BMC Bioinformatics</addtitle><description>A survey of presences and absences of specific species across multiple biogeographic units (or bioregions) are used in a broad area of biological studies from ecology to microbiology. Using binary presence-absence data, we evaluate species co-occurrences that help elucidate relationships among organisms and environments. To summarize similarity between occurrences of species, we routinely use the Jaccard/Tanimoto coefficient, which is the ratio of their intersection to their union. It is natural, then, to identify statistically significant Jaccard/Tanimoto coefficients, which suggest non-random co-occurrences of species. However, statistical hypothesis testing using this similarity coefficient has been seldom used or studied.
We introduce a hypothesis test for similarity for biological presence-absence data, using the Jaccard/Tanimoto coefficient. Several key improvements are presented including unbiased estimation of expectation and centered Jaccard/Tanimoto coefficients, that account for occurrence probabilities. The exact and asymptotic solutions are derived. To overcome a computational burden due to high-dimensionality, we propose the bootstrap and measurement concentration algorithms to efficiently estimate statistical significance of binary similarity. Comprehensive simulation studies demonstrate that our proposed methods produce accurate p-values and false discovery rates. The proposed estimation methods are orders of magnitude faster than the exact solution, particularly with an increasing dimensionality. We showcase their applications in evaluating co-occurrences of bird species in 28 islands of Vanuatu and fish species in 3347 freshwater habitats in France. The proposed methods are implemented in an open source R package called jaccard (https://cran.r-project.org/package=jaccard).
We introduce a suite of statistical methods for the Jaccard/Tanimoto similarity coefficient for binary data, that enable straightforward incorporation of probabilistic measures in analysis for species co-occurrences. Due to their generality, the proposed methods and implementations are applicable to a wide range of binary data arising from genomics, biochemistry, and other areas of science.</description><subject>Algorithms</subject><subject>Animals</subject><subject>Aquatic habitats</subject><subject>Associations</subject><subject>Asymptotic methods</subject><subject>Binary data</subject><subject>Binary similarity</subject><subject>Biogeography</subject><subject>Biometry</subject><subject>Birds</subject><subject>Co-occurrences</subject><subject>Coefficients</subject><subject>Computer applications</subject><subject>Computer simulation</subject><subject>Ecological monitoring</subject><subject>Estimates</subject><subject>Evaluation</subject><subject>Exact solutions</subject><subject>Expected values</subject><subject>Fishes</subject><subject>Freshwater Biology - methods</subject><subject>Freshwater environments</subject><subject>Freshwater fish</subject><subject>Genomics</subject><subject>Hypotheses</subject><subject>Jaccard</subject><subject>Microbiology</subject><subject>P-value</subject><subject>Presence-absence</subject><subject>Probability</subject><subject>Production methods</subject><subject>Species</subject><subject>Statistical analysis</subject><subject>Statistical methods</subject><subject>Statistical significance</subject><subject>Statistics</subject><subject>Tanimoto</subject><subject>Test procedures</subject><issn>1471-2105</issn><issn>1471-2105</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNpdkk9v1DAQxSMEoqXwAbigSFy4pPX4T2xfkFAFtKgSl3JDsmadydarJF5sb6V-e7zdUrWcxho___Rm_JrmPbBTANOfZeBG2Y6B7URtdOpFcwxSQ8eBqZdPzkfNm5w3jIE2TL1ujgQYLXtgx83vH-g9puHsGpcwxxLbHOYwYQrlri2US4vL0NYaZiwhLu1M5SYOuR1jalchTnEdPE7tNlGmxVOHq_vaDljwbfNqxCnTu4d60vz69vX6_KK7-vn98vzLVeeVsqUjIK6RG9DKrDT2K2E4IyEHYbUajCTDekmKmBlH4aUFQTRy5AJGNeAI4qS5PHCHiBu3TdVrunMRg7tvxLR2mErwEznrvUDDiYEB6a1GBUpbzUmQJO7Hyvp8YG13q5kGT0tJOD2DPr9Zwo1bx1vXW24FVxXw6QGQ4p9d3ZybQ_Y0TbhQ3GXHhWDKGtX3VfrxP-km7tJSV1VVkvU9AymrCg4qn2LOicZHM8DcPgfukANXc-D2OXB7Ex-eTvH44t_Hi780Fq60</recordid><startdate>20191224</startdate><enddate>20191224</enddate><creator>Chung, Neo Christopher</creator><creator>Miasojedow, BłaŻej</creator><creator>Startek, Michał</creator><creator>Gambin, Anna</creator><general>BioMed Central</general><general>BMC</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7QO</scope><scope>7SC</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>L7M</scope><scope>LK8</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0001-6798-8867</orcidid><orcidid>https://orcid.org/0000-0003-3476-3017</orcidid></search><sort><creationdate>20191224</creationdate><title>Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data</title><author>Chung, Neo Christopher ; Miasojedow, BłaŻej ; Startek, Michał ; Gambin, Anna</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c559t-e1e27a281758b7a6b3820e34d3975d84e8064e5e08ff3c4913eef2a231f5daf13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Algorithms</topic><topic>Animals</topic><topic>Aquatic habitats</topic><topic>Associations</topic><topic>Asymptotic methods</topic><topic>Binary data</topic><topic>Binary similarity</topic><topic>Biogeography</topic><topic>Biometry</topic><topic>Birds</topic><topic>Co-occurrences</topic><topic>Coefficients</topic><topic>Computer applications</topic><topic>Computer simulation</topic><topic>Ecological monitoring</topic><topic>Estimates</topic><topic>Evaluation</topic><topic>Exact solutions</topic><topic>Expected values</topic><topic>Fishes</topic><topic>Freshwater Biology - methods</topic><topic>Freshwater environments</topic><topic>Freshwater fish</topic><topic>Genomics</topic><topic>Hypotheses</topic><topic>Jaccard</topic><topic>Microbiology</topic><topic>P-value</topic><topic>Presence-absence</topic><topic>Probability</topic><topic>Production methods</topic><topic>Species</topic><topic>Statistical analysis</topic><topic>Statistical methods</topic><topic>Statistical significance</topic><topic>Statistics</topic><topic>Tanimoto</topic><topic>Test procedures</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chung, Neo Christopher</creatorcontrib><creatorcontrib>Miasojedow, BłaŻej</creatorcontrib><creatorcontrib>Startek, Michał</creatorcontrib><creatorcontrib>Gambin, Anna</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Biological Sciences</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>BMC bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chung, Neo Christopher</au><au>Miasojedow, BłaŻej</au><au>Startek, Michał</au><au>Gambin, Anna</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data</atitle><jtitle>BMC bioinformatics</jtitle><addtitle>BMC Bioinformatics</addtitle><date>2019-12-24</date><risdate>2019</risdate><volume>20</volume><issue>Suppl 15</issue><spage>644</spage><epage>644</epage><pages>644-644</pages><artnum>644</artnum><issn>1471-2105</issn><eissn>1471-2105</eissn><abstract>A survey of presences and absences of specific species across multiple biogeographic units (or bioregions) are used in a broad area of biological studies from ecology to microbiology. Using binary presence-absence data, we evaluate species co-occurrences that help elucidate relationships among organisms and environments. To summarize similarity between occurrences of species, we routinely use the Jaccard/Tanimoto coefficient, which is the ratio of their intersection to their union. It is natural, then, to identify statistically significant Jaccard/Tanimoto coefficients, which suggest non-random co-occurrences of species. However, statistical hypothesis testing using this similarity coefficient has been seldom used or studied.
We introduce a hypothesis test for similarity for biological presence-absence data, using the Jaccard/Tanimoto coefficient. Several key improvements are presented including unbiased estimation of expectation and centered Jaccard/Tanimoto coefficients, that account for occurrence probabilities. The exact and asymptotic solutions are derived. To overcome a computational burden due to high-dimensionality, we propose the bootstrap and measurement concentration algorithms to efficiently estimate statistical significance of binary similarity. Comprehensive simulation studies demonstrate that our proposed methods produce accurate p-values and false discovery rates. The proposed estimation methods are orders of magnitude faster than the exact solution, particularly with an increasing dimensionality. We showcase their applications in evaluating co-occurrences of bird species in 28 islands of Vanuatu and fish species in 3347 freshwater habitats in France. The proposed methods are implemented in an open source R package called jaccard (https://cran.r-project.org/package=jaccard).
We introduce a suite of statistical methods for the Jaccard/Tanimoto similarity coefficient for binary data, that enable straightforward incorporation of probabilistic measures in analysis for species co-occurrences. Due to their generality, the proposed methods and implementations are applicable to a wide range of binary data arising from genomics, biochemistry, and other areas of science.</abstract><cop>England</cop><pub>BioMed Central</pub><pmid>31874610</pmid><doi>10.1186/s12859-019-3118-5</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0001-6798-8867</orcidid><orcidid>https://orcid.org/0000-0003-3476-3017</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1471-2105 |
ispartof | BMC bioinformatics, 2019-12, Vol.20 (Suppl 15), p.644-644, Article 644 |
issn | 1471-2105 1471-2105 |
language | eng |
recordid | cdi_doaj_primary_oai_doaj_org_article_9cc3a82e01814c97a5157972e3e4e2cf |
source | Publicly Available Content Database; PubMed Central |
subjects | Algorithms Animals Aquatic habitats Associations Asymptotic methods Binary data Binary similarity Biogeography Biometry Birds Co-occurrences Coefficients Computer applications Computer simulation Ecological monitoring Estimates Evaluation Exact solutions Expected values Fishes Freshwater Biology - methods Freshwater environments Freshwater fish Genomics Hypotheses Jaccard Microbiology P-value Presence-absence Probability Production methods Species Statistical analysis Statistical methods Statistical significance Statistics Tanimoto Test procedures |
title | Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T16%3A40%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Jaccard/Tanimoto%20similarity%20test%20and%20estimation%20methods%20for%20biological%20presence-absence%20data&rft.jtitle=BMC%20bioinformatics&rft.au=Chung,%20Neo%20Christopher&rft.date=2019-12-24&rft.volume=20&rft.issue=Suppl%2015&rft.spage=644&rft.epage=644&rft.pages=644-644&rft.artnum=644&rft.issn=1471-2105&rft.eissn=1471-2105&rft_id=info:doi/10.1186/s12859-019-3118-5&rft_dat=%3Cproquest_doaj_%3E2340660144%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c559t-e1e27a281758b7a6b3820e34d3975d84e8064e5e08ff3c4913eef2a231f5daf13%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2340660144&rft_id=info:pmid/31874610&rfr_iscdi=true |