Loading…

Consensus building for interlaboratory studies, key comparisons, and meta-analysis

Interlaboratory studies in measurement science, including key comparisons, and meta-analyses in several fields, including medicine, serve to intercompare measurement results obtained independently, and typically produce a consensus value for the common measurand that blends the values measured by th...

Full description

Saved in:
Bibliographic Details
Published in:Metrologia 2017-06, Vol.54 (3), p.3
Main Authors: Koepke, Amanda, Lafarge, Thomas, Possolo, Antonio, Toman, Blaza
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue 3
container_start_page 3
container_title Metrologia
container_volume 54
creator Koepke, Amanda
Lafarge, Thomas
Possolo, Antonio
Toman, Blaza
description Interlaboratory studies in measurement science, including key comparisons, and meta-analyses in several fields, including medicine, serve to intercompare measurement results obtained independently, and typically produce a consensus value for the common measurand that blends the values measured by the participants. Since interlaboratory studies and meta-analyses reveal and quantify differences between measured values, regardless of the underlying causes for such differences, they also provide so-called 'top-down' evaluations of measurement uncertainty. Measured values are often substantially over-dispersed by comparison with their individual, stated uncertainties, thus suggesting the existence of yet unrecognized sources of uncertainty (dark uncertainty). We contrast two different approaches to take dark uncertainty into account both in the computation of consensus values and in the evaluation of the associated uncertainty, which have traditionally been preferred by different scientific communities. One inflates the stated uncertainties by a multiplicative factor. The other adds laboratory-specific 'effects' to the value of the measurand. After distinguishing what we call recipe-based and model-based approaches to data reductions in interlaboratory studies, we state six guiding principles that should inform such reductions. These principles favor model-based approaches that expose and facilitate the critical assessment of validating assumptions, and give preeminence to substantive criteria to determine which measurement results to include, and which to exclude, as opposed to purely statistical considerations, and also how to weigh them. Following an overview of maximum likelihood methods, three general purpose procedures for data reduction are described in detail, including explanations of how the consensus value and degrees of equivalence are computed, and the associated uncertainty evaluated: the DerSimonian-Laird procedure; a hierarchical Bayesian procedure; and the Linear Pool. These three procedures have been implemented and made widely accessible in a Web-based application (NIST Consensus Builder). We illustrate principles, statistical models, and data reduction procedures in four examples: (i) the measurement of the Newtonian constant of gravitation; (ii) the measurement of the half-lives of radioactive isotopes of caesium and strontium; (iii) the comparison of two alternative treatments for carotid artery stenosis; and (iv) a key comparison where th
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_1902096007</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1902096007</sourcerecordid><originalsourceid>FETCH-proquest_journals_19020960073</originalsourceid><addsrcrecordid>eNqNy70OgjAUQOHGaCL-vEMTV5q0IFRmonE27uYixRRLi73twNvL4AM4neU7C5KI8iSYLGSxJAnnWclEXh3XZIPYcy5kVsiE3GpnUVmMSJuoTavti3bOU22D8gYa5yE4P1EMsdUKU_pWE326YQSvcV5TCralgwrAwIKZUOOOrDowqPa_bsnhcr7XVzZ694kKw6N30c8YH6LiGa9KzmX-n_oCfOpB1A</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1902096007</pqid></control><display><type>article</type><title>Consensus building for interlaboratory studies, key comparisons, and meta-analysis</title><source>Institute of Physics</source><creator>Koepke, Amanda ; Lafarge, Thomas ; Possolo, Antonio ; Toman, Blaza</creator><creatorcontrib>Koepke, Amanda ; Lafarge, Thomas ; Possolo, Antonio ; Toman, Blaza</creatorcontrib><description>Interlaboratory studies in measurement science, including key comparisons, and meta-analyses in several fields, including medicine, serve to intercompare measurement results obtained independently, and typically produce a consensus value for the common measurand that blends the values measured by the participants. Since interlaboratory studies and meta-analyses reveal and quantify differences between measured values, regardless of the underlying causes for such differences, they also provide so-called 'top-down' evaluations of measurement uncertainty. Measured values are often substantially over-dispersed by comparison with their individual, stated uncertainties, thus suggesting the existence of yet unrecognized sources of uncertainty (dark uncertainty). We contrast two different approaches to take dark uncertainty into account both in the computation of consensus values and in the evaluation of the associated uncertainty, which have traditionally been preferred by different scientific communities. One inflates the stated uncertainties by a multiplicative factor. The other adds laboratory-specific 'effects' to the value of the measurand. After distinguishing what we call recipe-based and model-based approaches to data reductions in interlaboratory studies, we state six guiding principles that should inform such reductions. These principles favor model-based approaches that expose and facilitate the critical assessment of validating assumptions, and give preeminence to substantive criteria to determine which measurement results to include, and which to exclude, as opposed to purely statistical considerations, and also how to weigh them. Following an overview of maximum likelihood methods, three general purpose procedures for data reduction are described in detail, including explanations of how the consensus value and degrees of equivalence are computed, and the associated uncertainty evaluated: the DerSimonian-Laird procedure; a hierarchical Bayesian procedure; and the Linear Pool. These three procedures have been implemented and made widely accessible in a Web-based application (NIST Consensus Builder). We illustrate principles, statistical models, and data reduction procedures in four examples: (i) the measurement of the Newtonian constant of gravitation; (ii) the measurement of the half-lives of radioactive isotopes of caesium and strontium; (iii) the comparison of two alternative treatments for carotid artery stenosis; and (iv) a key comparison where the measurand was the calibration factor of a radio-frequency power sensor.</description><identifier>ISSN: 0026-1394</identifier><identifier>EISSN: 1681-7575</identifier><language>eng</language><publisher>Bristol: IOP Publishing</publisher><subject>Bayesian analysis ; Calibration ; Cesium ; Comparative analysis ; Data reduction ; Gravitation ; Isotopes ; Laboratories ; Mathematical models ; Meta-analysis ; Radioisotopes ; Statistical analysis ; Uncertainty</subject><ispartof>Metrologia, 2017-06, Vol.54 (3), p.3</ispartof><rights>Copyright IOP Publishing Jun 2017</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784</link.rule.ids></links><search><creatorcontrib>Koepke, Amanda</creatorcontrib><creatorcontrib>Lafarge, Thomas</creatorcontrib><creatorcontrib>Possolo, Antonio</creatorcontrib><creatorcontrib>Toman, Blaza</creatorcontrib><title>Consensus building for interlaboratory studies, key comparisons, and meta-analysis</title><title>Metrologia</title><description>Interlaboratory studies in measurement science, including key comparisons, and meta-analyses in several fields, including medicine, serve to intercompare measurement results obtained independently, and typically produce a consensus value for the common measurand that blends the values measured by the participants. Since interlaboratory studies and meta-analyses reveal and quantify differences between measured values, regardless of the underlying causes for such differences, they also provide so-called 'top-down' evaluations of measurement uncertainty. Measured values are often substantially over-dispersed by comparison with their individual, stated uncertainties, thus suggesting the existence of yet unrecognized sources of uncertainty (dark uncertainty). We contrast two different approaches to take dark uncertainty into account both in the computation of consensus values and in the evaluation of the associated uncertainty, which have traditionally been preferred by different scientific communities. One inflates the stated uncertainties by a multiplicative factor. The other adds laboratory-specific 'effects' to the value of the measurand. After distinguishing what we call recipe-based and model-based approaches to data reductions in interlaboratory studies, we state six guiding principles that should inform such reductions. These principles favor model-based approaches that expose and facilitate the critical assessment of validating assumptions, and give preeminence to substantive criteria to determine which measurement results to include, and which to exclude, as opposed to purely statistical considerations, and also how to weigh them. Following an overview of maximum likelihood methods, three general purpose procedures for data reduction are described in detail, including explanations of how the consensus value and degrees of equivalence are computed, and the associated uncertainty evaluated: the DerSimonian-Laird procedure; a hierarchical Bayesian procedure; and the Linear Pool. These three procedures have been implemented and made widely accessible in a Web-based application (NIST Consensus Builder). We illustrate principles, statistical models, and data reduction procedures in four examples: (i) the measurement of the Newtonian constant of gravitation; (ii) the measurement of the half-lives of radioactive isotopes of caesium and strontium; (iii) the comparison of two alternative treatments for carotid artery stenosis; and (iv) a key comparison where the measurand was the calibration factor of a radio-frequency power sensor.</description><subject>Bayesian analysis</subject><subject>Calibration</subject><subject>Cesium</subject><subject>Comparative analysis</subject><subject>Data reduction</subject><subject>Gravitation</subject><subject>Isotopes</subject><subject>Laboratories</subject><subject>Mathematical models</subject><subject>Meta-analysis</subject><subject>Radioisotopes</subject><subject>Statistical analysis</subject><subject>Uncertainty</subject><issn>0026-1394</issn><issn>1681-7575</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><recordid>eNqNy70OgjAUQOHGaCL-vEMTV5q0IFRmonE27uYixRRLi73twNvL4AM4neU7C5KI8iSYLGSxJAnnWclEXh3XZIPYcy5kVsiE3GpnUVmMSJuoTavti3bOU22D8gYa5yE4P1EMsdUKU_pWE326YQSvcV5TCralgwrAwIKZUOOOrDowqPa_bsnhcr7XVzZ694kKw6N30c8YH6LiGa9KzmX-n_oCfOpB1A</recordid><startdate>20170601</startdate><enddate>20170601</enddate><creator>Koepke, Amanda</creator><creator>Lafarge, Thomas</creator><creator>Possolo, Antonio</creator><creator>Toman, Blaza</creator><general>IOP Publishing</general><scope>7U5</scope><scope>8FD</scope><scope>L7M</scope></search><sort><creationdate>20170601</creationdate><title>Consensus building for interlaboratory studies, key comparisons, and meta-analysis</title><author>Koepke, Amanda ; Lafarge, Thomas ; Possolo, Antonio ; Toman, Blaza</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_19020960073</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Bayesian analysis</topic><topic>Calibration</topic><topic>Cesium</topic><topic>Comparative analysis</topic><topic>Data reduction</topic><topic>Gravitation</topic><topic>Isotopes</topic><topic>Laboratories</topic><topic>Mathematical models</topic><topic>Meta-analysis</topic><topic>Radioisotopes</topic><topic>Statistical analysis</topic><topic>Uncertainty</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Koepke, Amanda</creatorcontrib><creatorcontrib>Lafarge, Thomas</creatorcontrib><creatorcontrib>Possolo, Antonio</creatorcontrib><creatorcontrib>Toman, Blaza</creatorcontrib><collection>Solid State and Superconductivity Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>Metrologia</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Koepke, Amanda</au><au>Lafarge, Thomas</au><au>Possolo, Antonio</au><au>Toman, Blaza</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Consensus building for interlaboratory studies, key comparisons, and meta-analysis</atitle><jtitle>Metrologia</jtitle><date>2017-06-01</date><risdate>2017</risdate><volume>54</volume><issue>3</issue><spage>3</spage><pages>3-</pages><issn>0026-1394</issn><eissn>1681-7575</eissn><abstract>Interlaboratory studies in measurement science, including key comparisons, and meta-analyses in several fields, including medicine, serve to intercompare measurement results obtained independently, and typically produce a consensus value for the common measurand that blends the values measured by the participants. Since interlaboratory studies and meta-analyses reveal and quantify differences between measured values, regardless of the underlying causes for such differences, they also provide so-called 'top-down' evaluations of measurement uncertainty. Measured values are often substantially over-dispersed by comparison with their individual, stated uncertainties, thus suggesting the existence of yet unrecognized sources of uncertainty (dark uncertainty). We contrast two different approaches to take dark uncertainty into account both in the computation of consensus values and in the evaluation of the associated uncertainty, which have traditionally been preferred by different scientific communities. One inflates the stated uncertainties by a multiplicative factor. The other adds laboratory-specific 'effects' to the value of the measurand. After distinguishing what we call recipe-based and model-based approaches to data reductions in interlaboratory studies, we state six guiding principles that should inform such reductions. These principles favor model-based approaches that expose and facilitate the critical assessment of validating assumptions, and give preeminence to substantive criteria to determine which measurement results to include, and which to exclude, as opposed to purely statistical considerations, and also how to weigh them. Following an overview of maximum likelihood methods, three general purpose procedures for data reduction are described in detail, including explanations of how the consensus value and degrees of equivalence are computed, and the associated uncertainty evaluated: the DerSimonian-Laird procedure; a hierarchical Bayesian procedure; and the Linear Pool. These three procedures have been implemented and made widely accessible in a Web-based application (NIST Consensus Builder). We illustrate principles, statistical models, and data reduction procedures in four examples: (i) the measurement of the Newtonian constant of gravitation; (ii) the measurement of the half-lives of radioactive isotopes of caesium and strontium; (iii) the comparison of two alternative treatments for carotid artery stenosis; and (iv) a key comparison where the measurand was the calibration factor of a radio-frequency power sensor.</abstract><cop>Bristol</cop><pub>IOP Publishing</pub></addata></record>
fulltext fulltext
identifier ISSN: 0026-1394
ispartof Metrologia, 2017-06, Vol.54 (3), p.3
issn 0026-1394
1681-7575
language eng
recordid cdi_proquest_journals_1902096007
source Institute of Physics
subjects Bayesian analysis
Calibration
Cesium
Comparative analysis
Data reduction
Gravitation
Isotopes
Laboratories
Mathematical models
Meta-analysis
Radioisotopes
Statistical analysis
Uncertainty
title Consensus building for interlaboratory studies, key comparisons, and meta-analysis
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T20%3A21%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Consensus%20building%20for%20interlaboratory%20studies,%20key%20comparisons,%20and%20meta-analysis&rft.jtitle=Metrologia&rft.au=Koepke,%20Amanda&rft.date=2017-06-01&rft.volume=54&rft.issue=3&rft.spage=3&rft.pages=3-&rft.issn=0026-1394&rft.eissn=1681-7575&rft_id=info:doi/&rft_dat=%3Cproquest%3E1902096007%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_19020960073%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1902096007&rft_id=info:pmid/&rfr_iscdi=true