Loading…
Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks
Metal oxide (MOx) electro-chemical gas sensors are a sensible choice for many applications, due to their tunable sensitivity, their space-efficiency and their low price. Publicly available sensor datasets streamline the development and evaluation of novel algorithm and circuit designs, making them p...
Saved in:
Published in: | arXiv.org 2021-08 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | |
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Dennler, Nik Rastogi, Shavika Fonollosa, Jordi André van Schaik Schmuker, Michael |
description | Metal oxide (MOx) electro-chemical gas sensors are a sensible choice for many applications, due to their tunable sensitivity, their space-efficiency and their low price. Publicly available sensor datasets streamline the development and evaluation of novel algorithm and circuit designs, making them particularly valuable for the Artificial Olfaction / Mobile Robot Olfaction community. In 2013, Vergara et al. published a dataset comprising 16 months of recordings from a large MOx gas sensor array in a wind tunnel, which has since become a standard benchmark in the field. Here we report a previously undetected property of the dataset that limits its suitability for gas classification studies. The analysis of individual measurement timestamps reveals that gases were recorded in temporally clustered batches. The consequential correlation between the sensor response before gas exposure and the time of recording is often sufficient to predict the gas used in a given trial. Even if compensated by zero-offset-subtraction, residual short-term drift contains enough information for gas classification. We have identified a minimally drift-affected subset of the data, which is suitable for gas classification benchmarking after zero-offset-subtraction, although gas classification performance was substantially lower than for the full dataset. We conclude that previous studies conducted with this dataset very likely overestimate the accuracy of gas classification results. For the 17 potentially affected publications, we urge the authors to re-evaluate the results in light of our findings. Our observations emphasize the need to thoroughly document gas sensing datasets, and proper validation before using them for the development of algorithms. |
format | article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2564173559</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2564173559</sourcerecordid><originalsourceid>FETCH-proquest_journals_25641735593</originalsourceid><addsrcrecordid>eNqNi8sKwjAQRYMgWNR_GHAt1KTxsbX1sVAUdS9DnWK0TTSTip9vET_A1YF77mmJSCo1Gk4TKTuiz3yL41iOJ1JrFYk886YIYCwg7N2jLtHDlgKWsHubC8GRLDsPGQZkCnCgF2HJsDGVCRiMswxF41fIkJbIbAqTf3eYk82vFfo790S7aCLq_9gVg-XilK6HD--eNXE431ztbaPOUo-T0URpPVP_vT5sB0WR</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2564173559</pqid></control><display><type>article</type><title>Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks</title><source>Publicly Available Content Database</source><creator>Dennler, Nik ; Rastogi, Shavika ; Fonollosa, Jordi ; André van Schaik ; Schmuker, Michael</creator><creatorcontrib>Dennler, Nik ; Rastogi, Shavika ; Fonollosa, Jordi ; André van Schaik ; Schmuker, Michael</creatorcontrib><description>Metal oxide (MOx) electro-chemical gas sensors are a sensible choice for many applications, due to their tunable sensitivity, their space-efficiency and their low price. Publicly available sensor datasets streamline the development and evaluation of novel algorithm and circuit designs, making them particularly valuable for the Artificial Olfaction / Mobile Robot Olfaction community. In 2013, Vergara et al. published a dataset comprising 16 months of recordings from a large MOx gas sensor array in a wind tunnel, which has since become a standard benchmark in the field. Here we report a previously undetected property of the dataset that limits its suitability for gas classification studies. The analysis of individual measurement timestamps reveals that gases were recorded in temporally clustered batches. The consequential correlation between the sensor response before gas exposure and the time of recording is often sufficient to predict the gas used in a given trial. Even if compensated by zero-offset-subtraction, residual short-term drift contains enough information for gas classification. We have identified a minimally drift-affected subset of the data, which is suitable for gas classification benchmarking after zero-offset-subtraction, although gas classification performance was substantially lower than for the full dataset. We conclude that previous studies conducted with this dataset very likely overestimate the accuracy of gas classification results. For the 17 potentially affected publications, we urge the authors to re-evaluate the results in light of our findings. Our observations emphasize the need to thoroughly document gas sensing datasets, and proper validation before using them for the development of algorithms.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Benchmarks ; Chemical sensors ; Circuit design ; Classification ; Datasets ; Drift ; Gas sensors ; Gases ; Metal oxides ; Sensor arrays ; Sensors ; Subtraction ; Wind tunnels</subject><ispartof>arXiv.org, 2021-08</ispartof><rights>2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2564173559?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>776,780,25731,36989,44566</link.rule.ids></links><search><creatorcontrib>Dennler, Nik</creatorcontrib><creatorcontrib>Rastogi, Shavika</creatorcontrib><creatorcontrib>Fonollosa, Jordi</creatorcontrib><creatorcontrib>André van Schaik</creatorcontrib><creatorcontrib>Schmuker, Michael</creatorcontrib><title>Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks</title><title>arXiv.org</title><description>Metal oxide (MOx) electro-chemical gas sensors are a sensible choice for many applications, due to their tunable sensitivity, their space-efficiency and their low price. Publicly available sensor datasets streamline the development and evaluation of novel algorithm and circuit designs, making them particularly valuable for the Artificial Olfaction / Mobile Robot Olfaction community. In 2013, Vergara et al. published a dataset comprising 16 months of recordings from a large MOx gas sensor array in a wind tunnel, which has since become a standard benchmark in the field. Here we report a previously undetected property of the dataset that limits its suitability for gas classification studies. The analysis of individual measurement timestamps reveals that gases were recorded in temporally clustered batches. The consequential correlation between the sensor response before gas exposure and the time of recording is often sufficient to predict the gas used in a given trial. Even if compensated by zero-offset-subtraction, residual short-term drift contains enough information for gas classification. We have identified a minimally drift-affected subset of the data, which is suitable for gas classification benchmarking after zero-offset-subtraction, although gas classification performance was substantially lower than for the full dataset. We conclude that previous studies conducted with this dataset very likely overestimate the accuracy of gas classification results. For the 17 potentially affected publications, we urge the authors to re-evaluate the results in light of our findings. Our observations emphasize the need to thoroughly document gas sensing datasets, and proper validation before using them for the development of algorithms.</description><subject>Algorithms</subject><subject>Benchmarks</subject><subject>Chemical sensors</subject><subject>Circuit design</subject><subject>Classification</subject><subject>Datasets</subject><subject>Drift</subject><subject>Gas sensors</subject><subject>Gases</subject><subject>Metal oxides</subject><subject>Sensor arrays</subject><subject>Sensors</subject><subject>Subtraction</subject><subject>Wind tunnels</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNi8sKwjAQRYMgWNR_GHAt1KTxsbX1sVAUdS9DnWK0TTSTip9vET_A1YF77mmJSCo1Gk4TKTuiz3yL41iOJ1JrFYk886YIYCwg7N2jLtHDlgKWsHubC8GRLDsPGQZkCnCgF2HJsDGVCRiMswxF41fIkJbIbAqTf3eYk82vFfo790S7aCLq_9gVg-XilK6HD--eNXE431ztbaPOUo-T0URpPVP_vT5sB0WR</recordid><startdate>20210819</startdate><enddate>20210819</enddate><creator>Dennler, Nik</creator><creator>Rastogi, Shavika</creator><creator>Fonollosa, Jordi</creator><creator>André van Schaik</creator><creator>Schmuker, Michael</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20210819</creationdate><title>Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks</title><author>Dennler, Nik ; Rastogi, Shavika ; Fonollosa, Jordi ; André van Schaik ; Schmuker, Michael</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_25641735593</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Benchmarks</topic><topic>Chemical sensors</topic><topic>Circuit design</topic><topic>Classification</topic><topic>Datasets</topic><topic>Drift</topic><topic>Gas sensors</topic><topic>Gases</topic><topic>Metal oxides</topic><topic>Sensor arrays</topic><topic>Sensors</topic><topic>Subtraction</topic><topic>Wind tunnels</topic><toplevel>online_resources</toplevel><creatorcontrib>Dennler, Nik</creatorcontrib><creatorcontrib>Rastogi, Shavika</creatorcontrib><creatorcontrib>Fonollosa, Jordi</creatorcontrib><creatorcontrib>André van Schaik</creatorcontrib><creatorcontrib>Schmuker, Michael</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dennler, Nik</au><au>Rastogi, Shavika</au><au>Fonollosa, Jordi</au><au>André van Schaik</au><au>Schmuker, Michael</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks</atitle><jtitle>arXiv.org</jtitle><date>2021-08-19</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>Metal oxide (MOx) electro-chemical gas sensors are a sensible choice for many applications, due to their tunable sensitivity, their space-efficiency and their low price. Publicly available sensor datasets streamline the development and evaluation of novel algorithm and circuit designs, making them particularly valuable for the Artificial Olfaction / Mobile Robot Olfaction community. In 2013, Vergara et al. published a dataset comprising 16 months of recordings from a large MOx gas sensor array in a wind tunnel, which has since become a standard benchmark in the field. Here we report a previously undetected property of the dataset that limits its suitability for gas classification studies. The analysis of individual measurement timestamps reveals that gases were recorded in temporally clustered batches. The consequential correlation between the sensor response before gas exposure and the time of recording is often sufficient to predict the gas used in a given trial. Even if compensated by zero-offset-subtraction, residual short-term drift contains enough information for gas classification. We have identified a minimally drift-affected subset of the data, which is suitable for gas classification benchmarking after zero-offset-subtraction, although gas classification performance was substantially lower than for the full dataset. We conclude that previous studies conducted with this dataset very likely overestimate the accuracy of gas classification results. For the 17 potentially affected publications, we urge the authors to re-evaluate the results in light of our findings. Our observations emphasize the need to thoroughly document gas sensing datasets, and proper validation before using them for the development of algorithms.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2021-08 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2564173559 |
source | Publicly Available Content Database |
subjects | Algorithms Benchmarks Chemical sensors Circuit design Classification Datasets Drift Gas sensors Gases Metal oxides Sensor arrays Sensors Subtraction Wind tunnels |
title | Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T04%3A39%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Drift%20in%20a%20Popular%20Metal%20Oxide%20Sensor%20Dataset%20Reveals%20Limitations%20for%20Gas%20Classification%20Benchmarks&rft.jtitle=arXiv.org&rft.au=Dennler,%20Nik&rft.date=2021-08-19&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2564173559%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_25641735593%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2564173559&rft_id=info:pmid/&rfr_iscdi=true |