Loading…

Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks

Metal oxide (MOx) electro-chemical gas sensors are a sensible choice for many applications, due to their tunable sensitivity, their space-efficiency and their low price. Publicly available sensor datasets streamline the development and evaluation of novel algorithm and circuit designs, making them p...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2021-08
Main Authors: Dennler, Nik, Rastogi, Shavika, Fonollosa, Jordi, André van Schaik, Schmuker, Michael
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Dennler, Nik
Rastogi, Shavika
Fonollosa, Jordi
André van Schaik
Schmuker, Michael
description Metal oxide (MOx) electro-chemical gas sensors are a sensible choice for many applications, due to their tunable sensitivity, their space-efficiency and their low price. Publicly available sensor datasets streamline the development and evaluation of novel algorithm and circuit designs, making them particularly valuable for the Artificial Olfaction / Mobile Robot Olfaction community. In 2013, Vergara et al. published a dataset comprising 16 months of recordings from a large MOx gas sensor array in a wind tunnel, which has since become a standard benchmark in the field. Here we report a previously undetected property of the dataset that limits its suitability for gas classification studies. The analysis of individual measurement timestamps reveals that gases were recorded in temporally clustered batches. The consequential correlation between the sensor response before gas exposure and the time of recording is often sufficient to predict the gas used in a given trial. Even if compensated by zero-offset-subtraction, residual short-term drift contains enough information for gas classification. We have identified a minimally drift-affected subset of the data, which is suitable for gas classification benchmarking after zero-offset-subtraction, although gas classification performance was substantially lower than for the full dataset. We conclude that previous studies conducted with this dataset very likely overestimate the accuracy of gas classification results. For the 17 potentially affected publications, we urge the authors to re-evaluate the results in light of our findings. Our observations emphasize the need to thoroughly document gas sensing datasets, and proper validation before using them for the development of algorithms.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2564173559</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2564173559</sourcerecordid><originalsourceid>FETCH-proquest_journals_25641735593</originalsourceid><addsrcrecordid>eNqNi8sKwjAQRYMgWNR_GHAt1KTxsbX1sVAUdS9DnWK0TTSTip9vET_A1YF77mmJSCo1Gk4TKTuiz3yL41iOJ1JrFYk886YIYCwg7N2jLtHDlgKWsHubC8GRLDsPGQZkCnCgF2HJsDGVCRiMswxF41fIkJbIbAqTf3eYk82vFfo790S7aCLq_9gVg-XilK6HD--eNXE431ztbaPOUo-T0URpPVP_vT5sB0WR</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2564173559</pqid></control><display><type>article</type><title>Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks</title><source>Publicly Available Content Database</source><creator>Dennler, Nik ; Rastogi, Shavika ; Fonollosa, Jordi ; André van Schaik ; Schmuker, Michael</creator><creatorcontrib>Dennler, Nik ; Rastogi, Shavika ; Fonollosa, Jordi ; André van Schaik ; Schmuker, Michael</creatorcontrib><description>Metal oxide (MOx) electro-chemical gas sensors are a sensible choice for many applications, due to their tunable sensitivity, their space-efficiency and their low price. Publicly available sensor datasets streamline the development and evaluation of novel algorithm and circuit designs, making them particularly valuable for the Artificial Olfaction / Mobile Robot Olfaction community. In 2013, Vergara et al. published a dataset comprising 16 months of recordings from a large MOx gas sensor array in a wind tunnel, which has since become a standard benchmark in the field. Here we report a previously undetected property of the dataset that limits its suitability for gas classification studies. The analysis of individual measurement timestamps reveals that gases were recorded in temporally clustered batches. The consequential correlation between the sensor response before gas exposure and the time of recording is often sufficient to predict the gas used in a given trial. Even if compensated by zero-offset-subtraction, residual short-term drift contains enough information for gas classification. We have identified a minimally drift-affected subset of the data, which is suitable for gas classification benchmarking after zero-offset-subtraction, although gas classification performance was substantially lower than for the full dataset. We conclude that previous studies conducted with this dataset very likely overestimate the accuracy of gas classification results. For the 17 potentially affected publications, we urge the authors to re-evaluate the results in light of our findings. Our observations emphasize the need to thoroughly document gas sensing datasets, and proper validation before using them for the development of algorithms.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Benchmarks ; Chemical sensors ; Circuit design ; Classification ; Datasets ; Drift ; Gas sensors ; Gases ; Metal oxides ; Sensor arrays ; Sensors ; Subtraction ; Wind tunnels</subject><ispartof>arXiv.org, 2021-08</ispartof><rights>2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2564173559?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>776,780,25731,36989,44566</link.rule.ids></links><search><creatorcontrib>Dennler, Nik</creatorcontrib><creatorcontrib>Rastogi, Shavika</creatorcontrib><creatorcontrib>Fonollosa, Jordi</creatorcontrib><creatorcontrib>André van Schaik</creatorcontrib><creatorcontrib>Schmuker, Michael</creatorcontrib><title>Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks</title><title>arXiv.org</title><description>Metal oxide (MOx) electro-chemical gas sensors are a sensible choice for many applications, due to their tunable sensitivity, their space-efficiency and their low price. Publicly available sensor datasets streamline the development and evaluation of novel algorithm and circuit designs, making them particularly valuable for the Artificial Olfaction / Mobile Robot Olfaction community. In 2013, Vergara et al. published a dataset comprising 16 months of recordings from a large MOx gas sensor array in a wind tunnel, which has since become a standard benchmark in the field. Here we report a previously undetected property of the dataset that limits its suitability for gas classification studies. The analysis of individual measurement timestamps reveals that gases were recorded in temporally clustered batches. The consequential correlation between the sensor response before gas exposure and the time of recording is often sufficient to predict the gas used in a given trial. Even if compensated by zero-offset-subtraction, residual short-term drift contains enough information for gas classification. We have identified a minimally drift-affected subset of the data, which is suitable for gas classification benchmarking after zero-offset-subtraction, although gas classification performance was substantially lower than for the full dataset. We conclude that previous studies conducted with this dataset very likely overestimate the accuracy of gas classification results. For the 17 potentially affected publications, we urge the authors to re-evaluate the results in light of our findings. Our observations emphasize the need to thoroughly document gas sensing datasets, and proper validation before using them for the development of algorithms.</description><subject>Algorithms</subject><subject>Benchmarks</subject><subject>Chemical sensors</subject><subject>Circuit design</subject><subject>Classification</subject><subject>Datasets</subject><subject>Drift</subject><subject>Gas sensors</subject><subject>Gases</subject><subject>Metal oxides</subject><subject>Sensor arrays</subject><subject>Sensors</subject><subject>Subtraction</subject><subject>Wind tunnels</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNi8sKwjAQRYMgWNR_GHAt1KTxsbX1sVAUdS9DnWK0TTSTip9vET_A1YF77mmJSCo1Gk4TKTuiz3yL41iOJ1JrFYk886YIYCwg7N2jLtHDlgKWsHubC8GRLDsPGQZkCnCgF2HJsDGVCRiMswxF41fIkJbIbAqTf3eYk82vFfo790S7aCLq_9gVg-XilK6HD--eNXE431ztbaPOUo-T0URpPVP_vT5sB0WR</recordid><startdate>20210819</startdate><enddate>20210819</enddate><creator>Dennler, Nik</creator><creator>Rastogi, Shavika</creator><creator>Fonollosa, Jordi</creator><creator>André van Schaik</creator><creator>Schmuker, Michael</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20210819</creationdate><title>Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks</title><author>Dennler, Nik ; Rastogi, Shavika ; Fonollosa, Jordi ; André van Schaik ; Schmuker, Michael</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_25641735593</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Benchmarks</topic><topic>Chemical sensors</topic><topic>Circuit design</topic><topic>Classification</topic><topic>Datasets</topic><topic>Drift</topic><topic>Gas sensors</topic><topic>Gases</topic><topic>Metal oxides</topic><topic>Sensor arrays</topic><topic>Sensors</topic><topic>Subtraction</topic><topic>Wind tunnels</topic><toplevel>online_resources</toplevel><creatorcontrib>Dennler, Nik</creatorcontrib><creatorcontrib>Rastogi, Shavika</creatorcontrib><creatorcontrib>Fonollosa, Jordi</creatorcontrib><creatorcontrib>André van Schaik</creatorcontrib><creatorcontrib>Schmuker, Michael</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dennler, Nik</au><au>Rastogi, Shavika</au><au>Fonollosa, Jordi</au><au>André van Schaik</au><au>Schmuker, Michael</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks</atitle><jtitle>arXiv.org</jtitle><date>2021-08-19</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>Metal oxide (MOx) electro-chemical gas sensors are a sensible choice for many applications, due to their tunable sensitivity, their space-efficiency and their low price. Publicly available sensor datasets streamline the development and evaluation of novel algorithm and circuit designs, making them particularly valuable for the Artificial Olfaction / Mobile Robot Olfaction community. In 2013, Vergara et al. published a dataset comprising 16 months of recordings from a large MOx gas sensor array in a wind tunnel, which has since become a standard benchmark in the field. Here we report a previously undetected property of the dataset that limits its suitability for gas classification studies. The analysis of individual measurement timestamps reveals that gases were recorded in temporally clustered batches. The consequential correlation between the sensor response before gas exposure and the time of recording is often sufficient to predict the gas used in a given trial. Even if compensated by zero-offset-subtraction, residual short-term drift contains enough information for gas classification. We have identified a minimally drift-affected subset of the data, which is suitable for gas classification benchmarking after zero-offset-subtraction, although gas classification performance was substantially lower than for the full dataset. We conclude that previous studies conducted with this dataset very likely overestimate the accuracy of gas classification results. For the 17 potentially affected publications, we urge the authors to re-evaluate the results in light of our findings. Our observations emphasize the need to thoroughly document gas sensing datasets, and proper validation before using them for the development of algorithms.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2021-08
issn 2331-8422
language eng
recordid cdi_proquest_journals_2564173559
source Publicly Available Content Database
subjects Algorithms
Benchmarks
Chemical sensors
Circuit design
Classification
Datasets
Drift
Gas sensors
Gases
Metal oxides
Sensor arrays
Sensors
Subtraction
Wind tunnels
title Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T04%3A39%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Drift%20in%20a%20Popular%20Metal%20Oxide%20Sensor%20Dataset%20Reveals%20Limitations%20for%20Gas%20Classification%20Benchmarks&rft.jtitle=arXiv.org&rft.au=Dennler,%20Nik&rft.date=2021-08-19&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2564173559%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_25641735593%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2564173559&rft_id=info:pmid/&rfr_iscdi=true