Loading…

A novel multiclass classification based approach for playback attack detection in speaker verification systems

Spoofing detection in automatic speaker verification (ASV) systems is typically handled as a binary classification approach. In this paper, we propose a novel approach to address this problem using a multi-class classification approach. Each audio sample is tagged on the basis of the source of the s...

Full description

Saved in:
Bibliographic Details
Published in:Journal of ambient intelligence and humanized computing 2023-12, Vol.14 (12), p.16737-16748
Main Authors: Mankad, Sapan H., Garg, Sanjay, Patel, Vansh, Patwa, Nishi
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c1859-d1d6c7018169fc3311e10a2a5fc6db771a0757ea406a2f2efa70253afa2e803a3
container_end_page 16748
container_issue 12
container_start_page 16737
container_title Journal of ambient intelligence and humanized computing
container_volume 14
creator Mankad, Sapan H.
Garg, Sanjay
Patel, Vansh
Patwa, Nishi
description Spoofing detection in automatic speaker verification (ASV) systems is typically handled as a binary classification approach. In this paper, we propose a novel approach to address this problem using a multi-class classification approach. Each audio sample is tagged on the basis of the source of the signal. Spoof class samples are divided according to corresponding recording devices which were used during recording of the genuine speaker’s voice to be later used for implementing playback attack. Three different multiclass based approaches proposed in this work are evaluated on ASVspoof 2017 v2.0 dataset. The performance of these systems is tested on conventional and deep classifier systems using both handcrafted features and spectrographic representations of audio. Results suggest the potential of the proposed multiclass classification based approach in comparison to binary classification, specifically in deep learning scenario.
doi_str_mv 10.1007/s12652-023-04684-9
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2919536974</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2919536974</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1859-d1d6c7018169fc3311e10a2a5fc6db771a0757ea406a2f2efa70253afa2e803a3</originalsourceid><addsrcrecordid>eNp9UMtOwzAQtBBIVKU_wMkS54AfsZ0cq4qXVIkLnK2ts4a0aRLstFL_HrdB9MYedvYwMzsaQm45u-eMmYfIhVYiY0JmLNdFnpUXZMILXWSK5-ry75bmmsxiXLM0spSc8wlp57Tt9tjQ7a4ZatdAjPS0a187GOqupSuIWFHo-9CB-6K-C7Rv4LACt6EwDEeocEB3ItctjT3CBgPdYzibxEMccBtvyJWHJuLsF6fk4-nxffGSLd-eXxfzZeZ4ocqs4pV2hvGC69I7maIiZyBAeaerlTEcmFEGIWcahBfowTChJHgQWDAJckruRt8U-nuHcbDrbhfa9NKKkpdK6tLkiSVGlgtdjAG97UO9hXCwnNljtXas1qZq7alaWyaRHEUxkdtPDGfrf1Q_iWR9ng</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2919536974</pqid></control><display><type>article</type><title>A novel multiclass classification based approach for playback attack detection in speaker verification systems</title><source>Springer Nature</source><creator>Mankad, Sapan H. ; Garg, Sanjay ; Patel, Vansh ; Patwa, Nishi</creator><creatorcontrib>Mankad, Sapan H. ; Garg, Sanjay ; Patel, Vansh ; Patwa, Nishi</creatorcontrib><description>Spoofing detection in automatic speaker verification (ASV) systems is typically handled as a binary classification approach. In this paper, we propose a novel approach to address this problem using a multi-class classification approach. Each audio sample is tagged on the basis of the source of the signal. Spoof class samples are divided according to corresponding recording devices which were used during recording of the genuine speaker’s voice to be later used for implementing playback attack. Three different multiclass based approaches proposed in this work are evaluated on ASVspoof 2017 v2.0 dataset. The performance of these systems is tested on conventional and deep classifier systems using both handcrafted features and spectrographic representations of audio. Results suggest the potential of the proposed multiclass classification based approach in comparison to binary classification, specifically in deep learning scenario.</description><identifier>ISSN: 1868-5137</identifier><identifier>EISSN: 1868-5145</identifier><identifier>DOI: 10.1007/s12652-023-04684-9</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Artificial Intelligence ; Biometrics ; Classification ; Computational Intelligence ; Datasets ; Deep learning ; Engineering ; Machine learning ; Neural networks ; Original Research ; Recording ; Robotics and Automation ; Spoofing ; User Interfaces and Human Computer Interaction ; Verification</subject><ispartof>Journal of ambient intelligence and humanized computing, 2023-12, Vol.14 (12), p.16737-16748</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1859-d1d6c7018169fc3311e10a2a5fc6db771a0757ea406a2f2efa70253afa2e803a3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27903,27904</link.rule.ids></links><search><creatorcontrib>Mankad, Sapan H.</creatorcontrib><creatorcontrib>Garg, Sanjay</creatorcontrib><creatorcontrib>Patel, Vansh</creatorcontrib><creatorcontrib>Patwa, Nishi</creatorcontrib><title>A novel multiclass classification based approach for playback attack detection in speaker verification systems</title><title>Journal of ambient intelligence and humanized computing</title><addtitle>J Ambient Intell Human Comput</addtitle><description>Spoofing detection in automatic speaker verification (ASV) systems is typically handled as a binary classification approach. In this paper, we propose a novel approach to address this problem using a multi-class classification approach. Each audio sample is tagged on the basis of the source of the signal. Spoof class samples are divided according to corresponding recording devices which were used during recording of the genuine speaker’s voice to be later used for implementing playback attack. Three different multiclass based approaches proposed in this work are evaluated on ASVspoof 2017 v2.0 dataset. The performance of these systems is tested on conventional and deep classifier systems using both handcrafted features and spectrographic representations of audio. Results suggest the potential of the proposed multiclass classification based approach in comparison to binary classification, specifically in deep learning scenario.</description><subject>Artificial Intelligence</subject><subject>Biometrics</subject><subject>Classification</subject><subject>Computational Intelligence</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Engineering</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Original Research</subject><subject>Recording</subject><subject>Robotics and Automation</subject><subject>Spoofing</subject><subject>User Interfaces and Human Computer Interaction</subject><subject>Verification</subject><issn>1868-5137</issn><issn>1868-5145</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9UMtOwzAQtBBIVKU_wMkS54AfsZ0cq4qXVIkLnK2ts4a0aRLstFL_HrdB9MYedvYwMzsaQm45u-eMmYfIhVYiY0JmLNdFnpUXZMILXWSK5-ry75bmmsxiXLM0spSc8wlp57Tt9tjQ7a4ZatdAjPS0a187GOqupSuIWFHo-9CB-6K-C7Rv4LACt6EwDEeocEB3ItctjT3CBgPdYzibxEMccBtvyJWHJuLsF6fk4-nxffGSLd-eXxfzZeZ4ocqs4pV2hvGC69I7maIiZyBAeaerlTEcmFEGIWcahBfowTChJHgQWDAJckruRt8U-nuHcbDrbhfa9NKKkpdK6tLkiSVGlgtdjAG97UO9hXCwnNljtXas1qZq7alaWyaRHEUxkdtPDGfrf1Q_iWR9ng</recordid><startdate>20231201</startdate><enddate>20231201</enddate><creator>Mankad, Sapan H.</creator><creator>Garg, Sanjay</creator><creator>Patel, Vansh</creator><creator>Patwa, Nishi</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope></search><sort><creationdate>20231201</creationdate><title>A novel multiclass classification based approach for playback attack detection in speaker verification systems</title><author>Mankad, Sapan H. ; Garg, Sanjay ; Patel, Vansh ; Patwa, Nishi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1859-d1d6c7018169fc3311e10a2a5fc6db771a0757ea406a2f2efa70253afa2e803a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial Intelligence</topic><topic>Biometrics</topic><topic>Classification</topic><topic>Computational Intelligence</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Engineering</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Original Research</topic><topic>Recording</topic><topic>Robotics and Automation</topic><topic>Spoofing</topic><topic>User Interfaces and Human Computer Interaction</topic><topic>Verification</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mankad, Sapan H.</creatorcontrib><creatorcontrib>Garg, Sanjay</creatorcontrib><creatorcontrib>Patel, Vansh</creatorcontrib><creatorcontrib>Patwa, Nishi</creatorcontrib><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Database‎ (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer science database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>Journal of ambient intelligence and humanized computing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mankad, Sapan H.</au><au>Garg, Sanjay</au><au>Patel, Vansh</au><au>Patwa, Nishi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A novel multiclass classification based approach for playback attack detection in speaker verification systems</atitle><jtitle>Journal of ambient intelligence and humanized computing</jtitle><stitle>J Ambient Intell Human Comput</stitle><date>2023-12-01</date><risdate>2023</risdate><volume>14</volume><issue>12</issue><spage>16737</spage><epage>16748</epage><pages>16737-16748</pages><issn>1868-5137</issn><eissn>1868-5145</eissn><abstract>Spoofing detection in automatic speaker verification (ASV) systems is typically handled as a binary classification approach. In this paper, we propose a novel approach to address this problem using a multi-class classification approach. Each audio sample is tagged on the basis of the source of the signal. Spoof class samples are divided according to corresponding recording devices which were used during recording of the genuine speaker’s voice to be later used for implementing playback attack. Three different multiclass based approaches proposed in this work are evaluated on ASVspoof 2017 v2.0 dataset. The performance of these systems is tested on conventional and deep classifier systems using both handcrafted features and spectrographic representations of audio. Results suggest the potential of the proposed multiclass classification based approach in comparison to binary classification, specifically in deep learning scenario.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s12652-023-04684-9</doi><tpages>12</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1868-5137
ispartof Journal of ambient intelligence and humanized computing, 2023-12, Vol.14 (12), p.16737-16748
issn 1868-5137
1868-5145
language eng
recordid cdi_proquest_journals_2919536974
source Springer Nature
subjects Artificial Intelligence
Biometrics
Classification
Computational Intelligence
Datasets
Deep learning
Engineering
Machine learning
Neural networks
Original Research
Recording
Robotics and Automation
Spoofing
User Interfaces and Human Computer Interaction
Verification
title A novel multiclass classification based approach for playback attack detection in speaker verification systems
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T17%3A10%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20novel%20multiclass%20classification%20based%20approach%20for%20playback%20attack%20detection%20in%20speaker%20verification%20systems&rft.jtitle=Journal%20of%20ambient%20intelligence%20and%20humanized%20computing&rft.au=Mankad,%20Sapan%20H.&rft.date=2023-12-01&rft.volume=14&rft.issue=12&rft.spage=16737&rft.epage=16748&rft.pages=16737-16748&rft.issn=1868-5137&rft.eissn=1868-5145&rft_id=info:doi/10.1007/s12652-023-04684-9&rft_dat=%3Cproquest_cross%3E2919536974%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c1859-d1d6c7018169fc3311e10a2a5fc6db771a0757ea406a2f2efa70253afa2e803a3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2919536974&rft_id=info:pmid/&rfr_iscdi=true