Loading…
A novel multiclass classification based approach for playback attack detection in speaker verification systems
Spoofing detection in automatic speaker verification (ASV) systems is typically handled as a binary classification approach. In this paper, we propose a novel approach to address this problem using a multi-class classification approach. Each audio sample is tagged on the basis of the source of the s...
Saved in:
Published in: | Journal of ambient intelligence and humanized computing 2023-12, Vol.14 (12), p.16737-16748 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c1859-d1d6c7018169fc3311e10a2a5fc6db771a0757ea406a2f2efa70253afa2e803a3 |
container_end_page | 16748 |
container_issue | 12 |
container_start_page | 16737 |
container_title | Journal of ambient intelligence and humanized computing |
container_volume | 14 |
creator | Mankad, Sapan H. Garg, Sanjay Patel, Vansh Patwa, Nishi |
description | Spoofing detection in automatic speaker verification (ASV) systems is typically handled as a binary classification approach. In this paper, we propose a novel approach to address this problem using a multi-class classification approach. Each audio sample is tagged on the basis of the source of the signal. Spoof class samples are divided according to corresponding recording devices which were used during recording of the genuine speaker’s voice to be later used for implementing playback attack. Three different multiclass based approaches proposed in this work are evaluated on ASVspoof 2017 v2.0 dataset. The performance of these systems is tested on conventional and deep classifier systems using both handcrafted features and spectrographic representations of audio. Results suggest the potential of the proposed multiclass classification based approach in comparison to binary classification, specifically in deep learning scenario. |
doi_str_mv | 10.1007/s12652-023-04684-9 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2919536974</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2919536974</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1859-d1d6c7018169fc3311e10a2a5fc6db771a0757ea406a2f2efa70253afa2e803a3</originalsourceid><addsrcrecordid>eNp9UMtOwzAQtBBIVKU_wMkS54AfsZ0cq4qXVIkLnK2ts4a0aRLstFL_HrdB9MYedvYwMzsaQm45u-eMmYfIhVYiY0JmLNdFnpUXZMILXWSK5-ry75bmmsxiXLM0spSc8wlp57Tt9tjQ7a4ZatdAjPS0a187GOqupSuIWFHo-9CB-6K-C7Rv4LACt6EwDEeocEB3ItctjT3CBgPdYzibxEMccBtvyJWHJuLsF6fk4-nxffGSLd-eXxfzZeZ4ocqs4pV2hvGC69I7maIiZyBAeaerlTEcmFEGIWcahBfowTChJHgQWDAJckruRt8U-nuHcbDrbhfa9NKKkpdK6tLkiSVGlgtdjAG97UO9hXCwnNljtXas1qZq7alaWyaRHEUxkdtPDGfrf1Q_iWR9ng</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2919536974</pqid></control><display><type>article</type><title>A novel multiclass classification based approach for playback attack detection in speaker verification systems</title><source>Springer Nature</source><creator>Mankad, Sapan H. ; Garg, Sanjay ; Patel, Vansh ; Patwa, Nishi</creator><creatorcontrib>Mankad, Sapan H. ; Garg, Sanjay ; Patel, Vansh ; Patwa, Nishi</creatorcontrib><description>Spoofing detection in automatic speaker verification (ASV) systems is typically handled as a binary classification approach. In this paper, we propose a novel approach to address this problem using a multi-class classification approach. Each audio sample is tagged on the basis of the source of the signal. Spoof class samples are divided according to corresponding recording devices which were used during recording of the genuine speaker’s voice to be later used for implementing playback attack. Three different multiclass based approaches proposed in this work are evaluated on ASVspoof 2017 v2.0 dataset. The performance of these systems is tested on conventional and deep classifier systems using both handcrafted features and spectrographic representations of audio. Results suggest the potential of the proposed multiclass classification based approach in comparison to binary classification, specifically in deep learning scenario.</description><identifier>ISSN: 1868-5137</identifier><identifier>EISSN: 1868-5145</identifier><identifier>DOI: 10.1007/s12652-023-04684-9</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Artificial Intelligence ; Biometrics ; Classification ; Computational Intelligence ; Datasets ; Deep learning ; Engineering ; Machine learning ; Neural networks ; Original Research ; Recording ; Robotics and Automation ; Spoofing ; User Interfaces and Human Computer Interaction ; Verification</subject><ispartof>Journal of ambient intelligence and humanized computing, 2023-12, Vol.14 (12), p.16737-16748</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1859-d1d6c7018169fc3311e10a2a5fc6db771a0757ea406a2f2efa70253afa2e803a3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27903,27904</link.rule.ids></links><search><creatorcontrib>Mankad, Sapan H.</creatorcontrib><creatorcontrib>Garg, Sanjay</creatorcontrib><creatorcontrib>Patel, Vansh</creatorcontrib><creatorcontrib>Patwa, Nishi</creatorcontrib><title>A novel multiclass classification based approach for playback attack detection in speaker verification systems</title><title>Journal of ambient intelligence and humanized computing</title><addtitle>J Ambient Intell Human Comput</addtitle><description>Spoofing detection in automatic speaker verification (ASV) systems is typically handled as a binary classification approach. In this paper, we propose a novel approach to address this problem using a multi-class classification approach. Each audio sample is tagged on the basis of the source of the signal. Spoof class samples are divided according to corresponding recording devices which were used during recording of the genuine speaker’s voice to be later used for implementing playback attack. Three different multiclass based approaches proposed in this work are evaluated on ASVspoof 2017 v2.0 dataset. The performance of these systems is tested on conventional and deep classifier systems using both handcrafted features and spectrographic representations of audio. Results suggest the potential of the proposed multiclass classification based approach in comparison to binary classification, specifically in deep learning scenario.</description><subject>Artificial Intelligence</subject><subject>Biometrics</subject><subject>Classification</subject><subject>Computational Intelligence</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Engineering</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Original Research</subject><subject>Recording</subject><subject>Robotics and Automation</subject><subject>Spoofing</subject><subject>User Interfaces and Human Computer Interaction</subject><subject>Verification</subject><issn>1868-5137</issn><issn>1868-5145</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9UMtOwzAQtBBIVKU_wMkS54AfsZ0cq4qXVIkLnK2ts4a0aRLstFL_HrdB9MYedvYwMzsaQm45u-eMmYfIhVYiY0JmLNdFnpUXZMILXWSK5-ry75bmmsxiXLM0spSc8wlp57Tt9tjQ7a4ZatdAjPS0a187GOqupSuIWFHo-9CB-6K-C7Rv4LACt6EwDEeocEB3ItctjT3CBgPdYzibxEMccBtvyJWHJuLsF6fk4-nxffGSLd-eXxfzZeZ4ocqs4pV2hvGC69I7maIiZyBAeaerlTEcmFEGIWcahBfowTChJHgQWDAJckruRt8U-nuHcbDrbhfa9NKKkpdK6tLkiSVGlgtdjAG97UO9hXCwnNljtXas1qZq7alaWyaRHEUxkdtPDGfrf1Q_iWR9ng</recordid><startdate>20231201</startdate><enddate>20231201</enddate><creator>Mankad, Sapan H.</creator><creator>Garg, Sanjay</creator><creator>Patel, Vansh</creator><creator>Patwa, Nishi</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope></search><sort><creationdate>20231201</creationdate><title>A novel multiclass classification based approach for playback attack detection in speaker verification systems</title><author>Mankad, Sapan H. ; Garg, Sanjay ; Patel, Vansh ; Patwa, Nishi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1859-d1d6c7018169fc3311e10a2a5fc6db771a0757ea406a2f2efa70253afa2e803a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial Intelligence</topic><topic>Biometrics</topic><topic>Classification</topic><topic>Computational Intelligence</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Engineering</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Original Research</topic><topic>Recording</topic><topic>Robotics and Automation</topic><topic>Spoofing</topic><topic>User Interfaces and Human Computer Interaction</topic><topic>Verification</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mankad, Sapan H.</creatorcontrib><creatorcontrib>Garg, Sanjay</creatorcontrib><creatorcontrib>Patel, Vansh</creatorcontrib><creatorcontrib>Patwa, Nishi</creatorcontrib><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Database (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer science database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>Journal of ambient intelligence and humanized computing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mankad, Sapan H.</au><au>Garg, Sanjay</au><au>Patel, Vansh</au><au>Patwa, Nishi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A novel multiclass classification based approach for playback attack detection in speaker verification systems</atitle><jtitle>Journal of ambient intelligence and humanized computing</jtitle><stitle>J Ambient Intell Human Comput</stitle><date>2023-12-01</date><risdate>2023</risdate><volume>14</volume><issue>12</issue><spage>16737</spage><epage>16748</epage><pages>16737-16748</pages><issn>1868-5137</issn><eissn>1868-5145</eissn><abstract>Spoofing detection in automatic speaker verification (ASV) systems is typically handled as a binary classification approach. In this paper, we propose a novel approach to address this problem using a multi-class classification approach. Each audio sample is tagged on the basis of the source of the signal. Spoof class samples are divided according to corresponding recording devices which were used during recording of the genuine speaker’s voice to be later used for implementing playback attack. Three different multiclass based approaches proposed in this work are evaluated on ASVspoof 2017 v2.0 dataset. The performance of these systems is tested on conventional and deep classifier systems using both handcrafted features and spectrographic representations of audio. Results suggest the potential of the proposed multiclass classification based approach in comparison to binary classification, specifically in deep learning scenario.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s12652-023-04684-9</doi><tpages>12</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1868-5137 |
ispartof | Journal of ambient intelligence and humanized computing, 2023-12, Vol.14 (12), p.16737-16748 |
issn | 1868-5137 1868-5145 |
language | eng |
recordid | cdi_proquest_journals_2919536974 |
source | Springer Nature |
subjects | Artificial Intelligence Biometrics Classification Computational Intelligence Datasets Deep learning Engineering Machine learning Neural networks Original Research Recording Robotics and Automation Spoofing User Interfaces and Human Computer Interaction Verification |
title | A novel multiclass classification based approach for playback attack detection in speaker verification systems |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T17%3A10%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20novel%20multiclass%20classification%20based%20approach%20for%20playback%20attack%20detection%20in%20speaker%20verification%20systems&rft.jtitle=Journal%20of%20ambient%20intelligence%20and%20humanized%20computing&rft.au=Mankad,%20Sapan%20H.&rft.date=2023-12-01&rft.volume=14&rft.issue=12&rft.spage=16737&rft.epage=16748&rft.pages=16737-16748&rft.issn=1868-5137&rft.eissn=1868-5145&rft_id=info:doi/10.1007/s12652-023-04684-9&rft_dat=%3Cproquest_cross%3E2919536974%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c1859-d1d6c7018169fc3311e10a2a5fc6db771a0757ea406a2f2efa70253afa2e803a3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2919536974&rft_id=info:pmid/&rfr_iscdi=true |