Loading…

The Development of LOTUS-TRD: A Thai Regional Dialect Speech Corpus

Although, Thai dialect speech corpus for Automatic Speech Recognition (ASR) is publicly available [1], to accelerate the efficiency of ASR model is still need more speech hours of Thai dialect data in various domains. We present the 180 hours of the LOTUS-TRD comprised the four Thai regional dialect...

Full description

Saved in:
Bibliographic Details
Main Authors: Thatphithakkul, Sumonmas, Thangthai, Kwanchiva, Chunwijitra, Vataya
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 6
container_issue
container_start_page 1
container_title
container_volume
creator Thatphithakkul, Sumonmas
Thangthai, Kwanchiva
Chunwijitra, Vataya
description Although, Thai dialect speech corpus for Automatic Speech Recognition (ASR) is publicly available [1], to accelerate the efficiency of ASR model is still need more speech hours of Thai dialect data in various domains. We present the 180 hours of the LOTUS-TRD comprised the four Thai regional dialects: Northern Thai (NT), Central Thai (CT), Northeastern Thai or Isan (NET) and Southern Thai (ST) speech accompanied with the Thai text transcription of each dialect pronunciation style. The 560 native speakers from 4 dialects, 19-50 years old, were requested to read a set of 100 sentences from their dialect for recording which were randomly selected from 2,500 sentences per dialect. After that the correspondence between sentences and speaker's speech were verified and corrected to prepare for the pronunciation dictionary construction. The number of word tokens occurring in each dialect is quite resemblance. All of word tokens in the corpus are 1,587,691 tokens. The LOTUS- TRD is released under an open license based on CC-BY-SA 4.0. In addition, we provide a baseline result that demonstrates an average Word Error Rate (WER) of 17.10% on the test set.
doi_str_mv 10.1109/O-COCOSDA64382.2024.10800335
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_10800335</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10800335</ieee_id><sourcerecordid>10800335</sourcerecordid><originalsourceid>FETCH-ieee_primary_108003353</originalsourceid><addsrcrecordid>eNqFzrFuwjAUQFFTCQnU5g8Y3sDq9NlOnJgNOVQdKkVK3BlZ6NG4CkmUpJX4exaYme5wlsvYVmAsBJr3ktvSlnWx14nKZSxRJrHAHFGpdMEik5lcKZGiRiVf2FommeSZNumKRdP0i4hCi9Qkes2sawgK-qe2Hy7UzdCf4at03zV3VbGDPbjGB6joJ_Sdb6EIvqXTDPVAdGrA9uPwN72x5dm3E0X3vrLNx8HZTx6I6DiM4eLH6_Hxp57wDUMZPSg</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>The Development of LOTUS-TRD: A Thai Regional Dialect Speech Corpus</title><source>IEEE Xplore All Conference Series</source><creator>Thatphithakkul, Sumonmas ; Thangthai, Kwanchiva ; Chunwijitra, Vataya</creator><creatorcontrib>Thatphithakkul, Sumonmas ; Thangthai, Kwanchiva ; Chunwijitra, Vataya</creatorcontrib><description>Although, Thai dialect speech corpus for Automatic Speech Recognition (ASR) is publicly available [1], to accelerate the efficiency of ASR model is still need more speech hours of Thai dialect data in various domains. We present the 180 hours of the LOTUS-TRD comprised the four Thai regional dialects: Northern Thai (NT), Central Thai (CT), Northeastern Thai or Isan (NET) and Southern Thai (ST) speech accompanied with the Thai text transcription of each dialect pronunciation style. The 560 native speakers from 4 dialects, 19-50 years old, were requested to read a set of 100 sentences from their dialect for recording which were randomly selected from 2,500 sentences per dialect. After that the correspondence between sentences and speaker's speech were verified and corrected to prepare for the pronunciation dictionary construction. The number of word tokens occurring in each dialect is quite resemblance. All of word tokens in the corpus are 1,587,691 tokens. The LOTUS- TRD is released under an open license based on CC-BY-SA 4.0. In addition, we provide a baseline result that demonstrates an average Word Error Rate (WER) of 17.10% on the test set.</description><identifier>EISSN: 2472-7695</identifier><identifier>EISBN: 9798331506032</identifier><identifier>DOI: 10.1109/O-COCOSDA64382.2024.10800335</identifier><language>eng</language><publisher>IEEE</publisher><subject>Automatic speech recognition ; Data models ; dialect ASR ; Dictionaries ; Error analysis ; Licenses ; low resource language ; Recording ; speech corpus ; speech recognition ; Thai dialect ; Translation</subject><ispartof>International Conference on Speech Database and Assessments, 2024, p.1-6</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10800335$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,27916,54546,54923</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10800335$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Thatphithakkul, Sumonmas</creatorcontrib><creatorcontrib>Thangthai, Kwanchiva</creatorcontrib><creatorcontrib>Chunwijitra, Vataya</creatorcontrib><title>The Development of LOTUS-TRD: A Thai Regional Dialect Speech Corpus</title><title>International Conference on Speech Database and Assessments</title><addtitle>O-COCOSDA</addtitle><description>Although, Thai dialect speech corpus for Automatic Speech Recognition (ASR) is publicly available [1], to accelerate the efficiency of ASR model is still need more speech hours of Thai dialect data in various domains. We present the 180 hours of the LOTUS-TRD comprised the four Thai regional dialects: Northern Thai (NT), Central Thai (CT), Northeastern Thai or Isan (NET) and Southern Thai (ST) speech accompanied with the Thai text transcription of each dialect pronunciation style. The 560 native speakers from 4 dialects, 19-50 years old, were requested to read a set of 100 sentences from their dialect for recording which were randomly selected from 2,500 sentences per dialect. After that the correspondence between sentences and speaker's speech were verified and corrected to prepare for the pronunciation dictionary construction. The number of word tokens occurring in each dialect is quite resemblance. All of word tokens in the corpus are 1,587,691 tokens. The LOTUS- TRD is released under an open license based on CC-BY-SA 4.0. In addition, we provide a baseline result that demonstrates an average Word Error Rate (WER) of 17.10% on the test set.</description><subject>Automatic speech recognition</subject><subject>Data models</subject><subject>dialect ASR</subject><subject>Dictionaries</subject><subject>Error analysis</subject><subject>Licenses</subject><subject>low resource language</subject><subject>Recording</subject><subject>speech corpus</subject><subject>speech recognition</subject><subject>Thai dialect</subject><subject>Translation</subject><issn>2472-7695</issn><isbn>9798331506032</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2024</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNqFzrFuwjAUQFFTCQnU5g8Y3sDq9NlOnJgNOVQdKkVK3BlZ6NG4CkmUpJX4exaYme5wlsvYVmAsBJr3ktvSlnWx14nKZSxRJrHAHFGpdMEik5lcKZGiRiVf2FommeSZNumKRdP0i4hCi9Qkes2sawgK-qe2Hy7UzdCf4at03zV3VbGDPbjGB6joJ_Sdb6EIvqXTDPVAdGrA9uPwN72x5dm3E0X3vrLNx8HZTx6I6DiM4eLH6_Hxp57wDUMZPSg</recordid><startdate>20241017</startdate><enddate>20241017</enddate><creator>Thatphithakkul, Sumonmas</creator><creator>Thangthai, Kwanchiva</creator><creator>Chunwijitra, Vataya</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>20241017</creationdate><title>The Development of LOTUS-TRD: A Thai Regional Dialect Speech Corpus</title><author>Thatphithakkul, Sumonmas ; Thangthai, Kwanchiva ; Chunwijitra, Vataya</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-ieee_primary_108003353</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Automatic speech recognition</topic><topic>Data models</topic><topic>dialect ASR</topic><topic>Dictionaries</topic><topic>Error analysis</topic><topic>Licenses</topic><topic>low resource language</topic><topic>Recording</topic><topic>speech corpus</topic><topic>speech recognition</topic><topic>Thai dialect</topic><topic>Translation</topic><toplevel>online_resources</toplevel><creatorcontrib>Thatphithakkul, Sumonmas</creatorcontrib><creatorcontrib>Thangthai, Kwanchiva</creatorcontrib><creatorcontrib>Chunwijitra, Vataya</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Xplore (IEEE/IET Electronic Library - IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Thatphithakkul, Sumonmas</au><au>Thangthai, Kwanchiva</au><au>Chunwijitra, Vataya</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>The Development of LOTUS-TRD: A Thai Regional Dialect Speech Corpus</atitle><btitle>International Conference on Speech Database and Assessments</btitle><stitle>O-COCOSDA</stitle><date>2024-10-17</date><risdate>2024</risdate><spage>1</spage><epage>6</epage><pages>1-6</pages><eissn>2472-7695</eissn><eisbn>9798331506032</eisbn><abstract>Although, Thai dialect speech corpus for Automatic Speech Recognition (ASR) is publicly available [1], to accelerate the efficiency of ASR model is still need more speech hours of Thai dialect data in various domains. We present the 180 hours of the LOTUS-TRD comprised the four Thai regional dialects: Northern Thai (NT), Central Thai (CT), Northeastern Thai or Isan (NET) and Southern Thai (ST) speech accompanied with the Thai text transcription of each dialect pronunciation style. The 560 native speakers from 4 dialects, 19-50 years old, were requested to read a set of 100 sentences from their dialect for recording which were randomly selected from 2,500 sentences per dialect. After that the correspondence between sentences and speaker's speech were verified and corrected to prepare for the pronunciation dictionary construction. The number of word tokens occurring in each dialect is quite resemblance. All of word tokens in the corpus are 1,587,691 tokens. The LOTUS- TRD is released under an open license based on CC-BY-SA 4.0. In addition, we provide a baseline result that demonstrates an average Word Error Rate (WER) of 17.10% on the test set.</abstract><pub>IEEE</pub><doi>10.1109/O-COCOSDA64382.2024.10800335</doi></addata></record>
fulltext fulltext_linktorsrc
identifier EISSN: 2472-7695
ispartof International Conference on Speech Database and Assessments, 2024, p.1-6
issn 2472-7695
language eng
recordid cdi_ieee_primary_10800335
source IEEE Xplore All Conference Series
subjects Automatic speech recognition
Data models
dialect ASR
Dictionaries
Error analysis
Licenses
low resource language
Recording
speech corpus
speech recognition
Thai dialect
Translation
title The Development of LOTUS-TRD: A Thai Regional Dialect Speech Corpus
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T04%3A59%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=The%20Development%20of%20LOTUS-TRD:%20A%20Thai%20Regional%20Dialect%20Speech%20Corpus&rft.btitle=International%20Conference%20on%20Speech%20Database%20and%20Assessments&rft.au=Thatphithakkul,%20Sumonmas&rft.date=2024-10-17&rft.spage=1&rft.epage=6&rft.pages=1-6&rft.eissn=2472-7695&rft_id=info:doi/10.1109/O-COCOSDA64382.2024.10800335&rft.eisbn=9798331506032&rft_dat=%3Cieee_CHZPO%3E10800335%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-ieee_primary_108003353%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10800335&rfr_iscdi=true