Loading…
The Development of LOTUS-TRD: A Thai Regional Dialect Speech Corpus
Although, Thai dialect speech corpus for Automatic Speech Recognition (ASR) is publicly available [1], to accelerate the efficiency of ASR model is still need more speech hours of Thai dialect data in various domains. We present the 180 hours of the LOTUS-TRD comprised the four Thai regional dialect...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 6 |
container_issue | |
container_start_page | 1 |
container_title | |
container_volume | |
creator | Thatphithakkul, Sumonmas Thangthai, Kwanchiva Chunwijitra, Vataya |
description | Although, Thai dialect speech corpus for Automatic Speech Recognition (ASR) is publicly available [1], to accelerate the efficiency of ASR model is still need more speech hours of Thai dialect data in various domains. We present the 180 hours of the LOTUS-TRD comprised the four Thai regional dialects: Northern Thai (NT), Central Thai (CT), Northeastern Thai or Isan (NET) and Southern Thai (ST) speech accompanied with the Thai text transcription of each dialect pronunciation style. The 560 native speakers from 4 dialects, 19-50 years old, were requested to read a set of 100 sentences from their dialect for recording which were randomly selected from 2,500 sentences per dialect. After that the correspondence between sentences and speaker's speech were verified and corrected to prepare for the pronunciation dictionary construction. The number of word tokens occurring in each dialect is quite resemblance. All of word tokens in the corpus are 1,587,691 tokens. The LOTUS- TRD is released under an open license based on CC-BY-SA 4.0. In addition, we provide a baseline result that demonstrates an average Word Error Rate (WER) of 17.10% on the test set. |
doi_str_mv | 10.1109/O-COCOSDA64382.2024.10800335 |
format | conference_proceeding |
fullrecord | <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_10800335</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10800335</ieee_id><sourcerecordid>10800335</sourcerecordid><originalsourceid>FETCH-ieee_primary_108003353</originalsourceid><addsrcrecordid>eNqFzrFuwjAUQFFTCQnU5g8Y3sDq9NlOnJgNOVQdKkVK3BlZ6NG4CkmUpJX4exaYme5wlsvYVmAsBJr3ktvSlnWx14nKZSxRJrHAHFGpdMEik5lcKZGiRiVf2FommeSZNumKRdP0i4hCi9Qkes2sawgK-qe2Hy7UzdCf4at03zV3VbGDPbjGB6joJ_Sdb6EIvqXTDPVAdGrA9uPwN72x5dm3E0X3vrLNx8HZTx6I6DiM4eLH6_Hxp57wDUMZPSg</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>The Development of LOTUS-TRD: A Thai Regional Dialect Speech Corpus</title><source>IEEE Xplore All Conference Series</source><creator>Thatphithakkul, Sumonmas ; Thangthai, Kwanchiva ; Chunwijitra, Vataya</creator><creatorcontrib>Thatphithakkul, Sumonmas ; Thangthai, Kwanchiva ; Chunwijitra, Vataya</creatorcontrib><description>Although, Thai dialect speech corpus for Automatic Speech Recognition (ASR) is publicly available [1], to accelerate the efficiency of ASR model is still need more speech hours of Thai dialect data in various domains. We present the 180 hours of the LOTUS-TRD comprised the four Thai regional dialects: Northern Thai (NT), Central Thai (CT), Northeastern Thai or Isan (NET) and Southern Thai (ST) speech accompanied with the Thai text transcription of each dialect pronunciation style. The 560 native speakers from 4 dialects, 19-50 years old, were requested to read a set of 100 sentences from their dialect for recording which were randomly selected from 2,500 sentences per dialect. After that the correspondence between sentences and speaker's speech were verified and corrected to prepare for the pronunciation dictionary construction. The number of word tokens occurring in each dialect is quite resemblance. All of word tokens in the corpus are 1,587,691 tokens. The LOTUS- TRD is released under an open license based on CC-BY-SA 4.0. In addition, we provide a baseline result that demonstrates an average Word Error Rate (WER) of 17.10% on the test set.</description><identifier>EISSN: 2472-7695</identifier><identifier>EISBN: 9798331506032</identifier><identifier>DOI: 10.1109/O-COCOSDA64382.2024.10800335</identifier><language>eng</language><publisher>IEEE</publisher><subject>Automatic speech recognition ; Data models ; dialect ASR ; Dictionaries ; Error analysis ; Licenses ; low resource language ; Recording ; speech corpus ; speech recognition ; Thai dialect ; Translation</subject><ispartof>International Conference on Speech Database and Assessments, 2024, p.1-6</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10800335$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,27916,54546,54923</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10800335$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Thatphithakkul, Sumonmas</creatorcontrib><creatorcontrib>Thangthai, Kwanchiva</creatorcontrib><creatorcontrib>Chunwijitra, Vataya</creatorcontrib><title>The Development of LOTUS-TRD: A Thai Regional Dialect Speech Corpus</title><title>International Conference on Speech Database and Assessments</title><addtitle>O-COCOSDA</addtitle><description>Although, Thai dialect speech corpus for Automatic Speech Recognition (ASR) is publicly available [1], to accelerate the efficiency of ASR model is still need more speech hours of Thai dialect data in various domains. We present the 180 hours of the LOTUS-TRD comprised the four Thai regional dialects: Northern Thai (NT), Central Thai (CT), Northeastern Thai or Isan (NET) and Southern Thai (ST) speech accompanied with the Thai text transcription of each dialect pronunciation style. The 560 native speakers from 4 dialects, 19-50 years old, were requested to read a set of 100 sentences from their dialect for recording which were randomly selected from 2,500 sentences per dialect. After that the correspondence between sentences and speaker's speech were verified and corrected to prepare for the pronunciation dictionary construction. The number of word tokens occurring in each dialect is quite resemblance. All of word tokens in the corpus are 1,587,691 tokens. The LOTUS- TRD is released under an open license based on CC-BY-SA 4.0. In addition, we provide a baseline result that demonstrates an average Word Error Rate (WER) of 17.10% on the test set.</description><subject>Automatic speech recognition</subject><subject>Data models</subject><subject>dialect ASR</subject><subject>Dictionaries</subject><subject>Error analysis</subject><subject>Licenses</subject><subject>low resource language</subject><subject>Recording</subject><subject>speech corpus</subject><subject>speech recognition</subject><subject>Thai dialect</subject><subject>Translation</subject><issn>2472-7695</issn><isbn>9798331506032</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2024</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNqFzrFuwjAUQFFTCQnU5g8Y3sDq9NlOnJgNOVQdKkVK3BlZ6NG4CkmUpJX4exaYme5wlsvYVmAsBJr3ktvSlnWx14nKZSxRJrHAHFGpdMEik5lcKZGiRiVf2FommeSZNumKRdP0i4hCi9Qkes2sawgK-qe2Hy7UzdCf4at03zV3VbGDPbjGB6joJ_Sdb6EIvqXTDPVAdGrA9uPwN72x5dm3E0X3vrLNx8HZTx6I6DiM4eLH6_Hxp57wDUMZPSg</recordid><startdate>20241017</startdate><enddate>20241017</enddate><creator>Thatphithakkul, Sumonmas</creator><creator>Thangthai, Kwanchiva</creator><creator>Chunwijitra, Vataya</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>20241017</creationdate><title>The Development of LOTUS-TRD: A Thai Regional Dialect Speech Corpus</title><author>Thatphithakkul, Sumonmas ; Thangthai, Kwanchiva ; Chunwijitra, Vataya</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-ieee_primary_108003353</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Automatic speech recognition</topic><topic>Data models</topic><topic>dialect ASR</topic><topic>Dictionaries</topic><topic>Error analysis</topic><topic>Licenses</topic><topic>low resource language</topic><topic>Recording</topic><topic>speech corpus</topic><topic>speech recognition</topic><topic>Thai dialect</topic><topic>Translation</topic><toplevel>online_resources</toplevel><creatorcontrib>Thatphithakkul, Sumonmas</creatorcontrib><creatorcontrib>Thangthai, Kwanchiva</creatorcontrib><creatorcontrib>Chunwijitra, Vataya</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Xplore (IEEE/IET Electronic Library - IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Thatphithakkul, Sumonmas</au><au>Thangthai, Kwanchiva</au><au>Chunwijitra, Vataya</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>The Development of LOTUS-TRD: A Thai Regional Dialect Speech Corpus</atitle><btitle>International Conference on Speech Database and Assessments</btitle><stitle>O-COCOSDA</stitle><date>2024-10-17</date><risdate>2024</risdate><spage>1</spage><epage>6</epage><pages>1-6</pages><eissn>2472-7695</eissn><eisbn>9798331506032</eisbn><abstract>Although, Thai dialect speech corpus for Automatic Speech Recognition (ASR) is publicly available [1], to accelerate the efficiency of ASR model is still need more speech hours of Thai dialect data in various domains. We present the 180 hours of the LOTUS-TRD comprised the four Thai regional dialects: Northern Thai (NT), Central Thai (CT), Northeastern Thai or Isan (NET) and Southern Thai (ST) speech accompanied with the Thai text transcription of each dialect pronunciation style. The 560 native speakers from 4 dialects, 19-50 years old, were requested to read a set of 100 sentences from their dialect for recording which were randomly selected from 2,500 sentences per dialect. After that the correspondence between sentences and speaker's speech were verified and corrected to prepare for the pronunciation dictionary construction. The number of word tokens occurring in each dialect is quite resemblance. All of word tokens in the corpus are 1,587,691 tokens. The LOTUS- TRD is released under an open license based on CC-BY-SA 4.0. In addition, we provide a baseline result that demonstrates an average Word Error Rate (WER) of 17.10% on the test set.</abstract><pub>IEEE</pub><doi>10.1109/O-COCOSDA64382.2024.10800335</doi></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | EISSN: 2472-7695 |
ispartof | International Conference on Speech Database and Assessments, 2024, p.1-6 |
issn | 2472-7695 |
language | eng |
recordid | cdi_ieee_primary_10800335 |
source | IEEE Xplore All Conference Series |
subjects | Automatic speech recognition Data models dialect ASR Dictionaries Error analysis Licenses low resource language Recording speech corpus speech recognition Thai dialect Translation |
title | The Development of LOTUS-TRD: A Thai Regional Dialect Speech Corpus |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T04%3A59%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=The%20Development%20of%20LOTUS-TRD:%20A%20Thai%20Regional%20Dialect%20Speech%20Corpus&rft.btitle=International%20Conference%20on%20Speech%20Database%20and%20Assessments&rft.au=Thatphithakkul,%20Sumonmas&rft.date=2024-10-17&rft.spage=1&rft.epage=6&rft.pages=1-6&rft.eissn=2472-7695&rft_id=info:doi/10.1109/O-COCOSDA64382.2024.10800335&rft.eisbn=9798331506032&rft_dat=%3Cieee_CHZPO%3E10800335%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-ieee_primary_108003353%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10800335&rfr_iscdi=true |