Loading…

Taiwanese Across Taiwan Corpus And Its Applications

Taiwanese across Taiwan (TAT) corpus is the first large-scale and publicly released Taiwanese speech corpus which represents the modern Taiwanese around Taiwan. This paper briefly reviews the TAT corpus and a corresponding parallel Chinese, Hàn-Lô-Tâi-bûn, Tai-Luo and Péh-ōe-jī lexicon and demonstra...

Full description

Saved in:
Bibliographic Details
Main Authors: Liao, Yuan-Fu, Tsay, Jane S., Kang, Peter, Khoo, Hui-Lu, Tan, Le-Kun, Chang, Li-Chen, Iunn, Un-Gian, Su, Huang-Lan, Thiann, Tsun-Guan, Tiun, Hak-Khiam, Liao, Su-Lian
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 5
container_issue
container_start_page 1
container_title
container_volume
creator Liao, Yuan-Fu
Tsay, Jane S.
Kang, Peter
Khoo, Hui-Lu
Tan, Le-Kun
Chang, Li-Chen
Iunn, Un-Gian
Su, Huang-Lan
Thiann, Tsun-Guan
Tiun, Hak-Khiam
Liao, Su-Lian
description Taiwanese across Taiwan (TAT) corpus is the first large-scale and publicly released Taiwanese speech corpus which represents the modern Taiwanese around Taiwan. This paper briefly reviews the TAT corpus and a corresponding parallel Chinese, Hàn-Lô-Tâi-bûn, Tai-Luo and Péh-ōe-jī lexicon and demonstrate some of their potential applications including ASR, TTS and voice conversion. The corresponding pretrained ASR and TTS models, sample model usage codes and training scripts will also be released. More information could be found on the Formosa Speech in the Wild website: https: //sites. google. com/nycu.edu.tw/fsw.
doi_str_mv 10.1109/O-COCOSDA202257103.2022.9997977
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_9997977</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9997977</ieee_id><sourcerecordid>9997977</sourcerecordid><originalsourceid>FETCH-LOGICAL-i133t-69c67565e4bc7d07dfeb6cadf05c7ef505b2cc836aa132fe2cab8df73e66039d3</originalsourceid><addsrcrecordid>eNotj0tLxDAUhaMgOIz9BW66c9V6k9vcNMtSXwMDXTiuhzQPiIxtaSriv7cys_oOZ_FxDmMPHErOQT92Rdu13ftTI0AIqThg-Z9KrbXSSl2xbGWNElDXkqprthGVEoUiLW9ZltInAHDiUle0YXgw8ccMPvm8sfOYUn4u8nacp--UN4PLd8vKaTpFa5Y4DumO3QRzSj67cMs-Xp4P7Vux7153bbMvIkdcCtKWlCTpq94qB8oF35M1LoC0ygcJshfW1kjGcBTBC2v62gWFnmjd7nDL7s_e6L0_TnP8MvPv8XIT_wCdqEln</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Taiwanese Across Taiwan Corpus And Its Applications</title><source>IEEE Xplore All Conference Series</source><creator>Liao, Yuan-Fu ; Tsay, Jane S. ; Kang, Peter ; Khoo, Hui-Lu ; Tan, Le-Kun ; Chang, Li-Chen ; Iunn, Un-Gian ; Su, Huang-Lan ; Thiann, Tsun-Guan ; Tiun, Hak-Khiam ; Liao, Su-Lian</creator><creatorcontrib>Liao, Yuan-Fu ; Tsay, Jane S. ; Kang, Peter ; Khoo, Hui-Lu ; Tan, Le-Kun ; Chang, Li-Chen ; Iunn, Un-Gian ; Su, Huang-Lan ; Thiann, Tsun-Guan ; Tiun, Hak-Khiam ; Liao, Su-Lian</creatorcontrib><description>Taiwanese across Taiwan (TAT) corpus is the first large-scale and publicly released Taiwanese speech corpus which represents the modern Taiwanese around Taiwan. This paper briefly reviews the TAT corpus and a corresponding parallel Chinese, Hàn-Lô-Tâi-bûn, Tai-Luo and Péh-ōe-jī lexicon and demonstrate some of their potential applications including ASR, TTS and voice conversion. The corresponding pretrained ASR and TTS models, sample model usage codes and training scripts will also be released. More information could be found on the Formosa Speech in the Wild website: https: //sites. google. com/nycu.edu.tw/fsw.</description><identifier>EISSN: 2472-7695</identifier><identifier>EISBN: 9798350398564</identifier><identifier>DOI: 10.1109/O-COCOSDA202257103.2022.9997977</identifier><language>eng</language><publisher>IEEE</publisher><subject>automatic speech recognition ; speech synthesis ; Taiwanese speech corpus ; voice conversion</subject><ispartof>2022 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), 2022, p.1-5</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9997977$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,27925,54555,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9997977$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Liao, Yuan-Fu</creatorcontrib><creatorcontrib>Tsay, Jane S.</creatorcontrib><creatorcontrib>Kang, Peter</creatorcontrib><creatorcontrib>Khoo, Hui-Lu</creatorcontrib><creatorcontrib>Tan, Le-Kun</creatorcontrib><creatorcontrib>Chang, Li-Chen</creatorcontrib><creatorcontrib>Iunn, Un-Gian</creatorcontrib><creatorcontrib>Su, Huang-Lan</creatorcontrib><creatorcontrib>Thiann, Tsun-Guan</creatorcontrib><creatorcontrib>Tiun, Hak-Khiam</creatorcontrib><creatorcontrib>Liao, Su-Lian</creatorcontrib><title>Taiwanese Across Taiwan Corpus And Its Applications</title><title>2022 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)</title><addtitle>OCOCOSDA</addtitle><description>Taiwanese across Taiwan (TAT) corpus is the first large-scale and publicly released Taiwanese speech corpus which represents the modern Taiwanese around Taiwan. This paper briefly reviews the TAT corpus and a corresponding parallel Chinese, Hàn-Lô-Tâi-bûn, Tai-Luo and Péh-ōe-jī lexicon and demonstrate some of their potential applications including ASR, TTS and voice conversion. The corresponding pretrained ASR and TTS models, sample model usage codes and training scripts will also be released. More information could be found on the Formosa Speech in the Wild website: https: //sites. google. com/nycu.edu.tw/fsw.</description><subject>automatic speech recognition</subject><subject>speech synthesis</subject><subject>Taiwanese speech corpus</subject><subject>voice conversion</subject><issn>2472-7695</issn><isbn>9798350398564</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2022</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotj0tLxDAUhaMgOIz9BW66c9V6k9vcNMtSXwMDXTiuhzQPiIxtaSriv7cys_oOZ_FxDmMPHErOQT92Rdu13ftTI0AIqThg-Z9KrbXSSl2xbGWNElDXkqprthGVEoUiLW9ZltInAHDiUle0YXgw8ccMPvm8sfOYUn4u8nacp--UN4PLd8vKaTpFa5Y4DumO3QRzSj67cMs-Xp4P7Vux7153bbMvIkdcCtKWlCTpq94qB8oF35M1LoC0ygcJshfW1kjGcBTBC2v62gWFnmjd7nDL7s_e6L0_TnP8MvPv8XIT_wCdqEln</recordid><startdate>202211</startdate><enddate>202211</enddate><creator>Liao, Yuan-Fu</creator><creator>Tsay, Jane S.</creator><creator>Kang, Peter</creator><creator>Khoo, Hui-Lu</creator><creator>Tan, Le-Kun</creator><creator>Chang, Li-Chen</creator><creator>Iunn, Un-Gian</creator><creator>Su, Huang-Lan</creator><creator>Thiann, Tsun-Guan</creator><creator>Tiun, Hak-Khiam</creator><creator>Liao, Su-Lian</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>202211</creationdate><title>Taiwanese Across Taiwan Corpus And Its Applications</title><author>Liao, Yuan-Fu ; Tsay, Jane S. ; Kang, Peter ; Khoo, Hui-Lu ; Tan, Le-Kun ; Chang, Li-Chen ; Iunn, Un-Gian ; Su, Huang-Lan ; Thiann, Tsun-Guan ; Tiun, Hak-Khiam ; Liao, Su-Lian</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i133t-69c67565e4bc7d07dfeb6cadf05c7ef505b2cc836aa132fe2cab8df73e66039d3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2022</creationdate><topic>automatic speech recognition</topic><topic>speech synthesis</topic><topic>Taiwanese speech corpus</topic><topic>voice conversion</topic><toplevel>online_resources</toplevel><creatorcontrib>Liao, Yuan-Fu</creatorcontrib><creatorcontrib>Tsay, Jane S.</creatorcontrib><creatorcontrib>Kang, Peter</creatorcontrib><creatorcontrib>Khoo, Hui-Lu</creatorcontrib><creatorcontrib>Tan, Le-Kun</creatorcontrib><creatorcontrib>Chang, Li-Chen</creatorcontrib><creatorcontrib>Iunn, Un-Gian</creatorcontrib><creatorcontrib>Su, Huang-Lan</creatorcontrib><creatorcontrib>Thiann, Tsun-Guan</creatorcontrib><creatorcontrib>Tiun, Hak-Khiam</creatorcontrib><creatorcontrib>Liao, Su-Lian</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE/IET Electronic Library</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Liao, Yuan-Fu</au><au>Tsay, Jane S.</au><au>Kang, Peter</au><au>Khoo, Hui-Lu</au><au>Tan, Le-Kun</au><au>Chang, Li-Chen</au><au>Iunn, Un-Gian</au><au>Su, Huang-Lan</au><au>Thiann, Tsun-Guan</au><au>Tiun, Hak-Khiam</au><au>Liao, Su-Lian</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Taiwanese Across Taiwan Corpus And Its Applications</atitle><btitle>2022 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)</btitle><stitle>OCOCOSDA</stitle><date>2022-11</date><risdate>2022</risdate><spage>1</spage><epage>5</epage><pages>1-5</pages><eissn>2472-7695</eissn><eisbn>9798350398564</eisbn><abstract>Taiwanese across Taiwan (TAT) corpus is the first large-scale and publicly released Taiwanese speech corpus which represents the modern Taiwanese around Taiwan. This paper briefly reviews the TAT corpus and a corresponding parallel Chinese, Hàn-Lô-Tâi-bûn, Tai-Luo and Péh-ōe-jī lexicon and demonstrate some of their potential applications including ASR, TTS and voice conversion. The corresponding pretrained ASR and TTS models, sample model usage codes and training scripts will also be released. More information could be found on the Formosa Speech in the Wild website: https: //sites. google. com/nycu.edu.tw/fsw.</abstract><pub>IEEE</pub><doi>10.1109/O-COCOSDA202257103.2022.9997977</doi><tpages>5</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier EISSN: 2472-7695
ispartof 2022 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), 2022, p.1-5
issn 2472-7695
language eng
recordid cdi_ieee_primary_9997977
source IEEE Xplore All Conference Series
subjects automatic speech recognition
speech synthesis
Taiwanese speech corpus
voice conversion
title Taiwanese Across Taiwan Corpus And Its Applications
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T20%3A57%3A21IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Taiwanese%20Across%20Taiwan%20Corpus%20And%20Its%20Applications&rft.btitle=2022%2025th%20Conference%20of%20the%20Oriental%20COCOSDA%20International%20Committee%20for%20the%20Co-ordination%20and%20Standardisation%20of%20Speech%20Databases%20and%20Assessment%20Techniques%20(O-COCOSDA)&rft.au=Liao,%20Yuan-Fu&rft.date=2022-11&rft.spage=1&rft.epage=5&rft.pages=1-5&rft.eissn=2472-7695&rft_id=info:doi/10.1109/O-COCOSDA202257103.2022.9997977&rft.eisbn=9798350398564&rft_dat=%3Cieee_CHZPO%3E9997977%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i133t-69c67565e4bc7d07dfeb6cadf05c7ef505b2cc836aa132fe2cab8df73e66039d3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=9997977&rfr_iscdi=true