Loading…
A machine transliteration model based on correspondence between graphemes and phonemes
Machine transliteration is an automatic method for converting words in one language into phonetically equivalent ones in another language. There has been growing interest in the use of machine transliteration to assist machine translation and information retrieval. Three types of machine translitera...
Saved in:
Published in: | ACM transactions on Asian language information processing 2006-09, Vol.5 (3), p.185-208 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c214t-b19daa1549af1cc5b5614124bacd7cac964998831f616131609ec271d0bbf6fd3 |
---|---|
cites | cdi_FETCH-LOGICAL-c214t-b19daa1549af1cc5b5614124bacd7cac964998831f616131609ec271d0bbf6fd3 |
container_end_page | 208 |
container_issue | 3 |
container_start_page | 185 |
container_title | ACM transactions on Asian language information processing |
container_volume | 5 |
creator | Oh, Jong-Hoon Choi, Key-Sun Isahara, Hitoshi |
description | Machine transliteration is an automatic method for converting words in one language into phonetically equivalent ones in another language. There has been growing interest in the use of machine transliteration to assist machine translation and information retrieval. Three types of machine transliteration models---grapheme-based, phoneme-based, and hybrid---have been proposed. Surprisingly, there have been few reports of efforts to utilize the correspondence between source graphemes and source phonemes, although this correspondence plays an important role in machine transliteration. Furthermore, little work has been reported on ways to dynamically handle source graphemes and phonemes. In this paper, we propose a transliteration model that dynamically uses both graphemes and phonemes, particularly the correspondence between them. With this model, we have achieved better performance---improvements of about 15 to 41% in English-to-Korean transliteration and about 16 to 44% in English-to-Japanese transliteration---than has been reported for other models. |
doi_str_mv | 10.1145/1194936.1194938 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_28977438</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1199058331</sourcerecordid><originalsourceid>FETCH-LOGICAL-c214t-b19daa1549af1cc5b5614124bacd7cac964998831f616131609ec271d0bbf6fd3</originalsourceid><addsrcrecordid>eNpdkMFLwzAUxoMoOKdnr8GDt255SZo2xzF0CgMv6rWkyavraJOadIj_vR3bydPvffDj8fERcg9sASDzJYCWWqjFieUFmUGel5mQgl0eb8Eyxrm6Jjcp7RmDXAGfkc8V7Y3dtR7pGI1PXTtiNGMbPO2Dw47WJqGjU7QhRkxD8A69RVrj-IPo6Vc0ww57TNR4R4dd8MdwS64a0yW8O3NOPp6f3tcv2fZt87pebTPLQY5ZDdoZA7nUpgFr83oqJYHL2lhXWGO1klqXpYBGgQIBimm0vADH6rpRjRNz8nj6O8TwfcA0Vn2bLHad8RgOqeKlLgopykl8-CfuwyH6qVvFueAF40pM0vIk2RhSithUQ2x7E38rYNVx5Oo88pml-AOZMm8n</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>223270263</pqid></control><display><type>article</type><title>A machine transliteration model based on correspondence between graphemes and phonemes</title><source>Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list)</source><creator>Oh, Jong-Hoon ; Choi, Key-Sun ; Isahara, Hitoshi</creator><creatorcontrib>Oh, Jong-Hoon ; Choi, Key-Sun ; Isahara, Hitoshi</creatorcontrib><description>Machine transliteration is an automatic method for converting words in one language into phonetically equivalent ones in another language. There has been growing interest in the use of machine transliteration to assist machine translation and information retrieval. Three types of machine transliteration models---grapheme-based, phoneme-based, and hybrid---have been proposed. Surprisingly, there have been few reports of efforts to utilize the correspondence between source graphemes and source phonemes, although this correspondence plays an important role in machine transliteration. Furthermore, little work has been reported on ways to dynamically handle source graphemes and phonemes. In this paper, we propose a transliteration model that dynamically uses both graphemes and phonemes, particularly the correspondence between them. With this model, we have achieved better performance---improvements of about 15 to 41% in English-to-Korean transliteration and about 16 to 44% in English-to-Japanese transliteration---than has been reported for other models.</description><identifier>ISSN: 1530-0226</identifier><identifier>ISSN: 2375-4699</identifier><identifier>EISSN: 1558-3430</identifier><identifier>EISSN: 2375-4702</identifier><identifier>DOI: 10.1145/1194936.1194938</identifier><language>eng</language><publisher>New York: Association for Computing Machinery</publisher><subject>Automation ; Correspondence ; Information retrieval ; Machinery ; Proposals ; Studies ; Translations</subject><ispartof>ACM transactions on Asian language information processing, 2006-09, Vol.5 (3), p.185-208</ispartof><rights>Copyright Association for Computing Machinery Sep 2006</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c214t-b19daa1549af1cc5b5614124bacd7cac964998831f616131609ec271d0bbf6fd3</citedby><cites>FETCH-LOGICAL-c214t-b19daa1549af1cc5b5614124bacd7cac964998831f616131609ec271d0bbf6fd3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Oh, Jong-Hoon</creatorcontrib><creatorcontrib>Choi, Key-Sun</creatorcontrib><creatorcontrib>Isahara, Hitoshi</creatorcontrib><title>A machine transliteration model based on correspondence between graphemes and phonemes</title><title>ACM transactions on Asian language information processing</title><description>Machine transliteration is an automatic method for converting words in one language into phonetically equivalent ones in another language. There has been growing interest in the use of machine transliteration to assist machine translation and information retrieval. Three types of machine transliteration models---grapheme-based, phoneme-based, and hybrid---have been proposed. Surprisingly, there have been few reports of efforts to utilize the correspondence between source graphemes and source phonemes, although this correspondence plays an important role in machine transliteration. Furthermore, little work has been reported on ways to dynamically handle source graphemes and phonemes. In this paper, we propose a transliteration model that dynamically uses both graphemes and phonemes, particularly the correspondence between them. With this model, we have achieved better performance---improvements of about 15 to 41% in English-to-Korean transliteration and about 16 to 44% in English-to-Japanese transliteration---than has been reported for other models.</description><subject>Automation</subject><subject>Correspondence</subject><subject>Information retrieval</subject><subject>Machinery</subject><subject>Proposals</subject><subject>Studies</subject><subject>Translations</subject><issn>1530-0226</issn><issn>2375-4699</issn><issn>1558-3430</issn><issn>2375-4702</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2006</creationdate><recordtype>article</recordtype><recordid>eNpdkMFLwzAUxoMoOKdnr8GDt255SZo2xzF0CgMv6rWkyavraJOadIj_vR3bydPvffDj8fERcg9sASDzJYCWWqjFieUFmUGel5mQgl0eb8Eyxrm6Jjcp7RmDXAGfkc8V7Y3dtR7pGI1PXTtiNGMbPO2Dw47WJqGjU7QhRkxD8A69RVrj-IPo6Vc0ww57TNR4R4dd8MdwS64a0yW8O3NOPp6f3tcv2fZt87pebTPLQY5ZDdoZA7nUpgFr83oqJYHL2lhXWGO1klqXpYBGgQIBimm0vADH6rpRjRNz8nj6O8TwfcA0Vn2bLHad8RgOqeKlLgopykl8-CfuwyH6qVvFueAF40pM0vIk2RhSithUQ2x7E38rYNVx5Oo88pml-AOZMm8n</recordid><startdate>20060901</startdate><enddate>20060901</enddate><creator>Oh, Jong-Hoon</creator><creator>Choi, Key-Sun</creator><creator>Isahara, Hitoshi</creator><general>Association for Computing Machinery</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><scope>7SC</scope><scope>8FD</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20060901</creationdate><title>A machine transliteration model based on correspondence between graphemes and phonemes</title><author>Oh, Jong-Hoon ; Choi, Key-Sun ; Isahara, Hitoshi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c214t-b19daa1549af1cc5b5614124bacd7cac964998831f616131609ec271d0bbf6fd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2006</creationdate><topic>Automation</topic><topic>Correspondence</topic><topic>Information retrieval</topic><topic>Machinery</topic><topic>Proposals</topic><topic>Studies</topic><topic>Translations</topic><toplevel>online_resources</toplevel><creatorcontrib>Oh, Jong-Hoon</creatorcontrib><creatorcontrib>Choi, Key-Sun</creatorcontrib><creatorcontrib>Isahara, Hitoshi</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>ACM transactions on Asian language information processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Oh, Jong-Hoon</au><au>Choi, Key-Sun</au><au>Isahara, Hitoshi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A machine transliteration model based on correspondence between graphemes and phonemes</atitle><jtitle>ACM transactions on Asian language information processing</jtitle><date>2006-09-01</date><risdate>2006</risdate><volume>5</volume><issue>3</issue><spage>185</spage><epage>208</epage><pages>185-208</pages><issn>1530-0226</issn><issn>2375-4699</issn><eissn>1558-3430</eissn><eissn>2375-4702</eissn><abstract>Machine transliteration is an automatic method for converting words in one language into phonetically equivalent ones in another language. There has been growing interest in the use of machine transliteration to assist machine translation and information retrieval. Three types of machine transliteration models---grapheme-based, phoneme-based, and hybrid---have been proposed. Surprisingly, there have been few reports of efforts to utilize the correspondence between source graphemes and source phonemes, although this correspondence plays an important role in machine transliteration. Furthermore, little work has been reported on ways to dynamically handle source graphemes and phonemes. In this paper, we propose a transliteration model that dynamically uses both graphemes and phonemes, particularly the correspondence between them. With this model, we have achieved better performance---improvements of about 15 to 41% in English-to-Korean transliteration and about 16 to 44% in English-to-Japanese transliteration---than has been reported for other models.</abstract><cop>New York</cop><pub>Association for Computing Machinery</pub><doi>10.1145/1194936.1194938</doi><tpages>24</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1530-0226 |
ispartof | ACM transactions on Asian language information processing, 2006-09, Vol.5 (3), p.185-208 |
issn | 1530-0226 2375-4699 1558-3430 2375-4702 |
language | eng |
recordid | cdi_proquest_miscellaneous_28977438 |
source | Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list) |
subjects | Automation Correspondence Information retrieval Machinery Proposals Studies Translations |
title | A machine transliteration model based on correspondence between graphemes and phonemes |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T20%3A18%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20machine%20transliteration%20model%20based%20on%20correspondence%20between%20graphemes%20and%20phonemes&rft.jtitle=ACM%20transactions%20on%20Asian%20language%20information%20processing&rft.au=Oh,%20Jong-Hoon&rft.date=2006-09-01&rft.volume=5&rft.issue=3&rft.spage=185&rft.epage=208&rft.pages=185-208&rft.issn=1530-0226&rft.eissn=1558-3430&rft_id=info:doi/10.1145/1194936.1194938&rft_dat=%3Cproquest_cross%3E1199058331%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c214t-b19daa1549af1cc5b5614124bacd7cac964998831f616131609ec271d0bbf6fd3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=223270263&rft_id=info:pmid/&rfr_iscdi=true |