Loading…

Sino-Vietnamese Text Transcription using Word Embedding Approach

Sino-Vietnamese (aka Han Viet) characters, derived from ancient Chinese script, held prominence in Vietnam for centuries until the advent of the modern scripting system. Despite their historical significance, contemporary Vietnamese struggle to decipher these characters, hindering access to valuable...

Full description

Saved in:
Bibliographic Details
Main Authors: Dao, Trung-Kien, Nguyen, Dinh-Van, Pham, Minh-Vu, Duong, Nhat-Nam
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Sino-Vietnamese (aka Han Viet) characters, derived from ancient Chinese script, held prominence in Vietnam for centuries until the advent of the modern scripting system. Despite their historical significance, contemporary Vietnamese struggle to decipher these characters, hindering access to valuable cultural artifacts. This paper explores the complexities of Sino-Vietnamese characters, their divergence from modern Chinese characters, and the intricate process of converting them into readable texts. Notably, Sino-Vietnamese characters boast eight tones compared with Mandarin's five, adding to their linguistic complexity. While remnants of these characters persist in artifacts and bibliographies, their comprehension requires linguistic expertise because of the multiple possible readings for a single character. Addressing this challenge requires the development of tools for automatic recognition and translation, enabling broader access to Vietnam's cultural heritage. By facilitating the understanding of ancient texts, these efforts have contributed to the preservation and appreciation of Vietnamese linguistic and cultural traditions.
ISSN:2836-4392
DOI:10.1109/ICCE62051.2024.10634601