Loading…

Developing the first balanced corpus for Bangla language

The objective of this paper is to propose the development process of the first Bangladeshi National Corpus. The purpose of the study is to specify the domains to create a balanced Bangla corpus based on some selection criteria. This study focuses on three independent selection criteria: domain, time...

Full description

Saved in:
Bibliographic Details
Main Authors: Salam, K. M. Anwarus, Yamada, S., Nishino, T.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The objective of this paper is to propose the development process of the first Bangladeshi National Corpus. The purpose of the study is to specify the domains to create a balanced Bangla corpus based on some selection criteria. This study focuses on three independent selection criteria: domain, time and medium. This paper also explains domain classifications and weight percentage for each domain. We also identify the prospective source of information for preparing the corpus.
DOI:10.1109/ICIEV.2012.6317356