Loading…

Developing the first balanced corpus for Bangla language

The objective of this paper is to propose the development process of the first Bangladeshi National Corpus. The purpose of the study is to specify the domains to create a balanced Bangla corpus based on some selection criteria. This study focuses on three independent selection criteria: domain, time...

Full description

Saved in:
Bibliographic Details
Main Authors: Salam, K. M. Anwarus, Yamada, S., Nishino, T.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 1084
container_issue
container_start_page 1081
container_title
container_volume
creator Salam, K. M. Anwarus
Yamada, S.
Nishino, T.
description The objective of this paper is to propose the development process of the first Bangladeshi National Corpus. The purpose of the study is to specify the domains to create a balanced Bangla corpus based on some selection criteria. This study focuses on three independent selection criteria: domain, time and medium. This paper also explains domain classifications and weight percentage for each domain. We also identify the prospective source of information for preparing the corpus.
doi_str_mv 10.1109/ICIEV.2012.6317356
format conference_proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_6317356</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6317356</ieee_id><sourcerecordid>6317356</sourcerecordid><originalsourceid>FETCH-LOGICAL-i90t-8b8761346942c3c490a5b7cd74cbff4b6bca70ce258df8648c9e4bd35c89cf033</originalsourceid><addsrcrecordid>eNpFj81KxDAUhSMiqOO8gG7yAq1Jb5qfpdZRCwNuBrdDcntTK7UtTUfw7R1wwLM5fBz44DB2K0UupXD3dVVv3vNCyCLXIA2U-oxdS6UNSFkW7vwfwFyydUqf4hgrLTh3xewTfVM_Tt3Q8uWDeOzmtPDgez8gNRzHeTokHseZP_qh7T0_Du3Bt3TDLqLvE61PvWK7582ues22by919bDNOieWzAZrtASlnSoQUDnhy2CwMQpDjCrogN4IpKK0TbRaWXSkQgMlWodRAKzY3Z-2I6L9NHdffv7Zn47CL2BKRxI</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Developing the first balanced corpus for Bangla language</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Salam, K. M. Anwarus ; Yamada, S. ; Nishino, T.</creator><creatorcontrib>Salam, K. M. Anwarus ; Yamada, S. ; Nishino, T.</creatorcontrib><description>The objective of this paper is to propose the development process of the first Bangladeshi National Corpus. The purpose of the study is to specify the domains to create a balanced Bangla corpus based on some selection criteria. This study focuses on three independent selection criteria: domain, time and medium. This paper also explains domain classifications and weight percentage for each domain. We also identify the prospective source of information for preparing the corpus.</description><identifier>ISBN: 1467311537</identifier><identifier>ISBN: 9781467311533</identifier><identifier>EISBN: 1467311529</identifier><identifier>EISBN: 1467311545</identifier><identifier>EISBN: 9781467311540</identifier><identifier>EISBN: 9781467311526</identifier><identifier>DOI: 10.1109/ICIEV.2012.6317356</identifier><language>eng</language><publisher>IEEE</publisher><subject>Bangla Language Processing ; Blogs ; Business ; corpus development ; Electronic mail ; Encyclopedias ; History ; Writing</subject><ispartof>2012 International Conference on Informatics, Electronics &amp; Vision (ICIEV), 2012, p.1081-1084</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6317356$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6317356$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Salam, K. M. Anwarus</creatorcontrib><creatorcontrib>Yamada, S.</creatorcontrib><creatorcontrib>Nishino, T.</creatorcontrib><title>Developing the first balanced corpus for Bangla language</title><title>2012 International Conference on Informatics, Electronics &amp; Vision (ICIEV)</title><addtitle>ICIEV</addtitle><description>The objective of this paper is to propose the development process of the first Bangladeshi National Corpus. The purpose of the study is to specify the domains to create a balanced Bangla corpus based on some selection criteria. This study focuses on three independent selection criteria: domain, time and medium. This paper also explains domain classifications and weight percentage for each domain. We also identify the prospective source of information for preparing the corpus.</description><subject>Bangla Language Processing</subject><subject>Blogs</subject><subject>Business</subject><subject>corpus development</subject><subject>Electronic mail</subject><subject>Encyclopedias</subject><subject>History</subject><subject>Writing</subject><isbn>1467311537</isbn><isbn>9781467311533</isbn><isbn>1467311529</isbn><isbn>1467311545</isbn><isbn>9781467311540</isbn><isbn>9781467311526</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2012</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNpFj81KxDAUhSMiqOO8gG7yAq1Jb5qfpdZRCwNuBrdDcntTK7UtTUfw7R1wwLM5fBz44DB2K0UupXD3dVVv3vNCyCLXIA2U-oxdS6UNSFkW7vwfwFyydUqf4hgrLTh3xewTfVM_Tt3Q8uWDeOzmtPDgez8gNRzHeTokHseZP_qh7T0_Du3Bt3TDLqLvE61PvWK7582ues22by919bDNOieWzAZrtASlnSoQUDnhy2CwMQpDjCrogN4IpKK0TbRaWXSkQgMlWodRAKzY3Z-2I6L9NHdffv7Zn47CL2BKRxI</recordid><startdate>201205</startdate><enddate>201205</enddate><creator>Salam, K. M. Anwarus</creator><creator>Yamada, S.</creator><creator>Nishino, T.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201205</creationdate><title>Developing the first balanced corpus for Bangla language</title><author>Salam, K. M. Anwarus ; Yamada, S. ; Nishino, T.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i90t-8b8761346942c3c490a5b7cd74cbff4b6bca70ce258df8648c9e4bd35c89cf033</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Bangla Language Processing</topic><topic>Blogs</topic><topic>Business</topic><topic>corpus development</topic><topic>Electronic mail</topic><topic>Encyclopedias</topic><topic>History</topic><topic>Writing</topic><toplevel>online_resources</toplevel><creatorcontrib>Salam, K. M. Anwarus</creatorcontrib><creatorcontrib>Yamada, S.</creatorcontrib><creatorcontrib>Nishino, T.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEL</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Salam, K. M. Anwarus</au><au>Yamada, S.</au><au>Nishino, T.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Developing the first balanced corpus for Bangla language</atitle><btitle>2012 International Conference on Informatics, Electronics &amp; Vision (ICIEV)</btitle><stitle>ICIEV</stitle><date>2012-05</date><risdate>2012</risdate><spage>1081</spage><epage>1084</epage><pages>1081-1084</pages><isbn>1467311537</isbn><isbn>9781467311533</isbn><eisbn>1467311529</eisbn><eisbn>1467311545</eisbn><eisbn>9781467311540</eisbn><eisbn>9781467311526</eisbn><abstract>The objective of this paper is to propose the development process of the first Bangladeshi National Corpus. The purpose of the study is to specify the domains to create a balanced Bangla corpus based on some selection criteria. This study focuses on three independent selection criteria: domain, time and medium. This paper also explains domain classifications and weight percentage for each domain. We also identify the prospective source of information for preparing the corpus.</abstract><pub>IEEE</pub><doi>10.1109/ICIEV.2012.6317356</doi><tpages>4</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISBN: 1467311537
ispartof 2012 International Conference on Informatics, Electronics & Vision (ICIEV), 2012, p.1081-1084
issn
language eng
recordid cdi_ieee_primary_6317356
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Bangla Language Processing
Blogs
Business
corpus development
Electronic mail
Encyclopedias
History
Writing
title Developing the first balanced corpus for Bangla language
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T14%3A12%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Developing%20the%20first%20balanced%20corpus%20for%20Bangla%20language&rft.btitle=2012%20International%20Conference%20on%20Informatics,%20Electronics%20&%20Vision%20(ICIEV)&rft.au=Salam,%20K.%20M.%20Anwarus&rft.date=2012-05&rft.spage=1081&rft.epage=1084&rft.pages=1081-1084&rft.isbn=1467311537&rft.isbn_list=9781467311533&rft_id=info:doi/10.1109/ICIEV.2012.6317356&rft.eisbn=1467311529&rft.eisbn_list=1467311545&rft.eisbn_list=9781467311540&rft.eisbn_list=9781467311526&rft_dat=%3Cieee_6IE%3E6317356%3C/ieee_6IE%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i90t-8b8761346942c3c490a5b7cd74cbff4b6bca70ce258df8648c9e4bd35c89cf033%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6317356&rfr_iscdi=true