Loading…
Developing the first balanced corpus for Bangla language
The objective of this paper is to propose the development process of the first Bangladeshi National Corpus. The purpose of the study is to specify the domains to create a balanced Bangla corpus based on some selection criteria. This study focuses on three independent selection criteria: domain, time...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 1084 |
container_issue | |
container_start_page | 1081 |
container_title | |
container_volume | |
creator | Salam, K. M. Anwarus Yamada, S. Nishino, T. |
description | The objective of this paper is to propose the development process of the first Bangladeshi National Corpus. The purpose of the study is to specify the domains to create a balanced Bangla corpus based on some selection criteria. This study focuses on three independent selection criteria: domain, time and medium. This paper also explains domain classifications and weight percentage for each domain. We also identify the prospective source of information for preparing the corpus. |
doi_str_mv | 10.1109/ICIEV.2012.6317356 |
format | conference_proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_6317356</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6317356</ieee_id><sourcerecordid>6317356</sourcerecordid><originalsourceid>FETCH-LOGICAL-i90t-8b8761346942c3c490a5b7cd74cbff4b6bca70ce258df8648c9e4bd35c89cf033</originalsourceid><addsrcrecordid>eNpFj81KxDAUhSMiqOO8gG7yAq1Jb5qfpdZRCwNuBrdDcntTK7UtTUfw7R1wwLM5fBz44DB2K0UupXD3dVVv3vNCyCLXIA2U-oxdS6UNSFkW7vwfwFyydUqf4hgrLTh3xewTfVM_Tt3Q8uWDeOzmtPDgez8gNRzHeTokHseZP_qh7T0_Du3Bt3TDLqLvE61PvWK7582ues22by919bDNOieWzAZrtASlnSoQUDnhy2CwMQpDjCrogN4IpKK0TbRaWXSkQgMlWodRAKzY3Z-2I6L9NHdffv7Zn47CL2BKRxI</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Developing the first balanced corpus for Bangla language</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Salam, K. M. Anwarus ; Yamada, S. ; Nishino, T.</creator><creatorcontrib>Salam, K. M. Anwarus ; Yamada, S. ; Nishino, T.</creatorcontrib><description>The objective of this paper is to propose the development process of the first Bangladeshi National Corpus. The purpose of the study is to specify the domains to create a balanced Bangla corpus based on some selection criteria. This study focuses on three independent selection criteria: domain, time and medium. This paper also explains domain classifications and weight percentage for each domain. We also identify the prospective source of information for preparing the corpus.</description><identifier>ISBN: 1467311537</identifier><identifier>ISBN: 9781467311533</identifier><identifier>EISBN: 1467311529</identifier><identifier>EISBN: 1467311545</identifier><identifier>EISBN: 9781467311540</identifier><identifier>EISBN: 9781467311526</identifier><identifier>DOI: 10.1109/ICIEV.2012.6317356</identifier><language>eng</language><publisher>IEEE</publisher><subject>Bangla Language Processing ; Blogs ; Business ; corpus development ; Electronic mail ; Encyclopedias ; History ; Writing</subject><ispartof>2012 International Conference on Informatics, Electronics & Vision (ICIEV), 2012, p.1081-1084</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6317356$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6317356$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Salam, K. M. Anwarus</creatorcontrib><creatorcontrib>Yamada, S.</creatorcontrib><creatorcontrib>Nishino, T.</creatorcontrib><title>Developing the first balanced corpus for Bangla language</title><title>2012 International Conference on Informatics, Electronics & Vision (ICIEV)</title><addtitle>ICIEV</addtitle><description>The objective of this paper is to propose the development process of the first Bangladeshi National Corpus. The purpose of the study is to specify the domains to create a balanced Bangla corpus based on some selection criteria. This study focuses on three independent selection criteria: domain, time and medium. This paper also explains domain classifications and weight percentage for each domain. We also identify the prospective source of information for preparing the corpus.</description><subject>Bangla Language Processing</subject><subject>Blogs</subject><subject>Business</subject><subject>corpus development</subject><subject>Electronic mail</subject><subject>Encyclopedias</subject><subject>History</subject><subject>Writing</subject><isbn>1467311537</isbn><isbn>9781467311533</isbn><isbn>1467311529</isbn><isbn>1467311545</isbn><isbn>9781467311540</isbn><isbn>9781467311526</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2012</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNpFj81KxDAUhSMiqOO8gG7yAq1Jb5qfpdZRCwNuBrdDcntTK7UtTUfw7R1wwLM5fBz44DB2K0UupXD3dVVv3vNCyCLXIA2U-oxdS6UNSFkW7vwfwFyydUqf4hgrLTh3xewTfVM_Tt3Q8uWDeOzmtPDgez8gNRzHeTokHseZP_qh7T0_Du3Bt3TDLqLvE61PvWK7582ues22by919bDNOieWzAZrtASlnSoQUDnhy2CwMQpDjCrogN4IpKK0TbRaWXSkQgMlWodRAKzY3Z-2I6L9NHdffv7Zn47CL2BKRxI</recordid><startdate>201205</startdate><enddate>201205</enddate><creator>Salam, K. M. Anwarus</creator><creator>Yamada, S.</creator><creator>Nishino, T.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201205</creationdate><title>Developing the first balanced corpus for Bangla language</title><author>Salam, K. M. Anwarus ; Yamada, S. ; Nishino, T.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i90t-8b8761346942c3c490a5b7cd74cbff4b6bca70ce258df8648c9e4bd35c89cf033</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Bangla Language Processing</topic><topic>Blogs</topic><topic>Business</topic><topic>corpus development</topic><topic>Electronic mail</topic><topic>Encyclopedias</topic><topic>History</topic><topic>Writing</topic><toplevel>online_resources</toplevel><creatorcontrib>Salam, K. M. Anwarus</creatorcontrib><creatorcontrib>Yamada, S.</creatorcontrib><creatorcontrib>Nishino, T.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEL</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Salam, K. M. Anwarus</au><au>Yamada, S.</au><au>Nishino, T.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Developing the first balanced corpus for Bangla language</atitle><btitle>2012 International Conference on Informatics, Electronics & Vision (ICIEV)</btitle><stitle>ICIEV</stitle><date>2012-05</date><risdate>2012</risdate><spage>1081</spage><epage>1084</epage><pages>1081-1084</pages><isbn>1467311537</isbn><isbn>9781467311533</isbn><eisbn>1467311529</eisbn><eisbn>1467311545</eisbn><eisbn>9781467311540</eisbn><eisbn>9781467311526</eisbn><abstract>The objective of this paper is to propose the development process of the first Bangladeshi National Corpus. The purpose of the study is to specify the domains to create a balanced Bangla corpus based on some selection criteria. This study focuses on three independent selection criteria: domain, time and medium. This paper also explains domain classifications and weight percentage for each domain. We also identify the prospective source of information for preparing the corpus.</abstract><pub>IEEE</pub><doi>10.1109/ICIEV.2012.6317356</doi><tpages>4</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISBN: 1467311537 |
ispartof | 2012 International Conference on Informatics, Electronics & Vision (ICIEV), 2012, p.1081-1084 |
issn | |
language | eng |
recordid | cdi_ieee_primary_6317356 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Bangla Language Processing Blogs Business corpus development Electronic mail Encyclopedias History Writing |
title | Developing the first balanced corpus for Bangla language |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T14%3A12%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Developing%20the%20first%20balanced%20corpus%20for%20Bangla%20language&rft.btitle=2012%20International%20Conference%20on%20Informatics,%20Electronics%20&%20Vision%20(ICIEV)&rft.au=Salam,%20K.%20M.%20Anwarus&rft.date=2012-05&rft.spage=1081&rft.epage=1084&rft.pages=1081-1084&rft.isbn=1467311537&rft.isbn_list=9781467311533&rft_id=info:doi/10.1109/ICIEV.2012.6317356&rft.eisbn=1467311529&rft.eisbn_list=1467311545&rft.eisbn_list=9781467311540&rft.eisbn_list=9781467311526&rft_dat=%3Cieee_6IE%3E6317356%3C/ieee_6IE%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i90t-8b8761346942c3c490a5b7cd74cbff4b6bca70ce258df8648c9e4bd35c89cf033%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6317356&rfr_iscdi=true |