Loading…

A new algorithm for construction specific field terms using co-occurrence words information

Readers can know the subject of many document fields by reading only some specific words called field association (FA) terms. It is very important to construct these FA terms to decide correctly the document fields from few words information in part of file. The field can be decided efficiency if th...

Full description

Saved in:
Bibliographic Details
Main Authors: Atlam, E.-S., Ghada, E., Fuketa, M., Aoe, J.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 993 Vol. 2
container_issue
container_start_page 990
container_title
container_volume 2
creator Atlam, E.-S.
Ghada, E.
Fuketa, M.
Aoe, J.
description Readers can know the subject of many document fields by reading only some specific words called field association (FA) terms. It is very important to construct these FA terms to decide correctly the document fields from few words information in part of file. The field can be decided efficiency if the number of these FA terms is many and the frequency rate is high. If the number of level I (words that direct connect to terminal fields) FA word is limited, old methods can not determine the documents tiled easily and fast, special when there is a small number of corpus documents. This paper proposes a new method for deciding FA terms using the weight of co-occurrence words and declinable words which related to a narrow association category with eliminating FA terms ambiguity. Moreover, efficient FA terms are difficult to be extracted only by the information of the frequency of them. This paper proposed a new efficient method using new cooccurrence words weight which makes precision and recall are higher than the case of degree of frequency.
doi_str_mv 10.1109/MWSCAS.2003.1562453
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_1562453</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1562453</ieee_id><sourcerecordid>1562453</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-c86d357cb2363598798e962cacd67c718cce03ec0b7fe95fb579d8aad1abdd1c3</originalsourceid><addsrcrecordid>eNotkMtOwzAURC0eEm3hC7rxDyTYcRzbyyjiJRWxKIgFi8q5vilGeVR2ooq_JxVdzeLMnMUQsuYs5ZyZ-9fPbVVu04wxkXJZZLkUF2TBpdSJ0MZckiVTmgmdmVxcnUA-A5UXN2QZ4w9jmVDcLMhXSXs8Utvuh-DH7442Q6Aw9HEME4x-6Gk8IPjGA208to6OGLpIp-j7_dxLBoApBOwB6XEILlLfz4bOnqa35LqxbcS7c67Ix-PDe_WcbN6eXqpyk3iu5JiALpyQCupMFEIarYxGU2RgwRUKFNcAyAQCq1WDRja1VMZpax23tXMcxIqs_70eEXeH4DsbfnfnU8Qff4dXOg</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>A new algorithm for construction specific field terms using co-occurrence words information</title><source>IEEE Xplore All Conference Series</source><creator>Atlam, E.-S. ; Ghada, E. ; Fuketa, M. ; Aoe, J.</creator><creatorcontrib>Atlam, E.-S. ; Ghada, E. ; Fuketa, M. ; Aoe, J.</creatorcontrib><description>Readers can know the subject of many document fields by reading only some specific words called field association (FA) terms. It is very important to construct these FA terms to decide correctly the document fields from few words information in part of file. The field can be decided efficiency if the number of these FA terms is many and the frequency rate is high. If the number of level I (words that direct connect to terminal fields) FA word is limited, old methods can not determine the documents tiled easily and fast, special when there is a small number of corpus documents. This paper proposes a new method for deciding FA terms using the weight of co-occurrence words and declinable words which related to a narrow association category with eliminating FA terms ambiguity. Moreover, efficient FA terms are difficult to be extracted only by the information of the frequency of them. This paper proposed a new efficient method using new cooccurrence words weight which makes precision and recall are higher than the case of degree of frequency.</description><identifier>ISSN: 1548-3746</identifier><identifier>ISBN: 0780382943</identifier><identifier>ISBN: 9780780382947</identifier><identifier>EISSN: 1558-3899</identifier><identifier>DOI: 10.1109/MWSCAS.2003.1562453</identifier><language>eng</language><publisher>IEEE</publisher><subject>Clustering algorithms ; Costs ; Data compression ; Data mining ; Frequency ; Information science ; Intelligent systems ; Natural language processing ; Partitioning algorithms ; Thesauri</subject><ispartof>2003 46th Midwest Symposium on Circuits and Systems, 2003, Vol.2, p.990-993 Vol. 2</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1562453$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,4050,4051,23930,23931,25140,27925,54555,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1562453$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Atlam, E.-S.</creatorcontrib><creatorcontrib>Ghada, E.</creatorcontrib><creatorcontrib>Fuketa, M.</creatorcontrib><creatorcontrib>Aoe, J.</creatorcontrib><title>A new algorithm for construction specific field terms using co-occurrence words information</title><title>2003 46th Midwest Symposium on Circuits and Systems</title><addtitle>MWSCAS</addtitle><description>Readers can know the subject of many document fields by reading only some specific words called field association (FA) terms. It is very important to construct these FA terms to decide correctly the document fields from few words information in part of file. The field can be decided efficiency if the number of these FA terms is many and the frequency rate is high. If the number of level I (words that direct connect to terminal fields) FA word is limited, old methods can not determine the documents tiled easily and fast, special when there is a small number of corpus documents. This paper proposes a new method for deciding FA terms using the weight of co-occurrence words and declinable words which related to a narrow association category with eliminating FA terms ambiguity. Moreover, efficient FA terms are difficult to be extracted only by the information of the frequency of them. This paper proposed a new efficient method using new cooccurrence words weight which makes precision and recall are higher than the case of degree of frequency.</description><subject>Clustering algorithms</subject><subject>Costs</subject><subject>Data compression</subject><subject>Data mining</subject><subject>Frequency</subject><subject>Information science</subject><subject>Intelligent systems</subject><subject>Natural language processing</subject><subject>Partitioning algorithms</subject><subject>Thesauri</subject><issn>1548-3746</issn><issn>1558-3899</issn><isbn>0780382943</isbn><isbn>9780780382947</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2003</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotkMtOwzAURC0eEm3hC7rxDyTYcRzbyyjiJRWxKIgFi8q5vilGeVR2ooq_JxVdzeLMnMUQsuYs5ZyZ-9fPbVVu04wxkXJZZLkUF2TBpdSJ0MZckiVTmgmdmVxcnUA-A5UXN2QZ4w9jmVDcLMhXSXs8Utvuh-DH7442Q6Aw9HEME4x-6Gk8IPjGA208to6OGLpIp-j7_dxLBoApBOwB6XEILlLfz4bOnqa35LqxbcS7c67Ix-PDe_WcbN6eXqpyk3iu5JiALpyQCupMFEIarYxGU2RgwRUKFNcAyAQCq1WDRja1VMZpax23tXMcxIqs_70eEXeH4DsbfnfnU8Qff4dXOg</recordid><startdate>2003</startdate><enddate>2003</enddate><creator>Atlam, E.-S.</creator><creator>Ghada, E.</creator><creator>Fuketa, M.</creator><creator>Aoe, J.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>2003</creationdate><title>A new algorithm for construction specific field terms using co-occurrence words information</title><author>Atlam, E.-S. ; Ghada, E. ; Fuketa, M. ; Aoe, J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-c86d357cb2363598798e962cacd67c718cce03ec0b7fe95fb579d8aad1abdd1c3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2003</creationdate><topic>Clustering algorithms</topic><topic>Costs</topic><topic>Data compression</topic><topic>Data mining</topic><topic>Frequency</topic><topic>Information science</topic><topic>Intelligent systems</topic><topic>Natural language processing</topic><topic>Partitioning algorithms</topic><topic>Thesauri</topic><toplevel>online_resources</toplevel><creatorcontrib>Atlam, E.-S.</creatorcontrib><creatorcontrib>Ghada, E.</creatorcontrib><creatorcontrib>Fuketa, M.</creatorcontrib><creatorcontrib>Aoe, J.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Xplore (Online service)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Atlam, E.-S.</au><au>Ghada, E.</au><au>Fuketa, M.</au><au>Aoe, J.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A new algorithm for construction specific field terms using co-occurrence words information</atitle><btitle>2003 46th Midwest Symposium on Circuits and Systems</btitle><stitle>MWSCAS</stitle><date>2003</date><risdate>2003</risdate><volume>2</volume><spage>990</spage><epage>993 Vol. 2</epage><pages>990-993 Vol. 2</pages><issn>1548-3746</issn><eissn>1558-3899</eissn><isbn>0780382943</isbn><isbn>9780780382947</isbn><abstract>Readers can know the subject of many document fields by reading only some specific words called field association (FA) terms. It is very important to construct these FA terms to decide correctly the document fields from few words information in part of file. The field can be decided efficiency if the number of these FA terms is many and the frequency rate is high. If the number of level I (words that direct connect to terminal fields) FA word is limited, old methods can not determine the documents tiled easily and fast, special when there is a small number of corpus documents. This paper proposes a new method for deciding FA terms using the weight of co-occurrence words and declinable words which related to a narrow association category with eliminating FA terms ambiguity. Moreover, efficient FA terms are difficult to be extracted only by the information of the frequency of them. This paper proposed a new efficient method using new cooccurrence words weight which makes precision and recall are higher than the case of degree of frequency.</abstract><pub>IEEE</pub><doi>10.1109/MWSCAS.2003.1562453</doi></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1548-3746
ispartof 2003 46th Midwest Symposium on Circuits and Systems, 2003, Vol.2, p.990-993 Vol. 2
issn 1548-3746
1558-3899
language eng
recordid cdi_ieee_primary_1562453
source IEEE Xplore All Conference Series
subjects Clustering algorithms
Costs
Data compression
Data mining
Frequency
Information science
Intelligent systems
Natural language processing
Partitioning algorithms
Thesauri
title A new algorithm for construction specific field terms using co-occurrence words information
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-24T09%3A57%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%20new%20algorithm%20for%20construction%20specific%20field%20terms%20using%20co-occurrence%20words%20information&rft.btitle=2003%2046th%20Midwest%20Symposium%20on%20Circuits%20and%20Systems&rft.au=Atlam,%20E.-S.&rft.date=2003&rft.volume=2&rft.spage=990&rft.epage=993%20Vol.%202&rft.pages=990-993%20Vol.%202&rft.issn=1548-3746&rft.eissn=1558-3899&rft.isbn=0780382943&rft.isbn_list=9780780382947&rft_id=info:doi/10.1109/MWSCAS.2003.1562453&rft_dat=%3Cieee_CHZPO%3E1562453%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i175t-c86d357cb2363598798e962cacd67c718cce03ec0b7fe95fb579d8aad1abdd1c3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=1562453&rfr_iscdi=true