Loading…

Knowledge-aware Named Entity Recognition with Alleviating Heterogeneity

Named Entity Recognition (NER) is a fundamental and important research topic for many downstream NLP tasks, aiming at detecting and classifying named entities (NEs) mentioned in unstructured text into pre-defined categories. Learning from labeled data only is far from enough when it comes to domain-...

Full description

Saved in:

Bibliographic Details
Main Authors:	Nie, Binling, Ding, Ruixue, Xie, Pengjun, Huang, Fei, Qian, Chen, Si, Luo
Format:	Conference Proceeding
Language:	English
Citations:	Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c175t-64e79fd45d350df5c1d1c589907b44aadd7aa4874b42eeb58550c5bf1de5e4863
cites
container_end_page	13603
container_issue	15
container_start_page	13595
container_title
container_volume	35
creator	Nie, Binling Ding, Ruixue Xie, Pengjun Huang, Fei Qian, Chen Si, Luo
description	Named Entity Recognition (NER) is a fundamental and important research topic for many downstream NLP tasks, aiming at detecting and classifying named entities (NEs) mentioned in unstructured text into pre-defined categories. Learning from labeled data only is far from enough when it comes to domain-specific or temporally-evolving entities (medical terminologies or restaurant names). Luckily, open-source Knowledge Bases (KBs) (Wikidata and Freebase) contain NEs that are manually labeled with predefined types in different domains, which is potentially beneficial to identify entity boundaries and recognize entity types more accurately. However, the type system of a domain-specific NER task is typically independent of that of current KBs and thus exhibits heterogeneity issue inevitably, which makes matching between the original NER and KB types (Person in NER potentially matches President in KBs) less likely, or introduces unintended noises without considering domain-specific knowledge (Band in NER should be mapped to Out_of_Entity_Types in the restaurant-related task). To better incorporate and denoise the abundant knowledge in KBs, we propose a new KB-aware NER framework (KaNa), which utilizes type-heterogeneous knowledge to improve NER. Specifically, for an entity mention along with a set of candidate entities that are linked from KBs, KaNa first uses a type projection mechanism that maps the mention type and entity types into a shared space to homogenize the heterogeneous entity types. Then, based on projected types, a noise detector filters out certain less-confident candidate entities in an unsupervised manner. Finally, the filtered mention-entity pairs are injected into a NER model as a graph to predict answers. The experimental results demonstrate KaNa's state-of-the-art performance on five public benchmark datasets from different domains.
doi_str_mv	10.1609/aaai.v35i15.17603
format	conference_proceeding
fullrecord	<record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1609_aaai_v35i15_17603</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1609_aaai_v35i15_17603</sourcerecordid><originalsourceid>FETCH-LOGICAL-c175t-64e79fd45d350df5c1d1c589907b44aadd7aa4874b42eeb58550c5bf1de5e4863</originalsourceid><addsrcrecordid>eNot0MFKw0AQBuBFFCy1D-AtL5C4093JZo-l1FZaFETPYZKdxJU0kWRp6NubWufyz-HnP3xCPIJMIJX2iYh8clLoARMwqVQ3YrZURsdKp9nt9APaGJW192IxDN9yOm0BwMzEdt92Y8Ou5phG6jl6pSO7aNMGH87RO5dd3frguzYaffiKVk3DJ0_Bt3W048B9V3PLU_VB3FXUDLz4z7n4fN58rHfx4W37sl4d4hIMhjjVbGzlNDqF0lVYgoMSM2ulKbQmcs4Q6czoQi-ZC8wQZYlFBY6RdZaquYDrbtl3w9Bzlf_0_kj9OQeZXzDyC0Z-xcj_MNQvV_FU6g</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Knowledge-aware Named Entity Recognition with Alleviating Heterogeneity</title><source>Science Journals (Open access)</source><creator>Nie, Binling ; Ding, Ruixue ; Xie, Pengjun ; Huang, Fei ; Qian, Chen ; Si, Luo</creator><creatorcontrib>Nie, Binling ; Ding, Ruixue ; Xie, Pengjun ; Huang, Fei ; Qian, Chen ; Si, Luo</creatorcontrib><description>Named Entity Recognition (NER) is a fundamental and important research topic for many downstream NLP tasks, aiming at detecting and classifying named entities (NEs) mentioned in unstructured text into pre-defined categories. Learning from labeled data only is far from enough when it comes to domain-specific or temporally-evolving entities (medical terminologies or restaurant names). Luckily, open-source Knowledge Bases (KBs) (Wikidata and Freebase) contain NEs that are manually labeled with predefined types in different domains, which is potentially beneficial to identify entity boundaries and recognize entity types more accurately. However, the type system of a domain-specific NER task is typically independent of that of current KBs and thus exhibits heterogeneity issue inevitably, which makes matching between the original NER and KB types (Person in NER potentially matches President in KBs) less likely, or introduces unintended noises without considering domain-specific knowledge (Band in NER should be mapped to Out_of_Entity_Types in the restaurant-related task). To better incorporate and denoise the abundant knowledge in KBs, we propose a new KB-aware NER framework (KaNa), which utilizes type-heterogeneous knowledge to improve NER. Specifically, for an entity mention along with a set of candidate entities that are linked from KBs, KaNa first uses a type projection mechanism that maps the mention type and entity types into a shared space to homogenize the heterogeneous entity types. Then, based on projected types, a noise detector filters out certain less-confident candidate entities in an unsupervised manner. Finally, the filtered mention-entity pairs are injected into a NER model as a graph to predict answers. The experimental results demonstrate KaNa's state-of-the-art performance on five public benchmark datasets from different domains.</description><identifier>ISSN: 2159-5399</identifier><identifier>EISSN: 2374-3468</identifier><identifier>DOI: 10.1609/aaai.v35i15.17603</identifier><language>eng</language><ispartof>Proceedings of the ... AAAI Conference on Artificial Intelligence, 2021, Vol.35 (15), p.13595-13603</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c175t-64e79fd45d350df5c1d1c589907b44aadd7aa4874b42eeb58550c5bf1de5e4863</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Nie, Binling</creatorcontrib><creatorcontrib>Ding, Ruixue</creatorcontrib><creatorcontrib>Xie, Pengjun</creatorcontrib><creatorcontrib>Huang, Fei</creatorcontrib><creatorcontrib>Qian, Chen</creatorcontrib><creatorcontrib>Si, Luo</creatorcontrib><title>Knowledge-aware Named Entity Recognition with Alleviating Heterogeneity</title><title>Proceedings of the ... AAAI Conference on Artificial Intelligence</title><description>Named Entity Recognition (NER) is a fundamental and important research topic for many downstream NLP tasks, aiming at detecting and classifying named entities (NEs) mentioned in unstructured text into pre-defined categories. Learning from labeled data only is far from enough when it comes to domain-specific or temporally-evolving entities (medical terminologies or restaurant names). Luckily, open-source Knowledge Bases (KBs) (Wikidata and Freebase) contain NEs that are manually labeled with predefined types in different domains, which is potentially beneficial to identify entity boundaries and recognize entity types more accurately. However, the type system of a domain-specific NER task is typically independent of that of current KBs and thus exhibits heterogeneity issue inevitably, which makes matching between the original NER and KB types (Person in NER potentially matches President in KBs) less likely, or introduces unintended noises without considering domain-specific knowledge (Band in NER should be mapped to Out_of_Entity_Types in the restaurant-related task). To better incorporate and denoise the abundant knowledge in KBs, we propose a new KB-aware NER framework (KaNa), which utilizes type-heterogeneous knowledge to improve NER. Specifically, for an entity mention along with a set of candidate entities that are linked from KBs, KaNa first uses a type projection mechanism that maps the mention type and entity types into a shared space to homogenize the heterogeneous entity types. Then, based on projected types, a noise detector filters out certain less-confident candidate entities in an unsupervised manner. Finally, the filtered mention-entity pairs are injected into a NER model as a graph to predict answers. The experimental results demonstrate KaNa's state-of-the-art performance on five public benchmark datasets from different domains.</description><issn>2159-5399</issn><issn>2374-3468</issn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2021</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNot0MFKw0AQBuBFFCy1D-AtL5C4093JZo-l1FZaFETPYZKdxJU0kWRp6NubWufyz-HnP3xCPIJMIJX2iYh8clLoARMwqVQ3YrZURsdKp9nt9APaGJW192IxDN9yOm0BwMzEdt92Y8Ou5phG6jl6pSO7aNMGH87RO5dd3frguzYaffiKVk3DJ0_Bt3W048B9V3PLU_VB3FXUDLz4z7n4fN58rHfx4W37sl4d4hIMhjjVbGzlNDqF0lVYgoMSM2ulKbQmcs4Q6czoQi-ZC8wQZYlFBY6RdZaquYDrbtl3w9Bzlf_0_kj9OQeZXzDyC0Z-xcj_MNQvV_FU6g</recordid><startdate>20210518</startdate><enddate>20210518</enddate><creator>Nie, Binling</creator><creator>Ding, Ruixue</creator><creator>Xie, Pengjun</creator><creator>Huang, Fei</creator><creator>Qian, Chen</creator><creator>Si, Luo</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20210518</creationdate><title>Knowledge-aware Named Entity Recognition with Alleviating Heterogeneity</title><author>Nie, Binling ; Ding, Ruixue ; Xie, Pengjun ; Huang, Fei ; Qian, Chen ; Si, Luo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c175t-64e79fd45d350df5c1d1c589907b44aadd7aa4874b42eeb58550c5bf1de5e4863</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2021</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Nie, Binling</creatorcontrib><creatorcontrib>Ding, Ruixue</creatorcontrib><creatorcontrib>Xie, Pengjun</creatorcontrib><creatorcontrib>Huang, Fei</creatorcontrib><creatorcontrib>Qian, Chen</creatorcontrib><creatorcontrib>Si, Luo</creatorcontrib><collection>CrossRef</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Nie, Binling</au><au>Ding, Ruixue</au><au>Xie, Pengjun</au><au>Huang, Fei</au><au>Qian, Chen</au><au>Si, Luo</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Knowledge-aware Named Entity Recognition with Alleviating Heterogeneity</atitle><btitle>Proceedings of the ... AAAI Conference on Artificial Intelligence</btitle><date>2021-05-18</date><risdate>2021</risdate><volume>35</volume><issue>15</issue><spage>13595</spage><epage>13603</epage><pages>13595-13603</pages><issn>2159-5399</issn><eissn>2374-3468</eissn><abstract>Named Entity Recognition (NER) is a fundamental and important research topic for many downstream NLP tasks, aiming at detecting and classifying named entities (NEs) mentioned in unstructured text into pre-defined categories. Learning from labeled data only is far from enough when it comes to domain-specific or temporally-evolving entities (medical terminologies or restaurant names). Luckily, open-source Knowledge Bases (KBs) (Wikidata and Freebase) contain NEs that are manually labeled with predefined types in different domains, which is potentially beneficial to identify entity boundaries and recognize entity types more accurately. However, the type system of a domain-specific NER task is typically independent of that of current KBs and thus exhibits heterogeneity issue inevitably, which makes matching between the original NER and KB types (Person in NER potentially matches President in KBs) less likely, or introduces unintended noises without considering domain-specific knowledge (Band in NER should be mapped to Out_of_Entity_Types in the restaurant-related task). To better incorporate and denoise the abundant knowledge in KBs, we propose a new KB-aware NER framework (KaNa), which utilizes type-heterogeneous knowledge to improve NER. Specifically, for an entity mention along with a set of candidate entities that are linked from KBs, KaNa first uses a type projection mechanism that maps the mention type and entity types into a shared space to homogenize the heterogeneous entity types. Then, based on projected types, a noise detector filters out certain less-confident candidate entities in an unsupervised manner. Finally, the filtered mention-entity pairs are injected into a NER model as a graph to predict answers. The experimental results demonstrate KaNa's state-of-the-art performance on five public benchmark datasets from different domains.</abstract><doi>10.1609/aaai.v35i15.17603</doi><tpages>9</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 2159-5399
ispartof	Proceedings of the ... AAAI Conference on Artificial Intelligence, 2021, Vol.35 (15), p.13595-13603
issn	2159-5399 2374-3468
language	eng
recordid	cdi_crossref_primary_10_1609_aaai_v35i15_17603
source	Science Journals (Open access)
title	Knowledge-aware Named Entity Recognition with Alleviating Heterogeneity
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T04%3A37%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Knowledge-aware%20Named%20Entity%20Recognition%20with%20Alleviating%20Heterogeneity&rft.btitle=Proceedings%20of%20the%20...%20AAAI%20Conference%20on%20Artificial%20Intelligence&rft.au=Nie,%20Binling&rft.date=2021-05-18&rft.volume=35&rft.issue=15&rft.spage=13595&rft.epage=13603&rft.pages=13595-13603&rft.issn=2159-5399&rft.eissn=2374-3468&rft_id=info:doi/10.1609/aaai.v35i15.17603&rft_dat=%3Ccrossref%3E10_1609_aaai_v35i15_17603%3C/crossref%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c175t-64e79fd45d350df5c1d1c589907b44aadd7aa4874b42eeb58550c5bf1de5e4863%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true