Loading…

Bootstrapping Multilingual Relation Discovery Using English Wikipedia and Wikimedia-Induced Entity Extraction

Relation extraction has been a subject of significant study over the past decade. Most relation extractors have been developed by combining the training of complex computational systems on large volumes of annotations with extensive rule writing by language experts. Moreover, many relation extractor...

Full description

Saved in:
Bibliographic Details
Main Authors: Schone, P., Allison, T., Giannella, C., Pfeifer, C.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 951
container_issue
container_start_page 944
container_title
container_volume
creator Schone, P.
Allison, T.
Giannella, C.
Pfeifer, C.
description Relation extraction has been a subject of significant study over the past decade. Most relation extractors have been developed by combining the training of complex computational systems on large volumes of annotations with extensive rule writing by language experts. Moreover, many relation extractors are reliant on other non-trivial NLP technologies which themselves are developed through significant human efforts, such as entity tagging, parsing, etc. Due to the high cost of creating and assembling the required resources, relation extractors have typically been developed for only high-resourced languages. In this paper, we describe a near-zero-cost methodology to build relation extractors for significantly distinct non-English languages using only freely available Wikipedia and other web documents, and some knowledge of English. We apply our methodology and build alma-mater, birthplace, father, occupation, and spouse relation extractors in Greek, Spanish, Russian, and Chinese. We conduct evaluations of induced relations at the file level which are the most refined we have seen in the literature.
doi_str_mv 10.1109/ICTAI.2011.163
format conference_proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_6103454</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6103454</ieee_id><sourcerecordid>6103454</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-e66b2d469e9977a16992b162f7dc2b748224063648e338a453125e0e4322c5753</originalsourceid><addsrcrecordid>eNotjEtPwkAURsdXIiBbN27mDxTvvfPqLBVRm2BMDER3ZGgHHC1t0ylG_r2grr7vJCeHsUuEESLY62w8u8lGBIgj1OKIDa1JwWirpLLaHLMeCaMSQGtOWB-lMoZAp2-nrIeQUiIk2HPWj_EDgECR6LHNbV13sWtd04RqzZ-2ZRfK_du6kr_40nWhrvhdiHn95dsdn8eDNanWZYjv_DV8hsYXwXFXFb-0OVCSVcU298Xe60K345PvfT8_lC7Y2cqV0Q__d8Dm95PZ-DGZPj9k45tpEtCoLvFaL6mQ2nprjXGoraUlalqZIqelkSmRBC20TL0QqZNKICkPXgqiXBklBuzqrxu894umDRvX7hYaQUglxQ956lzM</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Bootstrapping Multilingual Relation Discovery Using English Wikipedia and Wikimedia-Induced Entity Extraction</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Schone, P. ; Allison, T. ; Giannella, C. ; Pfeifer, C.</creator><creatorcontrib>Schone, P. ; Allison, T. ; Giannella, C. ; Pfeifer, C.</creatorcontrib><description>Relation extraction has been a subject of significant study over the past decade. Most relation extractors have been developed by combining the training of complex computational systems on large volumes of annotations with extensive rule writing by language experts. Moreover, many relation extractors are reliant on other non-trivial NLP technologies which themselves are developed through significant human efforts, such as entity tagging, parsing, etc. Due to the high cost of creating and assembling the required resources, relation extractors have typically been developed for only high-resourced languages. In this paper, we describe a near-zero-cost methodology to build relation extractors for significantly distinct non-English languages using only freely available Wikipedia and other web documents, and some knowledge of English. We apply our methodology and build alma-mater, birthplace, father, occupation, and spouse relation extractors in Greek, Spanish, Russian, and Chinese. We conduct evaluations of induced relations at the file level which are the most refined we have seen in the literature.</description><identifier>ISSN: 1082-3409</identifier><identifier>ISBN: 145772068X</identifier><identifier>ISBN: 9781457720680</identifier><identifier>EISSN: 2375-0197</identifier><identifier>EISBN: 9780769545967</identifier><identifier>EISBN: 0769545963</identifier><identifier>DOI: 10.1109/ICTAI.2011.163</identifier><language>eng</language><publisher>IEEE</publisher><subject>Artificial intelligence ; Conferences ; multilingual relation extraction ; Wikipedia</subject><ispartof>2011 IEEE 23rd International Conference on Tools with Artificial Intelligence, 2011, p.944-951</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6103454$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,777,781,786,787,2053,27907,54537,54902,54914</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6103454$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Schone, P.</creatorcontrib><creatorcontrib>Allison, T.</creatorcontrib><creatorcontrib>Giannella, C.</creatorcontrib><creatorcontrib>Pfeifer, C.</creatorcontrib><title>Bootstrapping Multilingual Relation Discovery Using English Wikipedia and Wikimedia-Induced Entity Extraction</title><title>2011 IEEE 23rd International Conference on Tools with Artificial Intelligence</title><addtitle>ictai</addtitle><description>Relation extraction has been a subject of significant study over the past decade. Most relation extractors have been developed by combining the training of complex computational systems on large volumes of annotations with extensive rule writing by language experts. Moreover, many relation extractors are reliant on other non-trivial NLP technologies which themselves are developed through significant human efforts, such as entity tagging, parsing, etc. Due to the high cost of creating and assembling the required resources, relation extractors have typically been developed for only high-resourced languages. In this paper, we describe a near-zero-cost methodology to build relation extractors for significantly distinct non-English languages using only freely available Wikipedia and other web documents, and some knowledge of English. We apply our methodology and build alma-mater, birthplace, father, occupation, and spouse relation extractors in Greek, Spanish, Russian, and Chinese. We conduct evaluations of induced relations at the file level which are the most refined we have seen in the literature.</description><subject>Artificial intelligence</subject><subject>Conferences</subject><subject>multilingual relation extraction</subject><subject>Wikipedia</subject><issn>1082-3409</issn><issn>2375-0197</issn><isbn>145772068X</isbn><isbn>9781457720680</isbn><isbn>9780769545967</isbn><isbn>0769545963</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2011</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotjEtPwkAURsdXIiBbN27mDxTvvfPqLBVRm2BMDER3ZGgHHC1t0ylG_r2grr7vJCeHsUuEESLY62w8u8lGBIgj1OKIDa1JwWirpLLaHLMeCaMSQGtOWB-lMoZAp2-nrIeQUiIk2HPWj_EDgECR6LHNbV13sWtd04RqzZ-2ZRfK_du6kr_40nWhrvhdiHn95dsdn8eDNanWZYjv_DV8hsYXwXFXFb-0OVCSVcU298Xe60K345PvfT8_lC7Y2cqV0Q__d8Dm95PZ-DGZPj9k45tpEtCoLvFaL6mQ2nprjXGoraUlalqZIqelkSmRBC20TL0QqZNKICkPXgqiXBklBuzqrxu894umDRvX7hYaQUglxQ956lzM</recordid><startdate>201111</startdate><enddate>201111</enddate><creator>Schone, P.</creator><creator>Allison, T.</creator><creator>Giannella, C.</creator><creator>Pfeifer, C.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>201111</creationdate><title>Bootstrapping Multilingual Relation Discovery Using English Wikipedia and Wikimedia-Induced Entity Extraction</title><author>Schone, P. ; Allison, T. ; Giannella, C. ; Pfeifer, C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-e66b2d469e9977a16992b162f7dc2b748224063648e338a453125e0e4322c5753</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Artificial intelligence</topic><topic>Conferences</topic><topic>multilingual relation extraction</topic><topic>Wikipedia</topic><toplevel>online_resources</toplevel><creatorcontrib>Schone, P.</creatorcontrib><creatorcontrib>Allison, T.</creatorcontrib><creatorcontrib>Giannella, C.</creatorcontrib><creatorcontrib>Pfeifer, C.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Schone, P.</au><au>Allison, T.</au><au>Giannella, C.</au><au>Pfeifer, C.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Bootstrapping Multilingual Relation Discovery Using English Wikipedia and Wikimedia-Induced Entity Extraction</atitle><btitle>2011 IEEE 23rd International Conference on Tools with Artificial Intelligence</btitle><stitle>ictai</stitle><date>2011-11</date><risdate>2011</risdate><spage>944</spage><epage>951</epage><pages>944-951</pages><issn>1082-3409</issn><eissn>2375-0197</eissn><isbn>145772068X</isbn><isbn>9781457720680</isbn><eisbn>9780769545967</eisbn><eisbn>0769545963</eisbn><abstract>Relation extraction has been a subject of significant study over the past decade. Most relation extractors have been developed by combining the training of complex computational systems on large volumes of annotations with extensive rule writing by language experts. Moreover, many relation extractors are reliant on other non-trivial NLP technologies which themselves are developed through significant human efforts, such as entity tagging, parsing, etc. Due to the high cost of creating and assembling the required resources, relation extractors have typically been developed for only high-resourced languages. In this paper, we describe a near-zero-cost methodology to build relation extractors for significantly distinct non-English languages using only freely available Wikipedia and other web documents, and some knowledge of English. We apply our methodology and build alma-mater, birthplace, father, occupation, and spouse relation extractors in Greek, Spanish, Russian, and Chinese. We conduct evaluations of induced relations at the file level which are the most refined we have seen in the literature.</abstract><pub>IEEE</pub><doi>10.1109/ICTAI.2011.163</doi><tpages>8</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1082-3409
ispartof 2011 IEEE 23rd International Conference on Tools with Artificial Intelligence, 2011, p.944-951
issn 1082-3409
2375-0197
language eng
recordid cdi_ieee_primary_6103454
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Artificial intelligence
Conferences
multilingual relation extraction
Wikipedia
title Bootstrapping Multilingual Relation Discovery Using English Wikipedia and Wikimedia-Induced Entity Extraction
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T09%3A31%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Bootstrapping%20Multilingual%20Relation%20Discovery%20Using%20English%20Wikipedia%20and%20Wikimedia-Induced%20Entity%20Extraction&rft.btitle=2011%20IEEE%2023rd%20International%20Conference%20on%20Tools%20with%20Artificial%20Intelligence&rft.au=Schone,%20P.&rft.date=2011-11&rft.spage=944&rft.epage=951&rft.pages=944-951&rft.issn=1082-3409&rft.eissn=2375-0197&rft.isbn=145772068X&rft.isbn_list=9781457720680&rft_id=info:doi/10.1109/ICTAI.2011.163&rft.eisbn=9780769545967&rft.eisbn_list=0769545963&rft_dat=%3Cieee_6IE%3E6103454%3C/ieee_6IE%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i175t-e66b2d469e9977a16992b162f7dc2b748224063648e338a453125e0e4322c5753%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6103454&rfr_iscdi=true