Loading…
Computing Semantic Similarity of Concepts in Knowledge Graphs
This paper presents a method for measuring the semantic similarity between concepts in Knowledge Graphs (KGs) such as WordNet and DBpedia. Previous work on semantic similarity methods have focused on either the structure of the semantic network between concepts (e.g., path length and depth), or only...
Saved in:
Published in: | IEEE transactions on knowledge and data engineering 2017-01, Vol.29 (1), p.72-85 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133 |
---|---|
cites | cdi_FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133 |
container_end_page | 85 |
container_issue | 1 |
container_start_page | 72 |
container_title | IEEE transactions on knowledge and data engineering |
container_volume | 29 |
creator | Ganggao Zhu Iglesias, Carlos A. |
description | This paper presents a method for measuring the semantic similarity between concepts in Knowledge Graphs (KGs) such as WordNet and DBpedia. Previous work on semantic similarity methods have focused on either the structure of the semantic network between concepts (e.g., path length and depth), or only on the Information Content (IC) of concepts. We propose a semantic similarity method, namely wpath, to combine these two approaches, using IC to weight the shortest path length between concepts. Conventional corpus-based IC is computed from the distributions of concepts over textual corpus, which is required to prepare a domain corpus containing annotated concepts and has high computational cost. As instances are already extracted from textual corpus and annotated by concepts in KGs, graph-based IC is proposed to compute IC based on the distributions of concepts over instances. Through experiments performed on well known word similarity datasets, we show that the wpath semantic similarity method has produced a statistically significant improvement over other semantic similarity methods. Moreover, in a real category classification evaluation, the wpath method has shown the best performance in terms of accuracy and F score. |
doi_str_mv | 10.1109/TKDE.2016.2610428 |
format | article |
fullrecord | <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_ieee_primary_7572993</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7572993</ieee_id><sourcerecordid>1847635740</sourcerecordid><originalsourceid>FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133</originalsourceid><addsrcrecordid>eNo9kD1PwzAQhi0EEqXwAxBLJOYEn-3EzsCAQimolRhaZitxzsVV84GdCvXfk6oV093wvO-dHkLugSYANH9aL15nCaOQJSwDKpi6IBNIUxUzyOFy3KmAWHAhr8lNCFtKqZIKJuS56Jp-P7h2E62wKdvBmWjlGrcrvRsOUWejomsN9kOIXBst2u53h_UGo7kv--9wS65suQt4d55T8vU2Wxfv8fJz_lG8LGMjJBtirDNhaMUsqxClslgrJXOTKVYaZtIaa2tEJqsyo2llYHzZcMkxryzmigHnU_J46u1997PHMOhtt_fteFKDEjLjqRR0pOBEGd-F4NHq3rum9AcNVB8t6aMlfbSkz5bGzMMp4xDxn5epZHnO-R-wUWNL</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1847635740</pqid></control><display><type>article</type><title>Computing Semantic Similarity of Concepts in Knowledge Graphs</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Ganggao Zhu ; Iglesias, Carlos A.</creator><creatorcontrib>Ganggao Zhu ; Iglesias, Carlos A.</creatorcontrib><description>This paper presents a method for measuring the semantic similarity between concepts in Knowledge Graphs (KGs) such as WordNet and DBpedia. Previous work on semantic similarity methods have focused on either the structure of the semantic network between concepts (e.g., path length and depth), or only on the Information Content (IC) of concepts. We propose a semantic similarity method, namely wpath, to combine these two approaches, using IC to weight the shortest path length between concepts. Conventional corpus-based IC is computed from the distributions of concepts over textual corpus, which is required to prepare a domain corpus containing annotated concepts and has high computational cost. As instances are already extracted from textual corpus and annotated by concepts in KGs, graph-based IC is proposed to compute IC based on the distributions of concepts over instances. Through experiments performed on well known word similarity datasets, we show that the wpath semantic similarity method has produced a statistically significant improvement over other semantic similarity methods. Moreover, in a real category classification evaluation, the wpath method has shown the best performance in terms of accuracy and F score.</description><identifier>ISSN: 1041-4347</identifier><identifier>EISSN: 1558-2191</identifier><identifier>DOI: 10.1109/TKDE.2016.2610428</identifier><identifier>CODEN: ITKEEH</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>DBpedia ; Graphs ; information content ; Integrated circuits ; Knowledge based systems ; Knowledge engineering ; knowledge graph ; Knowledge representation ; Measurement ; Measurement methods ; Motion pictures ; semantic relatedness ; Semantic similarity ; Semantics ; Shortest-path problems ; Similarity ; Taxonomy ; WordNet</subject><ispartof>IEEE transactions on knowledge and data engineering, 2017-01, Vol.29 (1), p.72-85</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2017</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133</citedby><cites>FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7572993$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,27901,27902,54771</link.rule.ids></links><search><creatorcontrib>Ganggao Zhu</creatorcontrib><creatorcontrib>Iglesias, Carlos A.</creatorcontrib><title>Computing Semantic Similarity of Concepts in Knowledge Graphs</title><title>IEEE transactions on knowledge and data engineering</title><addtitle>TKDE</addtitle><description>This paper presents a method for measuring the semantic similarity between concepts in Knowledge Graphs (KGs) such as WordNet and DBpedia. Previous work on semantic similarity methods have focused on either the structure of the semantic network between concepts (e.g., path length and depth), or only on the Information Content (IC) of concepts. We propose a semantic similarity method, namely wpath, to combine these two approaches, using IC to weight the shortest path length between concepts. Conventional corpus-based IC is computed from the distributions of concepts over textual corpus, which is required to prepare a domain corpus containing annotated concepts and has high computational cost. As instances are already extracted from textual corpus and annotated by concepts in KGs, graph-based IC is proposed to compute IC based on the distributions of concepts over instances. Through experiments performed on well known word similarity datasets, we show that the wpath semantic similarity method has produced a statistically significant improvement over other semantic similarity methods. Moreover, in a real category classification evaluation, the wpath method has shown the best performance in terms of accuracy and F score.</description><subject>DBpedia</subject><subject>Graphs</subject><subject>information content</subject><subject>Integrated circuits</subject><subject>Knowledge based systems</subject><subject>Knowledge engineering</subject><subject>knowledge graph</subject><subject>Knowledge representation</subject><subject>Measurement</subject><subject>Measurement methods</subject><subject>Motion pictures</subject><subject>semantic relatedness</subject><subject>Semantic similarity</subject><subject>Semantics</subject><subject>Shortest-path problems</subject><subject>Similarity</subject><subject>Taxonomy</subject><subject>WordNet</subject><issn>1041-4347</issn><issn>1558-2191</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><recordid>eNo9kD1PwzAQhi0EEqXwAxBLJOYEn-3EzsCAQimolRhaZitxzsVV84GdCvXfk6oV093wvO-dHkLugSYANH9aL15nCaOQJSwDKpi6IBNIUxUzyOFy3KmAWHAhr8lNCFtKqZIKJuS56Jp-P7h2E62wKdvBmWjlGrcrvRsOUWejomsN9kOIXBst2u53h_UGo7kv--9wS65suQt4d55T8vU2Wxfv8fJz_lG8LGMjJBtirDNhaMUsqxClslgrJXOTKVYaZtIaa2tEJqsyo2llYHzZcMkxryzmigHnU_J46u1997PHMOhtt_fteFKDEjLjqRR0pOBEGd-F4NHq3rum9AcNVB8t6aMlfbSkz5bGzMMp4xDxn5epZHnO-R-wUWNL</recordid><startdate>20170101</startdate><enddate>20170101</enddate><creator>Ganggao Zhu</creator><creator>Iglesias, Carlos A.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20170101</creationdate><title>Computing Semantic Similarity of Concepts in Knowledge Graphs</title><author>Ganggao Zhu ; Iglesias, Carlos A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>DBpedia</topic><topic>Graphs</topic><topic>information content</topic><topic>Integrated circuits</topic><topic>Knowledge based systems</topic><topic>Knowledge engineering</topic><topic>knowledge graph</topic><topic>Knowledge representation</topic><topic>Measurement</topic><topic>Measurement methods</topic><topic>Motion pictures</topic><topic>semantic relatedness</topic><topic>Semantic similarity</topic><topic>Semantics</topic><topic>Shortest-path problems</topic><topic>Similarity</topic><topic>Taxonomy</topic><topic>WordNet</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ganggao Zhu</creatorcontrib><creatorcontrib>Iglesias, Carlos A.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on knowledge and data engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ganggao Zhu</au><au>Iglesias, Carlos A.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Computing Semantic Similarity of Concepts in Knowledge Graphs</atitle><jtitle>IEEE transactions on knowledge and data engineering</jtitle><stitle>TKDE</stitle><date>2017-01-01</date><risdate>2017</risdate><volume>29</volume><issue>1</issue><spage>72</spage><epage>85</epage><pages>72-85</pages><issn>1041-4347</issn><eissn>1558-2191</eissn><coden>ITKEEH</coden><abstract>This paper presents a method for measuring the semantic similarity between concepts in Knowledge Graphs (KGs) such as WordNet and DBpedia. Previous work on semantic similarity methods have focused on either the structure of the semantic network between concepts (e.g., path length and depth), or only on the Information Content (IC) of concepts. We propose a semantic similarity method, namely wpath, to combine these two approaches, using IC to weight the shortest path length between concepts. Conventional corpus-based IC is computed from the distributions of concepts over textual corpus, which is required to prepare a domain corpus containing annotated concepts and has high computational cost. As instances are already extracted from textual corpus and annotated by concepts in KGs, graph-based IC is proposed to compute IC based on the distributions of concepts over instances. Through experiments performed on well known word similarity datasets, we show that the wpath semantic similarity method has produced a statistically significant improvement over other semantic similarity methods. Moreover, in a real category classification evaluation, the wpath method has shown the best performance in terms of accuracy and F score.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TKDE.2016.2610428</doi><tpages>14</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1041-4347 |
ispartof | IEEE transactions on knowledge and data engineering, 2017-01, Vol.29 (1), p.72-85 |
issn | 1041-4347 1558-2191 |
language | eng |
recordid | cdi_ieee_primary_7572993 |
source | IEEE Electronic Library (IEL) Journals |
subjects | DBpedia Graphs information content Integrated circuits Knowledge based systems Knowledge engineering knowledge graph Knowledge representation Measurement Measurement methods Motion pictures semantic relatedness Semantic similarity Semantics Shortest-path problems Similarity Taxonomy WordNet |
title | Computing Semantic Similarity of Concepts in Knowledge Graphs |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T16%3A13%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Computing%20Semantic%20Similarity%20of%20Concepts%20in%20Knowledge%20Graphs&rft.jtitle=IEEE%20transactions%20on%20knowledge%20and%20data%20engineering&rft.au=Ganggao%20Zhu&rft.date=2017-01-01&rft.volume=29&rft.issue=1&rft.spage=72&rft.epage=85&rft.pages=72-85&rft.issn=1041-4347&rft.eissn=1558-2191&rft.coden=ITKEEH&rft_id=info:doi/10.1109/TKDE.2016.2610428&rft_dat=%3Cproquest_ieee_%3E1847635740%3C/proquest_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1847635740&rft_id=info:pmid/&rft_ieee_id=7572993&rfr_iscdi=true |