Loading…

Computing Semantic Similarity of Concepts in Knowledge Graphs

This paper presents a method for measuring the semantic similarity between concepts in Knowledge Graphs (KGs) such as WordNet and DBpedia. Previous work on semantic similarity methods have focused on either the structure of the semantic network between concepts (e.g., path length and depth), or only...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on knowledge and data engineering 2017-01, Vol.29 (1), p.72-85
Main Authors: Ganggao Zhu, Iglesias, Carlos A.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133
cites cdi_FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133
container_end_page 85
container_issue 1
container_start_page 72
container_title IEEE transactions on knowledge and data engineering
container_volume 29
creator Ganggao Zhu
Iglesias, Carlos A.
description This paper presents a method for measuring the semantic similarity between concepts in Knowledge Graphs (KGs) such as WordNet and DBpedia. Previous work on semantic similarity methods have focused on either the structure of the semantic network between concepts (e.g., path length and depth), or only on the Information Content (IC) of concepts. We propose a semantic similarity method, namely wpath, to combine these two approaches, using IC to weight the shortest path length between concepts. Conventional corpus-based IC is computed from the distributions of concepts over textual corpus, which is required to prepare a domain corpus containing annotated concepts and has high computational cost. As instances are already extracted from textual corpus and annotated by concepts in KGs, graph-based IC is proposed to compute IC based on the distributions of concepts over instances. Through experiments performed on well known word similarity datasets, we show that the wpath semantic similarity method has produced a statistically significant improvement over other semantic similarity methods. Moreover, in a real category classification evaluation, the wpath method has shown the best performance in terms of accuracy and F score.
doi_str_mv 10.1109/TKDE.2016.2610428
format article
fullrecord <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_ieee_primary_7572993</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7572993</ieee_id><sourcerecordid>1847635740</sourcerecordid><originalsourceid>FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133</originalsourceid><addsrcrecordid>eNo9kD1PwzAQhi0EEqXwAxBLJOYEn-3EzsCAQimolRhaZitxzsVV84GdCvXfk6oV093wvO-dHkLugSYANH9aL15nCaOQJSwDKpi6IBNIUxUzyOFy3KmAWHAhr8lNCFtKqZIKJuS56Jp-P7h2E62wKdvBmWjlGrcrvRsOUWejomsN9kOIXBst2u53h_UGo7kv--9wS65suQt4d55T8vU2Wxfv8fJz_lG8LGMjJBtirDNhaMUsqxClslgrJXOTKVYaZtIaa2tEJqsyo2llYHzZcMkxryzmigHnU_J46u1997PHMOhtt_fteFKDEjLjqRR0pOBEGd-F4NHq3rum9AcNVB8t6aMlfbSkz5bGzMMp4xDxn5epZHnO-R-wUWNL</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1847635740</pqid></control><display><type>article</type><title>Computing Semantic Similarity of Concepts in Knowledge Graphs</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Ganggao Zhu ; Iglesias, Carlos A.</creator><creatorcontrib>Ganggao Zhu ; Iglesias, Carlos A.</creatorcontrib><description>This paper presents a method for measuring the semantic similarity between concepts in Knowledge Graphs (KGs) such as WordNet and DBpedia. Previous work on semantic similarity methods have focused on either the structure of the semantic network between concepts (e.g., path length and depth), or only on the Information Content (IC) of concepts. We propose a semantic similarity method, namely wpath, to combine these two approaches, using IC to weight the shortest path length between concepts. Conventional corpus-based IC is computed from the distributions of concepts over textual corpus, which is required to prepare a domain corpus containing annotated concepts and has high computational cost. As instances are already extracted from textual corpus and annotated by concepts in KGs, graph-based IC is proposed to compute IC based on the distributions of concepts over instances. Through experiments performed on well known word similarity datasets, we show that the wpath semantic similarity method has produced a statistically significant improvement over other semantic similarity methods. Moreover, in a real category classification evaluation, the wpath method has shown the best performance in terms of accuracy and F score.</description><identifier>ISSN: 1041-4347</identifier><identifier>EISSN: 1558-2191</identifier><identifier>DOI: 10.1109/TKDE.2016.2610428</identifier><identifier>CODEN: ITKEEH</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>DBpedia ; Graphs ; information content ; Integrated circuits ; Knowledge based systems ; Knowledge engineering ; knowledge graph ; Knowledge representation ; Measurement ; Measurement methods ; Motion pictures ; semantic relatedness ; Semantic similarity ; Semantics ; Shortest-path problems ; Similarity ; Taxonomy ; WordNet</subject><ispartof>IEEE transactions on knowledge and data engineering, 2017-01, Vol.29 (1), p.72-85</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2017</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133</citedby><cites>FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7572993$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,27901,27902,54771</link.rule.ids></links><search><creatorcontrib>Ganggao Zhu</creatorcontrib><creatorcontrib>Iglesias, Carlos A.</creatorcontrib><title>Computing Semantic Similarity of Concepts in Knowledge Graphs</title><title>IEEE transactions on knowledge and data engineering</title><addtitle>TKDE</addtitle><description>This paper presents a method for measuring the semantic similarity between concepts in Knowledge Graphs (KGs) such as WordNet and DBpedia. Previous work on semantic similarity methods have focused on either the structure of the semantic network between concepts (e.g., path length and depth), or only on the Information Content (IC) of concepts. We propose a semantic similarity method, namely wpath, to combine these two approaches, using IC to weight the shortest path length between concepts. Conventional corpus-based IC is computed from the distributions of concepts over textual corpus, which is required to prepare a domain corpus containing annotated concepts and has high computational cost. As instances are already extracted from textual corpus and annotated by concepts in KGs, graph-based IC is proposed to compute IC based on the distributions of concepts over instances. Through experiments performed on well known word similarity datasets, we show that the wpath semantic similarity method has produced a statistically significant improvement over other semantic similarity methods. Moreover, in a real category classification evaluation, the wpath method has shown the best performance in terms of accuracy and F score.</description><subject>DBpedia</subject><subject>Graphs</subject><subject>information content</subject><subject>Integrated circuits</subject><subject>Knowledge based systems</subject><subject>Knowledge engineering</subject><subject>knowledge graph</subject><subject>Knowledge representation</subject><subject>Measurement</subject><subject>Measurement methods</subject><subject>Motion pictures</subject><subject>semantic relatedness</subject><subject>Semantic similarity</subject><subject>Semantics</subject><subject>Shortest-path problems</subject><subject>Similarity</subject><subject>Taxonomy</subject><subject>WordNet</subject><issn>1041-4347</issn><issn>1558-2191</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><recordid>eNo9kD1PwzAQhi0EEqXwAxBLJOYEn-3EzsCAQimolRhaZitxzsVV84GdCvXfk6oV093wvO-dHkLugSYANH9aL15nCaOQJSwDKpi6IBNIUxUzyOFy3KmAWHAhr8lNCFtKqZIKJuS56Jp-P7h2E62wKdvBmWjlGrcrvRsOUWejomsN9kOIXBst2u53h_UGo7kv--9wS65suQt4d55T8vU2Wxfv8fJz_lG8LGMjJBtirDNhaMUsqxClslgrJXOTKVYaZtIaa2tEJqsyo2llYHzZcMkxryzmigHnU_J46u1997PHMOhtt_fteFKDEjLjqRR0pOBEGd-F4NHq3rum9AcNVB8t6aMlfbSkz5bGzMMp4xDxn5epZHnO-R-wUWNL</recordid><startdate>20170101</startdate><enddate>20170101</enddate><creator>Ganggao Zhu</creator><creator>Iglesias, Carlos A.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20170101</creationdate><title>Computing Semantic Similarity of Concepts in Knowledge Graphs</title><author>Ganggao Zhu ; Iglesias, Carlos A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>DBpedia</topic><topic>Graphs</topic><topic>information content</topic><topic>Integrated circuits</topic><topic>Knowledge based systems</topic><topic>Knowledge engineering</topic><topic>knowledge graph</topic><topic>Knowledge representation</topic><topic>Measurement</topic><topic>Measurement methods</topic><topic>Motion pictures</topic><topic>semantic relatedness</topic><topic>Semantic similarity</topic><topic>Semantics</topic><topic>Shortest-path problems</topic><topic>Similarity</topic><topic>Taxonomy</topic><topic>WordNet</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ganggao Zhu</creatorcontrib><creatorcontrib>Iglesias, Carlos A.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on knowledge and data engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ganggao Zhu</au><au>Iglesias, Carlos A.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Computing Semantic Similarity of Concepts in Knowledge Graphs</atitle><jtitle>IEEE transactions on knowledge and data engineering</jtitle><stitle>TKDE</stitle><date>2017-01-01</date><risdate>2017</risdate><volume>29</volume><issue>1</issue><spage>72</spage><epage>85</epage><pages>72-85</pages><issn>1041-4347</issn><eissn>1558-2191</eissn><coden>ITKEEH</coden><abstract>This paper presents a method for measuring the semantic similarity between concepts in Knowledge Graphs (KGs) such as WordNet and DBpedia. Previous work on semantic similarity methods have focused on either the structure of the semantic network between concepts (e.g., path length and depth), or only on the Information Content (IC) of concepts. We propose a semantic similarity method, namely wpath, to combine these two approaches, using IC to weight the shortest path length between concepts. Conventional corpus-based IC is computed from the distributions of concepts over textual corpus, which is required to prepare a domain corpus containing annotated concepts and has high computational cost. As instances are already extracted from textual corpus and annotated by concepts in KGs, graph-based IC is proposed to compute IC based on the distributions of concepts over instances. Through experiments performed on well known word similarity datasets, we show that the wpath semantic similarity method has produced a statistically significant improvement over other semantic similarity methods. Moreover, in a real category classification evaluation, the wpath method has shown the best performance in terms of accuracy and F score.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TKDE.2016.2610428</doi><tpages>14</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1041-4347
ispartof IEEE transactions on knowledge and data engineering, 2017-01, Vol.29 (1), p.72-85
issn 1041-4347
1558-2191
language eng
recordid cdi_ieee_primary_7572993
source IEEE Electronic Library (IEL) Journals
subjects DBpedia
Graphs
information content
Integrated circuits
Knowledge based systems
Knowledge engineering
knowledge graph
Knowledge representation
Measurement
Measurement methods
Motion pictures
semantic relatedness
Semantic similarity
Semantics
Shortest-path problems
Similarity
Taxonomy
WordNet
title Computing Semantic Similarity of Concepts in Knowledge Graphs
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T16%3A13%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Computing%20Semantic%20Similarity%20of%20Concepts%20in%20Knowledge%20Graphs&rft.jtitle=IEEE%20transactions%20on%20knowledge%20and%20data%20engineering&rft.au=Ganggao%20Zhu&rft.date=2017-01-01&rft.volume=29&rft.issue=1&rft.spage=72&rft.epage=85&rft.pages=72-85&rft.issn=1041-4347&rft.eissn=1558-2191&rft.coden=ITKEEH&rft_id=info:doi/10.1109/TKDE.2016.2610428&rft_dat=%3Cproquest_ieee_%3E1847635740%3C/proquest_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c472t-ed64c0b2f2bee78fed8879c682ac2c5dedfc467ba605bc1219c373e9bfe982133%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1847635740&rft_id=info:pmid/&rft_ieee_id=7572993&rfr_iscdi=true