Loading…

Fast similarity search in peer-to-peer networks

Peer-to-peer (P2P) systems show numerous advantages over centralized systems, such as load balancing, scalability, and fault tolerance, and they require certain functionality, such as search, repair, and message and data transfer. In particular, structured P2P networks perform an exact search in log...

Full description

Saved in:
Bibliographic Details
Main Authors: Bocek, T., Hunt, E., Hausheer, D., Stiller, B.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 247
container_issue
container_start_page 240
container_title
container_volume
creator Bocek, T.
Hunt, E.
Hausheer, D.
Stiller, B.
description Peer-to-peer (P2P) systems show numerous advantages over centralized systems, such as load balancing, scalability, and fault tolerance, and they require certain functionality, such as search, repair, and message and data transfer. In particular, structured P2P networks perform an exact search in logarithmic time proportional to the number of peers. However, keyword similarity search in a structured P2P network remains a challenge. Similarity search for service discovery can significantly improve service management in a distributed environment. As services are often described informally in text form, keyword similarity search can find the required services or data items more reliably. This paper presents a fast similarity search algorithm for structured P2P systems. The new algorithm, called P2P fast similarity search (P2PFastSS), finds similar keys in any distributed hash table (DHT) using the edit distance metric, and is independent of the underlying P2P routing algorithm. Performance analysis shows that P2PFastSS carries out a similarity search in time proportional to the logarithm of the number of peers. Simulations on PlanetLab confirm these results and show that a similarity search with 34,000 peers performs in less than three seconds on average. Thus, P2PFastSS is suitable for similarity search in large-scale network infrastructures, such as service description matching in service discovery or searching for similar terms in P2P storage networks.
doi_str_mv 10.1109/NOMS.2008.4575140
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_4575140</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4575140</ieee_id><sourcerecordid>4575140</sourcerecordid><originalsourceid>FETCH-LOGICAL-i284t-7276e6c93c7f20c902ca2751619de33969b3a16db8bc9d8d3dc0ea030fec1f263</originalsourceid><addsrcrecordid>eNpFj81KAzEURuMfOK0-gLiZF8j03ptMMllKsSpUu1DXJZO5g9H-kQxI396KBVdnceDjfELcIFSI4CYvi-fXigCaSte2Rg0nYoSatCYwBk5FQcpq6Sy4s39R07kosNYkkQAvxSjnTwBtQUEhJjOfhzLHdVz5FId9mdmn8FHGTbljTnLYyl-WGx6-t-krX4mL3q8yXx85Fu-z-7fpo5wvHp6md3MZqdGDtGQNm-BUsD1BcEDB06HYoOtYKWdcqzyarm3a4LqmU10A9oeingP2ZNRY3P7tRmZe7lJc-7RfHl-rH7tmRpQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Fast similarity search in peer-to-peer networks</title><source>IEEE Xplore All Conference Series</source><creator>Bocek, T. ; Hunt, E. ; Hausheer, D. ; Stiller, B.</creator><creatorcontrib>Bocek, T. ; Hunt, E. ; Hausheer, D. ; Stiller, B.</creatorcontrib><description>Peer-to-peer (P2P) systems show numerous advantages over centralized systems, such as load balancing, scalability, and fault tolerance, and they require certain functionality, such as search, repair, and message and data transfer. In particular, structured P2P networks perform an exact search in logarithmic time proportional to the number of peers. However, keyword similarity search in a structured P2P network remains a challenge. Similarity search for service discovery can significantly improve service management in a distributed environment. As services are often described informally in text form, keyword similarity search can find the required services or data items more reliably. This paper presents a fast similarity search algorithm for structured P2P systems. The new algorithm, called P2P fast similarity search (P2PFastSS), finds similar keys in any distributed hash table (DHT) using the edit distance metric, and is independent of the underlying P2P routing algorithm. Performance analysis shows that P2PFastSS carries out a similarity search in time proportional to the logarithm of the number of peers. Simulations on PlanetLab confirm these results and show that a similarity search with 34,000 peers performs in less than three seconds on average. Thus, P2PFastSS is suitable for similarity search in large-scale network infrastructures, such as service description matching in service discovery or searching for similar terms in P2P storage networks.</description><identifier>ISSN: 1542-1201</identifier><identifier>ISBN: 1424420652</identifier><identifier>ISBN: 9781424420650</identifier><identifier>EISSN: 2374-9709</identifier><identifier>EISBN: 1424420660</identifier><identifier>EISBN: 9781424420667</identifier><identifier>DOI: 10.1109/NOMS.2008.4575140</identifier><language>eng</language><publisher>IEEE</publisher><subject>Computer science ; Data engineering ; DHT ; Electronic mail ; Fault tolerant systems ; Informatics ; Laboratories ; Load management ; P2P ; Peer to peer computing ; Routing ; Scalability ; service discovery ; similarity search</subject><ispartof>NOMS 2008 - 2008 IEEE Network Operations and Management Symposium, 2008, p.240-247</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4575140$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,27925,54555,54920,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4575140$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Bocek, T.</creatorcontrib><creatorcontrib>Hunt, E.</creatorcontrib><creatorcontrib>Hausheer, D.</creatorcontrib><creatorcontrib>Stiller, B.</creatorcontrib><title>Fast similarity search in peer-to-peer networks</title><title>NOMS 2008 - 2008 IEEE Network Operations and Management Symposium</title><addtitle>NOMS</addtitle><description>Peer-to-peer (P2P) systems show numerous advantages over centralized systems, such as load balancing, scalability, and fault tolerance, and they require certain functionality, such as search, repair, and message and data transfer. In particular, structured P2P networks perform an exact search in logarithmic time proportional to the number of peers. However, keyword similarity search in a structured P2P network remains a challenge. Similarity search for service discovery can significantly improve service management in a distributed environment. As services are often described informally in text form, keyword similarity search can find the required services or data items more reliably. This paper presents a fast similarity search algorithm for structured P2P systems. The new algorithm, called P2P fast similarity search (P2PFastSS), finds similar keys in any distributed hash table (DHT) using the edit distance metric, and is independent of the underlying P2P routing algorithm. Performance analysis shows that P2PFastSS carries out a similarity search in time proportional to the logarithm of the number of peers. Simulations on PlanetLab confirm these results and show that a similarity search with 34,000 peers performs in less than three seconds on average. Thus, P2PFastSS is suitable for similarity search in large-scale network infrastructures, such as service description matching in service discovery or searching for similar terms in P2P storage networks.</description><subject>Computer science</subject><subject>Data engineering</subject><subject>DHT</subject><subject>Electronic mail</subject><subject>Fault tolerant systems</subject><subject>Informatics</subject><subject>Laboratories</subject><subject>Load management</subject><subject>P2P</subject><subject>Peer to peer computing</subject><subject>Routing</subject><subject>Scalability</subject><subject>service discovery</subject><subject>similarity search</subject><issn>1542-1201</issn><issn>2374-9709</issn><isbn>1424420652</isbn><isbn>9781424420650</isbn><isbn>1424420660</isbn><isbn>9781424420667</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2008</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNpFj81KAzEURuMfOK0-gLiZF8j03ptMMllKsSpUu1DXJZO5g9H-kQxI396KBVdnceDjfELcIFSI4CYvi-fXigCaSte2Rg0nYoSatCYwBk5FQcpq6Sy4s39R07kosNYkkQAvxSjnTwBtQUEhJjOfhzLHdVz5FId9mdmn8FHGTbljTnLYyl-WGx6-t-krX4mL3q8yXx85Fu-z-7fpo5wvHp6md3MZqdGDtGQNm-BUsD1BcEDB06HYoOtYKWdcqzyarm3a4LqmU10A9oeingP2ZNRY3P7tRmZe7lJc-7RfHl-rH7tmRpQ</recordid><startdate>20080101</startdate><enddate>20080101</enddate><creator>Bocek, T.</creator><creator>Hunt, E.</creator><creator>Hausheer, D.</creator><creator>Stiller, B.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20080101</creationdate><title>Fast similarity search in peer-to-peer networks</title><author>Bocek, T. ; Hunt, E. ; Hausheer, D. ; Stiller, B.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i284t-7276e6c93c7f20c902ca2751619de33969b3a16db8bc9d8d3dc0ea030fec1f263</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Computer science</topic><topic>Data engineering</topic><topic>DHT</topic><topic>Electronic mail</topic><topic>Fault tolerant systems</topic><topic>Informatics</topic><topic>Laboratories</topic><topic>Load management</topic><topic>P2P</topic><topic>Peer to peer computing</topic><topic>Routing</topic><topic>Scalability</topic><topic>service discovery</topic><topic>similarity search</topic><toplevel>online_resources</toplevel><creatorcontrib>Bocek, T.</creatorcontrib><creatorcontrib>Hunt, E.</creatorcontrib><creatorcontrib>Hausheer, D.</creatorcontrib><creatorcontrib>Stiller, B.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Xplore</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Bocek, T.</au><au>Hunt, E.</au><au>Hausheer, D.</au><au>Stiller, B.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Fast similarity search in peer-to-peer networks</atitle><btitle>NOMS 2008 - 2008 IEEE Network Operations and Management Symposium</btitle><stitle>NOMS</stitle><date>2008-01-01</date><risdate>2008</risdate><spage>240</spage><epage>247</epage><pages>240-247</pages><issn>1542-1201</issn><eissn>2374-9709</eissn><isbn>1424420652</isbn><isbn>9781424420650</isbn><eisbn>1424420660</eisbn><eisbn>9781424420667</eisbn><abstract>Peer-to-peer (P2P) systems show numerous advantages over centralized systems, such as load balancing, scalability, and fault tolerance, and they require certain functionality, such as search, repair, and message and data transfer. In particular, structured P2P networks perform an exact search in logarithmic time proportional to the number of peers. However, keyword similarity search in a structured P2P network remains a challenge. Similarity search for service discovery can significantly improve service management in a distributed environment. As services are often described informally in text form, keyword similarity search can find the required services or data items more reliably. This paper presents a fast similarity search algorithm for structured P2P systems. The new algorithm, called P2P fast similarity search (P2PFastSS), finds similar keys in any distributed hash table (DHT) using the edit distance metric, and is independent of the underlying P2P routing algorithm. Performance analysis shows that P2PFastSS carries out a similarity search in time proportional to the logarithm of the number of peers. Simulations on PlanetLab confirm these results and show that a similarity search with 34,000 peers performs in less than three seconds on average. Thus, P2PFastSS is suitable for similarity search in large-scale network infrastructures, such as service description matching in service discovery or searching for similar terms in P2P storage networks.</abstract><pub>IEEE</pub><doi>10.1109/NOMS.2008.4575140</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1542-1201
ispartof NOMS 2008 - 2008 IEEE Network Operations and Management Symposium, 2008, p.240-247
issn 1542-1201
2374-9709
language eng
recordid cdi_ieee_primary_4575140
source IEEE Xplore All Conference Series
subjects Computer science
Data engineering
DHT
Electronic mail
Fault tolerant systems
Informatics
Laboratories
Load management
P2P
Peer to peer computing
Routing
Scalability
service discovery
similarity search
title Fast similarity search in peer-to-peer networks
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T18%3A23%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Fast%20similarity%20search%20in%20peer-to-peer%20networks&rft.btitle=NOMS%202008%20-%202008%20IEEE%20Network%20Operations%20and%20Management%20Symposium&rft.au=Bocek,%20T.&rft.date=2008-01-01&rft.spage=240&rft.epage=247&rft.pages=240-247&rft.issn=1542-1201&rft.eissn=2374-9709&rft.isbn=1424420652&rft.isbn_list=9781424420650&rft_id=info:doi/10.1109/NOMS.2008.4575140&rft.eisbn=1424420660&rft.eisbn_list=9781424420667&rft_dat=%3Cieee_CHZPO%3E4575140%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i284t-7276e6c93c7f20c902ca2751619de33969b3a16db8bc9d8d3dc0ea030fec1f263%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4575140&rfr_iscdi=true