Loading…

Probabilistic Skyline Computation on Vertically Distributed Uncertain Data

The skyline query is important in database community. Recently, owing to the inherent uncertainty of some applications, skyline query on uncertain data has been widelystudied using probabilistic model, e.g. p-skyline. In the scenario where uncertain data is vertically distributed among multiple serv...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhang, Kaiqi, Wang, Jinbao, Wang, Muxian, Han, Xixian
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 163
container_issue
container_start_page 154
container_title
container_volume
creator Zhang, Kaiqi
Wang, Jinbao
Wang, Muxian
Han, Xixian
description The skyline query is important in database community. Recently, owing to the inherent uncertainty of some applications, skyline query on uncertain data has been widelystudied using probabilistic model, e.g. p-skyline. In the scenario where uncertain data is vertically distributed among multiple servers, the main purpose of p-skyline computation is to minimize the retrieved records from servers to the local client due to the dominance factor of expensive network communication. In this paper, we present three communication-efficient p-skyline algorithms ASR, IASR and FSLR on vertically distributed uncertain data. ASR alternates sorted and random accesses to retrieve the records at servers and performs retrieving-boundingchecking iteration until all the objects can be determined whether they are in the p-skyline result or not. The communication of the instances not retrieved can be saved. IASR is an improved version of ASR. By examining the net gain of retrieving-boundingchecking iteration, IASR early terminates the iteration to further reduce the cost of communication. Compared to ASR and IASR, FSLR performs random accesses only on demand. FSLR first conducts sorted accesses to get loose upper bounds of skyline probabilities of the instances. Then, FSLR uses random accesses to complement a part of retrieved instances to get tighter upper and lower bounds of skyline probabilities until the p-skyline result is computed. Our experimental results demonstrate that our algorithms ASR, IASR and FSLR significantly outperform the intuitive method for p-skyline computation on vertically distributed uncertain data.
doi_str_mv 10.1109/ICDCS.2019.00024
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_8885362</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8885362</ieee_id><sourcerecordid>8885362</sourcerecordid><originalsourceid>FETCH-LOGICAL-i203t-5eae680e3beee62103fbfd65482dd09c23c0e9e3a780a63c851afddc29c584ec3</originalsourceid><addsrcrecordid>eNotjF1LwzAUhqMgOOfuBW_yBzpPkiZNLqX1YzJQmPN2nCanEO3a0XYX_fcGFF54L56Hh7E7AWshwD1syqrcrSUItwYAmV-wlSusKKQVUgsHl2whdaEzmwtxzW7G8Ttp2hq1YG8fQ19jHds4TtHz3c_cxo542R9P5wmn2Hc87YuGRLFtZ14lcYj1eaLA951PAGPHK5zwll012I60-v8l2z8_fZav2fb9ZVM-brMoQU2ZJiRjgVRNREYKUE3dBKNzK0MA56XyQI4UFhbQKG-1wCYEL53XNievluz-rxtT4HAa4hGH-WCt1cpI9Qu4Jk5v</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Probabilistic Skyline Computation on Vertically Distributed Uncertain Data</title><source>IEEE Xplore All Conference Series</source><creator>Zhang, Kaiqi ; Wang, Jinbao ; Wang, Muxian ; Han, Xixian</creator><creatorcontrib>Zhang, Kaiqi ; Wang, Jinbao ; Wang, Muxian ; Han, Xixian</creatorcontrib><description>The skyline query is important in database community. Recently, owing to the inherent uncertainty of some applications, skyline query on uncertain data has been widelystudied using probabilistic model, e.g. p-skyline. In the scenario where uncertain data is vertically distributed among multiple servers, the main purpose of p-skyline computation is to minimize the retrieved records from servers to the local client due to the dominance factor of expensive network communication. In this paper, we present three communication-efficient p-skyline algorithms ASR, IASR and FSLR on vertically distributed uncertain data. ASR alternates sorted and random accesses to retrieve the records at servers and performs retrieving-boundingchecking iteration until all the objects can be determined whether they are in the p-skyline result or not. The communication of the instances not retrieved can be saved. IASR is an improved version of ASR. By examining the net gain of retrieving-boundingchecking iteration, IASR early terminates the iteration to further reduce the cost of communication. Compared to ASR and IASR, FSLR performs random accesses only on demand. FSLR first conducts sorted accesses to get loose upper bounds of skyline probabilities of the instances. Then, FSLR uses random accesses to complement a part of retrieved instances to get tighter upper and lower bounds of skyline probabilities until the p-skyline result is computed. Our experimental results demonstrate that our algorithms ASR, IASR and FSLR significantly outperform the intuitive method for p-skyline computation on vertically distributed uncertain data.</description><identifier>EISSN: 2575-8411</identifier><identifier>EISBN: 9781728125190</identifier><identifier>EISBN: 1728125197</identifier><identifier>DOI: 10.1109/ICDCS.2019.00024</identifier><identifier>CODEN: IEEPAD</identifier><language>eng</language><publisher>IEEE</publisher><subject>Companies ; Distributed databases ; Probabilistic logic ; probabilistic skyline ; Servers ; uncertain data ; Uncertainty ; Upper bound ; vertical distribution</subject><ispartof>2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS), 2019, p.154-163</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8885362$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,27902,54530,54907</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/8885362$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Zhang, Kaiqi</creatorcontrib><creatorcontrib>Wang, Jinbao</creatorcontrib><creatorcontrib>Wang, Muxian</creatorcontrib><creatorcontrib>Han, Xixian</creatorcontrib><title>Probabilistic Skyline Computation on Vertically Distributed Uncertain Data</title><title>2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS)</title><addtitle>ICDSC</addtitle><description>The skyline query is important in database community. Recently, owing to the inherent uncertainty of some applications, skyline query on uncertain data has been widelystudied using probabilistic model, e.g. p-skyline. In the scenario where uncertain data is vertically distributed among multiple servers, the main purpose of p-skyline computation is to minimize the retrieved records from servers to the local client due to the dominance factor of expensive network communication. In this paper, we present three communication-efficient p-skyline algorithms ASR, IASR and FSLR on vertically distributed uncertain data. ASR alternates sorted and random accesses to retrieve the records at servers and performs retrieving-boundingchecking iteration until all the objects can be determined whether they are in the p-skyline result or not. The communication of the instances not retrieved can be saved. IASR is an improved version of ASR. By examining the net gain of retrieving-boundingchecking iteration, IASR early terminates the iteration to further reduce the cost of communication. Compared to ASR and IASR, FSLR performs random accesses only on demand. FSLR first conducts sorted accesses to get loose upper bounds of skyline probabilities of the instances. Then, FSLR uses random accesses to complement a part of retrieved instances to get tighter upper and lower bounds of skyline probabilities until the p-skyline result is computed. Our experimental results demonstrate that our algorithms ASR, IASR and FSLR significantly outperform the intuitive method for p-skyline computation on vertically distributed uncertain data.</description><subject>Companies</subject><subject>Distributed databases</subject><subject>Probabilistic logic</subject><subject>probabilistic skyline</subject><subject>Servers</subject><subject>uncertain data</subject><subject>Uncertainty</subject><subject>Upper bound</subject><subject>vertical distribution</subject><issn>2575-8411</issn><isbn>9781728125190</isbn><isbn>1728125197</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2019</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotjF1LwzAUhqMgOOfuBW_yBzpPkiZNLqX1YzJQmPN2nCanEO3a0XYX_fcGFF54L56Hh7E7AWshwD1syqrcrSUItwYAmV-wlSusKKQVUgsHl2whdaEzmwtxzW7G8Ttp2hq1YG8fQ19jHds4TtHz3c_cxo542R9P5wmn2Hc87YuGRLFtZ14lcYj1eaLA951PAGPHK5zwll012I60-v8l2z8_fZav2fb9ZVM-brMoQU2ZJiRjgVRNREYKUE3dBKNzK0MA56XyQI4UFhbQKG-1wCYEL53XNievluz-rxtT4HAa4hGH-WCt1cpI9Qu4Jk5v</recordid><startdate>201907</startdate><enddate>201907</enddate><creator>Zhang, Kaiqi</creator><creator>Wang, Jinbao</creator><creator>Wang, Muxian</creator><creator>Han, Xixian</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>201907</creationdate><title>Probabilistic Skyline Computation on Vertically Distributed Uncertain Data</title><author>Zhang, Kaiqi ; Wang, Jinbao ; Wang, Muxian ; Han, Xixian</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i203t-5eae680e3beee62103fbfd65482dd09c23c0e9e3a780a63c851afddc29c584ec3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Companies</topic><topic>Distributed databases</topic><topic>Probabilistic logic</topic><topic>probabilistic skyline</topic><topic>Servers</topic><topic>uncertain data</topic><topic>Uncertainty</topic><topic>Upper bound</topic><topic>vertical distribution</topic><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Kaiqi</creatorcontrib><creatorcontrib>Wang, Jinbao</creatorcontrib><creatorcontrib>Wang, Muxian</creatorcontrib><creatorcontrib>Han, Xixian</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Zhang, Kaiqi</au><au>Wang, Jinbao</au><au>Wang, Muxian</au><au>Han, Xixian</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Probabilistic Skyline Computation on Vertically Distributed Uncertain Data</atitle><btitle>2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS)</btitle><stitle>ICDSC</stitle><date>2019-07</date><risdate>2019</risdate><spage>154</spage><epage>163</epage><pages>154-163</pages><eissn>2575-8411</eissn><eisbn>9781728125190</eisbn><eisbn>1728125197</eisbn><coden>IEEPAD</coden><abstract>The skyline query is important in database community. Recently, owing to the inherent uncertainty of some applications, skyline query on uncertain data has been widelystudied using probabilistic model, e.g. p-skyline. In the scenario where uncertain data is vertically distributed among multiple servers, the main purpose of p-skyline computation is to minimize the retrieved records from servers to the local client due to the dominance factor of expensive network communication. In this paper, we present three communication-efficient p-skyline algorithms ASR, IASR and FSLR on vertically distributed uncertain data. ASR alternates sorted and random accesses to retrieve the records at servers and performs retrieving-boundingchecking iteration until all the objects can be determined whether they are in the p-skyline result or not. The communication of the instances not retrieved can be saved. IASR is an improved version of ASR. By examining the net gain of retrieving-boundingchecking iteration, IASR early terminates the iteration to further reduce the cost of communication. Compared to ASR and IASR, FSLR performs random accesses only on demand. FSLR first conducts sorted accesses to get loose upper bounds of skyline probabilities of the instances. Then, FSLR uses random accesses to complement a part of retrieved instances to get tighter upper and lower bounds of skyline probabilities until the p-skyline result is computed. Our experimental results demonstrate that our algorithms ASR, IASR and FSLR significantly outperform the intuitive method for p-skyline computation on vertically distributed uncertain data.</abstract><pub>IEEE</pub><doi>10.1109/ICDCS.2019.00024</doi><tpages>10</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier EISSN: 2575-8411
ispartof 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS), 2019, p.154-163
issn 2575-8411
language eng
recordid cdi_ieee_primary_8885362
source IEEE Xplore All Conference Series
subjects Companies
Distributed databases
Probabilistic logic
probabilistic skyline
Servers
uncertain data
Uncertainty
Upper bound
vertical distribution
title Probabilistic Skyline Computation on Vertically Distributed Uncertain Data
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T02%3A22%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Probabilistic%20Skyline%20Computation%20on%20Vertically%20Distributed%20Uncertain%20Data&rft.btitle=2019%20IEEE%2039th%20International%20Conference%20on%20Distributed%20Computing%20Systems%20(ICDCS)&rft.au=Zhang,%20Kaiqi&rft.date=2019-07&rft.spage=154&rft.epage=163&rft.pages=154-163&rft.eissn=2575-8411&rft.coden=IEEPAD&rft_id=info:doi/10.1109/ICDCS.2019.00024&rft.eisbn=9781728125190&rft.eisbn_list=1728125197&rft_dat=%3Cieee_CHZPO%3E8885362%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i203t-5eae680e3beee62103fbfd65482dd09c23c0e9e3a780a63c851afddc29c584ec3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=8885362&rfr_iscdi=true