Loading…

Platform-Independent Robust Query Processing

To address the classical selectivity estimation problem for OLAP queries in relational databases, a radically different approach called PlanBouquet was recently proposed in [1] , wherein the estimation process is completely abandoned and replaced with a calibrated discovery mechanism. The beneficia...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE transactions on knowledge and data engineering 2019-01, Vol.31 (1), p.17-31
Main Authors:	Karthik, Srinivas, Haritsa, Jayant R., Kenkre, Sreyash, Pandit, Vinayaka, Krishnan, Lohit
Format:	Article
Language:	English
Subjects:	Algorithms Alignment Benchmark testing Benchmarks Dependence Engines Estimation Inspection Linearity plan bouquets Queries Query processing Relational data bases robust query processing Robustness Selectivity Selectivity estimation Three-dimensional displays
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c293t-fa74e0c0ae15f609573dc5385599081eb4ea5e733acf8872d5292e5673cefe683
cites	cdi_FETCH-LOGICAL-c293t-fa74e0c0ae15f609573dc5385599081eb4ea5e733acf8872d5292e5673cefe683
container_end_page	31
container_issue	1
container_start_page	17
container_title	IEEE transactions on knowledge and data engineering
container_volume	31
creator	Karthik, Srinivas Haritsa, Jayant R. Kenkre, Sreyash Pandit, Vinayaka Krishnan, Lohit
description	To address the classical selectivity estimation problem for OLAP queries in relational databases, a radically different approach called PlanBouquet was recently proposed in [1] , wherein the estimation process is completely abandoned and replaced with a calibrated discovery mechanism. The beneficial outcome of this new construction is that provable guarantees on worst-case performance, measured as Maximum Sub-Optimality ( MSO ), are obtained thereby facilitating robust query processing. The PlanBouquet formulation suffers, however, from a systemic drawback-the MSO bound is a function of not only the query, but also the optimizer's behavioral profile over the underlying database platform. As a result, there are adverse consequences: (i) the bound value becomes highly variable, depending on the specifics of the current operating environment, and (ii) it becomes infeasible to compute the value without substantial investments in preprocessing overheads. In this paper, we first present SpillBound , a new query processing algorithm that retains the core strength of the PlanBouquet discovery process, but reduces the bound dependency to only the query. It does so by incorporating plan termination and selectivity monitoring mechanisms in the database engine. Specifically, SpillBound delivers a worst-case multiplicative bound, of D^2+3D , where D is simply the number of error-prone predicates in the user query. Consequently, the bound value becomes independent of the optimizer and the database platform, and the guarantee can be issued simply by query inspection. We go on to prove that SpillBound is within an O(D) factor of the best possible deterministic selectivity discovery algorithm in its class. We next devise techniques to bridge this quadratic-to-linear MSO gap by introducing the notion of contour alignment , a characterization of the nature of plan structures along the boundaries of the selectivity space. Specifically, we propose a variant of SpillBound , called AlignedBound , which exploits the alignment property and provides a guarantee in
doi_str_mv	10.1109/TKDE.2017.2664827
format	article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TKDE_2017_2664827</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7843652</ieee_id><sourcerecordid>2151473098</sourcerecordid><originalsourceid>FETCH-LOGICAL-c293t-fa74e0c0ae15f609573dc5385599081eb4ea5e733acf8872d5292e5673cefe683</originalsourceid><addsrcrecordid>eNo9kE1LAzEQhoMoWKs_QLwUvLprJh-b5Ci1arFglXoOaXYiLe3umuwe-u_dpcXLvHN43hl4CLkFmgNQ87h6f57ljILKWVEIzdQZGYGUOmNg4LzfqYBMcKEuyVVKW0qpVhpG5GG5c22o4z6bVyU22I-qnXzV6y61k88O42GyjLXHlDbVzzW5CG6X8OaUY_L9MltN37LFx-t8-rTIPDO8zYJTAqmnDkGGghqpeOkl11IaQzXgWqCTqDh3PmitWCmZYSgLxT0GLDQfk_vj3SbWvx2m1m7rLlb9S8tAglCcmoGCI-VjnVLEYJu42bt4sEDtIMUOUuwgxZ6k9J27Y2eDiP-80oIXkvE_qCJcRg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2151473098</pqid></control><display><type>article</type><title>Platform-Independent Robust Query Processing</title><source>IEEE Xplore (Online service)</source><creator>Karthik, Srinivas ; Haritsa, Jayant R. ; Kenkre, Sreyash ; Pandit, Vinayaka ; Krishnan, Lohit</creator><creatorcontrib>Karthik, Srinivas ; Haritsa, Jayant R. ; Kenkre, Sreyash ; Pandit, Vinayaka ; Krishnan, Lohit</creatorcontrib><description><![CDATA[To address the classical selectivity estimation problem for OLAP queries in relational databases, a radically different approach called PlanBouquet was recently proposed in <xref ref-type="bibr" rid="ref1"> [1] , wherein the estimation process is completely abandoned and replaced with a calibrated discovery mechanism. The beneficial outcome of this new construction is that provable guarantees on worst-case performance, measured as Maximum Sub-Optimality ( MSO ), are obtained thereby facilitating robust query processing. The PlanBouquet formulation suffers, however, from a systemic drawback-the MSO bound is a function of not only the query, but also the optimizer's behavioral profile over the underlying database platform. As a result, there are adverse consequences: (i) the bound value becomes highly variable, depending on the specifics of the current operating environment, and (ii) it becomes infeasible to compute the value without substantial investments in preprocessing overheads. In this paper, we first present SpillBound , a new query processing algorithm that retains the core strength of the PlanBouquet discovery process, but reduces the bound dependency to only the query. It does so by incorporating plan termination and selectivity monitoring mechanisms in the database engine. Specifically, SpillBound delivers a worst-case multiplicative bound, of <inline-formula><tex-math notation="LaTeX">D^2+3D</tex-math> <inline-graphic xlink:href="venkatesh-ieq1-2664827.gif"/> </inline-formula>, where <inline-formula> <tex-math notation="LaTeX">D</tex-math> <inline-graphic xlink:href="venkatesh-ieq2-2664827.gif"/> </inline-formula> is simply the number of error-prone predicates in the user query. Consequently, the bound value becomes independent of the optimizer and the database platform, and the guarantee can be issued simply by query inspection. We go on to prove that SpillBound is within an <inline-formula> <tex-math notation="LaTeX">O(D)</tex-math> <inline-graphic xlink:href="venkatesh-ieq3-2664827.gif"/> </inline-formula> factor of the best possible deterministic selectivity discovery algorithm in its class. We next devise techniques to bridge this quadratic-to-linear MSO gap by introducing the notion of contour alignment , a characterization of the nature of plan structures along the boundaries of the selectivity space. Specifically, we propose a variant of SpillBound , called AlignedBound , which exploits the alignment property and provides a guarantee in the range <inline-formula><tex-math notation="LaTeX">\mathbf {[2D+2,D^2+3D]}</tex-math> <inline-graphic xlink:href="venkatesh-ieq4-2664827.gif"/> </inline-formula>. Finally, a detailed empirical evaluation over the standard decision-support benchmarks indicates that: (i) SpillBound provides markedly superior performance w.r.t. MSO as compared to PlanBouquet , and (ii) AlignedBound provides additional benefits for query instances that are challenging for SpillBound , often coming close to the ideal of MSO linearity in <inline-formula> <tex-math notation="LaTeX">D</tex-math> <inline-graphic xlink:href="venkatesh-ieq5-2664827.gif"/> </inline-formula>. From an absolute perspective, AlignedBound evaluates virtually all the benchmark queries considered in our study with MSO of around 10 or lesser. Therefore, in an overall sense, SpillBound and AlignedBound offer a substantive step forward in the long-standing quest for robust query processing.]]></description><identifier>ISSN: 1041-4347</identifier><identifier>EISSN: 1558-2191</identifier><identifier>DOI: 10.1109/TKDE.2017.2664827</identifier><identifier>CODEN: ITKEEH</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Algorithms ; Alignment ; Benchmark testing ; Benchmarks ; Dependence ; Engines ; Estimation ; Inspection ; Linearity ; plan bouquets ; Queries ; Query processing ; Relational data bases ; robust query processing ; Robustness ; Selectivity ; Selectivity estimation ; Three-dimensional displays</subject><ispartof>IEEE transactions on knowledge and data engineering, 2019-01, Vol.31 (1), p.17-31</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2019</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c293t-fa74e0c0ae15f609573dc5385599081eb4ea5e733acf8872d5292e5673cefe683</citedby><cites>FETCH-LOGICAL-c293t-fa74e0c0ae15f609573dc5385599081eb4ea5e733acf8872d5292e5673cefe683</cites><orcidid>0000-0003-2946-2316</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7843652$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,54796</link.rule.ids></links><search><creatorcontrib>Karthik, Srinivas</creatorcontrib><creatorcontrib>Haritsa, Jayant R.</creatorcontrib><creatorcontrib>Kenkre, Sreyash</creatorcontrib><creatorcontrib>Pandit, Vinayaka</creatorcontrib><creatorcontrib>Krishnan, Lohit</creatorcontrib><title>Platform-Independent Robust Query Processing</title><title>IEEE transactions on knowledge and data engineering</title><addtitle>TKDE</addtitle><description><![CDATA[To address the classical selectivity estimation problem for OLAP queries in relational databases, a radically different approach called PlanBouquet was recently proposed in <xref ref-type="bibr" rid="ref1"> [1] , wherein the estimation process is completely abandoned and replaced with a calibrated discovery mechanism. The beneficial outcome of this new construction is that provable guarantees on worst-case performance, measured as Maximum Sub-Optimality ( MSO ), are obtained thereby facilitating robust query processing. The PlanBouquet formulation suffers, however, from a systemic drawback-the MSO bound is a function of not only the query, but also the optimizer's behavioral profile over the underlying database platform. As a result, there are adverse consequences: (i) the bound value becomes highly variable, depending on the specifics of the current operating environment, and (ii) it becomes infeasible to compute the value without substantial investments in preprocessing overheads. In this paper, we first present SpillBound , a new query processing algorithm that retains the core strength of the PlanBouquet discovery process, but reduces the bound dependency to only the query. It does so by incorporating plan termination and selectivity monitoring mechanisms in the database engine. Specifically, SpillBound delivers a worst-case multiplicative bound, of <inline-formula><tex-math notation="LaTeX">D^2+3D</tex-math> <inline-graphic xlink:href="venkatesh-ieq1-2664827.gif"/> </inline-formula>, where <inline-formula> <tex-math notation="LaTeX">D</tex-math> <inline-graphic xlink:href="venkatesh-ieq2-2664827.gif"/> </inline-formula> is simply the number of error-prone predicates in the user query. Consequently, the bound value becomes independent of the optimizer and the database platform, and the guarantee can be issued simply by query inspection. We go on to prove that SpillBound is within an <inline-formula> <tex-math notation="LaTeX">O(D)</tex-math> <inline-graphic xlink:href="venkatesh-ieq3-2664827.gif"/> </inline-formula> factor of the best possible deterministic selectivity discovery algorithm in its class. We next devise techniques to bridge this quadratic-to-linear MSO gap by introducing the notion of contour alignment , a characterization of the nature of plan structures along the boundaries of the selectivity space. Specifically, we propose a variant of SpillBound , called AlignedBound , which exploits the alignment property and provides a guarantee in the range <inline-formula><tex-math notation="LaTeX">\mathbf {[2D+2,D^2+3D]}</tex-math> <inline-graphic xlink:href="venkatesh-ieq4-2664827.gif"/> </inline-formula>. Finally, a detailed empirical evaluation over the standard decision-support benchmarks indicates that: (i) SpillBound provides markedly superior performance w.r.t. MSO as compared to PlanBouquet , and (ii) AlignedBound provides additional benefits for query instances that are challenging for SpillBound , often coming close to the ideal of MSO linearity in <inline-formula> <tex-math notation="LaTeX">D</tex-math> <inline-graphic xlink:href="venkatesh-ieq5-2664827.gif"/> </inline-formula>. From an absolute perspective, AlignedBound evaluates virtually all the benchmark queries considered in our study with MSO of around 10 or lesser. Therefore, in an overall sense, SpillBound and AlignedBound offer a substantive step forward in the long-standing quest for robust query processing.]]></description><subject>Algorithms</subject><subject>Alignment</subject><subject>Benchmark testing</subject><subject>Benchmarks</subject><subject>Dependence</subject><subject>Engines</subject><subject>Estimation</subject><subject>Inspection</subject><subject>Linearity</subject><subject>plan bouquets</subject><subject>Queries</subject><subject>Query processing</subject><subject>Relational data bases</subject><subject>robust query processing</subject><subject>Robustness</subject><subject>Selectivity</subject><subject>Selectivity estimation</subject><subject>Three-dimensional displays</subject><issn>1041-4347</issn><issn>1558-2191</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNo9kE1LAzEQhoMoWKs_QLwUvLprJh-b5Ci1arFglXoOaXYiLe3umuwe-u_dpcXLvHN43hl4CLkFmgNQ87h6f57ljILKWVEIzdQZGYGUOmNg4LzfqYBMcKEuyVVKW0qpVhpG5GG5c22o4z6bVyU22I-qnXzV6y61k88O42GyjLXHlDbVzzW5CG6X8OaUY_L9MltN37LFx-t8-rTIPDO8zYJTAqmnDkGGghqpeOkl11IaQzXgWqCTqDh3PmitWCmZYSgLxT0GLDQfk_vj3SbWvx2m1m7rLlb9S8tAglCcmoGCI-VjnVLEYJu42bt4sEDtIMUOUuwgxZ6k9J27Y2eDiP-80oIXkvE_qCJcRg</recordid><startdate>20190101</startdate><enddate>20190101</enddate><creator>Karthik, Srinivas</creator><creator>Haritsa, Jayant R.</creator><creator>Kenkre, Sreyash</creator><creator>Pandit, Vinayaka</creator><creator>Krishnan, Lohit</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0003-2946-2316</orcidid></search><sort><creationdate>20190101</creationdate><title>Platform-Independent Robust Query Processing</title><author>Karthik, Srinivas ; Haritsa, Jayant R. ; Kenkre, Sreyash ; Pandit, Vinayaka ; Krishnan, Lohit</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c293t-fa74e0c0ae15f609573dc5385599081eb4ea5e733acf8872d5292e5673cefe683</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Algorithms</topic><topic>Alignment</topic><topic>Benchmark testing</topic><topic>Benchmarks</topic><topic>Dependence</topic><topic>Engines</topic><topic>Estimation</topic><topic>Inspection</topic><topic>Linearity</topic><topic>plan bouquets</topic><topic>Queries</topic><topic>Query processing</topic><topic>Relational data bases</topic><topic>robust query processing</topic><topic>Robustness</topic><topic>Selectivity</topic><topic>Selectivity estimation</topic><topic>Three-dimensional displays</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Karthik, Srinivas</creatorcontrib><creatorcontrib>Haritsa, Jayant R.</creatorcontrib><creatorcontrib>Kenkre, Sreyash</creatorcontrib><creatorcontrib>Pandit, Vinayaka</creatorcontrib><creatorcontrib>Krishnan, Lohit</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on knowledge and data engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Karthik, Srinivas</au><au>Haritsa, Jayant R.</au><au>Kenkre, Sreyash</au><au>Pandit, Vinayaka</au><au>Krishnan, Lohit</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Platform-Independent Robust Query Processing</atitle><jtitle>IEEE transactions on knowledge and data engineering</jtitle><stitle>TKDE</stitle><date>2019-01-01</date><risdate>2019</risdate><volume>31</volume><issue>1</issue><spage>17</spage><epage>31</epage><pages>17-31</pages><issn>1041-4347</issn><eissn>1558-2191</eissn><coden>ITKEEH</coden><abstract><![CDATA[To address the classical selectivity estimation problem for OLAP queries in relational databases, a radically different approach called PlanBouquet was recently proposed in <xref ref-type="bibr" rid="ref1"> [1] , wherein the estimation process is completely abandoned and replaced with a calibrated discovery mechanism. The beneficial outcome of this new construction is that provable guarantees on worst-case performance, measured as Maximum Sub-Optimality ( MSO ), are obtained thereby facilitating robust query processing. The PlanBouquet formulation suffers, however, from a systemic drawback-the MSO bound is a function of not only the query, but also the optimizer's behavioral profile over the underlying database platform. As a result, there are adverse consequences: (i) the bound value becomes highly variable, depending on the specifics of the current operating environment, and (ii) it becomes infeasible to compute the value without substantial investments in preprocessing overheads. In this paper, we first present SpillBound , a new query processing algorithm that retains the core strength of the PlanBouquet discovery process, but reduces the bound dependency to only the query. It does so by incorporating plan termination and selectivity monitoring mechanisms in the database engine. Specifically, SpillBound delivers a worst-case multiplicative bound, of <inline-formula><tex-math notation="LaTeX">D^2+3D</tex-math> <inline-graphic xlink:href="venkatesh-ieq1-2664827.gif"/> </inline-formula>, where <inline-formula> <tex-math notation="LaTeX">D</tex-math> <inline-graphic xlink:href="venkatesh-ieq2-2664827.gif"/> </inline-formula> is simply the number of error-prone predicates in the user query. Consequently, the bound value becomes independent of the optimizer and the database platform, and the guarantee can be issued simply by query inspection. We go on to prove that SpillBound is within an <inline-formula> <tex-math notation="LaTeX">O(D)</tex-math> <inline-graphic xlink:href="venkatesh-ieq3-2664827.gif"/> </inline-formula> factor of the best possible deterministic selectivity discovery algorithm in its class. We next devise techniques to bridge this quadratic-to-linear MSO gap by introducing the notion of contour alignment , a characterization of the nature of plan structures along the boundaries of the selectivity space. Specifically, we propose a variant of SpillBound , called AlignedBound , which exploits the alignment property and provides a guarantee in the range <inline-formula><tex-math notation="LaTeX">\mathbf {[2D+2,D^2+3D]}</tex-math> <inline-graphic xlink:href="venkatesh-ieq4-2664827.gif"/> </inline-formula>. Finally, a detailed empirical evaluation over the standard decision-support benchmarks indicates that: (i) SpillBound provides markedly superior performance w.r.t. MSO as compared to PlanBouquet , and (ii) AlignedBound provides additional benefits for query instances that are challenging for SpillBound , often coming close to the ideal of MSO linearity in <inline-formula> <tex-math notation="LaTeX">D</tex-math> <inline-graphic xlink:href="venkatesh-ieq5-2664827.gif"/> </inline-formula>. From an absolute perspective, AlignedBound evaluates virtually all the benchmark queries considered in our study with MSO of around 10 or lesser. Therefore, in an overall sense, SpillBound and AlignedBound offer a substantive step forward in the long-standing quest for robust query processing.]]></abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TKDE.2017.2664827</doi><tpages>15</tpages><orcidid>https://orcid.org/0000-0003-2946-2316</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 1041-4347
ispartof	IEEE transactions on knowledge and data engineering, 2019-01, Vol.31 (1), p.17-31
issn	1041-4347 1558-2191
language	eng
recordid	cdi_crossref_primary_10_1109_TKDE_2017_2664827
source	IEEE Xplore (Online service)
subjects	Algorithms Alignment Benchmark testing Benchmarks Dependence Engines Estimation Inspection Linearity plan bouquets Queries Query processing Relational data bases robust query processing Robustness Selectivity Selectivity estimation Three-dimensional displays
title	Platform-Independent Robust Query Processing
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T18%3A51%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Platform-Independent%20Robust%20Query%20Processing&rft.jtitle=IEEE%20transactions%20on%20knowledge%20and%20data%20engineering&rft.au=Karthik,%20Srinivas&rft.date=2019-01-01&rft.volume=31&rft.issue=1&rft.spage=17&rft.epage=31&rft.pages=17-31&rft.issn=1041-4347&rft.eissn=1558-2191&rft.coden=ITKEEH&rft_id=info:doi/10.1109/TKDE.2017.2664827&rft_dat=%3Cproquest_cross%3E2151473098%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c293t-fa74e0c0ae15f609573dc5385599081eb4ea5e733acf8872d5292e5673cefe683%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2151473098&rft_id=info:pmid/&rft_ieee_id=7843652&rfr_iscdi=true