Loading…
Efficient Diversification for Recommending Aggregate Data Visualizations
Visual data exploration is ubiquitous in nearly every industry and organization to support discovering data-driven actionable insights. However, unlocking those insights requires analysts to manually construct a prohibitively large number of aggregate queries and visually explore their results looki...
Saved in:
Published in: | IEEE access 2023-01, Vol.11, p.1-1 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c359t-9eefd612f786809d124ffc94e877073059c4a1dc9c566303432d910f754e2b223 |
container_end_page | 1 |
container_issue | |
container_start_page | 1 |
container_title | IEEE access |
container_volume | 11 |
creator | Sharaf, Mohamed A. Mafrur, Rischan Zuccon, Guido |
description | Visual data exploration is ubiquitous in nearly every industry and organization to support discovering data-driven actionable insights. However, unlocking those insights requires analysts to manually construct a prohibitively large number of aggregate queries and visually explore their results looking for those valuable and insightful visualizations. Such challenge naturally motivated the development of novel solutions that automate the visual explorations process, and recommend to analysts those particular queries that best visualize their data and reveal interesting actionable insights. In such automated solutions, there is a clear need for providing analysts with a diversified and concise set of recommended visualizations, which cover and represent a large combinatorial high-dimensional space of possible visualizations. However, directly incorporating existing diversification methods leads to a "process-first-diversify-next" approach, in which all possible data visualizations are generated first through executing a large number of aggregate queries. To address this challenge and minimize the incurred query processing costs, in this work we propose novel optimization techniques for the efficient diversification of recommended insightful visualizations. The key idea underlying our proposed techniques is to identify and eliminate the processing of a large number of low-utility insignificant visualizations. Meanwhile, for the potentially high-utility insightful visualizations, shared multi-query optimization techniques are proposed for further reduction in data processing cost. Our extensive experimental evaluation on real datasets demonstrates the performance gains provided by our proposed techniques, in terms of minimizing the query processing cost (i.e., efficiency), as well as maximizing the quality of recommendations (i.e., effectiveness). |
doi_str_mv | 10.1109/ACCESS.2023.3283457 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_ACCESS_2023_3283457</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10144772</ieee_id><doaj_id>oai_doaj_org_article_2d098fa44e5f40eebd081427904f78b6</doaj_id><sourcerecordid>2828939916</sourcerecordid><originalsourceid>FETCH-LOGICAL-c359t-9eefd612f786809d124ffc94e877073059c4a1dc9c566303432d910f754e2b223</originalsourceid><addsrcrecordid>eNpNUV1LwzAULaLgmPsF-lDwuTNfbZrH0U03GAhOfQ1ZelMytmYmnaC_3mwdsvtyPzjn3Hs5SXKP0RhjJJ4mVTVbrcYEETqmpKQs51fJgOBCZDSnxfVFfZuMQtigGGUc5XyQzGfGWG2h7dKp_QYfbGxVZ12bGufTN9But4O2tm2TTprGQ6M6SKeqU-mnDQe1tb8ndLhLbozaBhid8zD5eJ69V_Ns-fqyqCbLTNNcdJkAMHWBieFlUSJRY8KM0YJByTniFOVCM4VrLXReFBRRRkktMDI8Z0DWhNBhsuh1a6c2cu_tTvkf6ZSVp4HzjVS-s3oLktRIlEYxBrlhCGBdx7cZ4QKxuH5dRK3HXmvv3dcBQic37uDbeL4kJSkFFQIfUbRHae9C8GD-t2Ikjw7I3gF5dECeHYish55lAeCCgRnjnNA_A32AqA</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2828939916</pqid></control><display><type>article</type><title>Efficient Diversification for Recommending Aggregate Data Visualizations</title><source>IEEE Xplore Open Access Journals</source><creator>Sharaf, Mohamed A. ; Mafrur, Rischan ; Zuccon, Guido</creator><creatorcontrib>Sharaf, Mohamed A. ; Mafrur, Rischan ; Zuccon, Guido</creatorcontrib><description>Visual data exploration is ubiquitous in nearly every industry and organization to support discovering data-driven actionable insights. However, unlocking those insights requires analysts to manually construct a prohibitively large number of aggregate queries and visually explore their results looking for those valuable and insightful visualizations. Such challenge naturally motivated the development of novel solutions that automate the visual explorations process, and recommend to analysts those particular queries that best visualize their data and reveal interesting actionable insights. In such automated solutions, there is a clear need for providing analysts with a diversified and concise set of recommended visualizations, which cover and represent a large combinatorial high-dimensional space of possible visualizations. However, directly incorporating existing diversification methods leads to a "process-first-diversify-next" approach, in which all possible data visualizations are generated first through executing a large number of aggregate queries. To address this challenge and minimize the incurred query processing costs, in this work we propose novel optimization techniques for the efficient diversification of recommended insightful visualizations. The key idea underlying our proposed techniques is to identify and eliminate the processing of a large number of low-utility insignificant visualizations. Meanwhile, for the potentially high-utility insightful visualizations, shared multi-query optimization techniques are proposed for further reduction in data processing cost. Our extensive experimental evaluation on real datasets demonstrates the performance gains provided by our proposed techniques, in terms of minimizing the query processing cost (i.e., efficiency), as well as maximizing the quality of recommendations (i.e., effectiveness).</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2023.3283457</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Aggregates ; Automation ; Combinatorial analysis ; Costs ; Data Exploration ; Data processing ; Data visualization ; Diabetes ; Diversification ; Hospitals ; Optimization ; Optimization techniques ; Queries ; Query processing ; Recommendation Diversification ; Visual Analytics</subject><ispartof>IEEE access, 2023-01, Vol.11, p.1-1</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c359t-9eefd612f786809d124ffc94e877073059c4a1dc9c566303432d910f754e2b223</cites><orcidid>0000-0002-3405-5224 ; 0000-0003-0271-5563 ; 0000-0003-4424-3736</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10144772$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,27610,27901,27902,54908</link.rule.ids></links><search><creatorcontrib>Sharaf, Mohamed A.</creatorcontrib><creatorcontrib>Mafrur, Rischan</creatorcontrib><creatorcontrib>Zuccon, Guido</creatorcontrib><title>Efficient Diversification for Recommending Aggregate Data Visualizations</title><title>IEEE access</title><addtitle>Access</addtitle><description>Visual data exploration is ubiquitous in nearly every industry and organization to support discovering data-driven actionable insights. However, unlocking those insights requires analysts to manually construct a prohibitively large number of aggregate queries and visually explore their results looking for those valuable and insightful visualizations. Such challenge naturally motivated the development of novel solutions that automate the visual explorations process, and recommend to analysts those particular queries that best visualize their data and reveal interesting actionable insights. In such automated solutions, there is a clear need for providing analysts with a diversified and concise set of recommended visualizations, which cover and represent a large combinatorial high-dimensional space of possible visualizations. However, directly incorporating existing diversification methods leads to a "process-first-diversify-next" approach, in which all possible data visualizations are generated first through executing a large number of aggregate queries. To address this challenge and minimize the incurred query processing costs, in this work we propose novel optimization techniques for the efficient diversification of recommended insightful visualizations. The key idea underlying our proposed techniques is to identify and eliminate the processing of a large number of low-utility insignificant visualizations. Meanwhile, for the potentially high-utility insightful visualizations, shared multi-query optimization techniques are proposed for further reduction in data processing cost. Our extensive experimental evaluation on real datasets demonstrates the performance gains provided by our proposed techniques, in terms of minimizing the query processing cost (i.e., efficiency), as well as maximizing the quality of recommendations (i.e., effectiveness).</description><subject>Aggregates</subject><subject>Automation</subject><subject>Combinatorial analysis</subject><subject>Costs</subject><subject>Data Exploration</subject><subject>Data processing</subject><subject>Data visualization</subject><subject>Diabetes</subject><subject>Diversification</subject><subject>Hospitals</subject><subject>Optimization</subject><subject>Optimization techniques</subject><subject>Queries</subject><subject>Query processing</subject><subject>Recommendation Diversification</subject><subject>Visual Analytics</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>DOA</sourceid><recordid>eNpNUV1LwzAULaLgmPsF-lDwuTNfbZrH0U03GAhOfQ1ZelMytmYmnaC_3mwdsvtyPzjn3Hs5SXKP0RhjJJ4mVTVbrcYEETqmpKQs51fJgOBCZDSnxfVFfZuMQtigGGUc5XyQzGfGWG2h7dKp_QYfbGxVZ12bGufTN9But4O2tm2TTprGQ6M6SKeqU-mnDQe1tb8ndLhLbozaBhid8zD5eJ69V_Ns-fqyqCbLTNNcdJkAMHWBieFlUSJRY8KM0YJByTniFOVCM4VrLXReFBRRRkktMDI8Z0DWhNBhsuh1a6c2cu_tTvkf6ZSVp4HzjVS-s3oLktRIlEYxBrlhCGBdx7cZ4QKxuH5dRK3HXmvv3dcBQic37uDbeL4kJSkFFQIfUbRHae9C8GD-t2Ikjw7I3gF5dECeHYish55lAeCCgRnjnNA_A32AqA</recordid><startdate>20230101</startdate><enddate>20230101</enddate><creator>Sharaf, Mohamed A.</creator><creator>Mafrur, Rischan</creator><creator>Zuccon, Guido</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-3405-5224</orcidid><orcidid>https://orcid.org/0000-0003-0271-5563</orcidid><orcidid>https://orcid.org/0000-0003-4424-3736</orcidid></search><sort><creationdate>20230101</creationdate><title>Efficient Diversification for Recommending Aggregate Data Visualizations</title><author>Sharaf, Mohamed A. ; Mafrur, Rischan ; Zuccon, Guido</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c359t-9eefd612f786809d124ffc94e877073059c4a1dc9c566303432d910f754e2b223</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Aggregates</topic><topic>Automation</topic><topic>Combinatorial analysis</topic><topic>Costs</topic><topic>Data Exploration</topic><topic>Data processing</topic><topic>Data visualization</topic><topic>Diabetes</topic><topic>Diversification</topic><topic>Hospitals</topic><topic>Optimization</topic><topic>Optimization techniques</topic><topic>Queries</topic><topic>Query processing</topic><topic>Recommendation Diversification</topic><topic>Visual Analytics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sharaf, Mohamed A.</creatorcontrib><creatorcontrib>Mafrur, Rischan</creatorcontrib><creatorcontrib>Zuccon, Guido</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Xplore Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sharaf, Mohamed A.</au><au>Mafrur, Rischan</au><au>Zuccon, Guido</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Efficient Diversification for Recommending Aggregate Data Visualizations</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2023-01-01</date><risdate>2023</risdate><volume>11</volume><spage>1</spage><epage>1</epage><pages>1-1</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Visual data exploration is ubiquitous in nearly every industry and organization to support discovering data-driven actionable insights. However, unlocking those insights requires analysts to manually construct a prohibitively large number of aggregate queries and visually explore their results looking for those valuable and insightful visualizations. Such challenge naturally motivated the development of novel solutions that automate the visual explorations process, and recommend to analysts those particular queries that best visualize their data and reveal interesting actionable insights. In such automated solutions, there is a clear need for providing analysts with a diversified and concise set of recommended visualizations, which cover and represent a large combinatorial high-dimensional space of possible visualizations. However, directly incorporating existing diversification methods leads to a "process-first-diversify-next" approach, in which all possible data visualizations are generated first through executing a large number of aggregate queries. To address this challenge and minimize the incurred query processing costs, in this work we propose novel optimization techniques for the efficient diversification of recommended insightful visualizations. The key idea underlying our proposed techniques is to identify and eliminate the processing of a large number of low-utility insignificant visualizations. Meanwhile, for the potentially high-utility insightful visualizations, shared multi-query optimization techniques are proposed for further reduction in data processing cost. Our extensive experimental evaluation on real datasets demonstrates the performance gains provided by our proposed techniques, in terms of minimizing the query processing cost (i.e., efficiency), as well as maximizing the quality of recommendations (i.e., effectiveness).</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2023.3283457</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-3405-5224</orcidid><orcidid>https://orcid.org/0000-0003-0271-5563</orcidid><orcidid>https://orcid.org/0000-0003-4424-3736</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2169-3536 |
ispartof | IEEE access, 2023-01, Vol.11, p.1-1 |
issn | 2169-3536 2169-3536 |
language | eng |
recordid | cdi_crossref_primary_10_1109_ACCESS_2023_3283457 |
source | IEEE Xplore Open Access Journals |
subjects | Aggregates Automation Combinatorial analysis Costs Data Exploration Data processing Data visualization Diabetes Diversification Hospitals Optimization Optimization techniques Queries Query processing Recommendation Diversification Visual Analytics |
title | Efficient Diversification for Recommending Aggregate Data Visualizations |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T13%3A01%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Efficient%20Diversification%20for%20Recommending%20Aggregate%20Data%20Visualizations&rft.jtitle=IEEE%20access&rft.au=Sharaf,%20Mohamed%20A.&rft.date=2023-01-01&rft.volume=11&rft.spage=1&rft.epage=1&rft.pages=1-1&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2023.3283457&rft_dat=%3Cproquest_cross%3E2828939916%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c359t-9eefd612f786809d124ffc94e877073059c4a1dc9c566303432d910f754e2b223%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2828939916&rft_id=info:pmid/&rft_ieee_id=10144772&rfr_iscdi=true |