Loading…

Efficient Diversification for Recommending Aggregate Data Visualizations

Visual data exploration is ubiquitous in nearly every industry and organization to support discovering data-driven actionable insights. However, unlocking those insights requires analysts to manually construct a prohibitively large number of aggregate queries and visually explore their results looki...

Full description

Saved in:
Bibliographic Details
Published in:IEEE access 2023-01, Vol.11, p.1-1
Main Authors: Sharaf, Mohamed A., Mafrur, Rischan, Zuccon, Guido
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c359t-9eefd612f786809d124ffc94e877073059c4a1dc9c566303432d910f754e2b223
container_end_page 1
container_issue
container_start_page 1
container_title IEEE access
container_volume 11
creator Sharaf, Mohamed A.
Mafrur, Rischan
Zuccon, Guido
description Visual data exploration is ubiquitous in nearly every industry and organization to support discovering data-driven actionable insights. However, unlocking those insights requires analysts to manually construct a prohibitively large number of aggregate queries and visually explore their results looking for those valuable and insightful visualizations. Such challenge naturally motivated the development of novel solutions that automate the visual explorations process, and recommend to analysts those particular queries that best visualize their data and reveal interesting actionable insights. In such automated solutions, there is a clear need for providing analysts with a diversified and concise set of recommended visualizations, which cover and represent a large combinatorial high-dimensional space of possible visualizations. However, directly incorporating existing diversification methods leads to a "process-first-diversify-next" approach, in which all possible data visualizations are generated first through executing a large number of aggregate queries. To address this challenge and minimize the incurred query processing costs, in this work we propose novel optimization techniques for the efficient diversification of recommended insightful visualizations. The key idea underlying our proposed techniques is to identify and eliminate the processing of a large number of low-utility insignificant visualizations. Meanwhile, for the potentially high-utility insightful visualizations, shared multi-query optimization techniques are proposed for further reduction in data processing cost. Our extensive experimental evaluation on real datasets demonstrates the performance gains provided by our proposed techniques, in terms of minimizing the query processing cost (i.e., efficiency), as well as maximizing the quality of recommendations (i.e., effectiveness).
doi_str_mv 10.1109/ACCESS.2023.3283457
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_ACCESS_2023_3283457</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10144772</ieee_id><doaj_id>oai_doaj_org_article_2d098fa44e5f40eebd081427904f78b6</doaj_id><sourcerecordid>2828939916</sourcerecordid><originalsourceid>FETCH-LOGICAL-c359t-9eefd612f786809d124ffc94e877073059c4a1dc9c566303432d910f754e2b223</originalsourceid><addsrcrecordid>eNpNUV1LwzAULaLgmPsF-lDwuTNfbZrH0U03GAhOfQ1ZelMytmYmnaC_3mwdsvtyPzjn3Hs5SXKP0RhjJJ4mVTVbrcYEETqmpKQs51fJgOBCZDSnxfVFfZuMQtigGGUc5XyQzGfGWG2h7dKp_QYfbGxVZ12bGufTN9But4O2tm2TTprGQ6M6SKeqU-mnDQe1tb8ndLhLbozaBhid8zD5eJ69V_Ns-fqyqCbLTNNcdJkAMHWBieFlUSJRY8KM0YJByTniFOVCM4VrLXReFBRRRkktMDI8Z0DWhNBhsuh1a6c2cu_tTvkf6ZSVp4HzjVS-s3oLktRIlEYxBrlhCGBdx7cZ4QKxuH5dRK3HXmvv3dcBQic37uDbeL4kJSkFFQIfUbRHae9C8GD-t2Ikjw7I3gF5dECeHYish55lAeCCgRnjnNA_A32AqA</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2828939916</pqid></control><display><type>article</type><title>Efficient Diversification for Recommending Aggregate Data Visualizations</title><source>IEEE Xplore Open Access Journals</source><creator>Sharaf, Mohamed A. ; Mafrur, Rischan ; Zuccon, Guido</creator><creatorcontrib>Sharaf, Mohamed A. ; Mafrur, Rischan ; Zuccon, Guido</creatorcontrib><description>Visual data exploration is ubiquitous in nearly every industry and organization to support discovering data-driven actionable insights. However, unlocking those insights requires analysts to manually construct a prohibitively large number of aggregate queries and visually explore their results looking for those valuable and insightful visualizations. Such challenge naturally motivated the development of novel solutions that automate the visual explorations process, and recommend to analysts those particular queries that best visualize their data and reveal interesting actionable insights. In such automated solutions, there is a clear need for providing analysts with a diversified and concise set of recommended visualizations, which cover and represent a large combinatorial high-dimensional space of possible visualizations. However, directly incorporating existing diversification methods leads to a "process-first-diversify-next" approach, in which all possible data visualizations are generated first through executing a large number of aggregate queries. To address this challenge and minimize the incurred query processing costs, in this work we propose novel optimization techniques for the efficient diversification of recommended insightful visualizations. The key idea underlying our proposed techniques is to identify and eliminate the processing of a large number of low-utility insignificant visualizations. Meanwhile, for the potentially high-utility insightful visualizations, shared multi-query optimization techniques are proposed for further reduction in data processing cost. Our extensive experimental evaluation on real datasets demonstrates the performance gains provided by our proposed techniques, in terms of minimizing the query processing cost (i.e., efficiency), as well as maximizing the quality of recommendations (i.e., effectiveness).</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2023.3283457</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Aggregates ; Automation ; Combinatorial analysis ; Costs ; Data Exploration ; Data processing ; Data visualization ; Diabetes ; Diversification ; Hospitals ; Optimization ; Optimization techniques ; Queries ; Query processing ; Recommendation Diversification ; Visual Analytics</subject><ispartof>IEEE access, 2023-01, Vol.11, p.1-1</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c359t-9eefd612f786809d124ffc94e877073059c4a1dc9c566303432d910f754e2b223</cites><orcidid>0000-0002-3405-5224 ; 0000-0003-0271-5563 ; 0000-0003-4424-3736</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10144772$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,27610,27901,27902,54908</link.rule.ids></links><search><creatorcontrib>Sharaf, Mohamed A.</creatorcontrib><creatorcontrib>Mafrur, Rischan</creatorcontrib><creatorcontrib>Zuccon, Guido</creatorcontrib><title>Efficient Diversification for Recommending Aggregate Data Visualizations</title><title>IEEE access</title><addtitle>Access</addtitle><description>Visual data exploration is ubiquitous in nearly every industry and organization to support discovering data-driven actionable insights. However, unlocking those insights requires analysts to manually construct a prohibitively large number of aggregate queries and visually explore their results looking for those valuable and insightful visualizations. Such challenge naturally motivated the development of novel solutions that automate the visual explorations process, and recommend to analysts those particular queries that best visualize their data and reveal interesting actionable insights. In such automated solutions, there is a clear need for providing analysts with a diversified and concise set of recommended visualizations, which cover and represent a large combinatorial high-dimensional space of possible visualizations. However, directly incorporating existing diversification methods leads to a "process-first-diversify-next" approach, in which all possible data visualizations are generated first through executing a large number of aggregate queries. To address this challenge and minimize the incurred query processing costs, in this work we propose novel optimization techniques for the efficient diversification of recommended insightful visualizations. The key idea underlying our proposed techniques is to identify and eliminate the processing of a large number of low-utility insignificant visualizations. Meanwhile, for the potentially high-utility insightful visualizations, shared multi-query optimization techniques are proposed for further reduction in data processing cost. Our extensive experimental evaluation on real datasets demonstrates the performance gains provided by our proposed techniques, in terms of minimizing the query processing cost (i.e., efficiency), as well as maximizing the quality of recommendations (i.e., effectiveness).</description><subject>Aggregates</subject><subject>Automation</subject><subject>Combinatorial analysis</subject><subject>Costs</subject><subject>Data Exploration</subject><subject>Data processing</subject><subject>Data visualization</subject><subject>Diabetes</subject><subject>Diversification</subject><subject>Hospitals</subject><subject>Optimization</subject><subject>Optimization techniques</subject><subject>Queries</subject><subject>Query processing</subject><subject>Recommendation Diversification</subject><subject>Visual Analytics</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>DOA</sourceid><recordid>eNpNUV1LwzAULaLgmPsF-lDwuTNfbZrH0U03GAhOfQ1ZelMytmYmnaC_3mwdsvtyPzjn3Hs5SXKP0RhjJJ4mVTVbrcYEETqmpKQs51fJgOBCZDSnxfVFfZuMQtigGGUc5XyQzGfGWG2h7dKp_QYfbGxVZ12bGufTN9But4O2tm2TTprGQ6M6SKeqU-mnDQe1tb8ndLhLbozaBhid8zD5eJ69V_Ns-fqyqCbLTNNcdJkAMHWBieFlUSJRY8KM0YJByTniFOVCM4VrLXReFBRRRkktMDI8Z0DWhNBhsuh1a6c2cu_tTvkf6ZSVp4HzjVS-s3oLktRIlEYxBrlhCGBdx7cZ4QKxuH5dRK3HXmvv3dcBQic37uDbeL4kJSkFFQIfUbRHae9C8GD-t2Ikjw7I3gF5dECeHYish55lAeCCgRnjnNA_A32AqA</recordid><startdate>20230101</startdate><enddate>20230101</enddate><creator>Sharaf, Mohamed A.</creator><creator>Mafrur, Rischan</creator><creator>Zuccon, Guido</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-3405-5224</orcidid><orcidid>https://orcid.org/0000-0003-0271-5563</orcidid><orcidid>https://orcid.org/0000-0003-4424-3736</orcidid></search><sort><creationdate>20230101</creationdate><title>Efficient Diversification for Recommending Aggregate Data Visualizations</title><author>Sharaf, Mohamed A. ; Mafrur, Rischan ; Zuccon, Guido</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c359t-9eefd612f786809d124ffc94e877073059c4a1dc9c566303432d910f754e2b223</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Aggregates</topic><topic>Automation</topic><topic>Combinatorial analysis</topic><topic>Costs</topic><topic>Data Exploration</topic><topic>Data processing</topic><topic>Data visualization</topic><topic>Diabetes</topic><topic>Diversification</topic><topic>Hospitals</topic><topic>Optimization</topic><topic>Optimization techniques</topic><topic>Queries</topic><topic>Query processing</topic><topic>Recommendation Diversification</topic><topic>Visual Analytics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sharaf, Mohamed A.</creatorcontrib><creatorcontrib>Mafrur, Rischan</creatorcontrib><creatorcontrib>Zuccon, Guido</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Xplore Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sharaf, Mohamed A.</au><au>Mafrur, Rischan</au><au>Zuccon, Guido</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Efficient Diversification for Recommending Aggregate Data Visualizations</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2023-01-01</date><risdate>2023</risdate><volume>11</volume><spage>1</spage><epage>1</epage><pages>1-1</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Visual data exploration is ubiquitous in nearly every industry and organization to support discovering data-driven actionable insights. However, unlocking those insights requires analysts to manually construct a prohibitively large number of aggregate queries and visually explore their results looking for those valuable and insightful visualizations. Such challenge naturally motivated the development of novel solutions that automate the visual explorations process, and recommend to analysts those particular queries that best visualize their data and reveal interesting actionable insights. In such automated solutions, there is a clear need for providing analysts with a diversified and concise set of recommended visualizations, which cover and represent a large combinatorial high-dimensional space of possible visualizations. However, directly incorporating existing diversification methods leads to a "process-first-diversify-next" approach, in which all possible data visualizations are generated first through executing a large number of aggregate queries. To address this challenge and minimize the incurred query processing costs, in this work we propose novel optimization techniques for the efficient diversification of recommended insightful visualizations. The key idea underlying our proposed techniques is to identify and eliminate the processing of a large number of low-utility insignificant visualizations. Meanwhile, for the potentially high-utility insightful visualizations, shared multi-query optimization techniques are proposed for further reduction in data processing cost. Our extensive experimental evaluation on real datasets demonstrates the performance gains provided by our proposed techniques, in terms of minimizing the query processing cost (i.e., efficiency), as well as maximizing the quality of recommendations (i.e., effectiveness).</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2023.3283457</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-3405-5224</orcidid><orcidid>https://orcid.org/0000-0003-0271-5563</orcidid><orcidid>https://orcid.org/0000-0003-4424-3736</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2023-01, Vol.11, p.1-1
issn 2169-3536
2169-3536
language eng
recordid cdi_crossref_primary_10_1109_ACCESS_2023_3283457
source IEEE Xplore Open Access Journals
subjects Aggregates
Automation
Combinatorial analysis
Costs
Data Exploration
Data processing
Data visualization
Diabetes
Diversification
Hospitals
Optimization
Optimization techniques
Queries
Query processing
Recommendation Diversification
Visual Analytics
title Efficient Diversification for Recommending Aggregate Data Visualizations
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T13%3A01%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Efficient%20Diversification%20for%20Recommending%20Aggregate%20Data%20Visualizations&rft.jtitle=IEEE%20access&rft.au=Sharaf,%20Mohamed%20A.&rft.date=2023-01-01&rft.volume=11&rft.spage=1&rft.epage=1&rft.pages=1-1&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2023.3283457&rft_dat=%3Cproquest_cross%3E2828939916%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c359t-9eefd612f786809d124ffc94e877073059c4a1dc9c566303432d910f754e2b223%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2828939916&rft_id=info:pmid/&rft_ieee_id=10144772&rfr_iscdi=true