Loading…
Efficient mining high average-utility itemsets with effective pruning strategies and novel list structure
High utility itemset mining can mine all itemsets that meet the minimum utility threshold set by the decision maker, thus becomes a popular and prominent data-mining technique. High average utility itemset mining(HAUIM) can determine the desired pattern by considering the utility and length of items...
Saved in:
Published in: | Applied intelligence (Dordrecht, Netherlands) Netherlands), 2023-03, Vol.53 (5), p.6099-6118 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c249t-c45a956d8b7f4f25b1e2befc4e61cd9ce9a7ed652c86180555ca0607b6c755dd3 |
---|---|
cites | cdi_FETCH-LOGICAL-c249t-c45a956d8b7f4f25b1e2befc4e61cd9ce9a7ed652c86180555ca0607b6c755dd3 |
container_end_page | 6118 |
container_issue | 5 |
container_start_page | 6099 |
container_title | Applied intelligence (Dordrecht, Netherlands) |
container_volume | 53 |
creator | Li, Gufeng Shang, Tao Zhang, Yinling |
description | High utility itemset mining can mine all itemsets that meet the minimum utility threshold set by the decision maker, thus becomes a popular and prominent data-mining technique. High average utility itemset mining(HAUIM) can determine the desired pattern by considering the utility and length of itemset, which is a fairer alternative measurement. Recently, several algorithms use various utility-list structures and upper bounds to improve the pruning methods. However, these algorithms generated too many unpromising candidates, since they overestimated the average-utility of itemsets too much. This paper designs a novel utility list structure and presents an efficient upper-bound model for improving the performance of HAUIM methods. This list structure captures all key feature informations to estimate the tighter upper-bound of itemset average-utility, it also can be used to calculate the actual itemset average-utility. For avoiding the processing of unpromising candidates, this pruning strategy utilizes the tighter upper-bound. Thus it reduces the number of join operations greatly compared with the state-of-art HAUIM methods. Various experiments were performed by several benchmark datasets to measure the performances of proposed algorithm. The experimental results show that the proposed algorithm has runtime, memory consumption, number of join operations, and scalability performances superior to those of existing algorithms. |
doi_str_mv | 10.1007/s10489-022-03722-x |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2778133081</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2778133081</sourcerecordid><originalsourceid>FETCH-LOGICAL-c249t-c45a956d8b7f4f25b1e2befc4e61cd9ce9a7ed652c86180555ca0607b6c755dd3</originalsourceid><addsrcrecordid>eNp9kM1OwzAQhC0EEqXwApwscTbYTmwnR1SVH6kSF5C4WY6zTl2labGd0r49aYPEjcvsYeeb1Q5Ct4zeM0rVQ2Q0L0pCOSc0U4Puz9CECZURlZfqHE1oyXMiZfl5ia5iXFFKs4yyCfJz57z10CW89p3vGrz0zRKbHQTTAOmTb306YJ9gHSFF_O3TEoNzYJPfAd6G_gTFFEyCxkPEpqtxt9lBi1sf03HT29QHuEYXzrQRbn7nFH08zd9nL2Tx9vw6e1wQy_MyEZsLUwpZF5VyueOiYsArcDYHyWxdWiiNgloKbgvJCiqEsIZKqipplRB1nU3R3Zi7DZuvHmLSq00fuuGk5koVbPh7kCnio8uGTYwBnN4GvzbhoBnVx0r1WKkeKtWnSvV-gLIRioO5ayD8Rf9D_QBH830E</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2778133081</pqid></control><display><type>article</type><title>Efficient mining high average-utility itemsets with effective pruning strategies and novel list structure</title><source>ABI/INFORM Collection</source><source>Springer Link</source><creator>Li, Gufeng ; Shang, Tao ; Zhang, Yinling</creator><creatorcontrib>Li, Gufeng ; Shang, Tao ; Zhang, Yinling</creatorcontrib><description>High utility itemset mining can mine all itemsets that meet the minimum utility threshold set by the decision maker, thus becomes a popular and prominent data-mining technique. High average utility itemset mining(HAUIM) can determine the desired pattern by considering the utility and length of itemset, which is a fairer alternative measurement. Recently, several algorithms use various utility-list structures and upper bounds to improve the pruning methods. However, these algorithms generated too many unpromising candidates, since they overestimated the average-utility of itemsets too much. This paper designs a novel utility list structure and presents an efficient upper-bound model for improving the performance of HAUIM methods. This list structure captures all key feature informations to estimate the tighter upper-bound of itemset average-utility, it also can be used to calculate the actual itemset average-utility. For avoiding the processing of unpromising candidates, this pruning strategy utilizes the tighter upper-bound. Thus it reduces the number of join operations greatly compared with the state-of-art HAUIM methods. Various experiments were performed by several benchmark datasets to measure the performances of proposed algorithm. The experimental results show that the proposed algorithm has runtime, memory consumption, number of join operations, and scalability performances superior to those of existing algorithms.</description><identifier>ISSN: 0924-669X</identifier><identifier>EISSN: 1573-7497</identifier><identifier>DOI: 10.1007/s10489-022-03722-x</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Artificial Intelligence ; Computer Science ; Data mining ; Decision making ; Machines ; Manufacturing ; Mechanical Engineering ; Mining ; Processes ; Upper bounds</subject><ispartof>Applied intelligence (Dordrecht, Netherlands), 2023-03, Vol.53 (5), p.6099-6118</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022</rights><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c249t-c45a956d8b7f4f25b1e2befc4e61cd9ce9a7ed652c86180555ca0607b6c755dd3</citedby><cites>FETCH-LOGICAL-c249t-c45a956d8b7f4f25b1e2befc4e61cd9ce9a7ed652c86180555ca0607b6c755dd3</cites><orcidid>0000-0003-4877-5975</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2778133081/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2778133081?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,780,784,11687,27923,27924,36059,44362,74666</link.rule.ids></links><search><creatorcontrib>Li, Gufeng</creatorcontrib><creatorcontrib>Shang, Tao</creatorcontrib><creatorcontrib>Zhang, Yinling</creatorcontrib><title>Efficient mining high average-utility itemsets with effective pruning strategies and novel list structure</title><title>Applied intelligence (Dordrecht, Netherlands)</title><addtitle>Appl Intell</addtitle><description>High utility itemset mining can mine all itemsets that meet the minimum utility threshold set by the decision maker, thus becomes a popular and prominent data-mining technique. High average utility itemset mining(HAUIM) can determine the desired pattern by considering the utility and length of itemset, which is a fairer alternative measurement. Recently, several algorithms use various utility-list structures and upper bounds to improve the pruning methods. However, these algorithms generated too many unpromising candidates, since they overestimated the average-utility of itemsets too much. This paper designs a novel utility list structure and presents an efficient upper-bound model for improving the performance of HAUIM methods. This list structure captures all key feature informations to estimate the tighter upper-bound of itemset average-utility, it also can be used to calculate the actual itemset average-utility. For avoiding the processing of unpromising candidates, this pruning strategy utilizes the tighter upper-bound. Thus it reduces the number of join operations greatly compared with the state-of-art HAUIM methods. Various experiments were performed by several benchmark datasets to measure the performances of proposed algorithm. The experimental results show that the proposed algorithm has runtime, memory consumption, number of join operations, and scalability performances superior to those of existing algorithms.</description><subject>Algorithms</subject><subject>Artificial Intelligence</subject><subject>Computer Science</subject><subject>Data mining</subject><subject>Decision making</subject><subject>Machines</subject><subject>Manufacturing</subject><subject>Mechanical Engineering</subject><subject>Mining</subject><subject>Processes</subject><subject>Upper bounds</subject><issn>0924-669X</issn><issn>1573-7497</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>M0C</sourceid><recordid>eNp9kM1OwzAQhC0EEqXwApwscTbYTmwnR1SVH6kSF5C4WY6zTl2labGd0r49aYPEjcvsYeeb1Q5Ct4zeM0rVQ2Q0L0pCOSc0U4Puz9CECZURlZfqHE1oyXMiZfl5ia5iXFFKs4yyCfJz57z10CW89p3vGrz0zRKbHQTTAOmTb306YJ9gHSFF_O3TEoNzYJPfAd6G_gTFFEyCxkPEpqtxt9lBi1sf03HT29QHuEYXzrQRbn7nFH08zd9nL2Tx9vw6e1wQy_MyEZsLUwpZF5VyueOiYsArcDYHyWxdWiiNgloKbgvJCiqEsIZKqipplRB1nU3R3Zi7DZuvHmLSq00fuuGk5koVbPh7kCnio8uGTYwBnN4GvzbhoBnVx0r1WKkeKtWnSvV-gLIRioO5ayD8Rf9D_QBH830E</recordid><startdate>20230301</startdate><enddate>20230301</enddate><creator>Li, Gufeng</creator><creator>Shang, Tao</creator><creator>Zhang, Yinling</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PSYQQ</scope><scope>PTHSS</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0003-4877-5975</orcidid></search><sort><creationdate>20230301</creationdate><title>Efficient mining high average-utility itemsets with effective pruning strategies and novel list structure</title><author>Li, Gufeng ; Shang, Tao ; Zhang, Yinling</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c249t-c45a956d8b7f4f25b1e2befc4e61cd9ce9a7ed652c86180555ca0607b6c755dd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Artificial Intelligence</topic><topic>Computer Science</topic><topic>Data mining</topic><topic>Decision making</topic><topic>Machines</topic><topic>Manufacturing</topic><topic>Mechanical Engineering</topic><topic>Mining</topic><topic>Processes</topic><topic>Upper bounds</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Gufeng</creatorcontrib><creatorcontrib>Shang, Tao</creatorcontrib><creatorcontrib>Zhang, Yinling</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Collection</collection><collection>Computing Database</collection><collection>Engineering Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>One Business (ProQuest)</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest One Psychology</collection><collection>Engineering Collection</collection><collection>ProQuest Central Basic</collection><jtitle>Applied intelligence (Dordrecht, Netherlands)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Gufeng</au><au>Shang, Tao</au><au>Zhang, Yinling</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Efficient mining high average-utility itemsets with effective pruning strategies and novel list structure</atitle><jtitle>Applied intelligence (Dordrecht, Netherlands)</jtitle><stitle>Appl Intell</stitle><date>2023-03-01</date><risdate>2023</risdate><volume>53</volume><issue>5</issue><spage>6099</spage><epage>6118</epage><pages>6099-6118</pages><issn>0924-669X</issn><eissn>1573-7497</eissn><abstract>High utility itemset mining can mine all itemsets that meet the minimum utility threshold set by the decision maker, thus becomes a popular and prominent data-mining technique. High average utility itemset mining(HAUIM) can determine the desired pattern by considering the utility and length of itemset, which is a fairer alternative measurement. Recently, several algorithms use various utility-list structures and upper bounds to improve the pruning methods. However, these algorithms generated too many unpromising candidates, since they overestimated the average-utility of itemsets too much. This paper designs a novel utility list structure and presents an efficient upper-bound model for improving the performance of HAUIM methods. This list structure captures all key feature informations to estimate the tighter upper-bound of itemset average-utility, it also can be used to calculate the actual itemset average-utility. For avoiding the processing of unpromising candidates, this pruning strategy utilizes the tighter upper-bound. Thus it reduces the number of join operations greatly compared with the state-of-art HAUIM methods. Various experiments were performed by several benchmark datasets to measure the performances of proposed algorithm. The experimental results show that the proposed algorithm has runtime, memory consumption, number of join operations, and scalability performances superior to those of existing algorithms.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10489-022-03722-x</doi><tpages>20</tpages><orcidid>https://orcid.org/0000-0003-4877-5975</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0924-669X |
ispartof | Applied intelligence (Dordrecht, Netherlands), 2023-03, Vol.53 (5), p.6099-6118 |
issn | 0924-669X 1573-7497 |
language | eng |
recordid | cdi_proquest_journals_2778133081 |
source | ABI/INFORM Collection; Springer Link |
subjects | Algorithms Artificial Intelligence Computer Science Data mining Decision making Machines Manufacturing Mechanical Engineering Mining Processes Upper bounds |
title | Efficient mining high average-utility itemsets with effective pruning strategies and novel list structure |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T00%3A58%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Efficient%20mining%20high%20average-utility%20itemsets%20with%20effective%20pruning%20strategies%20and%20novel%20list%20structure&rft.jtitle=Applied%20intelligence%20(Dordrecht,%20Netherlands)&rft.au=Li,%20Gufeng&rft.date=2023-03-01&rft.volume=53&rft.issue=5&rft.spage=6099&rft.epage=6118&rft.pages=6099-6118&rft.issn=0924-669X&rft.eissn=1573-7497&rft_id=info:doi/10.1007/s10489-022-03722-x&rft_dat=%3Cproquest_cross%3E2778133081%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c249t-c45a956d8b7f4f25b1e2befc4e61cd9ce9a7ed652c86180555ca0607b6c755dd3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2778133081&rft_id=info:pmid/&rfr_iscdi=true |