Loading…
Advanced pruning strategies to speed up mining closed molecular fragments
In years, several algorithms for mining frequent subgraphs in graph databases have been proposed, with a major application area being the discovery of frequent substructures of biomolecules. Unfortunately, most of these algorithms still struggle with fairly long execution times if larger substructur...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 4570 vol.5 |
container_issue | |
container_start_page | 4565 |
container_title | |
container_volume | 5 |
creator | Borgelt, C. Meinl, T. Berthold, M.R. |
description | In years, several algorithms for mining frequent subgraphs in graph databases have been proposed, with a major application area being the discovery of frequent substructures of biomolecules. Unfortunately, most of these algorithms still struggle with fairly long execution times if larger substructures or molecular fragments are desired. We describe two advanced pruning strategies - equivalent sibling pruning and perfect extension pruning - that can be used to speed up the MoFa algorithm (introduced in C. Borgelt and M.R. Berthold, (2002)) in the search for closed molecular fragments, as we demonstrate with experiments on the NCI's HIV database. |
doi_str_mv | 10.1109/ICSMC.2004.1401251 |
format | conference_proceeding |
fullrecord | <record><control><sourceid>pascalfrancis_CHZPO</sourceid><recordid>TN_cdi_pascalfrancis_primary_17524164</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1401251</ieee_id><sourcerecordid>17524164</sourcerecordid><originalsourceid>FETCH-LOGICAL-i249t-1d3e7103548bbdba0077510095a665fd8b727e8b0d4e309aaeedb99356e17f1f3</originalsourceid><addsrcrecordid>eNpFkElLxEAUhBsXcGb0D-glF48Z3-s1fRyCy8CIBxW8DZ3kJbRkI50I_nuDETwV1FdUQTF2jbBFBHu3T1-f0y0HkFuUgFzhCVtxZUyMWqlTtgaTgEiU1uaMrRA0jy3nHxdsHcInAAeJyYrtd8WXa3Mqon6YWt9WURgHN1LlKURjF4WeZjb1UeN_aV53YTaarqZ8qt0QlYOrGmrHcMnOS1cHuvrTDXt_uH9Ln-LDy-M-3R1iz6UdYywEGQShZJJlReYAjFEIYJXTWpVFkhluKMmgkCTAOjfvZ9YKpQlNiaXYsNult3chd_W83-Y-HPvBN274PqJRXKKWc-5myXki-sfLVeIHcsRbfw</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Advanced pruning strategies to speed up mining closed molecular fragments</title><source>IEEE Xplore All Conference Series</source><creator>Borgelt, C. ; Meinl, T. ; Berthold, M.R.</creator><creatorcontrib>Borgelt, C. ; Meinl, T. ; Berthold, M.R.</creatorcontrib><description>In years, several algorithms for mining frequent subgraphs in graph databases have been proposed, with a major application area being the discovery of frequent substructures of biomolecules. Unfortunately, most of these algorithms still struggle with fairly long execution times if larger substructures or molecular fragments are desired. We describe two advanced pruning strategies - equivalent sibling pruning and perfect extension pruning - that can be used to speed up the MoFa algorithm (introduced in C. Borgelt and M.R. Berthold, (2002)) in the search for closed molecular fragments, as we demonstrate with experiments on the NCI's HIV database.</description><identifier>ISSN: 1062-922X</identifier><identifier>ISBN: 0780385667</identifier><identifier>ISBN: 9780780385665</identifier><identifier>EISSN: 2577-1655</identifier><identifier>DOI: 10.1109/ICSMC.2004.1401251</identifier><language>eng</language><publisher>Piscataway NJ: IEEE</publisher><subject>Applied sciences ; Biochemistry ; Computer science ; Computer science; control theory; systems ; Control theory. Systems ; Data mining ; Drugs ; Ear ; Exact sciences and technology ; Human immunodeficiency virus ; Hydrogen ; Information science ; Protection ; Tree graphs</subject><ispartof>2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583), 2004, Vol.5, p.4565-4570 vol.5</ispartof><rights>2006 INIST-CNRS</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1401251$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54555,54920,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1401251$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17524164$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Borgelt, C.</creatorcontrib><creatorcontrib>Meinl, T.</creatorcontrib><creatorcontrib>Berthold, M.R.</creatorcontrib><title>Advanced pruning strategies to speed up mining closed molecular fragments</title><title>2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583)</title><addtitle>ICSMC</addtitle><description>In years, several algorithms for mining frequent subgraphs in graph databases have been proposed, with a major application area being the discovery of frequent substructures of biomolecules. Unfortunately, most of these algorithms still struggle with fairly long execution times if larger substructures or molecular fragments are desired. We describe two advanced pruning strategies - equivalent sibling pruning and perfect extension pruning - that can be used to speed up the MoFa algorithm (introduced in C. Borgelt and M.R. Berthold, (2002)) in the search for closed molecular fragments, as we demonstrate with experiments on the NCI's HIV database.</description><subject>Applied sciences</subject><subject>Biochemistry</subject><subject>Computer science</subject><subject>Computer science; control theory; systems</subject><subject>Control theory. Systems</subject><subject>Data mining</subject><subject>Drugs</subject><subject>Ear</subject><subject>Exact sciences and technology</subject><subject>Human immunodeficiency virus</subject><subject>Hydrogen</subject><subject>Information science</subject><subject>Protection</subject><subject>Tree graphs</subject><issn>1062-922X</issn><issn>2577-1655</issn><isbn>0780385667</isbn><isbn>9780780385665</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2004</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNpFkElLxEAUhBsXcGb0D-glF48Z3-s1fRyCy8CIBxW8DZ3kJbRkI50I_nuDETwV1FdUQTF2jbBFBHu3T1-f0y0HkFuUgFzhCVtxZUyMWqlTtgaTgEiU1uaMrRA0jy3nHxdsHcInAAeJyYrtd8WXa3Mqon6YWt9WURgHN1LlKURjF4WeZjb1UeN_aV53YTaarqZ8qt0QlYOrGmrHcMnOS1cHuvrTDXt_uH9Ln-LDy-M-3R1iz6UdYywEGQShZJJlReYAjFEIYJXTWpVFkhluKMmgkCTAOjfvZ9YKpQlNiaXYsNult3chd_W83-Y-HPvBN274PqJRXKKWc-5myXki-sfLVeIHcsRbfw</recordid><startdate>2004</startdate><enddate>2004</enddate><creator>Borgelt, C.</creator><creator>Meinl, T.</creator><creator>Berthold, M.R.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope><scope>IQODW</scope></search><sort><creationdate>2004</creationdate><title>Advanced pruning strategies to speed up mining closed molecular fragments</title><author>Borgelt, C. ; Meinl, T. ; Berthold, M.R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i249t-1d3e7103548bbdba0077510095a665fd8b727e8b0d4e309aaeedb99356e17f1f3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Applied sciences</topic><topic>Biochemistry</topic><topic>Computer science</topic><topic>Computer science; control theory; systems</topic><topic>Control theory. Systems</topic><topic>Data mining</topic><topic>Drugs</topic><topic>Ear</topic><topic>Exact sciences and technology</topic><topic>Human immunodeficiency virus</topic><topic>Hydrogen</topic><topic>Information science</topic><topic>Protection</topic><topic>Tree graphs</topic><toplevel>online_resources</toplevel><creatorcontrib>Borgelt, C.</creatorcontrib><creatorcontrib>Meinl, T.</creatorcontrib><creatorcontrib>Berthold, M.R.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library Online</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection><collection>Pascal-Francis</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Borgelt, C.</au><au>Meinl, T.</au><au>Berthold, M.R.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Advanced pruning strategies to speed up mining closed molecular fragments</atitle><btitle>2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583)</btitle><stitle>ICSMC</stitle><date>2004</date><risdate>2004</risdate><volume>5</volume><spage>4565</spage><epage>4570 vol.5</epage><pages>4565-4570 vol.5</pages><issn>1062-922X</issn><eissn>2577-1655</eissn><isbn>0780385667</isbn><isbn>9780780385665</isbn><abstract>In years, several algorithms for mining frequent subgraphs in graph databases have been proposed, with a major application area being the discovery of frequent substructures of biomolecules. Unfortunately, most of these algorithms still struggle with fairly long execution times if larger substructures or molecular fragments are desired. We describe two advanced pruning strategies - equivalent sibling pruning and perfect extension pruning - that can be used to speed up the MoFa algorithm (introduced in C. Borgelt and M.R. Berthold, (2002)) in the search for closed molecular fragments, as we demonstrate with experiments on the NCI's HIV database.</abstract><cop>Piscataway NJ</cop><pub>IEEE</pub><doi>10.1109/ICSMC.2004.1401251</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1062-922X |
ispartof | 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583), 2004, Vol.5, p.4565-4570 vol.5 |
issn | 1062-922X 2577-1655 |
language | eng |
recordid | cdi_pascalfrancis_primary_17524164 |
source | IEEE Xplore All Conference Series |
subjects | Applied sciences Biochemistry Computer science Computer science control theory systems Control theory. Systems Data mining Drugs Ear Exact sciences and technology Human immunodeficiency virus Hydrogen Information science Protection Tree graphs |
title | Advanced pruning strategies to speed up mining closed molecular fragments |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-24T18%3A31%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Advanced%20pruning%20strategies%20to%20speed%20up%20mining%20closed%20molecular%20fragments&rft.btitle=2004%20IEEE%20International%20Conference%20on%20Systems,%20Man%20and%20Cybernetics%20(IEEE%20Cat.%20No.04CH37583)&rft.au=Borgelt,%20C.&rft.date=2004&rft.volume=5&rft.spage=4565&rft.epage=4570%20vol.5&rft.pages=4565-4570%20vol.5&rft.issn=1062-922X&rft.eissn=2577-1655&rft.isbn=0780385667&rft.isbn_list=9780780385665&rft_id=info:doi/10.1109/ICSMC.2004.1401251&rft_dat=%3Cpascalfrancis_CHZPO%3E17524164%3C/pascalfrancis_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i249t-1d3e7103548bbdba0077510095a665fd8b727e8b0d4e309aaeedb99356e17f1f3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=1401251&rfr_iscdi=true |