Loading…
Algorithms for Discovery of Multiple Markov Boundaries
Algorithms for Markov boundary discovery from data constitute an important recent development in machine learning, primarily because they offer a principled solution to the variable/feature selection problem and give insight on local causal structure. Over the last decade many sound algorithms have...
Saved in:
Published in: | Journal of machine learning research 2013-02, Vol.14, p.499-566 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 566 |
container_issue | |
container_start_page | 499 |
container_title | Journal of machine learning research |
container_volume | 14 |
creator | Statnikov, Alexander Lytkin, Nikita I Lemeire, Jan Aliferis, Constantin F |
description | Algorithms for Markov boundary discovery from data constitute an important recent development in machine learning, primarily because they offer a principled solution to the variable/feature selection problem and give insight on local causal structure. Over the last decade many sound algorithms have been proposed to identify a single Markov boundary of the response variable. Even though faithful distributions and, more broadly, distributions that satisfy the intersection property always have a single Markov boundary, other distributions/data sets may have multiple Markov boundaries of the response variable. The latter distributions/data sets are common in practical data-analytic applications, and there are several reasons why it is important to induce multiple Markov boundaries from such data. However, there are currently no sound and efficient algorithms that can accomplish this task. This paper describes a family of algorithms TIE* that can discover all Markov boundaries in a distribution. The broad applicability as well as efficiency of the new algorithmic family is demonstrated in an extensive benchmarking study that involved comparison with 26 state-of-the-art algorithms/variants in 15 data sets from a diversity of application domains. |
format | article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4184048</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1826603167</sourcerecordid><originalsourceid>FETCH-LOGICAL-p266t-5c243129b464d6230b4957dcf86f6b30e66ab99e82f86fe006e27edf3a1550ab3</originalsourceid><addsrcrecordid>eNpVkM1OwzAQhC0EoqXwCihHLpH8H-eCVAoFpFZc4Gw5yaY1OHGwk0p9ewIUBKcd7a6-md0jNCWCsTTLqTr-0jTlnIkJOovxFWOSCSpP0YQKqgQWdIrk3G18sP22iUntQ3JrY-l3EPaJr5P14HrbOUjWJrz5XXLjh7YywUI8Rye1cREuDnWGXpZ3z4uHdPV0_7iYr9KOStmnoqScEZoXXPJKUoYLnousKmsla1kwDFKaIs9B0c8OYCyBZlDVzBAhsCnYDF1_c7uhaKAqoe2DcboLtjFhr72x-v-ktVu98TvNieKYqxFwdQAE_z5A7HUzXgjOmRb8EDVRY1DMiMzG1cu_Xr8mP89iH5_WZ0k</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1826603167</pqid></control><display><type>article</type><title>Algorithms for Discovery of Multiple Markov Boundaries</title><source>Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list)</source><creator>Statnikov, Alexander ; Lytkin, Nikita I ; Lemeire, Jan ; Aliferis, Constantin F</creator><creatorcontrib>Statnikov, Alexander ; Lytkin, Nikita I ; Lemeire, Jan ; Aliferis, Constantin F</creatorcontrib><description>Algorithms for Markov boundary discovery from data constitute an important recent development in machine learning, primarily because they offer a principled solution to the variable/feature selection problem and give insight on local causal structure. Over the last decade many sound algorithms have been proposed to identify a single Markov boundary of the response variable. Even though faithful distributions and, more broadly, distributions that satisfy the intersection property always have a single Markov boundary, other distributions/data sets may have multiple Markov boundaries of the response variable. The latter distributions/data sets are common in practical data-analytic applications, and there are several reasons why it is important to induce multiple Markov boundaries from such data. However, there are currently no sound and efficient algorithms that can accomplish this task. This paper describes a family of algorithms TIE* that can discover all Markov boundaries in a distribution. The broad applicability as well as efficiency of the new algorithmic family is demonstrated in an extensive benchmarking study that involved comparison with 26 state-of-the-art algorithms/variants in 15 data sets from a diversity of application domains.</description><identifier>ISSN: 1532-4435</identifier><identifier>EISSN: 1533-7928</identifier><identifier>PMID: 25285052</identifier><language>eng</language><publisher>United States</publisher><ispartof>Journal of machine learning research, 2013-02, Vol.14, p.499-566</ispartof><rights>2013 Alexander Statnikov, Nikita I. Lytkin, Jan Lemeire and Constantin F. Aliferis 2013</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/25285052$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Statnikov, Alexander</creatorcontrib><creatorcontrib>Lytkin, Nikita I</creatorcontrib><creatorcontrib>Lemeire, Jan</creatorcontrib><creatorcontrib>Aliferis, Constantin F</creatorcontrib><title>Algorithms for Discovery of Multiple Markov Boundaries</title><title>Journal of machine learning research</title><addtitle>J Mach Learn Res</addtitle><description>Algorithms for Markov boundary discovery from data constitute an important recent development in machine learning, primarily because they offer a principled solution to the variable/feature selection problem and give insight on local causal structure. Over the last decade many sound algorithms have been proposed to identify a single Markov boundary of the response variable. Even though faithful distributions and, more broadly, distributions that satisfy the intersection property always have a single Markov boundary, other distributions/data sets may have multiple Markov boundaries of the response variable. The latter distributions/data sets are common in practical data-analytic applications, and there are several reasons why it is important to induce multiple Markov boundaries from such data. However, there are currently no sound and efficient algorithms that can accomplish this task. This paper describes a family of algorithms TIE* that can discover all Markov boundaries in a distribution. The broad applicability as well as efficiency of the new algorithmic family is demonstrated in an extensive benchmarking study that involved comparison with 26 state-of-the-art algorithms/variants in 15 data sets from a diversity of application domains.</description><issn>1532-4435</issn><issn>1533-7928</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2013</creationdate><recordtype>article</recordtype><recordid>eNpVkM1OwzAQhC0EoqXwCihHLpH8H-eCVAoFpFZc4Gw5yaY1OHGwk0p9ewIUBKcd7a6-md0jNCWCsTTLqTr-0jTlnIkJOovxFWOSCSpP0YQKqgQWdIrk3G18sP22iUntQ3JrY-l3EPaJr5P14HrbOUjWJrz5XXLjh7YywUI8Rye1cREuDnWGXpZ3z4uHdPV0_7iYr9KOStmnoqScEZoXXPJKUoYLnousKmsla1kwDFKaIs9B0c8OYCyBZlDVzBAhsCnYDF1_c7uhaKAqoe2DcboLtjFhr72x-v-ktVu98TvNieKYqxFwdQAE_z5A7HUzXgjOmRb8EDVRY1DMiMzG1cu_Xr8mP89iH5_WZ0k</recordid><startdate>201302</startdate><enddate>201302</enddate><creator>Statnikov, Alexander</creator><creator>Lytkin, Nikita I</creator><creator>Lemeire, Jan</creator><creator>Aliferis, Constantin F</creator><scope>NPM</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>201302</creationdate><title>Algorithms for Discovery of Multiple Markov Boundaries</title><author>Statnikov, Alexander ; Lytkin, Nikita I ; Lemeire, Jan ; Aliferis, Constantin F</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p266t-5c243129b464d6230b4957dcf86f6b30e66ab99e82f86fe006e27edf3a1550ab3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2013</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Statnikov, Alexander</creatorcontrib><creatorcontrib>Lytkin, Nikita I</creatorcontrib><creatorcontrib>Lemeire, Jan</creatorcontrib><creatorcontrib>Aliferis, Constantin F</creatorcontrib><collection>PubMed</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of machine learning research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Statnikov, Alexander</au><au>Lytkin, Nikita I</au><au>Lemeire, Jan</au><au>Aliferis, Constantin F</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Algorithms for Discovery of Multiple Markov Boundaries</atitle><jtitle>Journal of machine learning research</jtitle><addtitle>J Mach Learn Res</addtitle><date>2013-02</date><risdate>2013</risdate><volume>14</volume><spage>499</spage><epage>566</epage><pages>499-566</pages><issn>1532-4435</issn><eissn>1533-7928</eissn><abstract>Algorithms for Markov boundary discovery from data constitute an important recent development in machine learning, primarily because they offer a principled solution to the variable/feature selection problem and give insight on local causal structure. Over the last decade many sound algorithms have been proposed to identify a single Markov boundary of the response variable. Even though faithful distributions and, more broadly, distributions that satisfy the intersection property always have a single Markov boundary, other distributions/data sets may have multiple Markov boundaries of the response variable. The latter distributions/data sets are common in practical data-analytic applications, and there are several reasons why it is important to induce multiple Markov boundaries from such data. However, there are currently no sound and efficient algorithms that can accomplish this task. This paper describes a family of algorithms TIE* that can discover all Markov boundaries in a distribution. The broad applicability as well as efficiency of the new algorithmic family is demonstrated in an extensive benchmarking study that involved comparison with 26 state-of-the-art algorithms/variants in 15 data sets from a diversity of application domains.</abstract><cop>United States</cop><pmid>25285052</pmid><tpages>68</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1532-4435 |
ispartof | Journal of machine learning research, 2013-02, Vol.14, p.499-566 |
issn | 1532-4435 1533-7928 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4184048 |
source | Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list) |
title | Algorithms for Discovery of Multiple Markov Boundaries |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T02%3A31%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Algorithms%20for%20Discovery%20of%20Multiple%20Markov%20Boundaries&rft.jtitle=Journal%20of%20machine%20learning%20research&rft.au=Statnikov,%20Alexander&rft.date=2013-02&rft.volume=14&rft.spage=499&rft.epage=566&rft.pages=499-566&rft.issn=1532-4435&rft.eissn=1533-7928&rft_id=info:doi/&rft_dat=%3Cproquest_pubme%3E1826603167%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-p266t-5c243129b464d6230b4957dcf86f6b30e66ab99e82f86fe006e27edf3a1550ab3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1826603167&rft_id=info:pmid/25285052&rfr_iscdi=true |