Loading…

Algorithms for Discovery of Multiple Markov Boundaries

Algorithms for Markov boundary discovery from data constitute an important recent development in machine learning, primarily because they offer a principled solution to the variable/feature selection problem and give insight on local causal structure. Over the last decade many sound algorithms have...

Full description

Saved in:
Bibliographic Details
Published in:Journal of machine learning research 2013-02, Vol.14, p.499-566
Main Authors: Statnikov, Alexander, Lytkin, Nikita I, Lemeire, Jan, Aliferis, Constantin F
Format: Article
Language:English
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 566
container_issue
container_start_page 499
container_title Journal of machine learning research
container_volume 14
creator Statnikov, Alexander
Lytkin, Nikita I
Lemeire, Jan
Aliferis, Constantin F
description Algorithms for Markov boundary discovery from data constitute an important recent development in machine learning, primarily because they offer a principled solution to the variable/feature selection problem and give insight on local causal structure. Over the last decade many sound algorithms have been proposed to identify a single Markov boundary of the response variable. Even though faithful distributions and, more broadly, distributions that satisfy the intersection property always have a single Markov boundary, other distributions/data sets may have multiple Markov boundaries of the response variable. The latter distributions/data sets are common in practical data-analytic applications, and there are several reasons why it is important to induce multiple Markov boundaries from such data. However, there are currently no sound and efficient algorithms that can accomplish this task. This paper describes a family of algorithms TIE* that can discover all Markov boundaries in a distribution. The broad applicability as well as efficiency of the new algorithmic family is demonstrated in an extensive benchmarking study that involved comparison with 26 state-of-the-art algorithms/variants in 15 data sets from a diversity of application domains.
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4184048</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1826603167</sourcerecordid><originalsourceid>FETCH-LOGICAL-p266t-5c243129b464d6230b4957dcf86f6b30e66ab99e82f86fe006e27edf3a1550ab3</originalsourceid><addsrcrecordid>eNpVkM1OwzAQhC0EoqXwCihHLpH8H-eCVAoFpFZc4Gw5yaY1OHGwk0p9ewIUBKcd7a6-md0jNCWCsTTLqTr-0jTlnIkJOovxFWOSCSpP0YQKqgQWdIrk3G18sP22iUntQ3JrY-l3EPaJr5P14HrbOUjWJrz5XXLjh7YywUI8Rye1cREuDnWGXpZ3z4uHdPV0_7iYr9KOStmnoqScEZoXXPJKUoYLnousKmsla1kwDFKaIs9B0c8OYCyBZlDVzBAhsCnYDF1_c7uhaKAqoe2DcboLtjFhr72x-v-ktVu98TvNieKYqxFwdQAE_z5A7HUzXgjOmRb8EDVRY1DMiMzG1cu_Xr8mP89iH5_WZ0k</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1826603167</pqid></control><display><type>article</type><title>Algorithms for Discovery of Multiple Markov Boundaries</title><source>Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list)</source><creator>Statnikov, Alexander ; Lytkin, Nikita I ; Lemeire, Jan ; Aliferis, Constantin F</creator><creatorcontrib>Statnikov, Alexander ; Lytkin, Nikita I ; Lemeire, Jan ; Aliferis, Constantin F</creatorcontrib><description>Algorithms for Markov boundary discovery from data constitute an important recent development in machine learning, primarily because they offer a principled solution to the variable/feature selection problem and give insight on local causal structure. Over the last decade many sound algorithms have been proposed to identify a single Markov boundary of the response variable. Even though faithful distributions and, more broadly, distributions that satisfy the intersection property always have a single Markov boundary, other distributions/data sets may have multiple Markov boundaries of the response variable. The latter distributions/data sets are common in practical data-analytic applications, and there are several reasons why it is important to induce multiple Markov boundaries from such data. However, there are currently no sound and efficient algorithms that can accomplish this task. This paper describes a family of algorithms TIE* that can discover all Markov boundaries in a distribution. The broad applicability as well as efficiency of the new algorithmic family is demonstrated in an extensive benchmarking study that involved comparison with 26 state-of-the-art algorithms/variants in 15 data sets from a diversity of application domains.</description><identifier>ISSN: 1532-4435</identifier><identifier>EISSN: 1533-7928</identifier><identifier>PMID: 25285052</identifier><language>eng</language><publisher>United States</publisher><ispartof>Journal of machine learning research, 2013-02, Vol.14, p.499-566</ispartof><rights>2013 Alexander Statnikov, Nikita I. Lytkin, Jan Lemeire and Constantin F. Aliferis 2013</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/25285052$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Statnikov, Alexander</creatorcontrib><creatorcontrib>Lytkin, Nikita I</creatorcontrib><creatorcontrib>Lemeire, Jan</creatorcontrib><creatorcontrib>Aliferis, Constantin F</creatorcontrib><title>Algorithms for Discovery of Multiple Markov Boundaries</title><title>Journal of machine learning research</title><addtitle>J Mach Learn Res</addtitle><description>Algorithms for Markov boundary discovery from data constitute an important recent development in machine learning, primarily because they offer a principled solution to the variable/feature selection problem and give insight on local causal structure. Over the last decade many sound algorithms have been proposed to identify a single Markov boundary of the response variable. Even though faithful distributions and, more broadly, distributions that satisfy the intersection property always have a single Markov boundary, other distributions/data sets may have multiple Markov boundaries of the response variable. The latter distributions/data sets are common in practical data-analytic applications, and there are several reasons why it is important to induce multiple Markov boundaries from such data. However, there are currently no sound and efficient algorithms that can accomplish this task. This paper describes a family of algorithms TIE* that can discover all Markov boundaries in a distribution. The broad applicability as well as efficiency of the new algorithmic family is demonstrated in an extensive benchmarking study that involved comparison with 26 state-of-the-art algorithms/variants in 15 data sets from a diversity of application domains.</description><issn>1532-4435</issn><issn>1533-7928</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2013</creationdate><recordtype>article</recordtype><recordid>eNpVkM1OwzAQhC0EoqXwCihHLpH8H-eCVAoFpFZc4Gw5yaY1OHGwk0p9ewIUBKcd7a6-md0jNCWCsTTLqTr-0jTlnIkJOovxFWOSCSpP0YQKqgQWdIrk3G18sP22iUntQ3JrY-l3EPaJr5P14HrbOUjWJrz5XXLjh7YywUI8Rye1cREuDnWGXpZ3z4uHdPV0_7iYr9KOStmnoqScEZoXXPJKUoYLnousKmsla1kwDFKaIs9B0c8OYCyBZlDVzBAhsCnYDF1_c7uhaKAqoe2DcboLtjFhr72x-v-ktVu98TvNieKYqxFwdQAE_z5A7HUzXgjOmRb8EDVRY1DMiMzG1cu_Xr8mP89iH5_WZ0k</recordid><startdate>201302</startdate><enddate>201302</enddate><creator>Statnikov, Alexander</creator><creator>Lytkin, Nikita I</creator><creator>Lemeire, Jan</creator><creator>Aliferis, Constantin F</creator><scope>NPM</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>201302</creationdate><title>Algorithms for Discovery of Multiple Markov Boundaries</title><author>Statnikov, Alexander ; Lytkin, Nikita I ; Lemeire, Jan ; Aliferis, Constantin F</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p266t-5c243129b464d6230b4957dcf86f6b30e66ab99e82f86fe006e27edf3a1550ab3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2013</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Statnikov, Alexander</creatorcontrib><creatorcontrib>Lytkin, Nikita I</creatorcontrib><creatorcontrib>Lemeire, Jan</creatorcontrib><creatorcontrib>Aliferis, Constantin F</creatorcontrib><collection>PubMed</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of machine learning research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Statnikov, Alexander</au><au>Lytkin, Nikita I</au><au>Lemeire, Jan</au><au>Aliferis, Constantin F</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Algorithms for Discovery of Multiple Markov Boundaries</atitle><jtitle>Journal of machine learning research</jtitle><addtitle>J Mach Learn Res</addtitle><date>2013-02</date><risdate>2013</risdate><volume>14</volume><spage>499</spage><epage>566</epage><pages>499-566</pages><issn>1532-4435</issn><eissn>1533-7928</eissn><abstract>Algorithms for Markov boundary discovery from data constitute an important recent development in machine learning, primarily because they offer a principled solution to the variable/feature selection problem and give insight on local causal structure. Over the last decade many sound algorithms have been proposed to identify a single Markov boundary of the response variable. Even though faithful distributions and, more broadly, distributions that satisfy the intersection property always have a single Markov boundary, other distributions/data sets may have multiple Markov boundaries of the response variable. The latter distributions/data sets are common in practical data-analytic applications, and there are several reasons why it is important to induce multiple Markov boundaries from such data. However, there are currently no sound and efficient algorithms that can accomplish this task. This paper describes a family of algorithms TIE* that can discover all Markov boundaries in a distribution. The broad applicability as well as efficiency of the new algorithmic family is demonstrated in an extensive benchmarking study that involved comparison with 26 state-of-the-art algorithms/variants in 15 data sets from a diversity of application domains.</abstract><cop>United States</cop><pmid>25285052</pmid><tpages>68</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1532-4435
ispartof Journal of machine learning research, 2013-02, Vol.14, p.499-566
issn 1532-4435
1533-7928
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4184048
source Association for Computing Machinery:Jisc Collections:ACM OPEN Journals 2023-2025 (reading list)
title Algorithms for Discovery of Multiple Markov Boundaries
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T02%3A31%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Algorithms%20for%20Discovery%20of%20Multiple%20Markov%20Boundaries&rft.jtitle=Journal%20of%20machine%20learning%20research&rft.au=Statnikov,%20Alexander&rft.date=2013-02&rft.volume=14&rft.spage=499&rft.epage=566&rft.pages=499-566&rft.issn=1532-4435&rft.eissn=1533-7928&rft_id=info:doi/&rft_dat=%3Cproquest_pubme%3E1826603167%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-p266t-5c243129b464d6230b4957dcf86f6b30e66ab99e82f86fe006e27edf3a1550ab3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1826603167&rft_id=info:pmid/25285052&rfr_iscdi=true