Loading…

Classifying real-world data with the DDα-procedure

The D D α -classifier, a nonparametric fast and very robust procedure, is described and applied to fifty classification problems regarding a broad spectrum of real-world data. The procedure first transforms the data from their original property space into a depth space, which is a low-dimensional un...

Full description

Saved in:
Bibliographic Details
Published in:Advances in data analysis and classification 2015-09, Vol.9 (3), p.287-314
Main Authors: Mozharovskyi, Pavlo, Mosler, Karl, Lange, Tatjana
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 314
container_issue 3
container_start_page 287
container_title Advances in data analysis and classification
container_volume 9
creator Mozharovskyi, Pavlo
Mosler, Karl
Lange, Tatjana
description The D D α -classifier, a nonparametric fast and very robust procedure, is described and applied to fifty classification problems regarding a broad spectrum of real-world data. The procedure first transforms the data from their original property space into a depth space, which is a low-dimensional unit cube, and then separates them by a projective invariant procedure, called α -procedure. To each data point the transformation assigns its depth values with respect to the given classes. Several alternative depth notions (spatial depth, Mahalanobis depth, projection depth, and Tukey depth, the latter two being approximated by univariate projections) are used in the procedure, and compared regarding their average error rates. With the Tukey depth, which fits the distributions’ shape best and is most robust, ‘outsiders’, that is data points having zero depth in all classes, appear. They need an additional treatment for classification. Evidence is also given about the dimension of the extended feature space needed for linear separation. The D D α -procedure is available as an R-package.
doi_str_mv 10.1007/s11634-014-0180-8
format article
fullrecord <record><control><sourceid>springer</sourceid><recordid>TN_cdi_springer_journals_10_1007_s11634_014_0180_8</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1007_s11634_014_0180_8</sourcerecordid><originalsourceid>FETCH-LOGICAL-s198t-f774785134bb00c362240a1c45f069401ae580567386ffee0b2431e4cca816053</originalsourceid><addsrcrecordid>eNo9j0tOwzAYhC0EEqVwAHa-gOH__e4SpbykSmxgbTmOTVNFTWWnqnqsXoQzkaiIxWhmNTMfIfcIDwhgHguiFpIBTrLA7AWZodWcKaHU5X-W5prclLIB0CBBzYioOl9Km47t9pvm6Dt26HPX0MYPnh7aYU2HdaTL5c-J7XIfYrPP8ZZcJd-VePfnc_L18vxZvbHVx-t79bRiBRd2YMkYaaxCIesaIAjNuQSPQaoEeiEBfVQWlDbC6pRihJpLgVGG4C1qUGJO-Lm37PJ4L2a36fd5O046BDdhuzO2G7HdhO2s-AXoQUnI</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Classifying real-world data with the DDα-procedure</title><source>Springer Nature</source><creator>Mozharovskyi, Pavlo ; Mosler, Karl ; Lange, Tatjana</creator><creatorcontrib>Mozharovskyi, Pavlo ; Mosler, Karl ; Lange, Tatjana</creatorcontrib><description>The D D α -classifier, a nonparametric fast and very robust procedure, is described and applied to fifty classification problems regarding a broad spectrum of real-world data. The procedure first transforms the data from their original property space into a depth space, which is a low-dimensional unit cube, and then separates them by a projective invariant procedure, called α -procedure. To each data point the transformation assigns its depth values with respect to the given classes. Several alternative depth notions (spatial depth, Mahalanobis depth, projection depth, and Tukey depth, the latter two being approximated by univariate projections) are used in the procedure, and compared regarding their average error rates. With the Tukey depth, which fits the distributions’ shape best and is most robust, ‘outsiders’, that is data points having zero depth in all classes, appear. They need an additional treatment for classification. Evidence is also given about the dimension of the extended feature space needed for linear separation. The D D α -procedure is available as an R-package.</description><identifier>ISSN: 1862-5347</identifier><identifier>EISSN: 1862-5355</identifier><identifier>DOI: 10.1007/s11634-014-0180-8</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Chemistry and Earth Sciences ; Computer Science ; Data Mining and Knowledge Discovery ; Economics ; Finance ; Health Sciences ; Humanities ; Insurance ; Law ; Management ; Mathematics and Statistics ; Medicine ; Physics ; Regular Article ; Statistical Theory and Methods ; Statistics ; Statistics for Business ; Statistics for Engineering ; Statistics for Life Sciences ; Statistics for Social Sciences</subject><ispartof>Advances in data analysis and classification, 2015-09, Vol.9 (3), p.287-314</ispartof><rights>Springer-Verlag Berlin Heidelberg 2014</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Mozharovskyi, Pavlo</creatorcontrib><creatorcontrib>Mosler, Karl</creatorcontrib><creatorcontrib>Lange, Tatjana</creatorcontrib><title>Classifying real-world data with the DDα-procedure</title><title>Advances in data analysis and classification</title><addtitle>Adv Data Anal Classif</addtitle><description>The D D α -classifier, a nonparametric fast and very robust procedure, is described and applied to fifty classification problems regarding a broad spectrum of real-world data. The procedure first transforms the data from their original property space into a depth space, which is a low-dimensional unit cube, and then separates them by a projective invariant procedure, called α -procedure. To each data point the transformation assigns its depth values with respect to the given classes. Several alternative depth notions (spatial depth, Mahalanobis depth, projection depth, and Tukey depth, the latter two being approximated by univariate projections) are used in the procedure, and compared regarding their average error rates. With the Tukey depth, which fits the distributions’ shape best and is most robust, ‘outsiders’, that is data points having zero depth in all classes, appear. They need an additional treatment for classification. Evidence is also given about the dimension of the extended feature space needed for linear separation. The D D α -procedure is available as an R-package.</description><subject>Chemistry and Earth Sciences</subject><subject>Computer Science</subject><subject>Data Mining and Knowledge Discovery</subject><subject>Economics</subject><subject>Finance</subject><subject>Health Sciences</subject><subject>Humanities</subject><subject>Insurance</subject><subject>Law</subject><subject>Management</subject><subject>Mathematics and Statistics</subject><subject>Medicine</subject><subject>Physics</subject><subject>Regular Article</subject><subject>Statistical Theory and Methods</subject><subject>Statistics</subject><subject>Statistics for Business</subject><subject>Statistics for Engineering</subject><subject>Statistics for Life Sciences</subject><subject>Statistics for Social Sciences</subject><issn>1862-5347</issn><issn>1862-5355</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid/><recordid>eNo9j0tOwzAYhC0EEqVwAHa-gOH__e4SpbykSmxgbTmOTVNFTWWnqnqsXoQzkaiIxWhmNTMfIfcIDwhgHguiFpIBTrLA7AWZodWcKaHU5X-W5prclLIB0CBBzYioOl9Km47t9pvm6Dt26HPX0MYPnh7aYU2HdaTL5c-J7XIfYrPP8ZZcJd-VePfnc_L18vxZvbHVx-t79bRiBRd2YMkYaaxCIesaIAjNuQSPQaoEeiEBfVQWlDbC6pRihJpLgVGG4C1qUGJO-Lm37PJ4L2a36fd5O046BDdhuzO2G7HdhO2s-AXoQUnI</recordid><startdate>20150902</startdate><enddate>20150902</enddate><creator>Mozharovskyi, Pavlo</creator><creator>Mosler, Karl</creator><creator>Lange, Tatjana</creator><general>Springer Berlin Heidelberg</general><scope/></search><sort><creationdate>20150902</creationdate><title>Classifying real-world data with the DDα-procedure</title><author>Mozharovskyi, Pavlo ; Mosler, Karl ; Lange, Tatjana</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-s198t-f774785134bb00c362240a1c45f069401ae580567386ffee0b2431e4cca816053</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Chemistry and Earth Sciences</topic><topic>Computer Science</topic><topic>Data Mining and Knowledge Discovery</topic><topic>Economics</topic><topic>Finance</topic><topic>Health Sciences</topic><topic>Humanities</topic><topic>Insurance</topic><topic>Law</topic><topic>Management</topic><topic>Mathematics and Statistics</topic><topic>Medicine</topic><topic>Physics</topic><topic>Regular Article</topic><topic>Statistical Theory and Methods</topic><topic>Statistics</topic><topic>Statistics for Business</topic><topic>Statistics for Engineering</topic><topic>Statistics for Life Sciences</topic><topic>Statistics for Social Sciences</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mozharovskyi, Pavlo</creatorcontrib><creatorcontrib>Mosler, Karl</creatorcontrib><creatorcontrib>Lange, Tatjana</creatorcontrib><jtitle>Advances in data analysis and classification</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mozharovskyi, Pavlo</au><au>Mosler, Karl</au><au>Lange, Tatjana</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Classifying real-world data with the DDα-procedure</atitle><jtitle>Advances in data analysis and classification</jtitle><stitle>Adv Data Anal Classif</stitle><date>2015-09-02</date><risdate>2015</risdate><volume>9</volume><issue>3</issue><spage>287</spage><epage>314</epage><pages>287-314</pages><issn>1862-5347</issn><eissn>1862-5355</eissn><abstract>The D D α -classifier, a nonparametric fast and very robust procedure, is described and applied to fifty classification problems regarding a broad spectrum of real-world data. The procedure first transforms the data from their original property space into a depth space, which is a low-dimensional unit cube, and then separates them by a projective invariant procedure, called α -procedure. To each data point the transformation assigns its depth values with respect to the given classes. Several alternative depth notions (spatial depth, Mahalanobis depth, projection depth, and Tukey depth, the latter two being approximated by univariate projections) are used in the procedure, and compared regarding their average error rates. With the Tukey depth, which fits the distributions’ shape best and is most robust, ‘outsiders’, that is data points having zero depth in all classes, appear. They need an additional treatment for classification. Evidence is also given about the dimension of the extended feature space needed for linear separation. The D D α -procedure is available as an R-package.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s11634-014-0180-8</doi><tpages>28</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1862-5347
ispartof Advances in data analysis and classification, 2015-09, Vol.9 (3), p.287-314
issn 1862-5347
1862-5355
language eng
recordid cdi_springer_journals_10_1007_s11634_014_0180_8
source Springer Nature
subjects Chemistry and Earth Sciences
Computer Science
Data Mining and Knowledge Discovery
Economics
Finance
Health Sciences
Humanities
Insurance
Law
Management
Mathematics and Statistics
Medicine
Physics
Regular Article
Statistical Theory and Methods
Statistics
Statistics for Business
Statistics for Engineering
Statistics for Life Sciences
Statistics for Social Sciences
title Classifying real-world data with the DDα-procedure
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T03%3A32%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-springer&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Classifying%20real-world%20data%20with%20the%20DD%CE%B1-procedure&rft.jtitle=Advances%20in%20data%20analysis%20and%20classification&rft.au=Mozharovskyi,%20Pavlo&rft.date=2015-09-02&rft.volume=9&rft.issue=3&rft.spage=287&rft.epage=314&rft.pages=287-314&rft.issn=1862-5347&rft.eissn=1862-5355&rft_id=info:doi/10.1007/s11634-014-0180-8&rft_dat=%3Cspringer%3E10_1007_s11634_014_0180_8%3C/springer%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-s198t-f774785134bb00c362240a1c45f069401ae580567386ffee0b2431e4cca816053%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true