Loading…
On Mahalanobis distance in functional settings
Mahalanobis distance is a classical tool in multivariate analysis. We suggest here an extension of this concept to the case of functional data. More precisely, the proposed definition concerns those statistical problems where the sample data are real functions defined on a compact interval of the re...
Saved in:
Published in: | arXiv.org 2018-03 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | |
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Berrendero, José R Bueno-Larraz, Beatriz Cuevas, Antonio |
description | Mahalanobis distance is a classical tool in multivariate analysis. We suggest here an extension of this concept to the case of functional data. More precisely, the proposed definition concerns those statistical problems where the sample data are real functions defined on a compact interval of the real line. The obvious difficulty for such a functional extension is the non-invertibility of the covariance operator in infinite-dimensional cases. Unlike other recent proposals, our definition is suggested and motivated in terms of the Reproducing Kernel Hilbert Space (RKHS) associated with the stochastic process that generates the data. The proposed distance is a true metric; it depends on a unique real smoothing parameter which is fully motivated in RKHS terms. Moreover, it shares some properties of its finite dimensional counterpart: it is invariant under isometries, it can be consistently estimated from the data and its sampling distribution is known under Gaussian models. An empirical study for two statistical applications, outliers detection and binary classification, is included. The obtained results are quite competitive when compared to other recent proposals of the literature. |
format | article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2071825233</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2071825233</sourcerecordid><originalsourceid>FETCH-proquest_journals_20718252333</originalsourceid><addsrcrecordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mTQ889T8E3MSMxJzMtPyixWSMksLknMS05VyMxTSCvNSy7JzM9LzFEoTi0pycxLL-ZhYE1LzClO5YXS3AzKbq4hzh66BUX5haWpxSXxWfmlRUAdxfFGBuaGFkamQLuNiVMFAAANMnc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2071825233</pqid></control><display><type>article</type><title>On Mahalanobis distance in functional settings</title><source>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</source><creator>Berrendero, José R ; Bueno-Larraz, Beatriz ; Cuevas, Antonio</creator><creatorcontrib>Berrendero, José R ; Bueno-Larraz, Beatriz ; Cuevas, Antonio</creatorcontrib><description>Mahalanobis distance is a classical tool in multivariate analysis. We suggest here an extension of this concept to the case of functional data. More precisely, the proposed definition concerns those statistical problems where the sample data are real functions defined on a compact interval of the real line. The obvious difficulty for such a functional extension is the non-invertibility of the covariance operator in infinite-dimensional cases. Unlike other recent proposals, our definition is suggested and motivated in terms of the Reproducing Kernel Hilbert Space (RKHS) associated with the stochastic process that generates the data. The proposed distance is a true metric; it depends on a unique real smoothing parameter which is fully motivated in RKHS terms. Moreover, it shares some properties of its finite dimensional counterpart: it is invariant under isometries, it can be consistently estimated from the data and its sampling distribution is known under Gaussian models. An empirical study for two statistical applications, outliers detection and binary classification, is included. The obtained results are quite competitive when compared to other recent proposals of the literature.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Covariance ; Data analysis ; Empirical analysis ; Hilbert space ; Multivariate analysis ; Outliers (statistics) ; Proposals ; Stochastic processes</subject><ispartof>arXiv.org, 2018-03</ispartof><rights>2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2071825233?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,37012,44590</link.rule.ids></links><search><creatorcontrib>Berrendero, José R</creatorcontrib><creatorcontrib>Bueno-Larraz, Beatriz</creatorcontrib><creatorcontrib>Cuevas, Antonio</creatorcontrib><title>On Mahalanobis distance in functional settings</title><title>arXiv.org</title><description>Mahalanobis distance is a classical tool in multivariate analysis. We suggest here an extension of this concept to the case of functional data. More precisely, the proposed definition concerns those statistical problems where the sample data are real functions defined on a compact interval of the real line. The obvious difficulty for such a functional extension is the non-invertibility of the covariance operator in infinite-dimensional cases. Unlike other recent proposals, our definition is suggested and motivated in terms of the Reproducing Kernel Hilbert Space (RKHS) associated with the stochastic process that generates the data. The proposed distance is a true metric; it depends on a unique real smoothing parameter which is fully motivated in RKHS terms. Moreover, it shares some properties of its finite dimensional counterpart: it is invariant under isometries, it can be consistently estimated from the data and its sampling distribution is known under Gaussian models. An empirical study for two statistical applications, outliers detection and binary classification, is included. The obtained results are quite competitive when compared to other recent proposals of the literature.</description><subject>Covariance</subject><subject>Data analysis</subject><subject>Empirical analysis</subject><subject>Hilbert space</subject><subject>Multivariate analysis</subject><subject>Outliers (statistics)</subject><subject>Proposals</subject><subject>Stochastic processes</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mTQ889T8E3MSMxJzMtPyixWSMksLknMS05VyMxTSCvNSy7JzM9LzFEoTi0pycxLL-ZhYE1LzClO5YXS3AzKbq4hzh66BUX5haWpxSXxWfmlRUAdxfFGBuaGFkamQLuNiVMFAAANMnc</recordid><startdate>20180317</startdate><enddate>20180317</enddate><creator>Berrendero, José R</creator><creator>Bueno-Larraz, Beatriz</creator><creator>Cuevas, Antonio</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20180317</creationdate><title>On Mahalanobis distance in functional settings</title><author>Berrendero, José R ; Bueno-Larraz, Beatriz ; Cuevas, Antonio</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_20718252333</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Covariance</topic><topic>Data analysis</topic><topic>Empirical analysis</topic><topic>Hilbert space</topic><topic>Multivariate analysis</topic><topic>Outliers (statistics)</topic><topic>Proposals</topic><topic>Stochastic processes</topic><toplevel>online_resources</toplevel><creatorcontrib>Berrendero, José R</creatorcontrib><creatorcontrib>Bueno-Larraz, Beatriz</creatorcontrib><creatorcontrib>Cuevas, Antonio</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Engineering Database</collection><collection>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Berrendero, José R</au><au>Bueno-Larraz, Beatriz</au><au>Cuevas, Antonio</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>On Mahalanobis distance in functional settings</atitle><jtitle>arXiv.org</jtitle><date>2018-03-17</date><risdate>2018</risdate><eissn>2331-8422</eissn><abstract>Mahalanobis distance is a classical tool in multivariate analysis. We suggest here an extension of this concept to the case of functional data. More precisely, the proposed definition concerns those statistical problems where the sample data are real functions defined on a compact interval of the real line. The obvious difficulty for such a functional extension is the non-invertibility of the covariance operator in infinite-dimensional cases. Unlike other recent proposals, our definition is suggested and motivated in terms of the Reproducing Kernel Hilbert Space (RKHS) associated with the stochastic process that generates the data. The proposed distance is a true metric; it depends on a unique real smoothing parameter which is fully motivated in RKHS terms. Moreover, it shares some properties of its finite dimensional counterpart: it is invariant under isometries, it can be consistently estimated from the data and its sampling distribution is known under Gaussian models. An empirical study for two statistical applications, outliers detection and binary classification, is included. The obtained results are quite competitive when compared to other recent proposals of the literature.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2018-03 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2071825233 |
source | Publicly Available Content Database (Proquest) (PQ_SDU_P3) |
subjects | Covariance Data analysis Empirical analysis Hilbert space Multivariate analysis Outliers (statistics) Proposals Stochastic processes |
title | On Mahalanobis distance in functional settings |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T20%3A30%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=On%20Mahalanobis%20distance%20in%20functional%20settings&rft.jtitle=arXiv.org&rft.au=Berrendero,%20Jos%C3%A9%20R&rft.date=2018-03-17&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2071825233%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_20718252333%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2071825233&rft_id=info:pmid/&rfr_iscdi=true |