Loading…
Principal component analysis constrained by layered simple structures
The paper proposes a procedure for principal component analysis called layered principal component analysis (LPCA) to produce a simple and interpretable loading matrix. The novelty of LPCA is that a loading matrix is constrained as a sum of matrices with simple structures called layers, and the resu...
Saved in:
Published in: | Advances in data analysis and classification 2023-06, Vol.17 (2), p.347-367 |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c270t-e4f42cc1550b69cfabc4b4e3dc67cb350ea31938c04a165673097d0fc6f7516c3 |
container_end_page | 367 |
container_issue | 2 |
container_start_page | 347 |
container_title | Advances in data analysis and classification |
container_volume | 17 |
creator | Yamashita, Naoto |
description | The paper proposes a procedure for principal component analysis called layered principal component analysis (LPCA) to produce a simple and interpretable loading matrix. The novelty of LPCA is that a loading matrix is constrained as a sum of matrices with simple structures called layers, and the resulting simplicity of the LPCA solution is controlled by how many layers are used. LPCA is a generalization of disjoint PCA proposed as reported by Ferrara (in: Giommi (ed) Topics in theoretical and applied statistics, Springer, Cham 2016). The number of layers controls the balance of simplicity and the fit to the data, and the user can choose the desired level of simplicity between the most restrictive but simplest case with a single layer or multiple layers with better fit to the data. The optimal number of layers is specified in terms of explained variance and two information criteria. Two simulation studies were conducted to evaluate how accurately the LPCA procedure recovers the true parameter values. The results showed that LPCA was effective for parameter recovery. The paper presents three examples of LPCA applied to real data, which show the potential of LPCA for producing simple and interpretable loading matrices. |
doi_str_mv | 10.1007/s11634-022-00503-9 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2814911470</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2814911470</sourcerecordid><originalsourceid>FETCH-LOGICAL-c270t-e4f42cc1550b69cfabc4b4e3dc67cb350ea31938c04a165673097d0fc6f7516c3</originalsourceid><addsrcrecordid>eNp9kE9LxDAQxYMouK5-AU8Fz9HJ_-1RltUVFvSg55DOptKlm9akPfTbG63ozdM8Zt57DD9CrhncMgBzlxjTQlLgnAIoELQ8IQu20pwqodTpr5bmnFykdADQIEEtyOYlNgGb3rUFdse-Cz4MhQuunVKT8iqkIbom-H1RTUXrJh-zTM2xb32RTyMOY_TpkpzVrk3-6mcuydvD5nW9pbvnx6f1_Y4iNzBQL2vJEZlSUOkSa1ehrKQXe9QGK6HAO8FKsUKQjmmljYDS7KFGXRvFNIoluZl7-9h9jD4N9tCNMX-bLF8xWTImDWQXn10Yu5Sir20fm6OLk2Vgv3DZGZfNuOw3LlvmkJhDKZvDu49_1f-kPgGgVm3r</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2814911470</pqid></control><display><type>article</type><title>Principal component analysis constrained by layered simple structures</title><source>Springer Link</source><creator>Yamashita, Naoto</creator><creatorcontrib>Yamashita, Naoto</creatorcontrib><description>The paper proposes a procedure for principal component analysis called layered principal component analysis (LPCA) to produce a simple and interpretable loading matrix. The novelty of LPCA is that a loading matrix is constrained as a sum of matrices with simple structures called layers, and the resulting simplicity of the LPCA solution is controlled by how many layers are used. LPCA is a generalization of disjoint PCA proposed as reported by Ferrara (in: Giommi (ed) Topics in theoretical and applied statistics, Springer, Cham 2016). The number of layers controls the balance of simplicity and the fit to the data, and the user can choose the desired level of simplicity between the most restrictive but simplest case with a single layer or multiple layers with better fit to the data. The optimal number of layers is specified in terms of explained variance and two information criteria. Two simulation studies were conducted to evaluate how accurately the LPCA procedure recovers the true parameter values. The results showed that LPCA was effective for parameter recovery. The paper presents three examples of LPCA applied to real data, which show the potential of LPCA for producing simple and interpretable loading matrices.</description><identifier>ISSN: 1862-5347</identifier><identifier>EISSN: 1862-5355</identifier><identifier>DOI: 10.1007/s11634-022-00503-9</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Applied statistics ; Chemistry and Earth Sciences ; Computer Science ; Data Mining and Knowledge Discovery ; Economics ; Finance ; Health Sciences ; Humanities ; Insurance ; Law ; Management ; Mathematics and Statistics ; Medicine ; Parameters ; Physics ; Principal components analysis ; Regular Article ; Statistical Theory and Methods ; Statistics ; Statistics for Business ; Statistics for Engineering ; Statistics for Life Sciences ; Statistics for Social Sciences</subject><ispartof>Advances in data analysis and classification, 2023-06, Vol.17 (2), p.347-367</ispartof><rights>Springer-Verlag GmbH Germany, part of Springer Nature 2022</rights><rights>Springer-Verlag GmbH Germany, part of Springer Nature 2022.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c270t-e4f42cc1550b69cfabc4b4e3dc67cb350ea31938c04a165673097d0fc6f7516c3</cites><orcidid>0000-0002-8819-4262</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Yamashita, Naoto</creatorcontrib><title>Principal component analysis constrained by layered simple structures</title><title>Advances in data analysis and classification</title><addtitle>Adv Data Anal Classif</addtitle><description>The paper proposes a procedure for principal component analysis called layered principal component analysis (LPCA) to produce a simple and interpretable loading matrix. The novelty of LPCA is that a loading matrix is constrained as a sum of matrices with simple structures called layers, and the resulting simplicity of the LPCA solution is controlled by how many layers are used. LPCA is a generalization of disjoint PCA proposed as reported by Ferrara (in: Giommi (ed) Topics in theoretical and applied statistics, Springer, Cham 2016). The number of layers controls the balance of simplicity and the fit to the data, and the user can choose the desired level of simplicity between the most restrictive but simplest case with a single layer or multiple layers with better fit to the data. The optimal number of layers is specified in terms of explained variance and two information criteria. Two simulation studies were conducted to evaluate how accurately the LPCA procedure recovers the true parameter values. The results showed that LPCA was effective for parameter recovery. The paper presents three examples of LPCA applied to real data, which show the potential of LPCA for producing simple and interpretable loading matrices.</description><subject>Applied statistics</subject><subject>Chemistry and Earth Sciences</subject><subject>Computer Science</subject><subject>Data Mining and Knowledge Discovery</subject><subject>Economics</subject><subject>Finance</subject><subject>Health Sciences</subject><subject>Humanities</subject><subject>Insurance</subject><subject>Law</subject><subject>Management</subject><subject>Mathematics and Statistics</subject><subject>Medicine</subject><subject>Parameters</subject><subject>Physics</subject><subject>Principal components analysis</subject><subject>Regular Article</subject><subject>Statistical Theory and Methods</subject><subject>Statistics</subject><subject>Statistics for Business</subject><subject>Statistics for Engineering</subject><subject>Statistics for Life Sciences</subject><subject>Statistics for Social Sciences</subject><issn>1862-5347</issn><issn>1862-5355</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LxDAQxYMouK5-AU8Fz9HJ_-1RltUVFvSg55DOptKlm9akPfTbG63ozdM8Zt57DD9CrhncMgBzlxjTQlLgnAIoELQ8IQu20pwqodTpr5bmnFykdADQIEEtyOYlNgGb3rUFdse-Cz4MhQuunVKT8iqkIbom-H1RTUXrJh-zTM2xb32RTyMOY_TpkpzVrk3-6mcuydvD5nW9pbvnx6f1_Y4iNzBQL2vJEZlSUOkSa1ehrKQXe9QGK6HAO8FKsUKQjmmljYDS7KFGXRvFNIoluZl7-9h9jD4N9tCNMX-bLF8xWTImDWQXn10Yu5Sir20fm6OLk2Vgv3DZGZfNuOw3LlvmkJhDKZvDu49_1f-kPgGgVm3r</recordid><startdate>20230601</startdate><enddate>20230601</enddate><creator>Yamashita, Naoto</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-8819-4262</orcidid></search><sort><creationdate>20230601</creationdate><title>Principal component analysis constrained by layered simple structures</title><author>Yamashita, Naoto</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c270t-e4f42cc1550b69cfabc4b4e3dc67cb350ea31938c04a165673097d0fc6f7516c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Applied statistics</topic><topic>Chemistry and Earth Sciences</topic><topic>Computer Science</topic><topic>Data Mining and Knowledge Discovery</topic><topic>Economics</topic><topic>Finance</topic><topic>Health Sciences</topic><topic>Humanities</topic><topic>Insurance</topic><topic>Law</topic><topic>Management</topic><topic>Mathematics and Statistics</topic><topic>Medicine</topic><topic>Parameters</topic><topic>Physics</topic><topic>Principal components analysis</topic><topic>Regular Article</topic><topic>Statistical Theory and Methods</topic><topic>Statistics</topic><topic>Statistics for Business</topic><topic>Statistics for Engineering</topic><topic>Statistics for Life Sciences</topic><topic>Statistics for Social Sciences</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yamashita, Naoto</creatorcontrib><collection>CrossRef</collection><jtitle>Advances in data analysis and classification</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yamashita, Naoto</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Principal component analysis constrained by layered simple structures</atitle><jtitle>Advances in data analysis and classification</jtitle><stitle>Adv Data Anal Classif</stitle><date>2023-06-01</date><risdate>2023</risdate><volume>17</volume><issue>2</issue><spage>347</spage><epage>367</epage><pages>347-367</pages><issn>1862-5347</issn><eissn>1862-5355</eissn><abstract>The paper proposes a procedure for principal component analysis called layered principal component analysis (LPCA) to produce a simple and interpretable loading matrix. The novelty of LPCA is that a loading matrix is constrained as a sum of matrices with simple structures called layers, and the resulting simplicity of the LPCA solution is controlled by how many layers are used. LPCA is a generalization of disjoint PCA proposed as reported by Ferrara (in: Giommi (ed) Topics in theoretical and applied statistics, Springer, Cham 2016). The number of layers controls the balance of simplicity and the fit to the data, and the user can choose the desired level of simplicity between the most restrictive but simplest case with a single layer or multiple layers with better fit to the data. The optimal number of layers is specified in terms of explained variance and two information criteria. Two simulation studies were conducted to evaluate how accurately the LPCA procedure recovers the true parameter values. The results showed that LPCA was effective for parameter recovery. The paper presents three examples of LPCA applied to real data, which show the potential of LPCA for producing simple and interpretable loading matrices.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s11634-022-00503-9</doi><tpages>21</tpages><orcidid>https://orcid.org/0000-0002-8819-4262</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1862-5347 |
ispartof | Advances in data analysis and classification, 2023-06, Vol.17 (2), p.347-367 |
issn | 1862-5347 1862-5355 |
language | eng |
recordid | cdi_proquest_journals_2814911470 |
source | Springer Link |
subjects | Applied statistics Chemistry and Earth Sciences Computer Science Data Mining and Knowledge Discovery Economics Finance Health Sciences Humanities Insurance Law Management Mathematics and Statistics Medicine Parameters Physics Principal components analysis Regular Article Statistical Theory and Methods Statistics Statistics for Business Statistics for Engineering Statistics for Life Sciences Statistics for Social Sciences |
title | Principal component analysis constrained by layered simple structures |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T21%3A20%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Principal%20component%20analysis%20constrained%20by%20layered%20simple%20structures&rft.jtitle=Advances%20in%20data%20analysis%20and%20classification&rft.au=Yamashita,%20Naoto&rft.date=2023-06-01&rft.volume=17&rft.issue=2&rft.spage=347&rft.epage=367&rft.pages=347-367&rft.issn=1862-5347&rft.eissn=1862-5355&rft_id=info:doi/10.1007/s11634-022-00503-9&rft_dat=%3Cproquest_cross%3E2814911470%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c270t-e4f42cc1550b69cfabc4b4e3dc67cb350ea31938c04a165673097d0fc6f7516c3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2814911470&rft_id=info:pmid/&rfr_iscdi=true |