Loading…

Principal component analysis constrained by layered simple structures

The paper proposes a procedure for principal component analysis called layered principal component analysis (LPCA) to produce a simple and interpretable loading matrix. The novelty of LPCA is that a loading matrix is constrained as a sum of matrices with simple structures called layers, and the resu...

Full description

Saved in:
Bibliographic Details
Published in:Advances in data analysis and classification 2023-06, Vol.17 (2), p.347-367
Main Author: Yamashita, Naoto
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c270t-e4f42cc1550b69cfabc4b4e3dc67cb350ea31938c04a165673097d0fc6f7516c3
container_end_page 367
container_issue 2
container_start_page 347
container_title Advances in data analysis and classification
container_volume 17
creator Yamashita, Naoto
description The paper proposes a procedure for principal component analysis called layered principal component analysis (LPCA) to produce a simple and interpretable loading matrix. The novelty of LPCA is that a loading matrix is constrained as a sum of matrices with simple structures called layers, and the resulting simplicity of the LPCA solution is controlled by how many layers are used. LPCA is a generalization of disjoint PCA proposed as reported by Ferrara (in: Giommi (ed) Topics in theoretical and applied statistics, Springer, Cham 2016). The number of layers controls the balance of simplicity and the fit to the data, and the user can choose the desired level of simplicity between the most restrictive but simplest case with a single layer or multiple layers with better fit to the data. The optimal number of layers is specified in terms of explained variance and two information criteria. Two simulation studies were conducted to evaluate how accurately the LPCA procedure recovers the true parameter values. The results showed that LPCA was effective for parameter recovery. The paper presents three examples of LPCA applied to real data, which show the potential of LPCA for producing simple and interpretable loading matrices.
doi_str_mv 10.1007/s11634-022-00503-9
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2814911470</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2814911470</sourcerecordid><originalsourceid>FETCH-LOGICAL-c270t-e4f42cc1550b69cfabc4b4e3dc67cb350ea31938c04a165673097d0fc6f7516c3</originalsourceid><addsrcrecordid>eNp9kE9LxDAQxYMouK5-AU8Fz9HJ_-1RltUVFvSg55DOptKlm9akPfTbG63ozdM8Zt57DD9CrhncMgBzlxjTQlLgnAIoELQ8IQu20pwqodTpr5bmnFykdADQIEEtyOYlNgGb3rUFdse-Cz4MhQuunVKT8iqkIbom-H1RTUXrJh-zTM2xb32RTyMOY_TpkpzVrk3-6mcuydvD5nW9pbvnx6f1_Y4iNzBQL2vJEZlSUOkSa1ehrKQXe9QGK6HAO8FKsUKQjmmljYDS7KFGXRvFNIoluZl7-9h9jD4N9tCNMX-bLF8xWTImDWQXn10Yu5Sir20fm6OLk2Vgv3DZGZfNuOw3LlvmkJhDKZvDu49_1f-kPgGgVm3r</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2814911470</pqid></control><display><type>article</type><title>Principal component analysis constrained by layered simple structures</title><source>Springer Link</source><creator>Yamashita, Naoto</creator><creatorcontrib>Yamashita, Naoto</creatorcontrib><description>The paper proposes a procedure for principal component analysis called layered principal component analysis (LPCA) to produce a simple and interpretable loading matrix. The novelty of LPCA is that a loading matrix is constrained as a sum of matrices with simple structures called layers, and the resulting simplicity of the LPCA solution is controlled by how many layers are used. LPCA is a generalization of disjoint PCA proposed as reported by Ferrara (in: Giommi (ed) Topics in theoretical and applied statistics, Springer, Cham 2016). The number of layers controls the balance of simplicity and the fit to the data, and the user can choose the desired level of simplicity between the most restrictive but simplest case with a single layer or multiple layers with better fit to the data. The optimal number of layers is specified in terms of explained variance and two information criteria. Two simulation studies were conducted to evaluate how accurately the LPCA procedure recovers the true parameter values. The results showed that LPCA was effective for parameter recovery. The paper presents three examples of LPCA applied to real data, which show the potential of LPCA for producing simple and interpretable loading matrices.</description><identifier>ISSN: 1862-5347</identifier><identifier>EISSN: 1862-5355</identifier><identifier>DOI: 10.1007/s11634-022-00503-9</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Applied statistics ; Chemistry and Earth Sciences ; Computer Science ; Data Mining and Knowledge Discovery ; Economics ; Finance ; Health Sciences ; Humanities ; Insurance ; Law ; Management ; Mathematics and Statistics ; Medicine ; Parameters ; Physics ; Principal components analysis ; Regular Article ; Statistical Theory and Methods ; Statistics ; Statistics for Business ; Statistics for Engineering ; Statistics for Life Sciences ; Statistics for Social Sciences</subject><ispartof>Advances in data analysis and classification, 2023-06, Vol.17 (2), p.347-367</ispartof><rights>Springer-Verlag GmbH Germany, part of Springer Nature 2022</rights><rights>Springer-Verlag GmbH Germany, part of Springer Nature 2022.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c270t-e4f42cc1550b69cfabc4b4e3dc67cb350ea31938c04a165673097d0fc6f7516c3</cites><orcidid>0000-0002-8819-4262</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Yamashita, Naoto</creatorcontrib><title>Principal component analysis constrained by layered simple structures</title><title>Advances in data analysis and classification</title><addtitle>Adv Data Anal Classif</addtitle><description>The paper proposes a procedure for principal component analysis called layered principal component analysis (LPCA) to produce a simple and interpretable loading matrix. The novelty of LPCA is that a loading matrix is constrained as a sum of matrices with simple structures called layers, and the resulting simplicity of the LPCA solution is controlled by how many layers are used. LPCA is a generalization of disjoint PCA proposed as reported by Ferrara (in: Giommi (ed) Topics in theoretical and applied statistics, Springer, Cham 2016). The number of layers controls the balance of simplicity and the fit to the data, and the user can choose the desired level of simplicity between the most restrictive but simplest case with a single layer or multiple layers with better fit to the data. The optimal number of layers is specified in terms of explained variance and two information criteria. Two simulation studies were conducted to evaluate how accurately the LPCA procedure recovers the true parameter values. The results showed that LPCA was effective for parameter recovery. The paper presents three examples of LPCA applied to real data, which show the potential of LPCA for producing simple and interpretable loading matrices.</description><subject>Applied statistics</subject><subject>Chemistry and Earth Sciences</subject><subject>Computer Science</subject><subject>Data Mining and Knowledge Discovery</subject><subject>Economics</subject><subject>Finance</subject><subject>Health Sciences</subject><subject>Humanities</subject><subject>Insurance</subject><subject>Law</subject><subject>Management</subject><subject>Mathematics and Statistics</subject><subject>Medicine</subject><subject>Parameters</subject><subject>Physics</subject><subject>Principal components analysis</subject><subject>Regular Article</subject><subject>Statistical Theory and Methods</subject><subject>Statistics</subject><subject>Statistics for Business</subject><subject>Statistics for Engineering</subject><subject>Statistics for Life Sciences</subject><subject>Statistics for Social Sciences</subject><issn>1862-5347</issn><issn>1862-5355</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LxDAQxYMouK5-AU8Fz9HJ_-1RltUVFvSg55DOptKlm9akPfTbG63ozdM8Zt57DD9CrhncMgBzlxjTQlLgnAIoELQ8IQu20pwqodTpr5bmnFykdADQIEEtyOYlNgGb3rUFdse-Cz4MhQuunVKT8iqkIbom-H1RTUXrJh-zTM2xb32RTyMOY_TpkpzVrk3-6mcuydvD5nW9pbvnx6f1_Y4iNzBQL2vJEZlSUOkSa1ehrKQXe9QGK6HAO8FKsUKQjmmljYDS7KFGXRvFNIoluZl7-9h9jD4N9tCNMX-bLF8xWTImDWQXn10Yu5Sir20fm6OLk2Vgv3DZGZfNuOw3LlvmkJhDKZvDu49_1f-kPgGgVm3r</recordid><startdate>20230601</startdate><enddate>20230601</enddate><creator>Yamashita, Naoto</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-8819-4262</orcidid></search><sort><creationdate>20230601</creationdate><title>Principal component analysis constrained by layered simple structures</title><author>Yamashita, Naoto</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c270t-e4f42cc1550b69cfabc4b4e3dc67cb350ea31938c04a165673097d0fc6f7516c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Applied statistics</topic><topic>Chemistry and Earth Sciences</topic><topic>Computer Science</topic><topic>Data Mining and Knowledge Discovery</topic><topic>Economics</topic><topic>Finance</topic><topic>Health Sciences</topic><topic>Humanities</topic><topic>Insurance</topic><topic>Law</topic><topic>Management</topic><topic>Mathematics and Statistics</topic><topic>Medicine</topic><topic>Parameters</topic><topic>Physics</topic><topic>Principal components analysis</topic><topic>Regular Article</topic><topic>Statistical Theory and Methods</topic><topic>Statistics</topic><topic>Statistics for Business</topic><topic>Statistics for Engineering</topic><topic>Statistics for Life Sciences</topic><topic>Statistics for Social Sciences</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yamashita, Naoto</creatorcontrib><collection>CrossRef</collection><jtitle>Advances in data analysis and classification</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yamashita, Naoto</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Principal component analysis constrained by layered simple structures</atitle><jtitle>Advances in data analysis and classification</jtitle><stitle>Adv Data Anal Classif</stitle><date>2023-06-01</date><risdate>2023</risdate><volume>17</volume><issue>2</issue><spage>347</spage><epage>367</epage><pages>347-367</pages><issn>1862-5347</issn><eissn>1862-5355</eissn><abstract>The paper proposes a procedure for principal component analysis called layered principal component analysis (LPCA) to produce a simple and interpretable loading matrix. The novelty of LPCA is that a loading matrix is constrained as a sum of matrices with simple structures called layers, and the resulting simplicity of the LPCA solution is controlled by how many layers are used. LPCA is a generalization of disjoint PCA proposed as reported by Ferrara (in: Giommi (ed) Topics in theoretical and applied statistics, Springer, Cham 2016). The number of layers controls the balance of simplicity and the fit to the data, and the user can choose the desired level of simplicity between the most restrictive but simplest case with a single layer or multiple layers with better fit to the data. The optimal number of layers is specified in terms of explained variance and two information criteria. Two simulation studies were conducted to evaluate how accurately the LPCA procedure recovers the true parameter values. The results showed that LPCA was effective for parameter recovery. The paper presents three examples of LPCA applied to real data, which show the potential of LPCA for producing simple and interpretable loading matrices.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s11634-022-00503-9</doi><tpages>21</tpages><orcidid>https://orcid.org/0000-0002-8819-4262</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1862-5347
ispartof Advances in data analysis and classification, 2023-06, Vol.17 (2), p.347-367
issn 1862-5347
1862-5355
language eng
recordid cdi_proquest_journals_2814911470
source Springer Link
subjects Applied statistics
Chemistry and Earth Sciences
Computer Science
Data Mining and Knowledge Discovery
Economics
Finance
Health Sciences
Humanities
Insurance
Law
Management
Mathematics and Statistics
Medicine
Parameters
Physics
Principal components analysis
Regular Article
Statistical Theory and Methods
Statistics
Statistics for Business
Statistics for Engineering
Statistics for Life Sciences
Statistics for Social Sciences
title Principal component analysis constrained by layered simple structures
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T21%3A20%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Principal%20component%20analysis%20constrained%20by%20layered%20simple%20structures&rft.jtitle=Advances%20in%20data%20analysis%20and%20classification&rft.au=Yamashita,%20Naoto&rft.date=2023-06-01&rft.volume=17&rft.issue=2&rft.spage=347&rft.epage=367&rft.pages=347-367&rft.issn=1862-5347&rft.eissn=1862-5355&rft_id=info:doi/10.1007/s11634-022-00503-9&rft_dat=%3Cproquest_cross%3E2814911470%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c270t-e4f42cc1550b69cfabc4b4e3dc67cb350ea31938c04a165673097d0fc6f7516c3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2814911470&rft_id=info:pmid/&rfr_iscdi=true