Loading…
Weighted Multiview Possibilistic C-Means Clustering With L2 Regularization
Since social media, virtual communities and networks rapidly grow, multiview data become more popular. In general, multiview data always contain different feature components in different views. Although these data are extracted in different ways (views) from diverse settings and domains, they are us...
Saved in:
Published in: | IEEE transactions on fuzzy systems 2022-05, Vol.30 (5), p.1357-1370 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c295t-ca666be2db8e85af081484df22111a81f1ca792306342401d7d8923ad194f1ae3 |
---|---|
cites | cdi_FETCH-LOGICAL-c295t-ca666be2db8e85af081484df22111a81f1ca792306342401d7d8923ad194f1ae3 |
container_end_page | 1370 |
container_issue | 5 |
container_start_page | 1357 |
container_title | IEEE transactions on fuzzy systems |
container_volume | 30 |
creator | Benjamin, Josephine Bernadette M. Yang, Miin-Shen |
description | Since social media, virtual communities and networks rapidly grow, multiview data become more popular. In general, multiview data always contain different feature components in different views. Although these data are extracted in different ways (views) from diverse settings and domains, they are used to describe the same samples, which make them highly related. Hence, applying (single-view) clustering methods for multiview data poses difficulty in achieving desirable clustering results. Thus, multiview clustering methods should be developed that will utilize available multiview information. Most of multiview clustering techniques currently use k-means due to its conceptual simplicity, and use fuzzy c-means (FCM) that the datapoints can belong to more than one cluster based on their membership degrees from 0 to 1. However, the use of k-means or FCM may degrade its performance due to the presence of noise and outliers, especially on large or high-dimensional datasets. The constraint imposed on the membership degrees of k-means and FCM tends to assign a corresponding high membership value to an outlier or a noisy data point. To address these drawbacks, possibilistic c-means (PCM) relaxes the membership constraint of k-means and FCM so that outliers and noisy datapoints can be properly identified. On the other hand, there are various extensions of k-means and FCM for multiview data, but no extension of PCM for multiview data was made in the literature. Thus, we use PCM in our proposed multiview clustering model. In this article, we propose novel weighted multiview PCM algorithms designed for clustering multiview data as well as view and feature weights on PCM approaches, called W-MV-PCM and W-MV-PCM with L2 regularization (W-MV-PCM-L2). In multiview clustering, different views may vary with respect to its importance and each view may contain some irrelevant features. In the proposed algorithms, a learning scheme is constructed to compute for the view weights, and feature weights within each view. This scheme will be able to identify the importance of each view and, at the same time, it will also identify and select relevant features in each view. Comparisons of W-MV-PCM-L2 with existing multiview clustering algorithms are made on both synthetic and real datasets. The experimental results are evaluated using accuracy rate (AR) and external validity indexes, such as Rand index (RI) and normalized mutual information (NMI). The proposed W-MV-PCM-L2 algorithm with |
doi_str_mv | 10.1109/TFUZZ.2021.3058572 |
format | article |
fullrecord | <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_ieee_primary_9352509</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9352509</ieee_id><sourcerecordid>2659345609</sourcerecordid><originalsourceid>FETCH-LOGICAL-c295t-ca666be2db8e85af081484df22111a81f1ca792306342401d7d8923ad194f1ae3</originalsourceid><addsrcrecordid>eNo9kFtLAzEQhYMoWKt_QF8WfN6ayW2zj7J4pUWRlkJfQrqbbVPW3ZpkFf31prb4NDNwzsycD6FLwCMAnN9M72eLxYhgAiOKueQZOUIDyBmkGFN2HHssaCoyLE7RmfcbjIFxkAP0PDd2tQ6mSiZ9E-ynNV_Ja-e9XdrG-mDLpEgnRrc-KZreB-Nsu0rmNqyTMUnezKpvtLM_OtiuPUcntW68uTjUIZrd302Lx3T88vBU3I7TkuQ8pKUWQiwNqZbSSK5rLIFJVtWEAICWUEOps5zQ-DAjDEOVVTKOuopxatCGDtH1fu_WdR-98UFtut618aQigueUcYHzqCJ7VeliHGdqtXX2XbtvBVjtmKk_ZmrHTB2YRdPV3mSNMf-GnHLC48pfAjpnnA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2659345609</pqid></control><display><type>article</type><title>Weighted Multiview Possibilistic C-Means Clustering With L2 Regularization</title><source>IEEE Xplore (Online service)</source><creator>Benjamin, Josephine Bernadette M. ; Yang, Miin-Shen</creator><creatorcontrib>Benjamin, Josephine Bernadette M. ; Yang, Miin-Shen</creatorcontrib><description>Since social media, virtual communities and networks rapidly grow, multiview data become more popular. In general, multiview data always contain different feature components in different views. Although these data are extracted in different ways (views) from diverse settings and domains, they are used to describe the same samples, which make them highly related. Hence, applying (single-view) clustering methods for multiview data poses difficulty in achieving desirable clustering results. Thus, multiview clustering methods should be developed that will utilize available multiview information. Most of multiview clustering techniques currently use k-means due to its conceptual simplicity, and use fuzzy c-means (FCM) that the datapoints can belong to more than one cluster based on their membership degrees from 0 to 1. However, the use of k-means or FCM may degrade its performance due to the presence of noise and outliers, especially on large or high-dimensional datasets. The constraint imposed on the membership degrees of k-means and FCM tends to assign a corresponding high membership value to an outlier or a noisy data point. To address these drawbacks, possibilistic c-means (PCM) relaxes the membership constraint of k-means and FCM so that outliers and noisy datapoints can be properly identified. On the other hand, there are various extensions of k-means and FCM for multiview data, but no extension of PCM for multiview data was made in the literature. Thus, we use PCM in our proposed multiview clustering model. In this article, we propose novel weighted multiview PCM algorithms designed for clustering multiview data as well as view and feature weights on PCM approaches, called W-MV-PCM and W-MV-PCM with L2 regularization (W-MV-PCM-L2). In multiview clustering, different views may vary with respect to its importance and each view may contain some irrelevant features. In the proposed algorithms, a learning scheme is constructed to compute for the view weights, and feature weights within each view. This scheme will be able to identify the importance of each view and, at the same time, it will also identify and select relevant features in each view. Comparisons of W-MV-PCM-L2 with existing multiview clustering algorithms are made on both synthetic and real datasets. The experimental results are evaluated using accuracy rate (AR) and external validity indexes, such as Rand index (RI) and normalized mutual information (NMI). The proposed W-MV-PCM-L2 algorithm with comparisons of existing algorithms under criteria of AR, RI, and NMI shows that it is a feasible and effective multiview clustering algorithm.</description><identifier>ISSN: 1063-6706</identifier><identifier>EISSN: 1941-0034</identifier><identifier>DOI: 10.1109/TFUZZ.2021.3058572</identifier><identifier>CODEN: IEFSEV</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Algorithms ; Clustering ; Clustering algorithms ; Data points ; Datasets ; Feature extraction ; fuzzy c-means (FCM) ; Indexes ; Linear programming ; Machine learning ; multiview clustering ; multiview data ; Outliers (statistics) ; Parameter estimation ; Partitioning algorithms ; Performance degradation ; Performance indices ; Phase change materials ; possibilistic c-means (PCM) ; Regularization ; Task analysis ; Virtual networks ; weighted ; weighted multiview PCM (W-MV-PCM)</subject><ispartof>IEEE transactions on fuzzy systems, 2022-05, Vol.30 (5), p.1357-1370</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2022</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c295t-ca666be2db8e85af081484df22111a81f1ca792306342401d7d8923ad194f1ae3</citedby><cites>FETCH-LOGICAL-c295t-ca666be2db8e85af081484df22111a81f1ca792306342401d7d8923ad194f1ae3</cites><orcidid>0000-0002-4907-3548 ; 0000-0002-2995-7207</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9352509$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,27903,27904,54775</link.rule.ids></links><search><creatorcontrib>Benjamin, Josephine Bernadette M.</creatorcontrib><creatorcontrib>Yang, Miin-Shen</creatorcontrib><title>Weighted Multiview Possibilistic C-Means Clustering With L2 Regularization</title><title>IEEE transactions on fuzzy systems</title><addtitle>TFUZZ</addtitle><description>Since social media, virtual communities and networks rapidly grow, multiview data become more popular. In general, multiview data always contain different feature components in different views. Although these data are extracted in different ways (views) from diverse settings and domains, they are used to describe the same samples, which make them highly related. Hence, applying (single-view) clustering methods for multiview data poses difficulty in achieving desirable clustering results. Thus, multiview clustering methods should be developed that will utilize available multiview information. Most of multiview clustering techniques currently use k-means due to its conceptual simplicity, and use fuzzy c-means (FCM) that the datapoints can belong to more than one cluster based on their membership degrees from 0 to 1. However, the use of k-means or FCM may degrade its performance due to the presence of noise and outliers, especially on large or high-dimensional datasets. The constraint imposed on the membership degrees of k-means and FCM tends to assign a corresponding high membership value to an outlier or a noisy data point. To address these drawbacks, possibilistic c-means (PCM) relaxes the membership constraint of k-means and FCM so that outliers and noisy datapoints can be properly identified. On the other hand, there are various extensions of k-means and FCM for multiview data, but no extension of PCM for multiview data was made in the literature. Thus, we use PCM in our proposed multiview clustering model. In this article, we propose novel weighted multiview PCM algorithms designed for clustering multiview data as well as view and feature weights on PCM approaches, called W-MV-PCM and W-MV-PCM with L2 regularization (W-MV-PCM-L2). In multiview clustering, different views may vary with respect to its importance and each view may contain some irrelevant features. In the proposed algorithms, a learning scheme is constructed to compute for the view weights, and feature weights within each view. This scheme will be able to identify the importance of each view and, at the same time, it will also identify and select relevant features in each view. Comparisons of W-MV-PCM-L2 with existing multiview clustering algorithms are made on both synthetic and real datasets. The experimental results are evaluated using accuracy rate (AR) and external validity indexes, such as Rand index (RI) and normalized mutual information (NMI). The proposed W-MV-PCM-L2 algorithm with comparisons of existing algorithms under criteria of AR, RI, and NMI shows that it is a feasible and effective multiview clustering algorithm.</description><subject>Algorithms</subject><subject>Clustering</subject><subject>Clustering algorithms</subject><subject>Data points</subject><subject>Datasets</subject><subject>Feature extraction</subject><subject>fuzzy c-means (FCM)</subject><subject>Indexes</subject><subject>Linear programming</subject><subject>Machine learning</subject><subject>multiview clustering</subject><subject>multiview data</subject><subject>Outliers (statistics)</subject><subject>Parameter estimation</subject><subject>Partitioning algorithms</subject><subject>Performance degradation</subject><subject>Performance indices</subject><subject>Phase change materials</subject><subject>possibilistic c-means (PCM)</subject><subject>Regularization</subject><subject>Task analysis</subject><subject>Virtual networks</subject><subject>weighted</subject><subject>weighted multiview PCM (W-MV-PCM)</subject><issn>1063-6706</issn><issn>1941-0034</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNo9kFtLAzEQhYMoWKt_QF8WfN6ayW2zj7J4pUWRlkJfQrqbbVPW3ZpkFf31prb4NDNwzsycD6FLwCMAnN9M72eLxYhgAiOKueQZOUIDyBmkGFN2HHssaCoyLE7RmfcbjIFxkAP0PDd2tQ6mSiZ9E-ynNV_Ja-e9XdrG-mDLpEgnRrc-KZreB-Nsu0rmNqyTMUnezKpvtLM_OtiuPUcntW68uTjUIZrd302Lx3T88vBU3I7TkuQ8pKUWQiwNqZbSSK5rLIFJVtWEAICWUEOps5zQ-DAjDEOVVTKOuopxatCGDtH1fu_WdR-98UFtut618aQigueUcYHzqCJ7VeliHGdqtXX2XbtvBVjtmKk_ZmrHTB2YRdPV3mSNMf-GnHLC48pfAjpnnA</recordid><startdate>20220501</startdate><enddate>20220501</enddate><creator>Benjamin, Josephine Bernadette M.</creator><creator>Yang, Miin-Shen</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-4907-3548</orcidid><orcidid>https://orcid.org/0000-0002-2995-7207</orcidid></search><sort><creationdate>20220501</creationdate><title>Weighted Multiview Possibilistic C-Means Clustering With L2 Regularization</title><author>Benjamin, Josephine Bernadette M. ; Yang, Miin-Shen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c295t-ca666be2db8e85af081484df22111a81f1ca792306342401d7d8923ad194f1ae3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Algorithms</topic><topic>Clustering</topic><topic>Clustering algorithms</topic><topic>Data points</topic><topic>Datasets</topic><topic>Feature extraction</topic><topic>fuzzy c-means (FCM)</topic><topic>Indexes</topic><topic>Linear programming</topic><topic>Machine learning</topic><topic>multiview clustering</topic><topic>multiview data</topic><topic>Outliers (statistics)</topic><topic>Parameter estimation</topic><topic>Partitioning algorithms</topic><topic>Performance degradation</topic><topic>Performance indices</topic><topic>Phase change materials</topic><topic>possibilistic c-means (PCM)</topic><topic>Regularization</topic><topic>Task analysis</topic><topic>Virtual networks</topic><topic>weighted</topic><topic>weighted multiview PCM (W-MV-PCM)</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Benjamin, Josephine Bernadette M.</creatorcontrib><creatorcontrib>Yang, Miin-Shen</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005–Present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998–Present</collection><collection>IEEE/IET Electronic Library</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on fuzzy systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Benjamin, Josephine Bernadette M.</au><au>Yang, Miin-Shen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Weighted Multiview Possibilistic C-Means Clustering With L2 Regularization</atitle><jtitle>IEEE transactions on fuzzy systems</jtitle><stitle>TFUZZ</stitle><date>2022-05-01</date><risdate>2022</risdate><volume>30</volume><issue>5</issue><spage>1357</spage><epage>1370</epage><pages>1357-1370</pages><issn>1063-6706</issn><eissn>1941-0034</eissn><coden>IEFSEV</coden><abstract>Since social media, virtual communities and networks rapidly grow, multiview data become more popular. In general, multiview data always contain different feature components in different views. Although these data are extracted in different ways (views) from diverse settings and domains, they are used to describe the same samples, which make them highly related. Hence, applying (single-view) clustering methods for multiview data poses difficulty in achieving desirable clustering results. Thus, multiview clustering methods should be developed that will utilize available multiview information. Most of multiview clustering techniques currently use k-means due to its conceptual simplicity, and use fuzzy c-means (FCM) that the datapoints can belong to more than one cluster based on their membership degrees from 0 to 1. However, the use of k-means or FCM may degrade its performance due to the presence of noise and outliers, especially on large or high-dimensional datasets. The constraint imposed on the membership degrees of k-means and FCM tends to assign a corresponding high membership value to an outlier or a noisy data point. To address these drawbacks, possibilistic c-means (PCM) relaxes the membership constraint of k-means and FCM so that outliers and noisy datapoints can be properly identified. On the other hand, there are various extensions of k-means and FCM for multiview data, but no extension of PCM for multiview data was made in the literature. Thus, we use PCM in our proposed multiview clustering model. In this article, we propose novel weighted multiview PCM algorithms designed for clustering multiview data as well as view and feature weights on PCM approaches, called W-MV-PCM and W-MV-PCM with L2 regularization (W-MV-PCM-L2). In multiview clustering, different views may vary with respect to its importance and each view may contain some irrelevant features. In the proposed algorithms, a learning scheme is constructed to compute for the view weights, and feature weights within each view. This scheme will be able to identify the importance of each view and, at the same time, it will also identify and select relevant features in each view. Comparisons of W-MV-PCM-L2 with existing multiview clustering algorithms are made on both synthetic and real datasets. The experimental results are evaluated using accuracy rate (AR) and external validity indexes, such as Rand index (RI) and normalized mutual information (NMI). The proposed W-MV-PCM-L2 algorithm with comparisons of existing algorithms under criteria of AR, RI, and NMI shows that it is a feasible and effective multiview clustering algorithm.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TFUZZ.2021.3058572</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0002-4907-3548</orcidid><orcidid>https://orcid.org/0000-0002-2995-7207</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1063-6706 |
ispartof | IEEE transactions on fuzzy systems, 2022-05, Vol.30 (5), p.1357-1370 |
issn | 1063-6706 1941-0034 |
language | eng |
recordid | cdi_ieee_primary_9352509 |
source | IEEE Xplore (Online service) |
subjects | Algorithms Clustering Clustering algorithms Data points Datasets Feature extraction fuzzy c-means (FCM) Indexes Linear programming Machine learning multiview clustering multiview data Outliers (statistics) Parameter estimation Partitioning algorithms Performance degradation Performance indices Phase change materials possibilistic c-means (PCM) Regularization Task analysis Virtual networks weighted weighted multiview PCM (W-MV-PCM) |
title | Weighted Multiview Possibilistic C-Means Clustering With L2 Regularization |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T00%3A36%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Weighted%20Multiview%20Possibilistic%20C-Means%20Clustering%20With%20L2%20Regularization&rft.jtitle=IEEE%20transactions%20on%20fuzzy%20systems&rft.au=Benjamin,%20Josephine%20Bernadette%20M.&rft.date=2022-05-01&rft.volume=30&rft.issue=5&rft.spage=1357&rft.epage=1370&rft.pages=1357-1370&rft.issn=1063-6706&rft.eissn=1941-0034&rft.coden=IEFSEV&rft_id=info:doi/10.1109/TFUZZ.2021.3058572&rft_dat=%3Cproquest_ieee_%3E2659345609%3C/proquest_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c295t-ca666be2db8e85af081484df22111a81f1ca792306342401d7d8923ad194f1ae3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2659345609&rft_id=info:pmid/&rft_ieee_id=9352509&rfr_iscdi=true |