Loading…

K-Means-Based Consensus Clustering: A Unified View

The objective of consensus clustering is to find a single partitioning which agrees as much as possible with existing basic partitionings. Consensus clustering emerges as a promising solution to find cluster structures from heterogeneous data. As an efficient approach for consensus clustering, the K...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE transactions on knowledge and data engineering 2015-01, Vol.27 (1), p.155-169
Main Authors:	Wu, Junjie, Liu, Hongfu, Xiong, Hui, Cao, Jie, Chen, Jian
Format:	Article
Language:	English
Subjects:	Clustering algorithms Convex functions Educational institutions Linear programming Partitioning algorithms Robustness Vectors
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c265t-de06aef048e2801c68f55239ac9403ab009f59130ef73693034ad531c72bc3
cites	cdi_FETCH-LOGICAL-c265t-de06aef048e2801c68f55239ac9403ab009f59130ef73693034ad531c72bc3
container_end_page	169
container_issue	1
container_start_page	155
container_title	IEEE transactions on knowledge and data engineering
container_volume	27
creator	Wu, Junjie Liu, Hongfu Xiong, Hui Cao, Jie Chen, Jian
description	The objective of consensus clustering is to find a single partitioning which agrees as much as possible with existing basic partitionings. Consensus clustering emerges as a promising solution to find cluster structures from heterogeneous data. As an efficient approach for consensus clustering, the K-means based method has garnered attention in the literature, however the existing research efforts are still preliminary and fragmented. To that end, in this paper, we provide a systematic study of K-means-based consensus clustering (KCC). Specifically, we first reveal a necessary and sufficient condition for utility functions which work for KCC. This helps to establish a unified framework for KCC on both complete and incomplete data sets. Also, we investigate some important factors, such as the quality and diversity of basic partitionings, which may affect the performances of KCC. Experimental results on various realworld data sets demonstrate that KCC is highly efficient and is comparable to the state-of-the-art methods in terms of clustering quality. In addition, KCC shows high robustness to incomplete basic partitionings with many missing values.
doi_str_mv	10.1109/TKDE.2014.2316512
format	article
fullrecord	<record><control><sourceid>crossref_ieee_</sourceid><recordid>TN_cdi_ieee_primary_6786489</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6786489</ieee_id><sourcerecordid>10_1109_TKDE_2014_2316512</sourcerecordid><originalsourceid>FETCH-LOGICAL-c265t-de06aef048e2801c68f55239ac9403ab009f59130ef73693034ad531c72bc3</originalsourceid><addsrcrecordid>eNo9j9tKAzEURYMoWKsfIL7MD2Q8J7dJfKtjvdCKINXXkGZOZKROZdIi_r0dWnzaG_YFFmOXCCUiuOvF7G5aCkBVColGozhiI9TacoEOj3ceFHIlVXXKznL-BABbWRwxMePPFLrMb0OmpqjXXaYub3NRr7Z5Q33bfdwUk-Kta1O7y99b-jlnJymsMl0cdMxe76eL-pHPXx6e6smcR2H0hjcEJlACZUlYwGhs0lpIF6JTIMMSwCXtUAKlShonQarQaImxEssoxwz3p7Ff59xT8t99-xX6X4_gB2I_EPuB2B-Id5ur_aYlov--qaxR1sk_mTdQDA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>K-Means-Based Consensus Clustering: A Unified View</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Wu, Junjie ; Liu, Hongfu ; Xiong, Hui ; Cao, Jie ; Chen, Jian</creator><creatorcontrib>Wu, Junjie ; Liu, Hongfu ; Xiong, Hui ; Cao, Jie ; Chen, Jian</creatorcontrib><description>The objective of consensus clustering is to find a single partitioning which agrees as much as possible with existing basic partitionings. Consensus clustering emerges as a promising solution to find cluster structures from heterogeneous data. As an efficient approach for consensus clustering, the K-means based method has garnered attention in the literature, however the existing research efforts are still preliminary and fragmented. To that end, in this paper, we provide a systematic study of K-means-based consensus clustering (KCC). Specifically, we first reveal a necessary and sufficient condition for utility functions which work for KCC. This helps to establish a unified framework for KCC on both complete and incomplete data sets. Also, we investigate some important factors, such as the quality and diversity of basic partitionings, which may affect the performances of KCC. Experimental results on various realworld data sets demonstrate that KCC is highly efficient and is comparable to the state-of-the-art methods in terms of clustering quality. In addition, KCC shows high robustness to incomplete basic partitionings with many missing values.</description><identifier>ISSN: 1041-4347</identifier><identifier>EISSN: 1558-2191</identifier><identifier>DOI: 10.1109/TKDE.2014.2316512</identifier><identifier>CODEN: ITKEEH</identifier><language>eng</language><publisher>IEEE</publisher><subject>Clustering algorithms ; Convex functions ; Educational institutions ; Linear programming ; Partitioning algorithms ; Robustness ; Vectors</subject><ispartof>IEEE transactions on knowledge and data engineering, 2015-01, Vol.27 (1), p.155-169</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c265t-de06aef048e2801c68f55239ac9403ab009f59130ef73693034ad531c72bc3</citedby><cites>FETCH-LOGICAL-c265t-de06aef048e2801c68f55239ac9403ab009f59130ef73693034ad531c72bc3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6786489$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,54796</link.rule.ids></links><search><creatorcontrib>Wu, Junjie</creatorcontrib><creatorcontrib>Liu, Hongfu</creatorcontrib><creatorcontrib>Xiong, Hui</creatorcontrib><creatorcontrib>Cao, Jie</creatorcontrib><creatorcontrib>Chen, Jian</creatorcontrib><title>K-Means-Based Consensus Clustering: A Unified View</title><title>IEEE transactions on knowledge and data engineering</title><addtitle>TKDE</addtitle><description>The objective of consensus clustering is to find a single partitioning which agrees as much as possible with existing basic partitionings. Consensus clustering emerges as a promising solution to find cluster structures from heterogeneous data. As an efficient approach for consensus clustering, the K-means based method has garnered attention in the literature, however the existing research efforts are still preliminary and fragmented. To that end, in this paper, we provide a systematic study of K-means-based consensus clustering (KCC). Specifically, we first reveal a necessary and sufficient condition for utility functions which work for KCC. This helps to establish a unified framework for KCC on both complete and incomplete data sets. Also, we investigate some important factors, such as the quality and diversity of basic partitionings, which may affect the performances of KCC. Experimental results on various realworld data sets demonstrate that KCC is highly efficient and is comparable to the state-of-the-art methods in terms of clustering quality. In addition, KCC shows high robustness to incomplete basic partitionings with many missing values.</description><subject>Clustering algorithms</subject><subject>Convex functions</subject><subject>Educational institutions</subject><subject>Linear programming</subject><subject>Partitioning algorithms</subject><subject>Robustness</subject><subject>Vectors</subject><issn>1041-4347</issn><issn>1558-2191</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><recordid>eNo9j9tKAzEURYMoWKsfIL7MD2Q8J7dJfKtjvdCKINXXkGZOZKROZdIi_r0dWnzaG_YFFmOXCCUiuOvF7G5aCkBVColGozhiI9TacoEOj3ceFHIlVXXKznL-BABbWRwxMePPFLrMb0OmpqjXXaYub3NRr7Z5Q33bfdwUk-Kta1O7y99b-jlnJymsMl0cdMxe76eL-pHPXx6e6smcR2H0hjcEJlACZUlYwGhs0lpIF6JTIMMSwCXtUAKlShonQarQaImxEssoxwz3p7Ff59xT8t99-xX6X4_gB2I_EPuB2B-Id5ur_aYlov--qaxR1sk_mTdQDA</recordid><startdate>20150101</startdate><enddate>20150101</enddate><creator>Wu, Junjie</creator><creator>Liu, Hongfu</creator><creator>Xiong, Hui</creator><creator>Cao, Jie</creator><creator>Chen, Jian</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20150101</creationdate><title>K-Means-Based Consensus Clustering: A Unified View</title><author>Wu, Junjie ; Liu, Hongfu ; Xiong, Hui ; Cao, Jie ; Chen, Jian</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c265t-de06aef048e2801c68f55239ac9403ab009f59130ef73693034ad531c72bc3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Clustering algorithms</topic><topic>Convex functions</topic><topic>Educational institutions</topic><topic>Linear programming</topic><topic>Partitioning algorithms</topic><topic>Robustness</topic><topic>Vectors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wu, Junjie</creatorcontrib><creatorcontrib>Liu, Hongfu</creatorcontrib><creatorcontrib>Xiong, Hui</creatorcontrib><creatorcontrib>Cao, Jie</creatorcontrib><creatorcontrib>Chen, Jian</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005–Present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE/IET Electronic Library</collection><collection>CrossRef</collection><jtitle>IEEE transactions on knowledge and data engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wu, Junjie</au><au>Liu, Hongfu</au><au>Xiong, Hui</au><au>Cao, Jie</au><au>Chen, Jian</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>K-Means-Based Consensus Clustering: A Unified View</atitle><jtitle>IEEE transactions on knowledge and data engineering</jtitle><stitle>TKDE</stitle><date>2015-01-01</date><risdate>2015</risdate><volume>27</volume><issue>1</issue><spage>155</spage><epage>169</epage><pages>155-169</pages><issn>1041-4347</issn><eissn>1558-2191</eissn><coden>ITKEEH</coden><abstract>The objective of consensus clustering is to find a single partitioning which agrees as much as possible with existing basic partitionings. Consensus clustering emerges as a promising solution to find cluster structures from heterogeneous data. As an efficient approach for consensus clustering, the K-means based method has garnered attention in the literature, however the existing research efforts are still preliminary and fragmented. To that end, in this paper, we provide a systematic study of K-means-based consensus clustering (KCC). Specifically, we first reveal a necessary and sufficient condition for utility functions which work for KCC. This helps to establish a unified framework for KCC on both complete and incomplete data sets. Also, we investigate some important factors, such as the quality and diversity of basic partitionings, which may affect the performances of KCC. Experimental results on various realworld data sets demonstrate that KCC is highly efficient and is comparable to the state-of-the-art methods in terms of clustering quality. In addition, KCC shows high robustness to incomplete basic partitionings with many missing values.</abstract><pub>IEEE</pub><doi>10.1109/TKDE.2014.2316512</doi><tpages>15</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 1041-4347
ispartof	IEEE transactions on knowledge and data engineering, 2015-01, Vol.27 (1), p.155-169
issn	1041-4347 1558-2191
language	eng
recordid	cdi_ieee_primary_6786489
source	IEEE Electronic Library (IEL) Journals
subjects	Clustering algorithms Convex functions Educational institutions Linear programming Partitioning algorithms Robustness Vectors
title	K-Means-Based Consensus Clustering: A Unified View
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T22%3A05%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=K-Means-Based%20Consensus%20Clustering:%20A%20Unified%20View&rft.jtitle=IEEE%20transactions%20on%20knowledge%20and%20data%20engineering&rft.au=Wu,%20Junjie&rft.date=2015-01-01&rft.volume=27&rft.issue=1&rft.spage=155&rft.epage=169&rft.pages=155-169&rft.issn=1041-4347&rft.eissn=1558-2191&rft.coden=ITKEEH&rft_id=info:doi/10.1109/TKDE.2014.2316512&rft_dat=%3Ccrossref_ieee_%3E10_1109_TKDE_2014_2316512%3C/crossref_ieee_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c265t-de06aef048e2801c68f55239ac9403ab009f59130ef73693034ad531c72bc3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6786489&rfr_iscdi=true