Loading…

A method for searching for a globally optimal k-partition of higher-dimensional datasets

The problem of finding a globally optimal k -partition of a set  A is a very intricate optimization problem for which in general, except in the case of one-dimensional data, i.e., for data with one feature ( A ⊂ R ), there is no method to solve. Only in the one-dimensional case, there are efficient...

Full description

Saved in:
Bibliographic Details
Published in:Journal of global optimization 2024-07, Vol.89 (3), p.633-653
Main Authors: Sabo, Kristian, Scitovski, Rudolf, Ungar, Šime, Tomljanović, Zoran
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The problem of finding a globally optimal k -partition of a set  A is a very intricate optimization problem for which in general, except in the case of one-dimensional data, i.e., for data with one feature ( A ⊂ R ), there is no method to solve. Only in the one-dimensional case, there are efficient methods based on the fact that the search for a globally optimal k -partition is equivalent to solving a global optimization problem for a symmetric Lipschitz-continuous function using the global optimization algorithm DIRECT. In the present paper, we propose a method for finding a globally optimal k -partition in the general case ( A ⊂ R n , n ≥ 1 ), generalizing an idea for solving the Lipschitz global optimization for symmetric functions. To do this, we propose a method that combines a global optimization algorithm with linear constraints and the k -means algorithm. The first of these two algorithms is used only to find a good initial approximation for the k -means algorithm. The method was tested on a number of artificial datasets and on several examples from the UCI Machine Learning Repository, and an application in spectral clustering for linearly non-separable datasets is also demonstrated. Our proposed method proved to be very efficient.
ISSN:0925-5001
1573-2916
DOI:10.1007/s10898-024-01372-6