Loading…

Localized algorithms for multiple kernel learning

Instead of selecting a single kernel, multiple kernel learning (MKL) uses a weighted sum of kernels where the weight of each kernel is optimized during training. Such methods assign the same weight to a kernel over the whole input space, and we discuss localized multiple kernel learning (LMKL) that...

Full description

Saved in:
Bibliographic Details
Published in:Pattern recognition 2013-03, Vol.46 (3), p.795-807
Main Authors: Goenen, Mehmet, Alpaydin, Ethem
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Instead of selecting a single kernel, multiple kernel learning (MKL) uses a weighted sum of kernels where the weight of each kernel is optimized during training. Such methods assign the same weight to a kernel over the whole input space, and we discuss localized multiple kernel learning (LMKL) that is composed of a kernel-based learning algorithm and a parametric gating model to assign local weights to kernel functions. These two components are trained in a coupled manner using a two-step alternating optimization algorithm. Empirical results on benchmark classification and regression data sets validate the applicability of our approach. We see that LMKL achieves higher accuracy compared with canonical MKL on classification problems with different feature representations. LMKL can also identify the relevant parts of images using the gating model as a saliency detector in image recognition problems. In regression tasks, LMKL improves the performance significantly or reduces the model complexity by storing significantly fewer support vectors. ► Introduces a localized multiple kernel learning framework for kernel-based algorithms. ► Generalizes the model for different gating models, kernel functions, and applications. ► Reports the results of extensive simulations on multiple real-world data sets. ► Identifies the relevant parts of images acting as a saliency detector. ► Has inherent regularization to avoid overfitting using required number of kernels.
ISSN:0031-3203
1873-5142
DOI:10.1016/j.patcog.2012.09.002