Loading…

On a Zero-Finding Problem Involving the Matrix Exponential

An important step in the solution of a matrix nearness problem that arises in certain machine learning applications is finding the zero of $f(\alpha) = \bm{z}^T \exp(\log X + \alpha\bm{z}\bm{z}^T)\bm{z} - b$. The matrix valued exponential and logarithm in $f(\alpha)$ arises from the use of the von N...

Full description

Saved in:

Bibliographic Details
Published in:	SIAM journal on matrix analysis and applications 2012-01, Vol.33 (4), p.1237-1249
Main Authors:	Sustik, Mátyás A., Dhillon, Inderjit S.
Format:	Article
Language:	English
Subjects:	Algorithms Machine learning
Citations:	Items that this one cites
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	An important step in the solution of a matrix nearness problem that arises in certain machine learning applications is finding the zero of $f(\alpha) = \bm{z}^T \exp(\log X + \alpha\bm{z}\bm{z}^T)\bm{z} - b$. The matrix valued exponential and logarithm in $f(\alpha)$ arises from the use of the von Neumann matrix divergence $\operatorname{tr}(X \log X - X \log Y - X + Y)$ to measure the nearness between the positive definite matrices $X$ and $Y$. A key step of an iterative algorithm used to solve the underlying matrix nearness problem requires the zero of $f(\alpha)$ to be repeatedly computed. In this paper we propose zero-finding algorithms that gain their advantage by exploiting the special structure of the objective function. We show how to efficiently compute the derivative of $f$, thereby allowing the use of Newton-type methods. In numerical experiments we establish the advantage of our algorithms. [PUBLICATION ABSTRACT]
ISSN:	0895-4798 1095-7162
DOI:	10.1137/11082498X