Loading…

Cumulative attribute space regression for head pose estimation and color constancy

•We show that joint prediction of multiple targets can exploit the target correlation structure and lead into improved accuracy.•We propose an approximate method for cases with more than two outputs. This prevents the combinatorial explosion.•We study the performance for two multi-target regression...

Full description

Saved in:
Bibliographic Details
Published in:Pattern recognition 2019-03, Vol.87, p.29-37
Main Authors: Chen, Ke, Jia, Kui, Huttunen, Heikki, Matas, Jiri, Kämäräinen, Joni-Kristian
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•We show that joint prediction of multiple targets can exploit the target correlation structure and lead into improved accuracy.•We propose an approximate method for cases with more than two outputs. This prevents the combinatorial explosion.•We study the performance for two multi-target regression cases: (a) 2D face pose estimation, (b) 3D face pose estimation and (c) three-output (RGB) illuminant estimation for color constancy. Two-stage Cumulative Attribute (CA) regression has been found effective in regression problems of computer vision such as facial age and crowd density estimation. The first stage regression maps input features to cumulative attributes that encode correlations between target values. The previous works have dealt with single output regression. In this work, we propose cumulative attribute spaces for 2- and 3-output (multivariate) regression. We show how the original CA space can be generalized to multiple output by the Cartesian product (CartCA). However, for target spaces with more than two outputs the CartCA becomes computationally infeasible and therefore we propose an approximate solution - multi-view CA (MvCA) - where CartCA is applied to output pairs. We experimentally verify improved performance of the CartCA and MvCA spaces in 2D and 3D face pose estimation and three-output (RGB) illuminant estimation for color constancy.
ISSN:0031-3203
1873-5142
DOI:10.1016/j.patcog.2018.10.015