Loading…

Adaptive facial point detection and emotion recognition for a humanoid robot

•We propose a robust landmark detector to deal with pose variation and occlusions.•SVRs and NNs are respectively used to estimate intensities of 18 selected AUs.•Fuzzy c-means clustering is employed to detect seven basic and compound emotions.•Our unsupervised facial point detector outperforms other...

Full description

Saved in:
Bibliographic Details
Published in:Computer vision and image understanding 2015-11, Vol.140, p.93-114
Main Authors: Zhang, Li, Mistry, Kamlesh, Jiang, Ming, Chin Neoh, Siew, Hossain, Mohammed Alamgir
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•We propose a robust landmark detector to deal with pose variation and occlusions.•SVRs and NNs are respectively used to estimate intensities of 18 selected AUs.•Fuzzy c-means clustering is employed to detect seven basic and compound emotions.•Our unsupervised facial point detector outperforms other supervised models.•The overall development is integrated with a modern humanoid robot platform. Automatic perception of facial expressions with scaling differences, pose variations and occlusions would greatly enhance natural human robot interaction. This research proposes unsupervised automatic facial point detection integrated with regression-based intensity estimation for facial action units (AUs) and emotion clustering to deal with such challenges. The proposed facial point detector is able to detect 54 facial points in images of faces with occlusions, pose variations and scaling differences using Gabor filtering, BRISK (Binary Robust Invariant Scalable Keypoints), an Iterative Closest Point (ICP) algorithm and fuzzy c-means (FCM) clustering. Especially, in order to effectively deal with images with occlusions, ICP is first applied to generate neutral landmarks for the occluded facial elements. Then FCM is used to further reason the shape of the occluded facial region by taking the prior knowledge of the non-occluded facial elements into account. Post landmark correlation processing is subsequently applied to derive the best fitting geometry for the occluded facial element to further adjust the neutral landmarks generated by ICP and reconstruct the occluded facial region. We then conduct AU intensity estimation respectively using support vector regression and neural networks for 18 selected AUs. FCM is also subsequently employed to recognize seven basic emotions as well as neutral expressions. It also shows great potential to deal with compound and newly arrived novel emotion class detection. The overall system is integrated with a humanoid robot and enables it to deal with challenging real-life facial emotion recognition tasks.
ISSN:1077-3142
1090-235X
DOI:10.1016/j.cviu.2015.07.007