Loading…

Face Alignment in Full Pose Range: A 3D Total Solution

Face alignment, which fits a face model to an image and extracts the semantic meanings of facial pixels, has been an important topic in the computer vision community. However, most algorithms are designed for faces in small to medium poses (yaw angle is smaller than 45 degree), which lack the abilit...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on pattern analysis and machine intelligence 2019-01, Vol.41 (1), p.78-92
Main Authors: Zhu, Xiangyu, Liu, Xiaoming, Lei, Zhen, Li, Stan Z.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c351t-239b77209112936d3f3fbf946b4e9012626890017ab102f3dfa629b8b70990f3
cites cdi_FETCH-LOGICAL-c351t-239b77209112936d3f3fbf946b4e9012626890017ab102f3dfa629b8b70990f3
container_end_page 92
container_issue 1
container_start_page 78
container_title IEEE transactions on pattern analysis and machine intelligence
container_volume 41
creator Zhu, Xiangyu
Liu, Xiaoming
Lei, Zhen
Li, Stan Z.
description Face alignment, which fits a face model to an image and extracts the semantic meanings of facial pixels, has been an important topic in the computer vision community. However, most algorithms are designed for faces in small to medium poses (yaw angle is smaller than 45 degree), which lack the ability to align faces in large poses up to 90 degree. The challenges are three-fold. First, the commonly used landmark face model assumes that all the landmarks are visible and is therefore not suitable for large poses. Second, the face appearance varies more drastically across large poses, from the frontal view to the profile view. Third, labelling landmarks in large poses is extremely challenging since the invisible landmarks have to be guessed. In this paper, we propose to tackle these three challenges in an new alignment framework termed 3D Dense Face Alignment (3DDFA), in which a dense 3D Morphable Model (3DMM) is fitted to the image via Cascaded Convolutional Neural Networks. We also utilize 3D information to synthesize face images in profile views to provide abundant samples for training. Experiments on the challenging AFLW database show that the proposed approach achieves significant improvements over the state-of-the-art methods.
doi_str_mv 10.1109/TPAMI.2017.2778152
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TPAMI_2017_2778152</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8122025</ieee_id><sourcerecordid>2068346122</sourcerecordid><originalsourceid>FETCH-LOGICAL-c351t-239b77209112936d3f3fbf946b4e9012626890017ab102f3dfa629b8b70990f3</originalsourceid><addsrcrecordid>eNpdkEFPg0AQhTdGY2v1D2hiNvHihTozW5Zdb6RabVJjo9w3QJeGhkJl4eC_F2ztwdMc5nsvLx9j1whjRNAP0TJ8m48JMBhTECj06YQNUQvtCV_oUzYElOQpRWrALpzbAODEB3HOBqS1BvDVkMlZnFoeFvm63Nqy4XnJZ21R8GXlLP-Iy7V95CEXTzyqmrjgn1XRNnlVXrKzLC6cvTrcEYtmz9H01Vu8v8yn4cJLhY-NR0InQUCgEUkLuRKZyJJMT2QysRqQJEnVDcEgThAoE6sslqQTlQTQDczEiN3va3d19dVa15ht7lJbFHFpq9YZAqnERCJRh979QzdVW5fdOEPoYw9BT9GeSuvKudpmZlfn27j-Ngiml2p-pZpeqjlI7UK3h-o22drVMfJnsQNu9kBurT2-VTcLyBc_TzN2Lw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2151461202</pqid></control><display><type>article</type><title>Face Alignment in Full Pose Range: A 3D Total Solution</title><source>IEEE Electronic Library (IEL) Journals</source><creator>Zhu, Xiangyu ; Liu, Xiaoming ; Lei, Zhen ; Li, Stan Z.</creator><creatorcontrib>Zhu, Xiangyu ; Liu, Xiaoming ; Lei, Zhen ; Li, Stan Z.</creatorcontrib><description>Face alignment, which fits a face model to an image and extracts the semantic meanings of facial pixels, has been an important topic in the computer vision community. However, most algorithms are designed for faces in small to medium poses (yaw angle is smaller than 45 degree), which lack the ability to align faces in large poses up to 90 degree. The challenges are three-fold. First, the commonly used landmark face model assumes that all the landmarks are visible and is therefore not suitable for large poses. Second, the face appearance varies more drastically across large poses, from the frontal view to the profile view. Third, labelling landmarks in large poses is extremely challenging since the invisible landmarks have to be guessed. In this paper, we propose to tackle these three challenges in an new alignment framework termed 3D Dense Face Alignment (3DDFA), in which a dense 3D Morphable Model (3DMM) is fitted to the image via Cascaded Convolutional Neural Networks. We also utilize 3D information to synthesize face images in profile views to provide abundant samples for training. Experiments on the challenging AFLW database show that the proposed approach achieves significant improvements over the state-of-the-art methods.</description><identifier>ISSN: 0162-8828</identifier><identifier>EISSN: 1939-3539</identifier><identifier>EISSN: 2160-9292</identifier><identifier>DOI: 10.1109/TPAMI.2017.2778152</identifier><identifier>PMID: 29990058</identifier><identifier>CODEN: ITPIDJ</identifier><language>eng</language><publisher>United States: IEEE</publisher><subject>3D morphable model ; Alignment ; Artificial neural networks ; cascaded regression ; Computer vision ; convolutional neural network ; Face ; Face alignment ; Landmarks ; Shape ; Solid modeling ; State of the art ; Three dimensional models ; Three-dimensional displays ; Training ; Two dimensional displays ; Yaw</subject><ispartof>IEEE transactions on pattern analysis and machine intelligence, 2019-01, Vol.41 (1), p.78-92</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2019</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c351t-239b77209112936d3f3fbf946b4e9012626890017ab102f3dfa629b8b70990f3</citedby><cites>FETCH-LOGICAL-c351t-239b77209112936d3f3fbf946b4e9012626890017ab102f3dfa629b8b70990f3</cites><orcidid>0000-0002-0791-189X ; 0000-0003-2756-401X ; 0000-0003-3215-8753</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8122025$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,54771</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29990058$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Zhu, Xiangyu</creatorcontrib><creatorcontrib>Liu, Xiaoming</creatorcontrib><creatorcontrib>Lei, Zhen</creatorcontrib><creatorcontrib>Li, Stan Z.</creatorcontrib><title>Face Alignment in Full Pose Range: A 3D Total Solution</title><title>IEEE transactions on pattern analysis and machine intelligence</title><addtitle>TPAMI</addtitle><addtitle>IEEE Trans Pattern Anal Mach Intell</addtitle><description>Face alignment, which fits a face model to an image and extracts the semantic meanings of facial pixels, has been an important topic in the computer vision community. However, most algorithms are designed for faces in small to medium poses (yaw angle is smaller than 45 degree), which lack the ability to align faces in large poses up to 90 degree. The challenges are three-fold. First, the commonly used landmark face model assumes that all the landmarks are visible and is therefore not suitable for large poses. Second, the face appearance varies more drastically across large poses, from the frontal view to the profile view. Third, labelling landmarks in large poses is extremely challenging since the invisible landmarks have to be guessed. In this paper, we propose to tackle these three challenges in an new alignment framework termed 3D Dense Face Alignment (3DDFA), in which a dense 3D Morphable Model (3DMM) is fitted to the image via Cascaded Convolutional Neural Networks. We also utilize 3D information to synthesize face images in profile views to provide abundant samples for training. Experiments on the challenging AFLW database show that the proposed approach achieves significant improvements over the state-of-the-art methods.</description><subject>3D morphable model</subject><subject>Alignment</subject><subject>Artificial neural networks</subject><subject>cascaded regression</subject><subject>Computer vision</subject><subject>convolutional neural network</subject><subject>Face</subject><subject>Face alignment</subject><subject>Landmarks</subject><subject>Shape</subject><subject>Solid modeling</subject><subject>State of the art</subject><subject>Three dimensional models</subject><subject>Three-dimensional displays</subject><subject>Training</subject><subject>Two dimensional displays</subject><subject>Yaw</subject><issn>0162-8828</issn><issn>1939-3539</issn><issn>2160-9292</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNpdkEFPg0AQhTdGY2v1D2hiNvHihTozW5Zdb6RabVJjo9w3QJeGhkJl4eC_F2ztwdMc5nsvLx9j1whjRNAP0TJ8m48JMBhTECj06YQNUQvtCV_oUzYElOQpRWrALpzbAODEB3HOBqS1BvDVkMlZnFoeFvm63Nqy4XnJZ21R8GXlLP-Iy7V95CEXTzyqmrjgn1XRNnlVXrKzLC6cvTrcEYtmz9H01Vu8v8yn4cJLhY-NR0InQUCgEUkLuRKZyJJMT2QysRqQJEnVDcEgThAoE6sslqQTlQTQDczEiN3va3d19dVa15ht7lJbFHFpq9YZAqnERCJRh979QzdVW5fdOEPoYw9BT9GeSuvKudpmZlfn27j-Ngiml2p-pZpeqjlI7UK3h-o22drVMfJnsQNu9kBurT2-VTcLyBc_TzN2Lw</recordid><startdate>20190101</startdate><enddate>20190101</enddate><creator>Zhu, Xiangyu</creator><creator>Liu, Xiaoming</creator><creator>Lei, Zhen</creator><creator>Li, Stan Z.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-0791-189X</orcidid><orcidid>https://orcid.org/0000-0003-2756-401X</orcidid><orcidid>https://orcid.org/0000-0003-3215-8753</orcidid></search><sort><creationdate>20190101</creationdate><title>Face Alignment in Full Pose Range: A 3D Total Solution</title><author>Zhu, Xiangyu ; Liu, Xiaoming ; Lei, Zhen ; Li, Stan Z.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c351t-239b77209112936d3f3fbf946b4e9012626890017ab102f3dfa629b8b70990f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>3D morphable model</topic><topic>Alignment</topic><topic>Artificial neural networks</topic><topic>cascaded regression</topic><topic>Computer vision</topic><topic>convolutional neural network</topic><topic>Face</topic><topic>Face alignment</topic><topic>Landmarks</topic><topic>Shape</topic><topic>Solid modeling</topic><topic>State of the art</topic><topic>Three dimensional models</topic><topic>Three-dimensional displays</topic><topic>Training</topic><topic>Two dimensional displays</topic><topic>Yaw</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhu, Xiangyu</creatorcontrib><creatorcontrib>Liu, Xiaoming</creatorcontrib><creatorcontrib>Lei, Zhen</creatorcontrib><creatorcontrib>Li, Stan Z.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE/IET Electronic Library</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>IEEE transactions on pattern analysis and machine intelligence</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhu, Xiangyu</au><au>Liu, Xiaoming</au><au>Lei, Zhen</au><au>Li, Stan Z.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Face Alignment in Full Pose Range: A 3D Total Solution</atitle><jtitle>IEEE transactions on pattern analysis and machine intelligence</jtitle><stitle>TPAMI</stitle><addtitle>IEEE Trans Pattern Anal Mach Intell</addtitle><date>2019-01-01</date><risdate>2019</risdate><volume>41</volume><issue>1</issue><spage>78</spage><epage>92</epage><pages>78-92</pages><issn>0162-8828</issn><eissn>1939-3539</eissn><eissn>2160-9292</eissn><coden>ITPIDJ</coden><abstract>Face alignment, which fits a face model to an image and extracts the semantic meanings of facial pixels, has been an important topic in the computer vision community. However, most algorithms are designed for faces in small to medium poses (yaw angle is smaller than 45 degree), which lack the ability to align faces in large poses up to 90 degree. The challenges are three-fold. First, the commonly used landmark face model assumes that all the landmarks are visible and is therefore not suitable for large poses. Second, the face appearance varies more drastically across large poses, from the frontal view to the profile view. Third, labelling landmarks in large poses is extremely challenging since the invisible landmarks have to be guessed. In this paper, we propose to tackle these three challenges in an new alignment framework termed 3D Dense Face Alignment (3DDFA), in which a dense 3D Morphable Model (3DMM) is fitted to the image via Cascaded Convolutional Neural Networks. We also utilize 3D information to synthesize face images in profile views to provide abundant samples for training. Experiments on the challenging AFLW database show that the proposed approach achieves significant improvements over the state-of-the-art methods.</abstract><cop>United States</cop><pub>IEEE</pub><pmid>29990058</pmid><doi>10.1109/TPAMI.2017.2778152</doi><tpages>15</tpages><orcidid>https://orcid.org/0000-0002-0791-189X</orcidid><orcidid>https://orcid.org/0000-0003-2756-401X</orcidid><orcidid>https://orcid.org/0000-0003-3215-8753</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0162-8828
ispartof IEEE transactions on pattern analysis and machine intelligence, 2019-01, Vol.41 (1), p.78-92
issn 0162-8828
1939-3539
2160-9292
language eng
recordid cdi_crossref_primary_10_1109_TPAMI_2017_2778152
source IEEE Electronic Library (IEL) Journals
subjects 3D morphable model
Alignment
Artificial neural networks
cascaded regression
Computer vision
convolutional neural network
Face
Face alignment
Landmarks
Shape
Solid modeling
State of the art
Three dimensional models
Three-dimensional displays
Training
Two dimensional displays
Yaw
title Face Alignment in Full Pose Range: A 3D Total Solution
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T21%3A58%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Face%20Alignment%20in%20Full%20Pose%20Range:%20A%203D%20Total%20Solution&rft.jtitle=IEEE%20transactions%20on%20pattern%20analysis%20and%20machine%20intelligence&rft.au=Zhu,%20Xiangyu&rft.date=2019-01-01&rft.volume=41&rft.issue=1&rft.spage=78&rft.epage=92&rft.pages=78-92&rft.issn=0162-8828&rft.eissn=1939-3539&rft.coden=ITPIDJ&rft_id=info:doi/10.1109/TPAMI.2017.2778152&rft_dat=%3Cproquest_cross%3E2068346122%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c351t-239b77209112936d3f3fbf946b4e9012626890017ab102f3dfa629b8b70990f3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2151461202&rft_id=info:pmid/29990058&rft_ieee_id=8122025&rfr_iscdi=true