Action unit intensity regression for facial MoCap aimed towards digital humans.

Saved in:
Bibliographic Details
Title: Action unit intensity regression for facial MoCap aimed towards digital humans.
Authors: Vilchis, Carlos1,2 (AUTHOR) carlos.vilchis@tec.mx, Mendez-Ruiz, Mauricio2 (AUTHOR) mauricio@eugenia.tech, Perez-Guerrero, Carmina2 (AUTHOR) carmina@eugenia.tech, Gonzalez-Mendoza, Miguel1 (AUTHOR) mgonza@tec.mx
Source: Multimedia Tools & Applications. Apr2025, Vol. 84 Issue 13, p11775-11794. 20p.
Subjects: Motion capture (Cinematography), Artificial intelligence, Image processing, Support vector machines, Machine learning, Avatars (Virtual reality)
Abstract: Due to the increasing demand for virtual avatars, there has been a recent growth in the research and development of frameworks for realistic digital humans, which create a demand for realistic and adaptable facial motion capture systems. Most frameworks belong to private companies or represent high investments, which is why the creation of democratized solutions is relevant for the growth of digital human content creation. This research work proposes a facial motion capture framework for digital humans with the use of machine learning for facial codification intensity regression. The main focus is to use coded face movement intensities to generate realistic expressions on a digital human. The ablation studies performed on the regression models show that Neural Networks, using Histogram of Oriented Gradients as features, and with person-specific normalization, present overall better performance against other methods in the literature. With an RMSE of 0.052, the proposed framework offers reliable results that can be rendered in the face of a MetaHuman. [ABSTRACT FROM AUTHOR]
Copyright of Multimedia Tools & Applications is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Engineering Source
Full text is not displayed to guests.
Description
Abstract:Due to the increasing demand for virtual avatars, there has been a recent growth in the research and development of frameworks for realistic digital humans, which create a demand for realistic and adaptable facial motion capture systems. Most frameworks belong to private companies or represent high investments, which is why the creation of democratized solutions is relevant for the growth of digital human content creation. This research work proposes a facial motion capture framework for digital humans with the use of machine learning for facial codification intensity regression. The main focus is to use coded face movement intensities to generate realistic expressions on a digital human. The ablation studies performed on the regression models show that Neural Networks, using Histogram of Oriented Gradients as features, and with person-specific normalization, present overall better performance against other methods in the literature. With an RMSE of 0.052, the proposed framework offers reliable results that can be rendered in the face of a MetaHuman. [ABSTRACT FROM AUTHOR]
ISSN:13807501
DOI:10.1007/s11042-024-19400-8