Author
Listed:
- Zhang Shanshan
- Sha Yanlin
- Loy Chee Luen
Abstract
Emotion recognition faces significant challenges in complex real-world environments, particularly under facial occlusion conditions that severely impact traditional deep learning approaches. This research proposes ChildEmoNet, a novel cascaded emotion recognition framework that strategically integrates Detection Transformer (DETR) for robust multi-person detection with ResNet50 for discriminative feature extraction. The primary contributions include the development of a cascaded DETR-ResNet50 architecture that addresses both detection and classification challenges simultaneously, enhanced robustness mechanisms specifically designed for facial occlusion scenarios, and comprehensive evaluation across both categorical and dimensional emotion recognition tasks. Extensive experiments on the OMG Emotion Dataset demonstrate the effectiveness of this integration: the proposed model achieves an AUC of 0.93 in standard emotion classification tasks, maintains 79% recognition accuracy under 30% facial occlusion conditions, and attains concordance correlation coefficients (CCC) of 0.52 and 0.46 for valence and arousal prediction, respectively. The experimental validation confirms the crucial role of the DETR module in processing multi-person scenarios and the effectiveness of ResNet50 in feature extraction, demonstrating superior performance across complex environmental conditions including varying lighting, face orientations, and partial occlusions. Compared with traditional methods, this cascaded architecture shows remarkable robustness under challenging real-world conditions. This research advances emotion computing technology by providing a robust solution for emotion recognition applications in complex environments where conventional approaches exhibit significant performance degradation.
Suggested Citation
Zhang Shanshan & Sha Yanlin & Loy Chee Luen, 2025.
"Robust emotion recognition for complex environments: ChildEmoNet model based on DETR-ResNet50 cascaded architecture,"
PLOS ONE, Public Library of Science, vol. 20(9), pages 1-32, September.
Handle:
RePEc:plo:pone00:0332130
DOI: 10.1371/journal.pone.0332130
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0332130. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.