Abstract
Recent advances in big data and artificial intelligence have led to active research in emotion recognition based on multimodal transformer models. Although these multimodal transformer models demonstrate high performance, their applications into real-time services are challenging due to their heavy computational requirements. Therefore, this study proposes a Multi-Still method, which transfers the multimodal knowledge of a teacher model to a student model using the knowledge distillation method supporting edge to cloud continuum environment. Multi-Still trained by text and voice data from Korean multimodal emotional datasets (KEMDy19, KEMDy20) both teacher and student. As a result, the student model transferred knowledge from the teacher model showed a 21% increase in number of inferences per second compared to the teacher model, 70.31% reduction in network size, and 65% reduction in the number of parameters. Nevertheless, it shows similar accuracy to the teacher model. We provide real-time emotion recognition services for the lightweight resources in edge continuum by efficiently learning multimodal data through knowledge distillation.
Original language | English |
---|---|
Title of host publication | Proceedings - 16th International Conference on Advanced Technologies for Communications, ATC 2023 |
Editors | Tran The Son |
Publisher | IEEE Computer Society |
Pages | 296-300 |
Number of pages | 5 |
ISBN (Electronic) | 9798350301328 |
DOIs | |
Publication status | Published - 2023 |
Event | 16th International Conference on Advanced Technologies for Communications, ATC 2023 - Da Nang, Viet Nam Duration: 19 Oct 2023 → 21 Oct 2023 |
Publication series
Name | International Conference on Advanced Technologies for Communications |
---|---|
ISSN (Print) | 2162-1039 |
ISSN (Electronic) | 2162-1020 |
Conference
Conference | 16th International Conference on Advanced Technologies for Communications, ATC 2023 |
---|---|
Country/Territory | Viet Nam |
City | Da Nang |
Period | 19/10/23 → 21/10/23 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.
Keywords
- emotion recognition
- knowledge distillation
- lightweight model
- multimodal