Multi-Still: A lightweight Multi-modal Cross Attention Knowledge Distillation method for the Real-Time Emotion Recognition Service in Edge-to-Cloud Continuum

Hyeon Ki Jo, Yuri Seo, Choong Seon Hong, Eui Nam Huh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recent advances in big data and artificial intelligence have led to active research in emotion recognition based on multimodal transformer models. Although these multimodal transformer models demonstrate high performance, their applications into real-time services are challenging due to their heavy computational requirements. Therefore, this study proposes a Multi-Still method, which transfers the multimodal knowledge of a teacher model to a student model using the knowledge distillation method supporting edge to cloud continuum environment. Multi-Still trained by text and voice data from Korean multimodal emotional datasets (KEMDy19, KEMDy20) both teacher and student. As a result, the student model transferred knowledge from the teacher model showed a 21% increase in number of inferences per second compared to the teacher model, 70.31% reduction in network size, and 65% reduction in the number of parameters. Nevertheless, it shows similar accuracy to the teacher model. We provide real-time emotion recognition services for the lightweight resources in edge continuum by efficiently learning multimodal data through knowledge distillation.

Original languageEnglish
Title of host publicationProceedings - 16th International Conference on Advanced Technologies for Communications, ATC 2023
EditorsTran The Son
PublisherIEEE Computer Society
Pages296-300
Number of pages5
ISBN (Electronic)9798350301328
DOIs
Publication statusPublished - 2023
Event16th International Conference on Advanced Technologies for Communications, ATC 2023 - Da Nang, Viet Nam
Duration: 19 Oct 202321 Oct 2023

Publication series

NameInternational Conference on Advanced Technologies for Communications
ISSN (Print)2162-1039
ISSN (Electronic)2162-1020

Conference

Conference16th International Conference on Advanced Technologies for Communications, ATC 2023
Country/TerritoryViet Nam
CityDa Nang
Period19/10/2321/10/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Keywords

  • emotion recognition
  • knowledge distillation
  • lightweight model
  • multimodal

Fingerprint

Dive into the research topics of 'Multi-Still: A lightweight Multi-modal Cross Attention Knowledge Distillation method for the Real-Time Emotion Recognition Service in Edge-to-Cloud Continuum'. Together they form a unique fingerprint.

Cite this