← Seminars

Emotion Intensity in Multimodal Dialogue Datasets

Yutong Hu

Abstract

This presentation explores emotion intensity analysis in multimodal dialogue systems, moving beyond categorical emotion detection to quantify emotional strength. A multimodal approach combining text and audio is explored because emotional intensity is conveyed through both linguistic content and acoustic features. A survey of around 20 dialogue datasets reveals that only IEMOCAP and MEISD simultaneously provide multimodal data, emotion intensity annotations, and dialogue structure. The primary focus is IEMOCAP, containing 151 dyadic dialogues with 10,086 utterances annotated for Valence-Arousal-Dominance (VAD) dimensions. Two research directions are proposed: detecting emotion intensity for current utterances given dialogue context, or predicting emotional intensity for future responses. The long-term goal is enabling emotion intensity-conditioned speech generation for more naturalistic voice agents.

Term
Fall 2025
Date
October 10, 2025
Time
3:00 - 4:00 PM
Location
White Hall 100