Revealing Temporal Label Noise in Multimodal Hateful Video Classification
By: Shuonan Yang , Tailin Chen , Rahul Singh and more
Potential Business Impact:
Finds hate speech hidden inside videos.
The rapid proliferation of online multimedia content has intensified the spread of hate speech, presenting critical societal and regulatory challenges. While recent work has advanced multimodal hateful video detection, most approaches rely on coarse, video-level annotations that overlook the temporal granularity of hateful content. This introduces substantial label noise, as videos annotated as hateful often contain long non-hateful segments. In this paper, we investigate the impact of such label ambiguity through a fine-grained approach. Specifically, we trim hateful videos from the HateMM and MultiHateClip English datasets using annotated timestamps to isolate explicitly hateful segments. We then conduct an exploratory analysis of these trimmed segments to examine the distribution and characteristics of both hateful and non-hateful content. This analysis highlights the degree of semantic overlap and the confusion introduced by coarse, video-level annotations. Finally, controlled experiments demonstrated that time-stamp noise fundamentally alters model decision boundaries and weakens classification confidence, highlighting the inherent context dependency and temporal continuity of hate speech expression. Our findings provide new insights into the temporal dynamics of multimodal hateful videos and highlight the need for temporally aware models and benchmarks for improved robustness and interpretability. Code and data are available at https://github.com/Multimodal-Intelligence-Lab-MIL/HatefulVideoLabelNoise.
Similar Papers
MultiHateLoc: Towards Temporal Localisation of Multimodal Hate Content in Online Videos
CV and Pattern Recognition
Finds hate speech hidden in videos.
HateClipSeg: A Segment-Level Annotated Dataset for Fine-Grained Hate Video Detection
CV and Pattern Recognition
Finds mean words in videos faster.
Multimodal Hate Detection Using Dual-Stream Graph Neural Networks
CV and Pattern Recognition
Finds hate in videos by focusing on bad parts.