Score: 1

BTW: A Non-Parametric Variance Stabilization Framework for Multimodal Model Integration

Published: August 25, 2025 | arXiv ID: 2508.18551v1

By: Jun Hou, Le Wang, Xuan Wang

Potential Business Impact:

Helps AI learn better from many different kinds of info.

Business Areas:

A/B Testing Data and Analytics

Mixture-of-Experts (MoE) models have become increasingly powerful in multimodal learning by enabling modular specialization across modalities. However, their effectiveness remains unclear when additional modalities introduce more noise than complementary information. Existing approaches, such as the Partial Information Decomposition, struggle to scale beyond two modalities and lack the resolution needed for instance-level control. We propose Beyond Two-modality Weighting (BTW), a bi-level, non-parametric weighting framework that combines instance-level Kullback-Leibler (KL) divergence and modality-level mutual information (MI) to dynamically adjust modality importance during training. Our method does not require additional parameters and can be applied to an arbitrary number of modalities. Specifically, BTW computes per-example KL weights by measuring the divergence between each unimodal and the current multimodal prediction, and modality-wide MI weights by estimating global alignment between unimodal and multimodal outputs. Extensive experiments on sentiment regression and clinical classification demonstrate that our method significantly improves regression performance and multiclass classification accuracy.

Quantifying Multimodal Imbalance: A GMM-Guided Adaptive Loss for Audio-Visual Learning

Machine Learning (CS)

Helps computers understand different kinds of information together.

20 Oct 2025 1

87%

Quantifying Multimodal Imbalance: A GMM-Guided Adaptive Loss for Audio-Visual Learning

Machine Learning (CS)

Helps computers understand mixed-up sounds and pictures better.

20 Oct 2025 1

87%

Probabilistic combination forecasts based on particle filtering: predictive prior

Methodology

Makes predictions better by using new ideas.

10 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Repos / Data Links

github.com

Page Count

15 pages

BTW: A Non-Parametric Variance Stabilization Framework for Multimodal Model Integration

Helps AI learn better from many different kinds of info.

Technical Abstract

Quantifying Multimodal Imbalance: A GMM-Guided Adaptive Loss for Audio-Visual Learning

Quantifying Multimodal Imbalance: A GMM-Guided Adaptive Loss for Audio-Visual Learning

Probabilistic combination forecasts based on particle filtering: predictive prior