Score: 0

Ensemble-Guided Distillation for Compact and Robust Acoustic Scene Classification on Edge Devices

Published: December 15, 2025 | arXiv ID: 2512.13905v1

By: Hossein Sharify , Behnam Raoufi , Mahdy Ramezani and more

We present a compact, quantization-ready acoustic scene classification (ASC) framework that couples an efficient student network with a learned teacher ensemble and knowledge distillation. The student backbone uses stacked depthwise-separable "expand-depthwise-project" blocks with global response normalization to stabilize training and improve robustness to device and noise variability, while a global pooling head yields class logits for efficient edge inference. To inject richer inductive bias, we assemble a diverse set of teacher models and learn two complementary fusion heads: z1, which predicts per-teacher mixture weights using a student-style backbone, and z2, a lightweight MLP that performs per-class logit fusion. The student is distilled from the ensemble via temperature-scaled soft targets combined with hard labels, enabling it to approximate the ensemble's decision geometry with a single compact model. Evaluated on the TAU Urban Acoustic Scenes 2022 Mobile benchmark, our approach achieves state-of-the-art (SOTA) results on the TAU dataset under matched edge-deployment constraints, demonstrating strong performance and practicality for mobile ASC.

Adaptive Knowledge Distillation using a Device-Aware Teacher for Low-Complexity Acoustic Scene Classification

Sound

Makes computers hear sounds from different devices.

11 Sep 2025 0

90%

Lightweight and Generalizable Acoustic Scene Representations via Contrastive Fine-Tuning and Distillation

Sound

Helps sound machines learn new sounds without retraining.

4 Oct 2025 0

89%

Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification

Sound

Makes small computer programs learn big program skills.

14 Mar 2025 0

View PDF Login to Bookmark

Ensemble-Guided Distillation for Compact and Robust Acoustic Scene Classification on Edge Devices

Technical Abstract

Adaptive Knowledge Distillation using a Device-Aware Teacher for Low-Complexity Acoustic Scene Classification

Lightweight and Generalizable Acoustic Scene Representations via Contrastive Fine-Tuning and Distillation

Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification