Score: 0

C-DIRA: Computationally Efficient Dynamic ROI Routing and Domain-Invariant Adversarial Learning for Lightweight Driver Behavior Recognition

Published: December 9, 2025 | arXiv ID: 2512.08647v1

By: Keito Inoshita

Driver distraction behavior recognition using in-vehicle cameras demands real-time inference on edge devices. However, lightweight models often fail to capture fine-grained behavioral cues, resulting in reduced performance on unseen drivers or under varying conditions. ROI-based methods also increase computational cost, making it difficult to balance efficiency and accuracy. This work addresses the need for a lightweight architecture that overcomes these constraints. We propose Computationally efficient Dynamic region of Interest Routing and domain-invariant Adversarial learning for lightweight driver behavior recognition (C-DIRA). The framework combines saliency-driven Top-K ROI pooling and fused classification for local feature extraction and integration. Dynamic ROI routing enables selective computation by applying ROI inference only to high difficulty data samples. Moreover, pseudo-domain labeling and adversarial learning are used to learn domain-invariant features robust to driver and background variation. Experiments on the State Farm Distracted Driver Detection Dataset show that C-DIRA maintains high accuracy with significantly fewer FLOPs and lower latency than prior lightweight models. It also demonstrates robustness under visual degradation such as blur and low-light, and stable performance across unseen domains. These results confirm C-DIRA's effectiveness in achieving compactness, efficiency, and generalization.

Cross-View Cross-Modal Unsupervised Domain Adaptation for Driver Monitoring System

CV and Pattern Recognition

Helps cars see if drivers are looking away.

15 Nov 2025 1

87%

Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL

Robotics

Teaches self-driving cars to drive in bad weather.

16 Nov 2025 1

87%

VISTA: Vision-Language Imitation of Situational Thinking and Attention for Human-Like Driver Focus in Dynamic Environments

CV and Pattern Recognition

Predicts where drivers look using words.

7 Aug 2025 0

View PDF Login to Bookmark

C-DIRA: Computationally Efficient Dynamic ROI Routing and Domain-Invariant Adversarial Learning for Lightweight Driver Behavior Recognition

Technical Abstract

Cross-View Cross-Modal Unsupervised Domain Adaptation for Driver Monitoring System

Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL

VISTA: Vision-Language Imitation of Situational Thinking and Attention for Human-Like Driver Focus in Dynamic Environments