Aerial View River Landform Video segmentation: A Weakly Supervised Context-aware Temporal Consistency Distillation Approach
By: Chi-Han Chen , Chieh-Ming Chen , Wen-Huang Cheng and more
Potential Business Impact:
Teaches drones to see rivers better with less data.
The study of terrain and landform classification through UAV remote sensing diverges significantly from ground vehicle patrol tasks. Besides grappling with the complexity of data annotation and ensuring temporal consistency, it also confronts the scarcity of relevant data and the limitations imposed by the effective range of many technologies. This research substantiates that, in aerial positioning tasks, both the mean Intersection over Union (mIoU) and temporal consistency (TC) metrics are of paramount importance. It is demonstrated that fully labeled data is not the optimal choice, as selecting only key data lacks the enhancement in TC, leading to failures. Hence, a teacher-student architecture, coupled with key frame selection and key frame updating algorithms, is proposed. This framework successfully performs weakly supervised learning and TC knowledge distillation, overcoming the deficiencies of traditional TC training in aerial tasks. The experimental results reveal that our method utilizing merely 30\% of labeled data, concurrently elevates mIoU and temporal consistency ensuring stable localization of terrain objects. Result demo : https://gitlab.com/prophet.ai.inc/drone-based-riverbed-inspection
Similar Papers
Efficient On-Board Processing of Oblique UAV Video for Rapid Flood Extent Mapping
CV and Pattern Recognition
Makes drones understand disaster scenes faster.
Remote Sensing Change Detection via Weak Temporal Supervision
CV and Pattern Recognition
Find changes in Earth pictures without new labels.
Temporally Consistent Unsupervised Segmentation for Mobile Robot Perception
CV and Pattern Recognition
Helps robots see and understand new places.