A dynamic memory assignment strategy for dilation-based ICP algorithm on embedded GPUs
By: Qiong Chang , Weimin Wang , Junpei Zhong and more
Potential Business Impact:
Makes big computer vision tasks run on small devices.
This paper proposes a memory-efficient optimization strategy for the high-performance point cloud registration algorithm VANICP, enabling lightweight execution on embedded GPUs with constrained hardware resources. VANICP is a recently published acceleration framework that significantly improves the computational efficiency of point-cloud-based applications. By transforming the global nearest neighbor search into a localized process through a dilation-based information propagation mechanism, VANICP greatly reduces the computational complexity of the NNS. However, its original implementation demands a considerable amount of memory, which restricts its deployment in resource-constrained environments such as embedded systems. To address this issue, we propose a GPU-oriented dynamic memory assignment strategy that optimizes the memory usage of the dilation operation. Furthermore, based on this strategy, we construct an enhanced version of the VANICP framework that achieves over 97% reduction in memory consumption while preserving the original performance. Source code is published on: https://github.com/changqiong/VANICP4Em.git.
Similar Papers
Dynamic-ICP: Doppler-Aware Iterative Closest Point Registration for Dynamic Scenes
Robotics
Helps self-driving cars see moving things better.
PointCNN++: Performant Convolution on Native Points
CV and Pattern Recognition
Makes 3D shape understanding faster and more accurate.
Gaussian Primitive Optimized Deformable Retinal Image Registration
CV and Pattern Recognition
Makes eye scans match perfectly for better health.