Multimodal Trajectory Representation Learning for Travel Time Estimation
By: Zhi Liu , Xuyuan Hu , Xiao Han and more
Potential Business Impact:
Predicts travel times more accurately for cars.
Accurate travel time estimation (TTE) plays a crucial role in intelligent transportation systems. However, it remains challenging due to heterogeneous data sources and complex traffic dynamics. Moreover, conventional approaches typically convert trajectories into fixed-length representations, neglecting the inherent variability of real-world trajectories, which often leads to information loss or feature redundancy. To address these challenges, this paper introduces the Multimodal Dynamic Trajectory Integration (MDTI) framework--a novel multimodal trajectory representation learning approach that integrates GPS sequences, grid trajectories, and road network constraints to enhance TTE accuracy. MDTI employs modality-specific encoders and a cross-modal interaction module to capture complementary spatial, temporal, and topological semantics, while a dynamic trajectory modeling mechanism adaptively regulates information density for trajectories of varying lengths. Two self-supervised pretraining objectives, named contrastive alignment and masked language modeling, further strengthen multimodal consistency and contextual understanding. Extensive experiments on three real-world datasets demonstrate that MDTI consistently outperforms state-of-the-art baselines, confirming its robustness and strong generalization abilities. The code is publicly available at: https://github.com/freshhxy/MDTI/
Similar Papers
MixTTE: Multi-Level Mixture-of-Experts for Scalable and Adaptive Travel Time Estimation
Machine Learning (CS)
Makes ride-hailing apps predict travel time better.
Bridging Traffic State and Trajectory for Dynamic Road Network and Trajectory Representation Learning
Machine Learning (CS)
Helps traffic lights predict jams better.
Towards An Efficient and Effective En Route Travel Time Estimation Framework
Artificial Intelligence
Predicts travel time faster, only when needed.