Score: 0

GeoPE:A Unified Geometric Positional Embedding for Structured Tensors

Published: December 4, 2025 | arXiv ID: 2512.04963v1

By: Yupu Yao, Bowen Yang

Potential Business Impact:

Helps computers see shapes better in pictures.

Business Areas:

Indoor Positioning Navigation and Mapping

Standard Vision Transformers flatten 2D images into 1D sequences, disrupting the natural spatial topology. While Rotary Positional Embedding (RoPE) excels in 1D, it inherits this limitation, often treating spatially distant patches (e.g., at row edges) as sequence neighbors. Existing 2D approaches typically treat spatial axes independently, failing to decouple this false sequential proximity from true spatial distance. To restore the 2D spatial manifold, we introduce Geometric Positional Embedding (GeoPE), a framework that extends rotations to 3D Euclidean space using quaternions. To overcome non-commutativity and ensure symmetry, GeoPE constructs a unified rotational operator by computing the geometric mean in the Lie algebra. This creates a geometrically coupled encoding that effectively separates spatial dimensions. Extensive experiments on image classification, object detection, and 3D semantic segmentation demonstrate that GeoPE consistently outperforms existing 2D RoPE variants and significantly enhances shape bias, confirming its ability to capture true geometric structure.

Rethinking RoPE: A Mathematical Blueprint for N-dimensional Positional Embedding

Machine Learning (CS)

Makes AI understand images and text better.

7 Apr 2025 0

91%

DoPE: Denoising Rotary Position Embedding

Computation and Language

Makes AI understand longer texts better.

12 Nov 2025 0

91%

HoPE: Hyperbolic Rotary Positional Encoding for Stable Long-Range Dependency Modeling in Large Language Models

Computation and Language

Makes AI understand long sentences better.

5 Sep 2025 1

View PDF Login to Bookmark

Page Count

21 pages

GeoPE:A Unified Geometric Positional Embedding for Structured Tensors

Helps computers see shapes better in pictures.

Technical Abstract

Rethinking RoPE: A Mathematical Blueprint for N-dimensional Positional Embedding

DoPE: Denoising Rotary Position Embedding

HoPE: Hyperbolic Rotary Positional Encoding for Stable Long-Range Dependency Modeling in Large Language Models