Score: 0

JEPA for RL: Investigating Joint-Embedding Predictive Architectures for Reinforcement Learning

Published: April 23, 2025 | arXiv ID: 2504.16591v1

By: Tristan Kenneweg, Philip Kenneweg, Barbara Hammer

Potential Business Impact:

Teaches robots to learn from watching.

Business Areas:
Image Recognition Data and Analytics, Software

Joint-Embedding Predictive Architectures (JEPA) have recently become popular as promising architectures for self-supervised learning. Vision transformers have been trained using JEPA to produce embeddings from images and videos, which have been shown to be highly suitable for downstream tasks like classification and segmentation. In this paper, we show how to adapt the JEPA architecture to reinforcement learning from images. We discuss model collapse, show how to prevent it, and provide exemplary data on the classical Cart Pole task.

Page Count
6 pages

Category
Computer Science:
CV and Pattern Recognition