Score: 1

From Classical Data to Quantum Advantage -- Quantum Policy Evaluation on Quantum Hardware

Published: September 9, 2025 | arXiv ID: 2509.07614v1

By: Daniel Hein , Simon Wiedemann , Markus Baumann and more

BigTech Affiliations: Siemens

Potential Business Impact:

Computers learn faster by using quantum tricks.

Business Areas:

Quantum Computing Science and Engineering

Quantum policy evaluation (QPE) is a reinforcement learning (RL) algorithm which is quadratically more efficient than an analogous classical Monte Carlo estimation. It makes use of a direct quantum mechanical realization of a finite Markov decision process, in which the agent and the environment are modeled by unitary operators and exchange states, actions, and rewards in superposition. Previously, the quantum environment has been implemented and parametrized manually for an illustrative benchmark using a quantum simulator. In this paper, we demonstrate how these environment parameters can be learned from a batch of classical observational data through quantum machine learning (QML) on quantum hardware. The learned quantum environment is then applied in QPE to also compute policy evaluations on quantum hardware. Our experiments reveal that, despite challenges such as noise and short coherence times, the integration of QML and QPE shows promising potential for achieving quantum advantage in RL.

Q-Policy: Quantum-Enhanced Policy Evaluation for Scalable Reinforcement Learning

Machine Learning (CS)

Makes computers learn faster using quantum power.

17 May 2025 0

90%

Dissecting Quantum Reinforcement Learning: A Systematic Evaluation of Key Components

Quantum Physics

Makes AI learn faster and more reliably.

21 Nov 2025 0

89%

A Primer on Quantum Machine Learning

Quantum Physics

Quantum computers learn faster than regular ones.

20 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇩🇪 Germany

Page Count

6 pages

From Classical Data to Quantum Advantage -- Quantum Policy Evaluation on Quantum Hardware

Computers learn faster by using quantum tricks.

Technical Abstract

Q-Policy: Quantum-Enhanced Policy Evaluation for Scalable Reinforcement Learning

Dissecting Quantum Reinforcement Learning: A Systematic Evaluation of Key Components

A Primer on Quantum Machine Learning