Score: 0

LLM-Enhanced Self-Evolving Reinforcement Learning for Multi-Step E-Commerce Payment Fraud Risk Detection

Published: September 23, 2025 | arXiv ID: 2509.18719v1

By: Bo Qu , Zhurong Wang , Daisuke Yagi and more

Potential Business Impact:

Finds fake online payments better using smart AI.

Business Areas:

Fraud Detection Financial Services, Payments, Privacy and Security

This paper presents a novel approach to e-commerce payment fraud detection by integrating reinforcement learning (RL) with Large Language Models (LLMs). By framing transaction risk as a multi-step Markov Decision Process (MDP), RL optimizes risk detection across multiple payment stages. Crafting effective reward functions, essential for RL model success, typically requires significant human expertise due to the complexity and variability in design. LLMs, with their advanced reasoning and coding capabilities, are well-suited to refine these functions, offering improvements over traditional methods. Our approach leverages LLMs to iteratively enhance reward functions, achieving better fraud detection accuracy and demonstrating zero-shot capability. Experiments with real-world data confirm the effectiveness, robustness, and resilience of our LLM-enhanced RL framework through long-term evaluations, underscoring the potential of LLMs in advancing industrial RL applications.

Reinforcement Learning of Large Language Models for Interpretable Credit Card Fraud Detection

Artificial Intelligence

**AI learns to spot online shopping scams.**

9 Jan 2026 0

89%

Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping

Computation and Language

Helps online stores act like you.

8 Oct 2025 0

89%

Scaling Autonomous Agents via Automatic Reward Modeling And Planning

Artificial Intelligence

Teaches computers to make better choices.

17 Feb 2025 0

View PDF Login to Bookmark

Page Count

12 pages

LLM-Enhanced Self-Evolving Reinforcement Learning for Multi-Step E-Commerce Payment Fraud Risk Detection

Finds fake online payments better using smart AI.

Technical Abstract

Reinforcement Learning of Large Language Models for Interpretable Credit Card Fraud Detection

Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping

Scaling Autonomous Agents via Automatic Reward Modeling And Planning