Planning with Diffusion Models for Target-Oriented Dialogue Systems
By: Hanwen Du, Bo Peng, Xia Ning
Potential Business Impact:
Helps chatbots plan conversations better to reach goals.
Target-Oriented Dialogue (TOD) remains a significant challenge in the LLM era, where strategic dialogue planning is crucial for directing conversations toward specific targets. However, existing dialogue planning methods generate dialogue plans in a step-by-step sequential manner, and may suffer from compounding errors and myopic actions. To address these limitations, we introduce a novel dialogue planning framework, DiffTOD, which leverages diffusion models to enable non-sequential dialogue planning. DiffTOD formulates dialogue planning as a trajectory generation problem with conditional guidance, and leverages a diffusion language model to estimate the likelihood of the dialogue trajectory. To optimize the dialogue action strategies, DiffTOD introduces three tailored guidance mechanisms for different target types, offering flexible guidance toward diverse TOD targets at test time. Extensive experiments across three diverse TOD settings show that DiffTOD can effectively perform non-myopic lookahead exploration and optimize action strategies over a long horizon through non-sequential dialogue planning, and demonstrates strong flexibility across complex and diverse dialogue scenarios. Our code and data are accessible through https://github.com/ninglab/DiffTOD.
Similar Papers
PyTOD: Programmable Task-Oriented Dialogue with Execution Feedback
Computation and Language
Helps computers understand conversations better.
Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues
Computation and Language
Builds talking robots with less work.
Spec-TOD: A Specialized Instruction-Tuned LLM Framework for Efficient Task-Oriented Dialogue Systems
Computation and Language
Teaches computers to talk with less training.