SAD: A Large-Scale Strategic Argumentative Dialogue Dataset
By: Yongkang Liu , Jiayang Yu , Mingyang Wang and more
Argumentation generation has attracted substantial research interest due to its central role in human reasoning and decision-making. However, most existing argumentative corpora focus on non-interactive, single-turn settings, either generating arguments from a given topic or refuting an existing argument. In practice, however, argumentation is often realized as multi-turn dialogue, where speakers defend their stances and employ diverse argumentative strategies to strengthen persuasiveness. To support deeper modeling of argumentation dialogue, we present the first large-scale \textbf{S}trategic \textbf{A}rgumentative \textbf{D}ialogue dataset, SAD, consisting of 392,822 examples. Grounded in argumentation theories, we annotate each utterance with five strategy types, allowing multiple strategies per utterance. Unlike prior datasets, SAD requires models to generate contextually appropriate arguments conditioned on the dialogue history, a specified stance on the topic, and targeted argumentation strategies. We further benchmark a range of pretrained generative models on SAD and present in-depth analysis of strategy usage patterns in argumentation.
Similar Papers
MADS: Multi-Agent Dialogue Simulation for Diverse Persuasion Data Generation
Computation and Language
Makes AI better at convincing people to buy things.
MADS: Multi-Agent Dialogue Simulation for Diverse Persuasion Data Generation
Computation and Language
Makes AI better at convincing people to buy things.
Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue Summarization
Computation and Language
Helps computers understand feelings in spoken words.