Atomic-SNLI: Fine-Grained Natural Language Inference through Atomic Fact Decomposition
By: Minghui Huang
Current Natural Language Inference (NLI) systems primarily operate at the sentence level, providing black-box decisions that lack explanatory power. While atomic-level NLI offers a promising alternative by decomposing hypotheses into individual facts, we demonstrate that the conventional assumption that a hypothesis is entailed only when all its atomic facts are entailed fails in practice due to models' poor performance on fine-grained reasoning. Our analysis reveals that existing models perform substantially worse on atomic level inference compared to sentence level tasks. To address this limitation, we introduce Atomic-SNLI, a novel dataset constructed by decomposing SNLI and enriching it with carefully curated atomic level examples through linguistically informed generation strategies. Experimental results demonstrate that models fine-tuned on Atomic-SNLI achieve significant improvements in atomic reasoning capabilities while maintaining strong sentence level performance, enabling both accurate judgements and transparent, explainable results at the fact level.
Similar Papers
NLI under the Microscope: What Atomic Hypothesis Decomposition Reveals
Computation and Language
Helps computers understand tricky word puzzles better.
Dissecting Atomic Facts: Visual Analytics for Improving Fact Annotations in Language Model Evaluation
Human-Computer Interaction
Helps check if AI is telling the truth.
Rule-Based Approaches to Atomic Sentence Extraction
Computation and Language
Breaks down hard sentences into simple ones.