Reason from Future: Reverse Thought Chain Enhances LLM Reasoning
By: Yinlong Xu , Yanzhao Zheng , Shuoshuo Sun and more
Potential Business Impact:
Helps computers solve hard problems by thinking backward.
It has been demonstrated that carefully designed reasoning paradigms, like Chain-of-Thought (CoT) and Tree-of-Thought (ToT), can enhance the reasoning capabilities of small language models by detailed thinking and extensive thought searching, unbounded branching factors in the searching space create prohibitive reasoning consumption. However these methods fall into the trap of local optimum reasoning, which means the model lacks a global perspective while solving problems. We propose a novel reasoning paradigm called Reason from Future (RFF), which generates reasoning paths by bidirectional reasoning that combines top-down planning with bottom-up reasoning accumulation. The essence of RFF lies in its reverse reasoning mechanism, which prioritizes core logical relationships and imposes goal-oriented constraints on intermediate steps, thereby reducing the searching space and mitigating error accumulation inherent in sequential forward reasoning. Empirical evaluations across diverse experiments demonstrate that RFF outperforms conventional paradigms with higher accuracy and less searching space to solve complex tasks.
Similar Papers
Think When You Need: Self-Adaptive Chain-of-Thought Learning
Computation and Language
Teaches computers to think only when needed.
Reverse Thinking Enhances Missing Information Detection in Large Language Models
Artificial Intelligence
Helps computers find missing puzzle pieces.
Efficient Reasoning via Thought-Training and Thought-Free Inference
Computation and Language
Computers learn to think without showing their work.