Revisiting the Necessity of Lengthy Chain-of-Thought in Vision-centric Reasoning Generalization Paper • 2511.22586 • Published Nov 27, 2025 • 7
VIPER: Process-aware Evaluation for Generative Video Reasoning Paper • 2512.24952 • Published Dec 31, 2025
Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework Paper • 2509.05007 • Published Sep 5, 2025
Improving Vision-language Models with Perception-centric Process Reward Models Paper • 2604.24583 • Published 6 days ago • 2
Improving Vision-language Models with Perception-centric Process Reward Models Paper • 2604.24583 • Published 6 days ago • 2
Improving Vision-language Models with Perception-centric Process Reward Models Paper • 2604.24583 • Published 6 days ago • 2
SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training Paper • 2602.03411 • Published Feb 3 • 39
SWE-World: Building Software Engineering Agents in Docker-Free Environments Paper • 2602.03419 • Published Feb 3 • 41
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning Paper • 2505.17005 • Published May 22, 2025 • 5
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning Paper • 2505.17005 • Published May 22, 2025 • 5
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning Paper • 2505.17005 • Published May 22, 2025 • 5 • 2
Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models Paper • 2503.21380 • Published Mar 27, 2025 • 38