arxiv:2511.07003
Tong Xiao
neupupil
AI & ML interests
NLP & ML & LLM
Recent Activity
upvoted a paper 2 days ago
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling upvoted a paper 27 days ago
GRAM: A Generative Foundation Reward Model for Reward Generalization liked a Space 2 months ago
EfficientReasoning/efficient_reasoning_online_judgement