✨ free demo spaces Collection HF Spaces for demoing chat completion models—no ZeroGPU, WebGPU, or BYOK included. Thank you so much to these devs! • 15 items • Updated 9 days ago • 2
FlashPrefill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling Paper • 2603.06199 • Published 5 days ago • 9
H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs Paper • 2512.01797 • Published Dec 1, 2025 • 9
TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents Paper • 2602.19633 • Published 16 days ago • 7
PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency Paper • 2602.16745 • Published 21 days ago • 8
Benchmark Test-Time Scaling of General LLM Agents Paper • 2602.18998 • Published 17 days ago • 8
QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models Paper • 2602.20309 • Published 15 days ago • 16
Test-Time Training with KV Binding Is Secretly Linear Attention Paper • 2602.21204 • Published 14 days ago • 30
Query-focused and Memory-aware Reranker for Long Context Processing Paper • 2602.12192 • Published 26 days ago • 56
Yor-Sarc: A gold-standard dataset for sarcasm detection in a low-resource African language Paper • 2602.18964 • Published 17 days ago • 1
GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL Paper • 2602.22190 • Published 13 days ago • 15
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning Paper • 2602.21534 • Published 14 days ago • 23
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference Paper • 2602.21548 • Published 14 days ago • 42
Solaris: Building a Multiplayer Video World Model in Minecraft Paper • 2602.22208 • Published 13 days ago • 27
MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models Paper • 2602.17602 • Published 19 days ago • 55
Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models Paper • 2602.20981 • Published 14 days ago • 1
What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance Paper • 2602.20300 • Published 15 days ago • 3