Urro's picture

In a Training Loop 🔄

Urro

urroxyz

·

https://urro.xyz/

urroxyz

AI & ML interests

i like research on empowering small LMs to do better 😮 i DISLIKE video & image generation (esp. ai "art") 🤢

Recent Activity

updated a collection about 21 hours ago

HUMAN-WRITTEN & LEGALLY-SOURCED*

liked a dataset about 21 hours ago

ronantakizawa/github-top-code

liked a dataset about 21 hours ago

ronantakizawa/github-codereview

View all activity

Organizations

upvoted a collection 1 day ago

✨ free demo spaces

HF Spaces for demoing chat completion models—no ZeroGPU, WebGPU, or BYOK included. Thank you so much to these devs! • 15 items • Updated 9 days ago • 2

upvoted a paper 1 day ago

FlashPrefill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling

Paper • 2603.06199 • Published 5 days ago • 9

upvoted a paper 7 days ago

H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs

Paper • 2512.01797 • Published Dec 1, 2025 • 9

upvoted 8 papers 8 days ago

TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents

Paper • 2602.19633 • Published 16 days ago • 7

PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency

Paper • 2602.16745 • Published 21 days ago • 8

Benchmark Test-Time Scaling of General LLM Agents

Paper • 2602.18998 • Published 17 days ago • 8

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

Paper • 2602.20309 • Published 15 days ago • 16

Multi-Vector Index Compression in Any Modality

Paper • 2602.21202 • Published 14 days ago • 22

Test-Time Training with KV Binding Is Secretly Linear Attention

Paper • 2602.21204 • Published 14 days ago • 30

Query-focused and Memory-aware Reranker for Long Context Processing

Paper • 2602.12192 • Published 26 days ago • 56

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published 13 days ago • 124

upvoted a collection 8 days ago

Qwen3.5

21 items • Updated 1 day ago • 1.09k

upvoted 8 papers 9 days ago

Yor-Sarc: A gold-standard dataset for sarcasm detection in a low-resource African language

Paper • 2602.18964 • Published 17 days ago • 1

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Paper • 2602.22190 • Published 13 days ago • 15

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

Paper • 2602.21534 • Published 14 days ago • 23

DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference

Paper • 2602.21548 • Published 14 days ago • 42

Solaris: Building a Multiplayer Video World Model in Minecraft

Paper • 2602.22208 • Published 13 days ago • 27

MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models

Paper • 2602.17602 • Published 19 days ago • 55

Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models

Paper • 2602.20981 • Published 14 days ago • 1

What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance

Paper • 2602.20300 • Published 15 days ago • 3