VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice Paper • 2601.05175 • Published Jan 8 • 36
Interpret Vision Transformers as ConvNets with Dynamic Convolutions Paper • 2309.10713 • Published Sep 19, 2023 • 1
EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM Paper • 2312.06660 • Published Dec 11, 2023 • 1
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively Paper • 2401.02955 • Published Jan 5, 2024 • 23