Official Tempo-6B collection: A query-aware framework solving the mismatch between massive video streams and bounded LLM context windows.
visioncairgroup
Vision-CAIR
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 hours ago
Small Vision-Language Models are Smart Compressors for Long Video Understanding updated a model about 18 hours ago
Vision-CAIR/Tempo-6B updated a Space about 18 hours ago
Vision-CAIR/Tempo