Official Tempo-6B collection: A query-aware framework solving the mismatch between massive video streams and bounded LLM context windows.
AI & ML interests
None defined yet.
Recent Activity
models 20
Vision-CAIR/Tempo-6B
Video-Text-to-Text • Updated • 144 • 2
Vision-CAIR/Tempo-6B-Stage2
Video-Text-to-Text • Updated • 48
Vision-CAIR/Tempo-6B-Stage1
Video-Text-to-Text • Updated • 32
Vision-CAIR/Tempo-6B-Stage0
Video-Text-to-Text • Updated • 39
Vision-CAIR/BFPO-Mistral-7b-v0.1
Text Generation • 7B • Updated • 12 • 1
Vision-CAIR/LongVU_Llama3_2_1B
Video-Text-to-Text • Updated • 30 • 12
Vision-CAIR/LongVU_Llama3_2_3B_img
Updated • 4 • 6
Vision-CAIR/LongVU_Qwen2_7B_img
Updated • 8 • 5
Vision-CAIR/LongVU_Llama3_2_3B
Video-Text-to-Text • Updated • 24 • 8
Vision-CAIR/LongVU_Qwen2_7B
Video-Text-to-Text • 8B • Updated • 186 • 76