FastVLM: Efficient Vision Encoding for Vision Language Models
Paper • 2412.13303 • Published • 76
Efficient Vision Encoding for Vision Language Models
Real-time video captioning powered by FastVLM
Note MLX checkpoint
Note MLX checkpoint
Note MLX checkpoint