Sleeping FoodExtract-Vision Fine-tuned VLM Structued Data Extractor ๐ Extract food and drink items from any image as structured JSON
ninjals/FoodExtract-Vision-SmolVLM2-500M-fine-tune-v1-VIDEO Image-Text-to-Text โข 0.5B โข Updated 7 days ago โข 103