SmolVLM - small yet mighty Vision Language Model

from blog Simon Willison's Weblog, | ↗ original
SmolVLM - small yet mighty Vision Language Model I've been having fun playing with this new vision model from the Hugging Face team behind SmolLM. They describe it as: [...] a 2B VLM, SOTA for its memory footprint. SmolVLM is small, fast, memory-efficient, and fully open-source. All model checkpoints, VLM datasets, training recipes and tools are...