Searching protocol for "llama 3.2 vision"
Fine-tune multimodal vision models.
Run local LLMs, keep data private, save costs.