NemoClaw Knowledge Wiki

❯

❯

LLaVA

Jul 12, 20261 min read

vision-language-models
multimodal-ai
open-source-ml
llama-backbone
computer-vision

LLaVA

An open vision-language model family built around LLaMA-style backbones for multimodal image-and-text tasks.

Ecosystem

llama
ollama

Related Notes

Backfilled from lab-note mentions and entity refresh.

Graph View

LLaVA
Ecosystem
Related Notes

Backlinks

INDEX
cross-attention

Created with Quartz v4.5.2 © 2026

GitHub
Discord Community