VideoRefer Suite

VideoRefer Suite is an open-source video model developed by Alibaba (Apache 2 license) that enhances large-language-models with spatial-temporal object understanding. It enables fine-grained tracking and reasoning about specific objects throughout video content.

Key Features

2026 04 14 Fahd Mirza Videorefer model running locally

Source Notes