Multimodal understanding

The ability of AI systems to process and integrate information across multiple modalities (text, code, visual, spatial) to form coherent representations and perform complex tasks.