World Models

A computational framework where an agent learns to predict the future states of its environment by modeling underlying dynamics, enabling planning and reasoning without direct interaction.

Core Principles

Predictive Dynamics: Modeling the causal structure and physics of an environment.
Latent Representation: Operating on abstract, high-level features rather than raw, noisy sensory input (e.g., pixels).
State Estimation: Maintaining an internal belief of the environment’s current state to anticipate future transitions.

Key Architectures & Approaches

llms: Autoregressive prediction of discrete linguistic tokens; primarily limited by the scope of text-based data.
vl-jepa: A recent vision-centric approach to AGI emerging from Meta FAIR Lab and Yann LeCun.
- Core Thesis: “Language is not intelligence”; moves the focus from generative text to visual/sensory world modeling.
- Departure from Generative AI: Aims to move away from the limitations of chatgpt and purely generative paradigms.
- Mechanism: Utilizes Joint-Embedding Predictive Architecture to predict information in a latent space, avoiding the computational overhead of pixel-by-pixel generation.
- Objective: Establishing non-LLM reasoning architectures through visual predictive modeling.

AGI
generative-ai
Latent Space
Joint-Embedding Predictive Architecture

Backlink: 2026 04 14 New paper for a vision approach to AGI not LLM

Source Notes

2026-04-14: How to get TACK SHARP photos with any camera!

NemoClaw Knowledge Wiki

Explorer

world-models

World Models

Core Principles

Key Architectures & Approaches

Source Notes

Graph View

Table of Contents

Backlinks

NemoClaw Knowledge Wiki

Explorer

world-models

World Models

Core Principles

Key Architectures & Approaches

Related Concepts

Source Notes

Graph View

Table of Contents

Backlinks