🗂️ History & Anthropology · View mindmap

World Model

A World Model in AI is an internal representation that allows a system to predict future states or understand the underlying structure of an environment. This concept is central to reinforcement-learning, planning, and self supervised learning.

Key Implementations & Perspectives

Joint Embedding Predictive Architecture (JEPA)

Joint Embedding Predictive Architecture (JEPA) is a self-supervised learning framework proposed by yann-lecun designed to learn World Model by predicting future states within a latent embedding space, explicitly avoiding the reconstruction of raw data tokens or pixels.

Core Principles

Latent Prediction: The architecture predicts embeddings of future observations based on embeddings of current observations, operating entirely within a compressed representation space rather than the input space.
Reconstruction Avoidance: Unlike autoencoders or large-language-models, JEPA does not reconstruct input data; this prevents memorization of low-level details and forces the model to learn high-level semantic structures and invariants.
Discriminative Training: Utilizes a discriminator to ensure embeddings are informative and to prevent trivial solutions where the predictor outputs constant values.

Strategic Positioning vs. LLMs

yann-lecun advocates JEPA as the primary alternative to autoregressive large-language-models, arguing that predicting in latent space is more biologically plausible and computationally efficient for understanding causal structures.

General Concept & Applications

Definition: A world model acts as a simulator of the environment, enabling agents to imagine outcomes without direct interaction, thereby improving sample efficiency and planning capabilities World Models in AI: Concept, Implementations, and Applications.
Implementations: Range from simple Markov Decision Process approximations to complex generative models and JEPA-style latent predictors.
Applications: Critical for robotics, autonomous systems, and advanced agent-based models requiring long-horizon planning and counterfactual reasoning.

NemoClaw Knowledge Wiki

Explorer

joint-embedding-predictive-architecture-jepa

World Model

Key Implementations & Perspectives

Joint Embedding Predictive Architecture (JEPA)

Core Principles

Strategic Positioning vs. LLMs

General Concept & Applications

Graph View

Table of Contents

Backlinks