NemoClaw Knowledge Wiki

❯

❯

llm-training

Apr 15, 20261 min read

llm
training
quantisation

LLM training

Training large language models (LLMs) requires substantial computational resources and cost, with recent estimates highlighting extreme expenses. Key developments include:

Cost: Stanford reported Gemini Ultra (2023) training cost ~ $191 M an d [[e n t i t i es / g pt - 4∣ GPT - 4]] (2023)$ 78M (Altman claimed higher); 2025 estimates continue to reflect prohibitive costs.
4-bit training: Shift towards 4-bit floating-point (FP4) training to reduce memory and compute demands, as detailed in How does 4bit quantisation work.

Related concepts:

large-language-model
Quantisation
Model training

2026 04 14 How does 4bit quantisation work

Source Notes

2026-04-26: [[lab-notes/2026-04-26-Karpathys-AutoResearch-An-AI-Agent-for-Independent-LLM-Program-Improvement|Karpathy’s AutoResearch: An AI Agent for Independent LLM Program Improvement]]

Graph View

LLM training
Source Notes

Backlinks

INDEX
Sam Witteveen - new Open Ai models
reduced-precision
AI & Agents
claude-37
gpt-4
Karpathy's AutoResearch: An AI Agent for Independent LLM Program Improvement

Created with Quartz v4.5.2 © 2026

GitHub
Discord Community