group: platforms-runtimes-environments

Gpu Clusters

2026-04-23: Adam Lucek - quantisation of LLM https://www.youtube.com/watch?v=3EDI4akymhA This video provides a detailed overview of quantization in the context of large language models (LLMs), explaining what it is, why it’s necessary, and how it’s implemented. **1. The Challenge of Large (Adam Lucek quantisation of LLM)
2026-04-14: Adam Lucek - quantisation of LLM https://www.youtube.com/watch?v=3EDI4akymhA This video provides a detailed overview of quantization in the context of large language models (LLMs), explaining what it is, why it’s necessary, and how it’s implemented. 1. The Challenge of Large Language Models: LLMs like NVIDIA’s Llama 3.1 Nemotron 70B (70.6 billion parameters) are massive, often requiring gigabytes of storage (e.g., 30+ file sizes).