Chunking Strategies

Chunking strategies are methods for dividing text into smaller, manageable segments (chunks) to improve information retrieval and processing in applications like retrieval-augmented-generation-rag.

Key Concepts

  • Fixed-size chunking: Divides text into chunks of equal length.
  • Semantic chunking: Considers meaning and context to create chunks.
  • Overlapping chunks: Chunks that share some content to preserve context.
  • Sliding window: A technique where chunks are created by moving a fixed-size window across the text.

Applications

  • Adam Lucek - optimal RAG chunking with ChromaDB
    • Video: Optimal RAG Chunking with ChromaDB
    • Explores various text chunking strategies for RAG.
    • Presents insights from a ChromaDB technical report titled “Evaluating Chunking Strategies for Retrieval.”
    • Details different methods, implementations, and performance findings.
  • 2026 04 14 Adam Lucek optimal RAG chunking with ChromaDB

Source Notes