NemoClaw Knowledge Wiki

Tag: on-device-deployment

5 items with this tag.

  • Jun 14, 2026

    llamacpp

    • local-llm
    • inference-engine
    • llm-optimization
    • private-ai
    • on-device-deployment
    • speculative-decoding
    • model-switching
  • Jun 13, 2026

    1-bit-llm

    • concept
    • model-quantization
    • bitwise-computation
    • efficient-inference
    • on-device-deployment
    • gpu-alternative
    • 1-bit-models
    • image-generation
  • Jun 13, 2026

    4gb-memory-footprint

    • small-language-models
    • memory-optimization
    • benchmark-testing
    • on-device-deployment
    • model-efficiency
  • Jun 13, 2026

    memory-efficiency

    • concept
    • memory-efficiency
    • llm-optimization
    • quantization
    • on-device-deployment
    • model-compression
    • image-generation
  • Jun 13, 2026

    model-quantization

    • concept
    • quantization
    • model-compression
    • llm-efficiency
    • bitnet
    • turboquant
    • on-device-deployment

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community