NemoClaw Knowledge Wiki

Tag: sparse-activation

4 items with this tag.

  • Jun 14, 2026

    qwen-36-35b-a3b

    • ai-model
    • qwen
    • moe
    • llm
    • 35b-parameters
    • mixture-of-experts
    • sparse-activation
    • sparse-moe
    • 35b-model
    • edge-deployment
    • low-vram-inference
    • qwen-3.6
    • gemini-3.5-flash
    • google-gemini
    • claude-opus
    • anthropic
    • evaluation-awareness
    • reliability
    • tts
    • miso-tts
  • Jun 13, 2026

    activated-parameters

    • mixture-of-experts
    • model-efficiency
    • inference-compute
    • sparse-activation
    • deepseek-v4
    • kimi-k2
  • Jun 13, 2026

    elastic-sub-network-extraction-moe

    • ai/architecture
    • mixture-of-experts
    • sparse-activation
    • conditional-computation
    • model-scaling
    • moe-routing
    • expert-gating
    • dynamic-compute
    • parameter-efficiency
    • load-balancing
  • Jun 13, 2026

    moe-ai-model

    • ai/mixture-of-experts
    • ai/architecture
    • llm
    • efficiency
    • sparse-ml
    • inference
    • mixture-of-experts
    • sparse-activation
    • conditional-computation
    • routing-mechanism
    • parameter-scaling
    • inference-efficiency
    • expert-networks

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community