NPU support

The capability of software and frameworks to leverage specialized Neural Processing Units for optimized, efficient AI model inference.

Key Implementations

Edge Models


Backlinks:

  • 2026 04 22 Google Gemma 4 Efficient 2.3B Parameter Multimodal Edge AI
  • 2026 04 14 Nexa AI run models locally

Source Notes