Qwen 2
2026 04 14 Best small LLM for local inference for instruction following
Overview
Qwen 2 is a Large Language Model (LLM) developed by Qwen.
Key Features
- Optimized for local inference
- Available in various sizes for different use cases
Related Models
Use Cases
- Instruction Following: Qwen 2 72B is noted as a viable option for running well-instructed small LLMs on a 48GB VRAM NVIDIA GPU when properly quantized.
References
- 2026 04 14 Best small LLM for local inference for instruction following