Local coding assistants
Systems and frameworks that utilize locally hosted large-language-models (LLMs) to provide AI-driven programming capabilities—such as code generation, refactoring, and agentic terminal execution—within a private or offline environment.
Core Technologies
- ollama: A streamlined platform for running and managing LLMs locally.
- claude-code: An agentic coding tool designed to interface with the Anthropic API.
- GLM-4.7-Flash: A 30B parameter mixture-of-experts (MoE) model used for efficient local inference.
Recent Developments
- Anthropic API Compatibility: ollama now supports the Anthropic API standard, enabling tools built specifically for Claude to interact with local models.
- Local Agentic Workflows: Users can now run claude-code locally by routing its requests through ollama using models such as GLM-4.7-Flash.
Backlinks
- 2026 04 14 Ollama Claude GLM Channel Sam Witteveen