NemoClaw Knowledge Wiki

Tag: llamacpp

11 items with this tag.

Jul 22, 2026
Bonsai 27B vs. Qwen 35B: LLM Performance and Replacement Feasibility Benchmarks
Jul 22, 2026
Ternary Bonsai 27B vs. Qwen 27B: LLM Performance Benchmarking Summary
Jul 17, 2026
fahd-mirza
Jul 13, 2026
Developing Persistent, Intelligent Memory for Local AI with a Librarian System
Jul 11, 2026
gguf-format
Jun 20, 2026
Ollama, LM Studio, and llama.cpp: Local AI Tool Comparison and Use Cases
Jun 03, 2026
Adaptive PFlash and Hermes Agent: Self-Tuning LLM Prefill for Long Contexts
May 22, 2026
llama.cpp Router Mode: Native Hot-Swappable Local LLM Switching
- llamacpp
May 20, 2026
MTP + Ngram Stacked Speculative Decoding in Llama.cpp for LLM Inference
May 11, 2026
Higgsfield: Enabling LLMs like Claude for Media Generation
May 10, 2026
Achieving Fast 35B MoE AI Model Performance on 6GB VRAM with Llama.cpp

Created with Quartz v4.5.2 © 2026

GitHub
Discord Community