Opencode · ArchWorks

A self-hosted multi-agent LLM stack

7 Jun 2026 8,165 words · 39 min read

The full writeup: a GPU host running llama.cpp + llama-swap behind a gateway, the OpenCode agent runtime on top, the single-slot constraint and the agents built around it, the five subagent rules, three-layer skills, a …