RAG over your notes gives an AI a pile of text. A knowledge graph gives it entities and relationships. Here is how I build one only from my own words: collect, extract triples, dedupe, normalise, and read and write it …
The full writeup: a GPU host running llama.cpp + llama-swap behind a gateway, the OpenCode agent runtime on top, the single-slot constraint and the agents built around it, the five subagent rules, three-layer skills, a …
Five layers on top of a local model: a single-slot model swap, model tiering, narrow agents with isolated context, skills that wrap an API as one tool call, and a memory layer that learns from its own runs. The …