
Building an LLM orchestration system that actually works
There's a pattern I keep seeing with teams that adopt LLMs: the first version works, the second version is worse, and by the third version nobody knows which prompts are in production. The problem isn't the models. It's that nobody's treating prompt systems like engineering artifacts.
Full post coming soon.