AI Systems
AI Systems
Designing and operating AI systems in production: contracts, state, model routing, human control.
A production AI system is not a model with a prompt. It is a system with input/output contracts, observable state, model-selection rules and fault-tolerance layers. We design AI as engineering infrastructure, not as a capabilities demo.
The difference shows not in the demo but a month into operation. A demo answers pre-selected questions over fresh data. Production meets stale sources, ambiguous requests, tool failures and cost that grows faster than load. A system designed as infrastructure handles these by rule; a system assembled as a demo breaks on them silently.
How it works
Every system is built around an explicit loop: context sources → retrieve and supply the right fragment → orchestrate steps → tools and agents → hand off to a human on hard cases → observability. Workflow state is materialized and traceable: for any answer you can reconstruct which context was supplied and why that decision was made. Without this the system can neither be debugged nor proven to behave predictably.
Model choice is a step decision
Not every step needs a strong model. Classification, field extraction and short rewrites are handled by a cheap fast model; the strong one is needed where the cost of error is high. The model is chosen per step — by risk, required quality and acceptable latency — not assigned globally “the same everywhere”. This is a quality and a cost question at once: routine steps on a cheap model remove a significant share of spend with no quality loss where quality wasn’t required.
Human control
Systems are designed as AI-assisted operations under human oversight: data-access rules, decision auditing, configurable thresholds for handing hard or risky cases to people. Autonomy is added by necessity, not by default: the right question is not “what else to trust the model with” but “what minimum authority is sufficient for this step”.
Where it applies
Customer interaction, operational automation, access to enterprise knowledge, monitoring — wherever repetitive workflows can move to a governed AI layer with predictable economics and preserved human control. The line is simple: AI takes the volume and the routine, the human stays in the loop where an error is expensive.