Built by Berry — Operational AI
Menu

Navigation

Built by Berry is the operational AI firm — we ship the systems, the agents, and the training to run them.

Start a Project
Expert in the Loop · Part 6 of 6 Explainer Engineering and operations leads shipping agent workflows to production

Where control points become software

5 min read Published 2026-04-10 Updated Jun 14, 2026

You can run the four-question test in a workshop. You can label generate, recommend, decide, and act on a whiteboard. You can name the domain expert and draw the judgment map. And then Monday arrives, volume spikes, and control points that lived only in meetings evaporate — because there is no queue, no routing rule, no record of who decided what. This piece closes the Expert in the Loop series by making control points operational: the minimum viable stack, the Friday afternoon test as capstone, and where to go next for production implementation.

#Why meeting-room control points fail

Implicit control points depend on presence. The expert is in the loop because they are online, they saw the Slack message, they remember the client tier rule from last quarter. That does not scale. It does not survive handoffs. It does not answer the question leadership asks after the first visible failure.

Software control points persist:

  • Exception routing — rules that send cases to human review before act steps execute.
  • Named queue owner — a person accountable for clearing the review queue, not a channel.
  • Input snapshot — frozen inputs at generation time so approval is not on stale data.
  • Action log — who decided, what changed, what shipped downstream.

These are not exotic requirements. They are the same infrastructure you would build for any multi-step operational workflow. The AI call is one step. The control point is another. Both are production software.

#Minimum viable control-point stack

You do not need a full platform on day one. You need four capabilities wired together on the first workflow you ship.

Capability What it does Failure without it
Exception routing Sends cases outside boundary to review Wrong automatic actions ship
Named queue owner Accountability for clearing review Cases age, SLAs slip
Input snapshot Freezes context at run time Approve stale output
Action log Records human decisions Friday debug becomes engineering archaeology

Mirror the architecture cues from production agent work: state in your database, queues for model runs and human review, configuration for business rules — thresholds, tiers, keyword matches — not buried in prompts only engineering can edit.

flowchart LR case[Case arrives] --> rules{Routing rules} rules -->|Inside boundary| auto[Automated path] rules -->|Outside boundary| queue[Review queue] queue --> owner[Named queue owner] owner --> log[Action log] auto --> log

This is the same shape as the reviewer console pattern — described in full in The reviewer console is where humans belong. This series explains why that pattern exists. Production AI shows how to build it.

#Friday afternoon test — series capstone

Someone will need to debug a wrong answer. The question is whether that someone is an ops lead with a screen or an engineer with log access.

Open the last case that produced a bad outcome. In under five minutes, reconstruct: what inputs did the model see, what did it produce, what routing rule sent it to review, what did the human do, what shipped downstream.

If you cannot pass that test, the control point is not production-ready — regardless of demo quality on happy paths. Audit is how you answer "what happened, and how do we prevent it?" without shutting the workflow down for a week.

Run this test on the workflow you sketched in Where AI contributes and where judgment takes over before you fund the next model upgrade.

#Series exit: from thinking to shipping

The Expert in the Loop arc moves from frame to diagnosis to visibility to accountability to workflow design to software.

Article You should be able to…
Thought partner, not shortcut Name shortcut vs alignment intent
Bad thinking scales Run the four-question test
Make thinking visible Produce desk-sized alignment artifacts
Expert as control point Name roles and avoid approver theater
Generate / decide split Label control points on one workflow
This article Ship minimum routing, queue, snapshot, log

If you are shipping agents next: read The reviewer console is where humans belong — exception routing, audit, and what to build before the next model upgrade.

If you are evaluating whether the operation can carry automation: start with AI readiness is an operations question and the Readiness series.

If you are building the full production stack: continue with the Production AI series.

If you are sharpening individual expert habits: use the Thinking With AI guide alongside this team-facing structure.

#Build the queue before the model upgrade

Pick the workflow where you labeled decide and act steps. Ship the review queue and action log first. Connect exception routing from day one. Train the named queue owner on the screen — not on the prompt.

Then upgrade the model. You will be able to tell whether accuracy improved, because you have the audit trail to measure it. Control points that live in software survive the week you hoped meetings would handle.

Edit this article on GitHub