Built by Berry — Operational AI
Menu

Navigation

Built by Berry is the operational AI firm — we ship the systems, the agents, and the training to run them.

Start a Project
Explainer Operations-heavy companies (~50–120 people) where manual coordination is the bottleneck

Your team is doing work the software should be doing

8 min read Published 2026-05-14 Updated Jun 14, 2026

Your CRM has a billing integration. Your onboarding tracker exists. You bought an AI copilot last year. And every Monday, ops still rebuilds new accounts by hand across three systems because nothing agrees on what a customer record should look like.

That is not a people problem. It is not a prompt problem. It is a systems problem — the software never took ownership of the multi-step job your team runs every day. Humans became the integration layer: copying fields, chasing approvals in Slack, reconciling spreadsheets because the tools do not talk to each other in the order the work actually happens.

#The job your team is doing instead of the software

Operational work at scale is mostly coordination: move data across boundaries, apply rules, route exceptions, leave an audit trail someone can trust next month.

Software is supposed to own that loop. When it does not, the team fills the gap with discipline and overtime.

A recognizable pattern from a ~50-person B2B services firm: sales closes a deal in the CRM. Ops opens billing, the onboarding tracker, and a shared drive folder. They retype the same company name, contract terms, and contact list three times because each system was bought for a department, not for the handoff between them. Roughly three hours per account. Errors surface two weeks later as billing disputes and missed SLAs. Leadership asks for a dashboard; finance exports CSVs because the numbers in CRM and billing do not match.

Nobody is lazy. The job was never modeled as one workflow. Each tool owns a slice. The team owns the glue.

The same pattern shows up outside onboarding:

  • Support triages tickets by reading the inbox, opening CRM, checking billing, and deciding who should see it — steps a workflow system should run with a human only on exceptions.
  • Finance closes the month by reconciling exports because the general ledger and the operational systems were never wired for a single source of truth.
  • Rev ops maintains a spreadsheet of "what we actually sold" because the CRM pipeline and the fulfillment record diverged six quarters ago.

In every case, skilled people do repetitive coordination because no system was built for the path work actually travels.

#Diagnostic signs — before you add more AI

More AI does not fix coordination debt. It automates a broken path faster.

If these signs are present, your bottleneck is systems — not model selection:

Retyping is the workflow. Ask the person who runs the job: how many systems do you touch, and where do you copy data by hand? If the answer is more than one manual copy per run, you have a Tools-layer gap before you have an AI opportunity. The Readiness Stack puts Data and Process ahead of Tools for a reason: agents cannot reconcile records that humans are still rebuilding from scratch.

Slack is your approval system. Decisions that should be logged, routed, and auditable live in threads. Nobody can answer "who approved this exception?" on a Friday afternoon without scrolling. That is a Process and Decision gap — not something you solve with a better chatbot.

Leadership does not trust the numbers. When ops and finance argue about headcount on new accounts or dispute volume, the issue is not reporting. It is that no system owns reconciliation. Dashboards built on drifted data accelerate bad decisions.

The AI you bought never left the early adopters. A copilot in the browser helps individuals write faster. It does not run the onboarding workflow, route billing exceptions, or give leadership one record per customer. Adoption stuck inside three power users is a signal the tool was never embedded in the job — see when operational AI stops being a pilot for the production threshold.

Engineering is the escalation path for ops. When routine operational exceptions become Jira tickets, ops is running a manual exception router because the product stack has no reviewer surface. Humans belong in a reviewer console, not in a Slack ping to engineering.

If three or more of these are true, pause the AI pilot. Fix the workflow shell first.

Sign What it means Layer to fix first
Retyping across systems Humans are the integration layer Tools / Data
Slack approvals No logged, auditable routing Process / Decision
Leadership distrusts numbers No reconciliation owner Data / Tools
Copilot stuck in 3 power users AI not embedded in the job Process / People
Ops escalates to engineering No reviewer surface Tools / People

#Why "more AI" makes coordination debt worse

The vendor pitch is seductive: connect your data, add an agent, watch the manual work disappear. In practice, without a system that owns state, handoffs, and audit, you get:

Faster wrong answers. An agent that drafts from CRM data that ops already knows is stale sends confident emails about the wrong contract tier. The team spends more time correcting AI output than they spent on copy-paste.

Unaudited automation. Scripts and copilots run in individual sessions. When something breaks, there is no job log, no replay, no owner. You cannot debug a Monday-morning failure if the steps lived in someone's chat history.

Pilot purgatory. A demo that triages ten tickets beautifully never ships because nobody defined what happens on ticket eleven — the edge case that needs a human, the billing flag that needs finance, the SLA breach that needs a manager. Pilots optimize the happy path. Operations lives in exceptions.

Budget split across layers. You pay for a model API, a integration tool, a consulting workshop, and still no single workflow that runs end-to-end in production. AI readiness is an operations question: readiness means the operation can host automation, not that someone attended a prompt-engineering webinar.

AI earns its place after the workflow exists — queued steps, explicit state, reconciliation built in, exceptions routed to a named human. Agents then handle the routine path inside that shell. Without the shell, agents are a faster version of the same manual glue.

#The systems-first path

Build the smallest end-to-end version that handles the common path in production. Real data. Real users. Side by side with whatever it replaces. Not a slide deck. Not a sandbox.

flowchart LR w1[Week 1 — Metric and scope] --> w2[Weeks 2-4 — Common path live] w2 --> w3[Weeks 5-8 — Exceptions and console] w3 --> w4[Train and exit]

Week 1 — Name the bottleneck and the metric. Sit with the team doing the work. Pick one workflow whose fix moves a number leadership already cares about: hours per onboarding, disputes per week, days to close. One page: metric, common path, v1 boundaries. Everything else waits.

Weeks 2–4 — Ship the routine path. Model the job as states and handoffs, not as a list of tool features. Ingest from the systems you have. Write back where the team writes today so adoption does not require a big-bang cutover. Log every transition. The goal is that a new account (or ticket, or close task) runs through the system without someone retyping the same field three times.

Weeks 5–8 — Cover the exceptions. Add the edge cases, alerting, and reviewer console. Route the 5% that needs judgment to the right human with context attached — not to a generic inbox. Watch the metric move. Tighten what is not moving fast enough.

Training on the way out. Department workshops on the workflows you shipped, written runbooks, a handoff the team can run without the vendor. Adoption is a deliverable, not a webinar scheduled after go-live.

For the B2B services pattern above, the outcome looked like this: per-account ops time on the routine path dropped by roughly eighty-five percent — from hours of cross-tool entry to under thirty minutes. Billing disputes traceable to onboarding errors went to near zero. Leadership got one record per new account instead of three conflicting ones. First system live in about six weeks. No new hires — the same team, less glue work.

That is operational AI in the right order: system first, agents on the routine steps inside it, humans on exceptions, training so the team owns it.

#What to avoid

Buying another departmental tool. A fourth SaaS product adds another login and another export. It does not own the handoff unless you build the workflow around it.

Automating the spreadsheet. Moving formulas into a script preserves the manual mental model. Build toward systems of record, not faster shadows.

Perfecting edge cases before the common path ships. The 70% that runs every day pays for the project. Edge cases matter in weeks five through eight, not before anyone uses v1.

Delegating scope to people who do not run the job. Executives know the pain exists. Operators know where the copy-paste lives. Scope without operators is how you fund six weeks of discovery theater.

Skipping the metric. If you cannot name what moves, you will ship something impressive that nobody measures. Tie the build to hours, days, counts — something finance and ops both recognize.

#What to do next

Tomorrow, shadow the workflow that costs the most hours every week. Count how many systems get touched, how many fields get retyped, and how many decisions live in Slack instead of a log.

Write the baseline — even a rough one. Then write one sentence: the smallest system that could run the common path end-to-end in production within sixty days.

If that sentence is hard to write, you are not ready for agents. You are ready to fix the coordination layer first — and the right next step is a scope conversation with the operator in the room, not another copilot license.

Edit this article on GitHub