How to use AI

Planted 02026-05-29

A new design material for building more productive organizations

Technology is neither necessary nor sufficient for productivity improvement.

The durable source of productivity is better organization design; a change in how work is divided and coordinated, how behavior is governed and controlled, and how information is processed into decisions and learning.

The new organization is an organization where some work is delegated to agents, some work is supervised by humans, and some work is restructured entirely because agents change the cost of coordination, analysis, documentation, monitoring, and reuse.

  • Who does what?
  • Who decides?
  • Who coordinates with whom?
  • What information is needed?
  • How is work controlled?

With AI agents, the same questions remain, but the answer changes as agents can become part of the organization’s coordination and information-processing architecture.

Coordination: fewer handoffs, smaller queues, faster routing

In many organizations, productivity is lost because work sits in queues, gets handed off too often, waits for clarification, or requires excessive meetings.

AI agents can improve coordination by acting as routing, preparation, follow-up, and escalation layers.

Instead of this:

Person A asks Person B for information.
Person B asks Person C.
Person C sends a spreadsheet.
Person A makes a deck.
A meeting is scheduled.
The meeting identifies missing information.
The cycle repeats.

You get this:

An agent gathers the needed information, checks prior work, identifies gaps, drafts a recommendation, routes exceptions to the right owner, and prepares the decision meeting only if needed.

Move routine coordination work out of meetings, email, chat, and status updates into agent-mediated workflows.

Coordination problemAI-agent redesign
Too many status meetingsAgents generate project status, blockers, risks, and decisions needed
Work gets stuck between teamsAgents monitor queues and escalate stalled items
Repeated clarification loopsAgents check requests for completeness before routing
Slow onboarding to projectsAgents generate project briefs, history, open issues, stakeholders, and relevant artifacts

Control: from supervision of activity to governance of outcomes and exceptions

The benefit is shifting management attention away from monitoring low-value activity and toward exceptions, judgment, and system improvement.

In a traditional organization, control often happens through:

  • hierarchy
  • approvals
  • meetings
  • reports
  • budget reviews
  • performance metrics
  • process compliance
  • managerial supervision

With agents, some of this control can become continuous, embedded, and exception-based.

Instead of managers asking:

“What did everyone do this week?”

Agents can answer:

“Here are the deliverables completed, decisions pending, risks emerging, commitments missed, customer issues unresolved, and places where human intervention is needed.”

Managers stop supervising activity and start governing decision quality, exceptions, priorities, and learning.

Control problemAI-agent redesign
Managers spend time chasing updatesAgents compile updates automatically
Metrics lag realityAgents produce near-real-time operating signals
Approvals are too slowAgents approve low-risk cases within policy and escalate exceptions
Accountability is vagueAgents maintain decision logs, owners, commitments, and due dates
Compliance is manualAgents check work against policy before submission

An improved control system could have three layers:

  1. Layer 1: Agent-executed controls
    • Agents handle routine checks:
      • Is the request complete?
      • Is it within policy?
      • Does it match prior precedent?
      • Is there a known risk?
      • Is the data internally consistent?
      • Does this require legal, finance, security, or executive review?
  2. Layer 2: Human exception review
    • Humans intervene where there is:
      • ambiguity
      • high risk
      • ethical judgment
      • customer sensitivity
      • strategic tradeoff
      • irreversible commitment
      • reputational exposure
  3. Layer 3: Governance of the agent system itself
    • The organization also needs to govern the agents:
      • What can agents decide?
      • What must they escalate?
      • What data can they access?
      • Who owns their outputs?
      • How are errors detected?
      • How are models, prompts, tools, and permissions audited?
      • How is performance measured?

Without clear decision rights, accountability, and governance, agents can amplify confusion.

From individual expertise to institutionalized intelligence

Much knowledge work today is wasteful because organizations repeatedly recreate context:

  • writing similar memos
  • rebuilding analyses
  • rediscovering prior decisions
  • asking the same experts the same questions
  • searching across scattered documents
  • making slides instead of decisions
  • producing analysis that is not reused

Instead of knowledge living mainly in people’s heads, old decks, Slack threads, emails, and shared drives, agents can help create an active organizational memory.

The future knowledge-work organization may have agents that:

  • retrieve prior work before new work begins
  • generate first drafts using company context
  • compare new problems to past cases
  • maintain decision logs
  • convert meetings into actions and reusable knowledge
  • identify contradictions across documents
  • summarize expert judgment
  • update playbooks after projects
  • create templates from repeated work
  • make best practices easier to follow

Knowledge management moves from passive repositories to active agents embedded in work.

The new division of labor

What work should humans do, what work should agents do, and what work should be redesigned away entirely?

Work typeBest owner
Repetitive, rules-based, low-risk workAgent
Information gathering and synthesisAgent with human review
Drafting, formatting, summarizingAgent
Monitoring, reminders, queue managementAgent
Pattern detection and anomaly detectionAgent
Ambiguous judgmentHuman
Ethical tradeoffsHuman
Strategic prioritizationHuman
Relationship managementHuman
Final accountabilityHuman
Cross-functional conflict resolutionHuman, supported by agents

Agents should not merely be added to existing jobs. Jobs should be decomposed.

An “analyst” role may become:

ComponentFuture design
Data gatheringAgent-led
Cleaning and reconciliationAgent-assisted, system-verified
Initial analysisAgent-generated
InterpretationHuman-led
RecommendationHuman-agent collaboration
Decision memoAgent-drafted, human-edited
Stakeholder alignmentHuman-led
Follow-up trackingAgent-led

Decision architectures

What decisions are made in this process, what information do they require, who has authority, and which parts can be prepared or executed by agents?

For each workflow, you can map:

Design elementQuestion
TriggerWhat starts the work?
InputsWhat information is required?
Agent tasksWhat can agents gather, draft, check, or decide?
Human tasksWhere is judgment required?
Decision rightsWho can approve, reject, escalate, or override?
ControlsWhat policies, metrics, and risk checks apply?
EscalationWhen does the agent stop and ask a human?
LearningWhat gets captured for future reuse?

The productivity gain comes from:

  • fewer handoffs
  • faster triage
  • clearer ownership
  • better reuse of prior cases
  • less meeting time
  • faster escalation
  • better organizational memory
  • improved learning loop

New roles

RolePurpose
Agent ownerAccountable for a specific agent’s performance and risk
Workflow architectRedesigns work around human-agent collaboration
Decision-rights designerDefines what agents, employees, and managers can decide
Knowledge curatorEnsures agents use accurate, current organizational knowledge
Agent evaluatorTests outputs, monitors quality, identifies failure modes
AI governance leadManages policy, risk, permissions, audits, and compliance
Human escalation ownerHandles exceptions agents cannot resolve
Process-learning ownerEnsures completed work improves future work

Before and after

Organizations often rely on:

  • meetings for coordination
  • managers for status tracking
  • analysts for information synthesis
  • employees for repetitive documentation
  • shared drives for knowledge storage
  • approvals for control
  • committees for cross-functional decisions
  • memory residing in individuals

The AI-agent-enabled organization relies more on:

  • agents for routine coordination
  • exception-based management
  • human judgment for high-value decisions
  • active organizational memory
  • clearer decision rights
  • automated monitoring and escalation
  • reusable knowledge assets
  • smaller teams with more leverage
  • continuous learning loops

The goal is not fewer people doing the same work. The goal is less organizational friction per unit of output.

A vendor can improve model alignment, but you still own deployment alignment

Fake productivity looks like:

  • faster reports with subtle inaccuracies,
  • fewer staff but more unresolved exceptions,
  • higher utilization but lower resilience,
  • lower service cost but worse customer trust,
  • faster maintenance planning but more safety risk,
  • smoother dashboards but less ground truth,
  • apparent compliance but weaker accountability.

AI is not a vendor promise, it is an operating discipline.

Define autonomy levels

LevelAgent authorityExample
0 — Read onlyCan observe and summarizeSummarize daily port delays.
1 — RecommendCan suggest actionsRecommend replenishment orders.
2 — DraftCan prepare actionsDraft customer replies or work orders.
3 — Act with approvalCan execute after human approvalSubmit purchase order after manager approval.
4 — Bounded autonomyCan act within strict limitsReorder consumables under $500 from approved vendors.
5 — High autonomyCan execute complex actionsGenerally avoid except in low-risk, reversible domains.

Use least-privilege tool access

CapabilityTreat separately
Read dataCan inspect records, logs, files
Write dataCan update records
CommunicateCan email, text, notify, call APIs
Spend moneyCan create or approve purchases
Change operationsCan alter schedules, routes, work orders, settings
Access sensitive dataCan view employee, customer, financial, safety, or proprietary data
Execute codeCan run scripts or modify systems

Use approval gates

TriggerRequire human approval
Dollar amountPurchases, refunds, credits, discounts above limit
Safety relevanceAnything affecting personnel safety or hazardous operations
Legal/regulatory relevanceCompliance filings, claims, regulated communications
Customer impactAccount closure, denial, escalation, major promise
Operational disruptionSchedule change, shutdown, reroute, shipment hold
Low confidenceAgent cannot cite sufficient evidence
Novel caseNo similar precedent or outside normal operating range
ConflictAgent receives contradictory data or policies
Sensitive dataEmployee, medical, financial, confidential, or trade-secret data

Test against failure modes before deployment

Test typeWhat you are checking
Normal casesDoes it perform routine work correctly?
Edge casesDoes it behave well outside common patterns?
Adversarial casesCan it be tricked into violating policy?
Conflict casesWhat happens when data sources disagree?
Missing-data casesDoes it admit uncertainty?
Incentive-gaming casesDoes it optimize the metric while harming the real goal?
Role-boundary casesDoes it refuse actions outside authority?
Escalation casesDoes it correctly call for human review?
Regression casesDoes behavior degrade after prompt/model/tool changes?

For an inventory agent, test: stockout, bad vendor data, duplicate SKU, promotion spike, weather event, supplier delay, return surge.

For a maintenance agent, test: noisy sensor, conflicting readings, stale data, safety-critical anomaly, known failure pattern, unknown pattern.

For a customer agent, test: angry customer, refund abuse, legal threat, discrimination complaint, safety issue, ambiguous policy.

Use separation of duties

RoleActor
ProposerAgent A recommends action
CheckerAgent B or deterministic rule checks policy and evidence
ApproverHuman approves material actions
ExecutorSystem performs approved action
AuditorSeparate process samples and reviews outcomes

Monitor behavior after deployment

Metric categoryExamples
Task performanceAccuracy, completion time, throughput
Human override rateHow often humans reject or edit recommendations
Escalation qualityDoes it escalate the right cases?
False positivesUnnecessary alerts, bad recommendations
False negativesMissed issues, missed risks
Policy violationsUnauthorized actions or attempted actions
Evidence qualityMissing citations, stale data, weak reasoning
DriftBehavior changes over time
User trustOverreliance or underuse
Business outcomeDowntime, refunds, quality defects, customer satisfaction, safety incidents

Install tripwires and kill switches

TripwireResponse
Attempts unauthorized tool useBlock and alert
Repeated low-confidence actionsPause workflow
Unusual volume of actionsRate-limit and review
Policy conflict detectedEscalate
Sensitive-data exposureStop and log incident
Unexpected spendFreeze purchases
Operational anomalyRequire human approval
User complaint spikeRoll back version

Keep humans meaningfully in the loop

Bad human-in-the-loop design:

Agent makes 500 recommendations. Human clicks approve because there is no time.

Good design:

Agent handles routine cases automatically within narrow bounds; ambiguous or high-impact cases are escalated with evidence and alternatives.

Own the data context

Context issueRisk
Stale policiesAgent follows outdated rules
Bad master dataAgent makes wrong operational decisions
Unclear source priorityAgent cannot resolve conflicting information
Excess contextAgent sees sensitive or irrelevant data
Missing contextAgent makes confident but wrong recommendations
Unstructured tribal knowledgeAgent misses real operating constraints

Stage deployment

StageDescription
Shadow modeAgent makes recommendations but no one acts on them yet. Compare to human decisions.
Advisory modeAgent recommendations are visible; humans decide.
Draft modeAgent prepares work products; humans edit and approve.
Bounded actionAgent acts only in low-risk, reversible cases.
Expanded actionAutonomy grows only after measured reliability.