How to use AI
Planted 02026-05-29
A new design material for building more productive organizations
Technology is neither necessary nor sufficient for productivity improvement.
The durable source of productivity is better organization design; a change in how work is divided and coordinated, how behavior is governed and controlled, and how information is processed into decisions and learning.
The new organization is an organization where some work is delegated to agents, some work is supervised by humans, and some work is restructured entirely because agents change the cost of coordination, analysis, documentation, monitoring, and reuse.
- Who does what?
- Who decides?
- Who coordinates with whom?
- What information is needed?
- How is work controlled?
With AI agents, the same questions remain, but the answer changes as agents can become part of the organization’s coordination and information-processing architecture.
Coordination: fewer handoffs, smaller queues, faster routing
In many organizations, productivity is lost because work sits in queues, gets handed off too often, waits for clarification, or requires excessive meetings.
AI agents can improve coordination by acting as routing, preparation, follow-up, and escalation layers.
Instead of this:
Person A asks Person B for information.
Person B asks Person C.
Person C sends a spreadsheet.
Person A makes a deck.
A meeting is scheduled.
The meeting identifies missing information.
The cycle repeats.
You get this:
An agent gathers the needed information, checks prior work, identifies gaps, drafts a recommendation, routes exceptions to the right owner, and prepares the decision meeting only if needed.
Move routine coordination work out of meetings, email, chat, and status updates into agent-mediated workflows.
| Coordination problem | AI-agent redesign |
|---|---|
| Too many status meetings | Agents generate project status, blockers, risks, and decisions needed |
| Work gets stuck between teams | Agents monitor queues and escalate stalled items |
| Repeated clarification loops | Agents check requests for completeness before routing |
| Slow onboarding to projects | Agents generate project briefs, history, open issues, stakeholders, and relevant artifacts |
Control: from supervision of activity to governance of outcomes and exceptions
The benefit is shifting management attention away from monitoring low-value activity and toward exceptions, judgment, and system improvement.
In a traditional organization, control often happens through:
- hierarchy
- approvals
- meetings
- reports
- budget reviews
- performance metrics
- process compliance
- managerial supervision
With agents, some of this control can become continuous, embedded, and exception-based.
Instead of managers asking:
“What did everyone do this week?”
Agents can answer:
“Here are the deliverables completed, decisions pending, risks emerging, commitments missed, customer issues unresolved, and places where human intervention is needed.”
Managers stop supervising activity and start governing decision quality, exceptions, priorities, and learning.
| Control problem | AI-agent redesign |
|---|---|
| Managers spend time chasing updates | Agents compile updates automatically |
| Metrics lag reality | Agents produce near-real-time operating signals |
| Approvals are too slow | Agents approve low-risk cases within policy and escalate exceptions |
| Accountability is vague | Agents maintain decision logs, owners, commitments, and due dates |
| Compliance is manual | Agents check work against policy before submission |
An improved control system could have three layers:
- Layer 1: Agent-executed controls
- Agents handle routine checks:
- Is the request complete?
- Is it within policy?
- Does it match prior precedent?
- Is there a known risk?
- Is the data internally consistent?
- Does this require legal, finance, security, or executive review?
- Agents handle routine checks:
- Layer 2: Human exception review
- Humans intervene where there is:
- ambiguity
- high risk
- ethical judgment
- customer sensitivity
- strategic tradeoff
- irreversible commitment
- reputational exposure
- Humans intervene where there is:
- Layer 3: Governance of the agent system itself
- The organization also needs to govern the agents:
- What can agents decide?
- What must they escalate?
- What data can they access?
- Who owns their outputs?
- How are errors detected?
- How are models, prompts, tools, and permissions audited?
- How is performance measured?
- The organization also needs to govern the agents:
Without clear decision rights, accountability, and governance, agents can amplify confusion.
From individual expertise to institutionalized intelligence
Much knowledge work today is wasteful because organizations repeatedly recreate context:
- writing similar memos
- rebuilding analyses
- rediscovering prior decisions
- asking the same experts the same questions
- searching across scattered documents
- making slides instead of decisions
- producing analysis that is not reused
Instead of knowledge living mainly in people’s heads, old decks, Slack threads, emails, and shared drives, agents can help create an active organizational memory.
The future knowledge-work organization may have agents that:
- retrieve prior work before new work begins
- generate first drafts using company context
- compare new problems to past cases
- maintain decision logs
- convert meetings into actions and reusable knowledge
- identify contradictions across documents
- summarize expert judgment
- update playbooks after projects
- create templates from repeated work
- make best practices easier to follow
Knowledge management moves from passive repositories to active agents embedded in work.
The new division of labor
What work should humans do, what work should agents do, and what work should be redesigned away entirely?
| Work type | Best owner |
|---|---|
| Repetitive, rules-based, low-risk work | Agent |
| Information gathering and synthesis | Agent with human review |
| Drafting, formatting, summarizing | Agent |
| Monitoring, reminders, queue management | Agent |
| Pattern detection and anomaly detection | Agent |
| Ambiguous judgment | Human |
| Ethical tradeoffs | Human |
| Strategic prioritization | Human |
| Relationship management | Human |
| Final accountability | Human |
| Cross-functional conflict resolution | Human, supported by agents |
Agents should not merely be added to existing jobs. Jobs should be decomposed.
An “analyst” role may become:
| Component | Future design |
|---|---|
| Data gathering | Agent-led |
| Cleaning and reconciliation | Agent-assisted, system-verified |
| Initial analysis | Agent-generated |
| Interpretation | Human-led |
| Recommendation | Human-agent collaboration |
| Decision memo | Agent-drafted, human-edited |
| Stakeholder alignment | Human-led |
| Follow-up tracking | Agent-led |
Decision architectures
What decisions are made in this process, what information do they require, who has authority, and which parts can be prepared or executed by agents?
For each workflow, you can map:
| Design element | Question |
|---|---|
| Trigger | What starts the work? |
| Inputs | What information is required? |
| Agent tasks | What can agents gather, draft, check, or decide? |
| Human tasks | Where is judgment required? |
| Decision rights | Who can approve, reject, escalate, or override? |
| Controls | What policies, metrics, and risk checks apply? |
| Escalation | When does the agent stop and ask a human? |
| Learning | What gets captured for future reuse? |
The productivity gain comes from:
- fewer handoffs
- faster triage
- clearer ownership
- better reuse of prior cases
- less meeting time
- faster escalation
- better organizational memory
- improved learning loop
New roles
| Role | Purpose |
|---|---|
| Agent owner | Accountable for a specific agent’s performance and risk |
| Workflow architect | Redesigns work around human-agent collaboration |
| Decision-rights designer | Defines what agents, employees, and managers can decide |
| Knowledge curator | Ensures agents use accurate, current organizational knowledge |
| Agent evaluator | Tests outputs, monitors quality, identifies failure modes |
| AI governance lead | Manages policy, risk, permissions, audits, and compliance |
| Human escalation owner | Handles exceptions agents cannot resolve |
| Process-learning owner | Ensures completed work improves future work |
Before and after
Organizations often rely on:
- meetings for coordination
- managers for status tracking
- analysts for information synthesis
- employees for repetitive documentation
- shared drives for knowledge storage
- approvals for control
- committees for cross-functional decisions
- memory residing in individuals
The AI-agent-enabled organization relies more on:
- agents for routine coordination
- exception-based management
- human judgment for high-value decisions
- active organizational memory
- clearer decision rights
- automated monitoring and escalation
- reusable knowledge assets
- smaller teams with more leverage
- continuous learning loops
The goal is not fewer people doing the same work. The goal is less organizational friction per unit of output.
A vendor can improve model alignment, but you still own deployment alignment
Fake productivity looks like:
- faster reports with subtle inaccuracies,
- fewer staff but more unresolved exceptions,
- higher utilization but lower resilience,
- lower service cost but worse customer trust,
- faster maintenance planning but more safety risk,
- smoother dashboards but less ground truth,
- apparent compliance but weaker accountability.
AI is not a vendor promise, it is an operating discipline.
Define autonomy levels
| Level | Agent authority | Example |
|---|---|---|
| 0 — Read only | Can observe and summarize | Summarize daily port delays. |
| 1 — Recommend | Can suggest actions | Recommend replenishment orders. |
| 2 — Draft | Can prepare actions | Draft customer replies or work orders. |
| 3 — Act with approval | Can execute after human approval | Submit purchase order after manager approval. |
| 4 — Bounded autonomy | Can act within strict limits | Reorder consumables under $500 from approved vendors. |
| 5 — High autonomy | Can execute complex actions | Generally avoid except in low-risk, reversible domains. |
Use least-privilege tool access
| Capability | Treat separately |
|---|---|
| Read data | Can inspect records, logs, files |
| Write data | Can update records |
| Communicate | Can email, text, notify, call APIs |
| Spend money | Can create or approve purchases |
| Change operations | Can alter schedules, routes, work orders, settings |
| Access sensitive data | Can view employee, customer, financial, safety, or proprietary data |
| Execute code | Can run scripts or modify systems |
Use approval gates
| Trigger | Require human approval |
|---|---|
| Dollar amount | Purchases, refunds, credits, discounts above limit |
| Safety relevance | Anything affecting personnel safety or hazardous operations |
| Legal/regulatory relevance | Compliance filings, claims, regulated communications |
| Customer impact | Account closure, denial, escalation, major promise |
| Operational disruption | Schedule change, shutdown, reroute, shipment hold |
| Low confidence | Agent cannot cite sufficient evidence |
| Novel case | No similar precedent or outside normal operating range |
| Conflict | Agent receives contradictory data or policies |
| Sensitive data | Employee, medical, financial, confidential, or trade-secret data |
Test against failure modes before deployment
| Test type | What you are checking |
|---|---|
| Normal cases | Does it perform routine work correctly? |
| Edge cases | Does it behave well outside common patterns? |
| Adversarial cases | Can it be tricked into violating policy? |
| Conflict cases | What happens when data sources disagree? |
| Missing-data cases | Does it admit uncertainty? |
| Incentive-gaming cases | Does it optimize the metric while harming the real goal? |
| Role-boundary cases | Does it refuse actions outside authority? |
| Escalation cases | Does it correctly call for human review? |
| Regression cases | Does behavior degrade after prompt/model/tool changes? |
For an inventory agent, test: stockout, bad vendor data, duplicate SKU, promotion spike, weather event, supplier delay, return surge.
For a maintenance agent, test: noisy sensor, conflicting readings, stale data, safety-critical anomaly, known failure pattern, unknown pattern.
For a customer agent, test: angry customer, refund abuse, legal threat, discrimination complaint, safety issue, ambiguous policy.
Use separation of duties
| Role | Actor |
|---|---|
| Proposer | Agent A recommends action |
| Checker | Agent B or deterministic rule checks policy and evidence |
| Approver | Human approves material actions |
| Executor | System performs approved action |
| Auditor | Separate process samples and reviews outcomes |
Monitor behavior after deployment
| Metric category | Examples |
|---|---|
| Task performance | Accuracy, completion time, throughput |
| Human override rate | How often humans reject or edit recommendations |
| Escalation quality | Does it escalate the right cases? |
| False positives | Unnecessary alerts, bad recommendations |
| False negatives | Missed issues, missed risks |
| Policy violations | Unauthorized actions or attempted actions |
| Evidence quality | Missing citations, stale data, weak reasoning |
| Drift | Behavior changes over time |
| User trust | Overreliance or underuse |
| Business outcome | Downtime, refunds, quality defects, customer satisfaction, safety incidents |
Install tripwires and kill switches
| Tripwire | Response |
|---|---|
| Attempts unauthorized tool use | Block and alert |
| Repeated low-confidence actions | Pause workflow |
| Unusual volume of actions | Rate-limit and review |
| Policy conflict detected | Escalate |
| Sensitive-data exposure | Stop and log incident |
| Unexpected spend | Freeze purchases |
| Operational anomaly | Require human approval |
| User complaint spike | Roll back version |
Keep humans meaningfully in the loop
Bad human-in-the-loop design:
Agent makes 500 recommendations. Human clicks approve because there is no time.
Good design:
Agent handles routine cases automatically within narrow bounds; ambiguous or high-impact cases are escalated with evidence and alternatives.
Own the data context
| Context issue | Risk |
|---|---|
| Stale policies | Agent follows outdated rules |
| Bad master data | Agent makes wrong operational decisions |
| Unclear source priority | Agent cannot resolve conflicting information |
| Excess context | Agent sees sensitive or irrelevant data |
| Missing context | Agent makes confident but wrong recommendations |
| Unstructured tribal knowledge | Agent misses real operating constraints |
Stage deployment
| Stage | Description |
|---|---|
| Shadow mode | Agent makes recommendations but no one acts on them yet. Compare to human decisions. |
| Advisory mode | Agent recommendations are visible; humans decide. |
| Draft mode | Agent prepares work products; humans edit and approve. |
| Bounded action | Agent acts only in low-risk, reversible cases. |
| Expanded action | Autonomy grows only after measured reliability. |