RICE-A Test   ADLC Role Assignment Framework

Q: What is the RICE-A Test in the ADLC?

The RICE-A Test is a five-factor decision framework used in the Agentic Development Lifecycle to determine whether a task should be delegated to an AI agent or retained by a human. RICE-A stands for Reversible, Inspectable, Constrained, Low Error Cost, and Autonomy-appropriate.

Q: What happens if a task fails the RICE-A Test?

If any of the five RICE-A criteria fail, the human owns the task. The test is a gate all five must pass for the agent to be assigned the work. A single failure means human ownership.

Q: What does each letter in RICE-A stand for?

R Reversible: can the task be undone cheaply if wrong? I Inspectable: can the output be verified by test or review? C Constrained: do clear and testable criteria exist? E Low Error Cost: is the cost low if things fail? A Autonomy: what level of autonomy is appropriate for this task?

Prashant Dhingra

The five criteria

What each letter tests

Each criterion targets a different failure mode of over-delegating to an agent. Together they form a complete safety screen.

Reversible

Can it be undone cheaply if wrong?

Reversibility is the single most important safety property of agent-assigned work. If an agent makes an error on a reversible task, the fix is local and bounded. If the task is irreversible think database schema migration, public API change, or a financial transaction a mistake propagates forward and compounds. Tasks that cannot be easily undone stay with humans.

✓ Write a function ✓ Refactor a module ✗ Rename public API endpoint ✗ Delete production data

Inspectable

Can the output be verified by test or review?

If you cannot check whether the agent did the task correctly, you cannot safely trust the output. Inspectability requires either automated verification (tests, linters, type-checkers) or a feasible human review path. Tasks whose correctness can only be assessed by running in production with real users are not inspectable enough to be agent-owned.

✓ Generate test suite ✓ Document an API ✗ UX copy for emotional resonance ✗ Business strategy framing

Constrained

Do clear and testable criteria exist?

An agent without constraints is an agent without a success condition. Constrained tasks have explicit acceptance criteria that can be checked either by the agent itself or by a gate. Vague tasks ("make this better", "improve performance") cannot be reliably assessed and so cannot be safely delegated. Specification quality directly determines whether this criterion passes.

✓ Sort list ascending, all tests pass ✓ Response time < 200ms under load ✗ Make the UI feel more intuitive ✗ Improve developer experience

Low Error Cost

Is the cost low if things fail?

Even reversible, inspectable, constrained tasks can carry high error cost if they are on a critical path, involve sensitive data, or block many downstream teams. Error cost is not just about reverting a change it includes detection time, blast radius, customer impact, and reputational exposure. High-error-cost tasks require human oversight even when other criteria pass.

✓ Draft internal documentation ✓ Generate mock data for dev env ✗ Modify authentication logic ✗ Deploy to production on Friday

Autonomy

What level of autonomy applies?

The first four criteria determine whether to delegate. Autonomy determines how much. A task may pass R, I, C, and E but still require human checkpoints at each sub-step (Level 2) rather than full autonomous execution (Level 5). Autonomy level is set by the human at Stage C (Planning) based on demonstrated agent reliability at that specific task type not optimism.

L1–L2 Human-led or co-piloted L3 Supervised autonomy L4–L5 Agent-led or fully autonomous

The gate

Apply the test at every stage gate

The RICE-A test is not applied once at the start of a project. It is re-applied at each ADLC stage gate as context changes a task that passes early may fail later as its blast radius grows, or pass later as agent reliability in that area is established.

Can the output be reverted without costly side effects?

Can correctness be confirmed by test, lint, or review?

Are acceptance criteria explicit and machine-checkable?

Is the blast radius of failure bounded and tolerable?

Has the right autonomy level been assigned and approved?

Outcomes

🤖

Agent can do it

All 5 criteria → PASS

The task is safely within the agent's operating envelope. Assign at the determined autonomy level, set guardrails, and proceed. The human reviews the output at the stage gate not every micro-step.

👤

Human owns it

Any 1 criterion → FAIL

The task carries risk that the agent cannot safely manage alone. The human performs or oversees the work. This is not a failure of the agent it is the framework working correctly. The agent may still assist, but accountability stays human.

In practice

Worked examples

Applying the RICE-A test to common software development tasks showing how the criteria interact to determine ownership.

Task	R	I	C	E	A	Owner
Write unit tests for a utility function	✓	✓	✓	✓	✓	Agent
Refactor a module to reduce duplication	✓	✓	✓	✓	✓	Agent
Generate inline code documentation	✓	✓	✓	✓	✓	Agent
Choose microservice architecture boundaries	✗	✗	✗	✗	✗	Human
Modify authentication token validation logic	✓	✓	✓	✗	✗	Human
Rename a public REST API endpoint	✗	✓	✓	✗	✗	Human
Fix a failing CI pipeline step	✓	✓	✓	✓	✓	Agent
Approve production release to 100% of users	✗	✓	✓	✗	✗	Human