feat: adaptive replanning when task results deviate from plan by Ricardo-M-L · Pull Request #5392 · crewAIInc/crewAI

Ricardo-M-L · 2026-04-10T11:04:54Z

Summary

Adds adaptive re-planning capability when planning=True crews encounter task results that deviate from the original plan's assumptions
Introduces ReplanningEvaluator that runs a lightweight LLM check after each task to detect deviations (missing data, unexpected results, infeasible approaches)
When deviation is detected, CrewPlanner.replan() generates revised plans for remaining tasks only, using actual results as context
Fully backwards compatible: replan_on_failure defaults to False, existing crews are completely unaffected

New API

crew = Crew(
    agents=[...],
    tasks=[...],
    planning=True,
    replan_on_failure=True,     # enables adaptive replanning
    max_replans=3,              # prevents infinite loops (default: 3)
    evaluation_criteria=EvaluationCriteria(
        quality_threshold=7.0,  # configurable quality bar
        check_completeness=True,
        check_relevance=True,
        custom_criteria="Must include data sources",
    ),
    replanning_evaluator=custom_evaluator,  # pluggable evaluator
)

Files Changed

File	Change
`lib/crewai/src/crewai/utilities/replanning_evaluator.py`	New - `ReplanningEvaluator`, `ReplanDecision`, `EvaluationCriteria`
`lib/crewai/src/crewai/utilities/planning_handler.py`	Added `replan()` method to `CrewPlanner`
`lib/crewai/src/crewai/crew.py`	Added fields + `_evaluate_and_replan()` hook in `_execute_tasks()`
`lib/crewai/tests/utilities/test_replanning_evaluator.py`	New - 34 test cases

How It Works

After each synchronous task completes, _should_evaluate_for_replan() checks if replanning is enabled and budget remains
ReplanningEvaluator.evaluate() makes a structured LLM call: "Does this result deviate significantly from what the plan assumed?"
If ReplanDecision.should_replan=True, CrewPlanner.replan() generates revised plans for remaining tasks using completed results as context
Revised plans are injected into remaining task descriptions as [REVISED PLAN] sections (old revisions are replaced, not stacked)
_replan_count prevents runaway loops (capped at max_replans)

Test Plan

34 test cases covering all components (all passing)
ReplanDecision model validation (bounds, defaults, all fields)
EvaluationCriteria model validation (bounds, custom criteria)
ReplanningEvaluator (init, no-remaining-tasks, replan/no-replan decisions, parse failure fallback, criteria text building)
CrewPlanner.replan() (returns revised plans, raises on failure, remaining tasks summary)
Crew integration (field defaults, custom config, should_evaluate guards, replan triggers, error handling, replan count, revision stacking, backwards compatibility)
Existing test_planning_handler.py tests still pass (12/12)

Fixes #4983

🤖 Generated with Claude Code

Note

Medium Risk
Adds new LLM-driven evaluation and dynamic replanning into the core sequential execution loop, which can change task descriptions mid-run and affect determinism/cost. Guardrails exist (replan_on_failure default off, max_replans cap), but the behavior is complex and touches planning/execution paths.

Overview
Introduces adaptive replanning for planning=True crews via new Crew options (replan_on_failure, max_replans, evaluation_criteria, and pluggable replanning_evaluator). After each synchronous task, an LLM-based ReplanningEvaluator can decide whether outputs deviate from the original plan and, if so, CrewPlanner.replan() regenerates plans for remaining tasks and injects them into task descriptions as [REVISED PLAN].

Adds CrewPlanner.replan() to produce revised plans using completed TaskOutputs as context, plus a new utilities/replanning_evaluator.py module with structured decision/criteria models and robust fallback behavior. Includes a comprehensive new test suite covering evaluator behavior, replanning generation, and Crew integration/backwards-compatibility.

^{Reviewed by Cursor Bugbot for commit 782f96a. Bugbot is set up for automated code reviews on this repo. Configure here.}

When `planning=True`, the plan is currently static and never updated during execution, causing compounding errors when early tasks return unexpected results. This adds an optional `replan_on_failure` flag that enables adaptive re-planning: after each task, a lightweight ReplanningEvaluator checks whether the result deviates from the plan's assumptions, and if so, triggers CrewPlanner.replan() to generate revised plans for remaining tasks. New API (fully backwards compatible): - `replan_on_failure=True` on Crew enables the feature - `max_replans=N` prevents infinite replanning loops - `replanning_evaluator` allows plugging in a custom evaluator - `evaluation_criteria` configures quality threshold, completeness, relevance Fixes crewAIInc#4983 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Move _replan_count increment after successful replanning so failed attempts don't consume the replan budget - Remove unused ReplanDecision import Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 782f96a. Configure here.}

cursor · 2026-04-10T11:50:01Z

+            self._logger.log(
+                "warning",
+                f"Replanning failed: {e}. Continuing with original plan.",
+            )


Replan count not incremented on replanning failure

High Severity

_replan_count += 1 is only reached on the success path inside the try block (after planner.replan() returns). When planner.replan() raises an exception, the increment is skipped and the except block doesn't increment it either. This means persistent replanning failures never consume the max_replans budget, so _should_evaluate_for_replan() keeps returning True and the system retries on every subsequent task — defeating the runaway-loop protection. The corresponding test also asserts _replan_count == 1 after a failure, which will fail against this code.

Additional Locations (1)

lib/crewai/tests/utilities/test_replanning_evaluator.py#L565-L567

^{Reviewed by Cursor Bugbot for commit 782f96a. Configure here.}

Ricardo-M-L · 2026-04-16T07:52:40Z

Closing — branch has diverged significantly from upstream, and large features should be discussed first. Will resubmit properly if needed.

cursor bot reviewed Apr 10, 2026

View reviewed changes

Comment thread lib/crewai/src/crewai/crew.py

Comment thread lib/crewai/src/crewai/crew.py

fix: address review feedback on adaptive replanning

782f96a

- Move _replan_count increment after successful replanning so failed attempts don't consume the replan budget - Remove unused ReplanDecision import Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

cursor bot reviewed Apr 10, 2026

View reviewed changes

Ricardo-M-L closed this Apr 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: adaptive replanning when task results deviate from plan#5392

feat: adaptive replanning when task results deviate from plan#5392
Ricardo-M-L wants to merge 2 commits intocrewAIInc:mainfrom
Ricardo-M-L:feat/adaptive-replanning

Ricardo-M-L commented Apr 10, 2026 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Apr 10, 2026

Uh oh!

Ricardo-M-L commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Ricardo-M-L commented Apr 10, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

New API

Files Changed

How It Works

Test Plan

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Apr 10, 2026

Choose a reason for hiding this comment

Replan count not incremented on replanning failure

Uh oh!

Ricardo-M-L commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Ricardo-M-L commented Apr 10, 2026 •

edited by cursor bot

Loading