feat: centralized single-writer TaskEngine with full CRUD API by Aureliolo · Pull Request #328 · Aureliolo/synthorg

Aureliolo · 2026-03-12T19:54:17Z

Summary

Centralized single-writer TaskEngine using actor pattern with asyncio.Queue — all task state mutations flow through a single background processing loop, eliminating race conditions
Full CRUD API via Litestar controller (/tasks) with create, read, update, delete, transition, and cancel endpoints
Optimistic concurrency via VersionTracker with expected_version support on mutations
Snapshot publishing — TaskStateChanged events published to message bus after each mutation (configurable via TaskEngineConfig.publish_snapshots)
Typed error hierarchy — TaskNotFoundError, TaskVersionConflictError, TaskInternalError mapped to proper HTTP status codes (404, 409, 503)
Graceful shutdown with drain timeout — pending mutations processed during stop(), in-flight futures resolved on timeout
Security hardening — sanitized 5xx error messages, audit logging for created_by mismatch, safety cap on list_tasks results
Pre-reviewed by 10 agents — code-reviewer, python-reviewer, pr-test-analyzer, silent-failure-hunter, comment-analyzer, type-design-analyzer, logging-audit, resilience-audit, security-reviewer, docs-consistency — 34 findings implemented
Fix code scanning alert Implement agent-to-agent messaging with channels and topics #10 — removed explicit PR head SHA checkout in pages-preview.yml (untrusted checkout)

Test plan

Review coverage

Pre-reviewed by 10 specialized agents, 34 findings addressed including:

1 CRITICAL (drain timeout in-flight future resolution)
8 MAJOR (error type fixes, security hardening, type design)
15 MEDIUM (logging, docstrings, test coverage gaps)
10 MINOR (naming, imports, code organization)

🤖 Generated with Claude Code

Implement actor-like TaskEngine that owns all task state mutations via asyncio.Queue. A single background task processes mutations sequentially using model_copy(update=...), persists results, and publishes snapshots to the message bus. Reads bypass the queue for direct persistence access. - Add TaskEngine core with start/stop lifecycle, submit/convenience methods - Add 5 mutation types (create, update, transition, delete, cancel) - Add TaskMutationResult response and TaskStateChanged event models - Add TaskEngineConfig (queue size, drain timeout, snapshot toggle) - Add 4 error types (TaskEngineError, NotRunning, Mutation, VersionConflict) - Wire into API controllers, AppState, app lifecycle, and config - Add optional AgentEngine report-back for terminal task status - Add 57 unit tests covering all mutations, ordering, versioning, drain Closes #204

Pre-reviewed by 10 agents, 37 findings addressed: - Add exhaustive match default + typed error hierarchy (TaskNotFoundError, TaskEngineQueueFullError, TaskVersionConflictError) - Sanitize internal exception details from API responses - Add immutable field rejection validators on UpdateTaskMutation and TransitionTaskMutation - Thread previous_status through TaskMutationResult and snapshots - Add consistency model_validator to TaskMutationResult - Guard _processing_loop against unhandled exceptions - Fix startup cleanup to handle task engine failures - Replace assert with proper error handling in convenience methods - Add _fail_remaining_futures for drain timeout cleanup - Add comprehensive logging coverage (creation, conflicts, loop errors) - Add _not_found_result helper to reduce duplication - Extract _TERMINAL_STATUSES module constant - Use Self return type in model validators - Split broad except in _report_to_task_engine (TaskMutationError vs Exception) - Update docs: tech-stack Adopted, CLAUDE.md engine description, engine.md TaskEngine architecture subsection - Add tests: AppState.task_engine, _report_to_task_engine, app lifecycle, version conflicts, cancel not-found, previous_status, immutable fields, typed errors

- Add error_code discriminator to TaskMutationResult (not_found/version_conflict/validation/internal) - Fix _raise_typed_error to use error_code match instead of fragile string matching - Fix _processing_loop outer catch to resolve envelope future (prevents caller deadlock) - Guard happy-path set_result in _process_one with done() check - Fix _apply_update to use Task.model_validate() instead of model_copy() (runs validators) - Clean _IMMUTABLE_TASK_FIELDS: remove 4 non-existent timestamp fields (only id/status/created_by) - Rename _TERMINAL_STATUSES → _REPORTABLE_STATUSES (FAILED/INTERRUPTED are not strictly terminal) - Fix assigned_to=None override bug in transition_task controller - Add from_status to task transition audit log - Fix failure log to use API_TASK_TRANSITION_FAILED event instead of TASK_STATUS_CHANGED - Map TaskEngineNotRunningError/TaskEngineQueueFullError → ServiceUnavailableError (503) - Fix create_task missing error handling - Fix _on_expire broad exception handler to re-raise MemoryError/RecursionError - Add MemoryError/RecursionError re-raise to _publish_snapshot - Add API_TASK_TRANSITION_FAILED event constant - Fix AppState docstring and set_task_engine docstring - Add version_conflict test to TestTypedErrors - Add TestDrainTimeout: verify abandoned futures resolved on _fail_remaining_futures - Add TestMutationResultConsistency: validate success/error invariants enforced by Pydantic - Add test_memory_error_propagates to TestReportToTaskEngine - Fix engine.md: text lang specifier, version conflict description, _IMMUTABLE_TASK_FIELDS, asyncio.wait

…tness - Add TaskInternalError for internal engine faults (maps to 5xx vs 4xx) - Thread error_code through all TaskMutationResult failure paths - Extend _raise_typed_error to cover 'internal' code with -> Never return - Add TaskInternalError handling in all TaskController endpoints - Fix bridge leak in _cleanup_on_failure (started_bridge flag was missing) - Remove phantom 'created_at' from _IMMUTABLE_OVERRIDE_FIELDS (field absent on Task) - Change transition_task to return tuple[Task, TaskStatus | None] (eliminates extra get_task round-trip in controller) - Split 1140-line test_task_engine.py into three focused files + helpers - Move TaskEngine fixtures from helpers to conftest.py (auto-discovery, no F401 hacks) - Fix all ruff/mypy issues in new test files (TC001, I001, F811, E501, unused-ignore)

…ent flaky test

…handling, add tests - Extract VersionTracker into task_engine_version.py (optimistic concurrency) - Extract mutation apply logic into task_engine_apply.py (dispatch + apply_*) - Reduce task_engine.py from 976 to ~620 lines (well under 800 limit) - Make TaskInternalError a sibling of TaskMutationError, not a subtype - Add MemoryError/RecursionError re-raise guards in all except-Exception blocks - Add _try_stop helper in app.py to reduce C901 complexity - Add _extract_requester and _map_task_engine_errors helpers in tasks controller - Add deep-copy at system boundaries for mutable dict fields in frozen models - Add unknown-key validation in UpdateTaskMutation and TransitionTaskMutation - Track in-flight envelope for drain-timeout resolution - Fix _safe_shutdown argument ordering (task_engine first, mirrors cleanup) - Fix existing test bug: mock_te passed as bridge instead of task_engine - Update docs/design/engine.md for immutable-style updates and version tracking - Add 62 new tests: VersionTracker, apply functions, coverage edge cases, controller helpers, _try_stop, in-flight resolution, processing loop resilience

…overage Review fixes: - create_task now uses _raise_typed_error for proper error dispatch - Add PydanticValidationError catch in all controller write methods - Use _extract_requester for create_task audit trail consistency - Add logging for TaskInternalError and TaskEngineQueueFullError in controller - Add TaskEngineError catch in agent_engine _report_to_task_engine - Extract hardcoded values to class constants (_POLL_INTERVAL_SECONDS, etc.) - Add failed_count tracking and logging in _fail_remaining_futures - Add reason field to TaskStateChanged event for audit trail - Add bounds check to VersionTracker.set_initial - Fix _try_stop parameter type from object to Awaitable[None] - Fix State Coordination docs (model_copy -> model_validate/with_transition) - Improve docstrings for _processing_loop, _publish_snapshot, apply_cancel, etc. New tests (33 total): - FIFO ordering guarantee - Default reason generation in transition_task - Delete snapshot new_status=None verification - Cancel version bump correctness - _raise_typed_error dispatch for all error codes - Snapshot reason propagation (transition, cancel, create, update) - MemoryError re-raise in processing loop - _fail_remaining_futures count tracking - Deep-copy isolation for UpdateTaskMutation and TransitionTaskMutation - Unknown field rejection in updates and overrides - VersionTracker.set_initial bounds check (zero and negative) - TaskStateChanged reason field (populated, default none, cancel) - create_task typed error dispatch (internal, validation) - Transition overrides and previous_status via engine

- Add TaskVersionConflictError → ConflictError(409) mapping in _map_task_engine_errors (was silently falling through to 422) - Wrap PydanticValidationError in update_task/transition_task convenience methods so it raises TaskMutationError instead of escaping - Sanitize Pydantic error messages via _format_validation_error helper - Fix VersionTracker docstrings: clarify version is seeded at 1 heuristically, not loaded from persistence - Update engine design doc optimistic concurrency section to match reality - Replace timing-based asyncio.sleep with Event synchronization in drain test - Add _snapshot_content helper for type-safe snapshot access in tests - Add 15 new tests: version conflict via convenience methods, PydanticValidation wrapping, cancel/delete not-found, start/stop lifecycle, is_running property, 409 mapping (6913 total, up from 6898)

…ert #10 Pre-reviewed by 10 agents (code-reviewer, python-reviewer, pr-test-analyzer, silent-failure-hunter, comment-analyzer, type-design-analyzer, logging-audit, resilience-audit, security-reviewer, docs-consistency), 34 findings addressed. Key fixes: - CRITICAL: drain timeout in-flight future now resolved (saved before cancel) - Fix TaskInternalError for invariant violations (was TaskMutationError → wrong HTTP status) - Sanitize 5xx error messages (no internal details leaked to API clients) - Map TaskVersionConflictError to 409 Conflict - Add MutationType/TaskErrorCode type aliases, length constraints on CreateTaskData - Add logging in _raise_typed_error, read-through error wrapping, safety cap on list_tasks - Fix code scanning alert #10: remove explicit PR head SHA checkout in pages-preview - Replace Any with concrete types in API test fake repositories - Add TaskEngineConfig boundary validation, RecursionError re-raise, TaskEngineError branch tests - Split test_task_engine_extended.py (was 870 lines → 540 + 335) - Add apply_cancel docstring with Args/Returns

coderabbitai · 2026-03-12T19:54:31Z

Caution

Review failed

Pull request was closed or merged during review

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Summary by CodeRabbit

New Features
- Central Task Engine: start/stop lifecycle, single-writer task mutations, read-through task queries, snapshot publishing, and a new cancel-task API.
Refactor
- API routes task operations through the Task Engine and supports optimistic concurrency (expected version) for updates/transitions.
Documentation
- Architecture and engine design docs updated to describe the new coordination and mutation model.
Tests
- Extensive unit and integration tests added for engine behavior, lifecycle, concurrency, and error handling.
Chores
- Added descriptive comment to CI preview workflow for security/context.

Walkthrough

Adds a centralized single-writer TaskEngine and integrates it across the codebase: new engine modules (models, apply logic, version tracker, config), typed task-engine errors/events, AppState and lifecycle wiring, API controller usage and DTOs, AgentEngine reporting, documentation and CI comment, and extensive tests.

Changes

Cohort / File(s)	Summary
Core TaskEngine implementation `src/ai_company/engine/task_engine.py`, `src/ai_company/engine/task_engine_apply.py`, `src/ai_company/engine/task_engine_models.py`, `src/ai_company/engine/task_engine_version.py`, `src/ai_company/engine/task_engine_config.py`	Adds TaskEngine single-writer asyncio actor, queue processing loop, mutation dispatch/apply logic, immutable Pydantic mutation models and results/events, in-memory VersionTracker, config model, snapshot publishing, error mapping, and convenience APIs (create/update/transition/delete/cancel).
Engine public API & errors `src/ai_company/engine/__init__.py`, `src/ai_company/engine/errors.py`	Re-exports TaskEngine and TaskEngineConfig; introduces task-engine-specific error hierarchy and exposes mutation/event types on engine public surface.
AgentEngine integration `src/ai_company/engine/agent_engine.py`	AgentEngine accepts optional `task_engine`, stores it, and reports terminal task statuses via new internal `_report_to_task_engine` (best-effort, logged).
API wiring, AppState & controllers `src/ai_company/api/app.py`, `src/ai_company/api/state.py`, `src/ai_company/api/controllers/tasks.py`, `src/ai_company/api/dto.py`	create_app, lifecycle builders, and AppState updated to accept/store TaskEngine; controllers route task CRUD/transition/cancel via TaskEngine with error mapping; DTOs add expected_version fields and CancelTaskRequest.
Config & defaults `src/ai_company/config/schema.py`, `src/ai_company/config/defaults.py`	Adds `task_engine` subtree to RootConfig and default config; adjusts default_config_dict typing.
Observability events `src/ai_company/observability/events/task_engine.py`, `src/ai_company/observability/events/api.py`	Adds task-engine-specific and new API event constants for instrumentation and error reporting.
Docs & CI `docs/design/engine.md`, `docs/architecture/tech-stack.md`, `CLAUDE.md`, `.github/workflows/pages-preview.yml`	Documents TaskEngine design and adopted coordination approach (model_validate/with_transition), updates agent/engine docs and logging guidance, and adds a descriptive CI comment (no functional change).
API pagination tweak `src/ai_company/api/pagination.py`	Adds optional `total` parameter to paginate(...) so callers can supply true total when upstream truncation occurred.
Tests — engine `tests/unit/engine/...`, `tests/unit/engine/conftest.py`, `tests/unit/engine/task_engine_helpers.py`	Large new test suite and helpers: lifecycle, concurrency, FIFO ordering, typed errors, mutation application, snapshots, drain/stop semantics, VersionTracker and extensive integration/unit coverage.
Tests — API & state `tests/unit/api/conftest.py`, `tests/unit/api/test_app.py`, `tests/unit/api/controllers/test_task_helpers.py`, `tests/unit/api/test_state.py`	Test fixtures and clients wired to fake TaskEngine; tests for AppState task_engine accessors, lifecycle cleanup on engine failures, controller helper behavior, and engine→API error mapping.
Tests — models & observability `tests/unit/engine/test_task_engine_models.py`, `tests/unit/observability/test_events.py`	Comprehensive model validation tests and updated observability discovery expectations to include the new task_engine domain.
Test typing/fixtures adjustments `tests/unit/config/conftest.py`, `tests/unit/config/test_defaults.py`, `tests/unit/api/conftest.py`	Minor typing/fixture updates to support new config and TaskEngine wiring.

Sequence Diagram(s)

sequenceDiagram
  participant Client as Client
  participant API as API Controller
  participant Engine as TaskEngine
  participant Persist as Persistence
  participant Bus as MessageBus

  Client->>API: POST /tasks (CreateTaskData)
  API->>API: build CreateTaskMutation (request_id, requested_by)
  API->>Engine: submit(CreateTaskMutation)
  activate Engine
  Note right of Engine: enqueue mutation, return Future
  Engine->>Engine: _processing_loop processes envelope
  Engine->>Persist: persist created Task
  Persist-->>Engine: saved Task
  alt publish_snapshots enabled
    Engine->>Bus: publish TaskStateChanged (snapshot)
    Bus-->>Engine: ack / error (logged)
  end
  Engine-->>API: Future resolves (TaskMutationResult)
  deactivate Engine
  API->>Client: 201 Created (Task)

Estimated code review effort

🎯 5 (Critical) | ⏱️ ~120 minutes

Possibly related issues

feat: implement centralized single-writer state coordination (TaskEngine) #204 — Implements the centralized single-writer TaskEngine described in that issue; this PR appears to realize that feature request.

Possibly related PRs

refactor: harden BudgetEnforcer with error handling and review fixes #182 — Modifies AgentEngine; related at the code level because both PRs extend AgentEngine constructor/behavior to accept injectable runtime dependencies.

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 39.74% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	Title clearly summarizes the main feature: a centralized TaskEngine with CRUD API, capturing the primary architectural change.
Description check	✅ Passed	Description is comprehensive and directly related to the changeset, providing context on TaskEngine implementation, API design, testing, and code review.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feat/task-engine

✨ Simplify code

Create PR with simplified code
Commit simplified code in branch feat/task-engine

📝 Coding Plan

Generate coding plan for human review comments

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Keep main's refactored workflow (workflow_dispatch support, PR metadata validation step). Add zizmor ignore comment for untrusted-checkout — the checkout is intentional for preview builds and mitigated (no secrets, read-only perms, persist-credentials: false, deploy uses artifacts).

gemini-code-assist · 2026-03-12T19:59:02Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a robust and centralized task management system, significantly enhancing the application's ability to handle task state mutations reliably. By consolidating all task operations through a single, asynchronous engine, it eliminates potential race conditions and provides a consistent, version-controlled API for task lifecycle management. The changes also improve error handling, ensure graceful service operation, and integrate seamlessly with existing agent execution flows.

Highlights

Centralized TaskEngine: Introduced a new TaskEngine component that acts as a single-writer, actor-pattern-based state manager for all task mutations, utilizing asyncio.Queue to prevent race conditions and ensure data consistency.
Full CRUD API: Implemented a comprehensive CRUD (Create, Read, Update, Delete) API for tasks via a Litestar controller (/tasks), including endpoints for creating, reading, updating, deleting, transitioning, and canceling tasks.
Optimistic Concurrency: Integrated optimistic concurrency control using a VersionTracker which supports expected_version for mutations, allowing the system to detect and prevent stale writes.
Typed Error Handling: Established a typed error hierarchy (TaskNotFoundError, TaskVersionConflictError, TaskInternalError) that maps to appropriate HTTP status codes (404, 409, 503) for precise error classification and improved API responses.
Snapshot Publishing: Enabled the publishing of TaskStateChanged events to the message bus after each successful task mutation, providing real-time updates and audit trails (configurable via TaskEngineConfig.publish_snapshots).
Graceful Shutdown: Implemented graceful shutdown for the TaskEngine, ensuring that all pending mutations are processed within a configurable drain timeout, and in-flight futures are resolved upon timeout.
AgentEngine Integration: Updated the AgentEngine to report final execution statuses (COMPLETED, FAILED, INTERRUPTED, CANCELLED) to the centralized TaskEngine, ensuring consistent task state across the system.

Changelog

CLAUDE.md
- Updated the description of the engine/ module to reflect the inclusion of the centralized single-writer task state engine.
- Added TASK_ENGINE_STARTED to the list of required event constants for observability.
docs/architecture/tech-stack.md
- Updated the status of 'State coordination' to 'Adopted' and refined its mechanism description to reflect the use of model_validate / with_transition.
docs/design/engine.md
- Added a new comprehensive section detailing the TaskEngine architecture, mutation types, error handling, and lifecycle.
- Updated references from asyncio.wait_for to asyncio.wait and model_copy(update=...) to model_validate / with_transition in relevant sections.
src/ai_company/api/app.py
- Imported TaskEngine to integrate it into the application lifecycle.
- Modified exception handling in _on_expire to specifically re-raise MemoryError and RecursionError.
- Updated _build_lifecycle, _safe_startup, and _safe_shutdown functions to manage the TaskEngine's lifecycle.
- Enhanced _cleanup_on_failure to include task_engine in the cleanup process.
- Modified create_app to require task_engine as a dependency, ensuring its availability for controllers.
src/ai_company/api/controllers/tasks.py
- Refactored the task controller to use the new TaskEngine for all CRUD operations instead of TaskRepository.
- Introduced helper functions _extract_requester for user identity and _map_task_engine_errors to translate TaskEngine exceptions into appropriate API errors.
- Updated list_tasks, get_task, create_task, update_task, transition_task, and delete_task methods to interact with the TaskEngine and handle its specific error types.
src/ai_company/api/state.py
- Imported TaskEngine and added _task_engine as a service field to AppState.
- Implemented task_engine property for access, has_task_engine for checking availability, and set_task_engine for deferred initialization.
- Updated docstrings to reflect the inclusion of task_engine as a service field.
src/ai_company/config/defaults.py
- Added a default empty dictionary for task_engine configuration.
src/ai_company/config/schema.py
- Imported TaskEngineConfig and added a task_engine field to RootConfig with a default factory.
src/ai_company/engine/init.py
- Updated the module docstring to include task engine in the re-exported public API description.
- Exported new TaskEngine related errors (TaskEngineError, TaskEngineNotRunningError, TaskEngineQueueFullError, TaskMutationError, TaskNotFoundError, TaskVersionConflictError, TaskInternalError).
- Exported TaskEngine class and its associated models (CancelTaskMutation, CreateTaskData, CreateTaskMutation, DeleteTaskMutation, TaskMutation, TaskMutationResult, TaskStateChanged, TransitionTaskMutation, UpdateTaskMutation).
src/ai_company/engine/agent_engine.py
- Imported TaskEngine and its specific error types.
- Defined _REPORTABLE_STATUSES to specify which task statuses trigger reporting to the TaskEngine.
- Added task_engine as an optional parameter to the AgentEngine constructor.
- Implemented _report_to_task_engine to update the centralized TaskEngine with final execution statuses, handling various error scenarios gracefully.
src/ai_company/engine/errors.py
- Added a new hierarchy of exceptions for the task engine: TaskEngineError, TaskEngineNotRunningError, TaskEngineQueueFullError, TaskMutationError, TaskNotFoundError, TaskVersionConflictError, and TaskInternalError.
src/ai_company/engine/task_engine.py
- Added the TaskEngine module, implementing a centralized single-writer task engine with an asyncio.Queue for sequential mutation processing.
- Included lifecycle methods (start, stop), convenience CRUD methods (create_task, update_task, transition_task, delete_task, cancel_task), and read-through methods (get_task, list_tasks).
- Implemented snapshot publishing to a message bus and graceful shutdown with drain timeout.
src/ai_company/engine/task_engine_apply.py
- Added the task_engine_apply module, containing the core logic for dispatching and applying different types of TaskMutation objects.
- Provided functions for apply_create, apply_update, apply_transition, apply_delete, and apply_cancel, along with helpers for validation error formatting and not-found results.
src/ai_company/engine/task_engine_config.py
- Added the TaskEngineConfig module, defining the Pydantic model for configuring the TaskEngine, including max_queue_size, drain_timeout_seconds, and publish_snapshots.
src/ai_company/engine/task_engine_models.py
- Added the task_engine_models module, defining Pydantic models for CreateTaskData, various TaskMutation requests (Create, Update, Transition, Delete, Cancel), TaskMutationResult, and TaskStateChanged events.
- Included validation logic for immutable fields and deep-copy isolation for mutable dictionary fields.
src/ai_company/engine/task_engine_version.py
- Added the VersionTracker module, providing an in-memory mechanism for optimistic concurrency control by tracking task versions with seed, bump, check, and remove operations.
src/ai_company/observability/events/api.py
- Added new API event constants: API_TASK_TRANSITION_FAILED and API_AUTH_FALLBACK.
src/ai_company/observability/events/task_engine.py
- Added new event constants specifically for the TaskEngine's lifecycle and operations, such as TASK_ENGINE_CREATED, TASK_ENGINE_STARTED, TASK_ENGINE_MUTATION_RECEIVED, TASK_ENGINE_SNAPSHOT_PUBLISHED, and TASK_ENGINE_DRAIN_TIMEOUT.
tests/unit/api/conftest.py
- Updated test_client fixture to include fake_task_engine and changed its return type to a Generator.
- Added fake_task_engine fixture to provide a test instance of TaskEngine.
- Refined type hints in FakeLifecycleEventRepository, FakeTaskMetricRepository, and FakeCollaborationMetricRepository for better type safety.
tests/unit/api/controllers/test_task_helpers.py
- Added new unit tests for the _extract_requester and _map_task_engine_errors helper functions within the task controller, ensuring correct user extraction and error mapping.
tests/unit/api/test_app.py
- Updated calls to _safe_startup and _safe_shutdown to pass the task_engine instance.
- Added tests to verify cleanup behavior when task_engine startup fails and that task_engine stop failures during shutdown are logged but not propagated.
- Introduced tests for the _try_stop helper function, covering success, swallowed exceptions, and re-raised critical errors.
tests/unit/api/test_state.py
- Added unit tests for the AppState's task_engine property, has_task_engine property, and set_task_engine method, covering various scenarios including service unavailability and re-configuration attempts.
tests/unit/config/conftest.py
- Added escalation_paths to the RootConfigFactory for test configuration generation.
tests/unit/engine/agent_engine.py
- Updated imports to include new TaskEngine errors.
- Added extensive unit tests for the _report_to_task_engine method, covering scenarios where no TaskEngine is configured, non-terminal statuses are skipped, terminal statuses are reported, and various TaskEngine errors are handled (logged and swallowed, or re-raised for critical errors like MemoryError).
tests/unit/engine/conftest.py
- Imported TaskEngine and TaskEngineConfig for use in engine-related test fixtures.
- Added new pytest fixtures: persistence (FakePersistence), message_bus (FakeMessageBus), config (TaskEngineConfig), engine (TaskEngine instance), and engine_with_bus (TaskEngine with message bus), to facilitate testing of the new TaskEngine.
tests/unit/engine/task_engine_helpers.py
- Added new helper classes for TaskEngine testing: FakeTaskRepository, FakePersistence, FakeMessageBus, and FailingMessageBus.
- Included a make_create_data helper function to easily create CreateTaskData instances for tests.
tests/unit/engine/test_task_engine_apply.py
- Added comprehensive unit tests for the dispatch and apply_* functions in task_engine_apply.py, covering create, update, transition, delete, and cancel mutations, including validation, not-found scenarios, and version conflicts.
tests/unit/engine/test_task_engine_convenience.py
- Added unit tests for TaskEngine convenience methods, focusing on typed error dispatch for all error codes, transition overrides, Pydantic validation wrapping, version conflicts, and handling of not-found errors for cancel and delete operations.
- Included tests for TaskEngine lifecycle edge cases such as starting an already running engine, stop idempotency, and submitting to a stopped engine.
tests/unit/engine/test_task_engine_coverage.py
- Added additional coverage tests for TaskEngine, including in-flight envelope resolution during drain timeout, _process_one exception handling (e.g., dispatch exceptions returning internal errors), snapshot publishing failures (logged but not raised), and MemoryError/RecursionError re-propagation in the processing loop.
tests/unit/engine/test_task_engine_extended.py
- Added extended coverage tests for TaskEngine, verifying FIFO ordering of mutations, default reason generation for transitions, correct new_status=None for delete snapshots, version bumping for cancel mutations, proper _raise_typed_error dispatch for create_task, and reason propagation in snapshot events.
tests/unit/engine/test_task_engine_integration.py
- Added integration tests for TaskEngine, covering snapshot publishing behavior (on/off, failure handling), sequential ordering of concurrent submits, queue backpressure (max_queue_size), version tracking (increment, conflict, reset on delete), and drain timeout behavior for pending futures.
tests/unit/engine/test_task_engine_lifecycle.py
- Added unit tests for the TaskEngine's lifecycle (start, stop, double-start, idempotency, restart) and TaskEngineConfig validation (defaults, custom values, frozen state, range constraints).
tests/unit/engine/test_task_engine_models.py
- Added unit tests for all TaskEngine request, response, and event models, including CreateTaskData (construction, validation, immutability), CreateTaskMutation, UpdateTaskMutation (with expected version, empty updates), TransitionTaskMutation (with overrides), DeleteTaskMutation, CancelTaskMutation, TaskMutationResult (success/failure consistency), and TaskStateChanged (construction, transition, delete, reason propagation, serialization).
- Included tests for deep-copy isolation of mutable dictionary fields and rejection of unknown/immutable fields in update/transition mutations.
tests/unit/engine/test_task_engine_mutations.py
- Added comprehensive CRUD mutation tests for TaskEngine, covering create (with/without assignee, version 1), update (fields, empty, not found), transition (valid/invalid, not found), delete (success, not found), and cancel (valid/invalid).
- Included tests for read-through methods (get_task, list_tasks), previous status tracking in results, immutable field rejection, and typed error propagation for not-found and version conflict scenarios.
tests/unit/engine/test_task_engine_version.py
- Added unit tests for VersionTracker, covering seeding, bumping, getting, removing, and checking versions, including idempotency, auto-seeding, conflict detection, and validation of initial version values.
tests/unit/observability/test_events.py
- Added task_engine to the list of discovered domain modules, ensuring its event constants are properly recognized.

Ignored Files

Ignored by pattern: .github/workflows/** (1)
- .github/workflows/pages-preview.yml

Activity

The pull request introduces a major new feature, the TaskEngine, which centralizes task state management.
The author, Aureliolo, has implemented a robust solution with extensive test coverage, including unit, integration, and lifecycle tests.
The PR description indicates that the code has been pre-reviewed by 10 specialized agents, leading to 34 findings being addressed, including critical and major issues, demonstrating a thorough review process before submission.
A code scanning alert (Implement agent-to-agent messaging with channels and topics #10) related to untrusted checkout in pages-preview.yml was fixed as part of this PR, indicating attention to security practices.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

Copilot

Pull request overview

Introduces a centralized, single-writer TaskEngine (actor pattern over an asyncio.Queue) and wires it into the API/app lifecycle so all task mutations are serialized, version-tracked, and can publish TaskStateChanged snapshots.

Changes:

Added TaskEngine core modules (engine, config, mutation models, apply/dispatch, optimistic version tracker) plus new observability event constants.
Switched /tasks API controller and app startup/shutdown to use TaskEngine + typed error mapping.
Added extensive unit/integration test coverage for lifecycle, mutations, snapshot publishing, drain-timeout behavior, and AgentEngine reporting.

Reviewed changes

Copilot reviewed 37 out of 37 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
tests/unit/observability/test_events.py	Includes `task_engine` in domain-module discovery assertions.
tests/unit/engine/test_task_engine_version.py	Unit tests for `VersionTracker` behavior.
tests/unit/engine/test_task_engine_mutations.py	CRUD + typed-error + consistency tests for `TaskEngine` convenience methods.
tests/unit/engine/test_task_engine_models.py	Tests for mutation/result/event Pydantic models and validators.
tests/unit/engine/test_task_engine_lifecycle.py	Start/stop/idempotency/config tests for `TaskEngine`.
tests/unit/engine/test_task_engine_integration.py	Integration tests for publishing, ordering, queue backpressure, versioning, drain.
tests/unit/engine/test_task_engine_extended.py	Extra edge-case coverage (FIFO, reasons, snapshots, OOM propagation, shutdown failures).
tests/unit/engine/test_task_engine_coverage.py	Additional coverage for drain/in-flight cleanup, snapshot publish failures, loop resilience.
tests/unit/engine/test_task_engine_convenience.py	Focused tests for typed error mapping + convenience method behaviors.
tests/unit/engine/test_task_engine_apply.py	Tests for mutation dispatch + apply_* functions and persistence/version interactions.
tests/unit/engine/test_agent_engine.py	Adds tests for AgentEngine → TaskEngine reporting behavior and error handling.
tests/unit/engine/task_engine_helpers.py	Adds shared fake persistence/message bus helpers for engine tests.
tests/unit/engine/conftest.py	Adds fixtures for `TaskEngine` and supporting fakes.
tests/unit/config/conftest.py	Updates RootConfigFactory defaults to include `escalation_paths`.
tests/unit/api/test_state.py	Tests new `AppState.task_engine` accessors and setters.
tests/unit/api/test_app.py	Updates startup/shutdown helpers to manage `TaskEngine` and tests cleanup behavior.
tests/unit/api/controllers/test_task_helpers.py	Tests requester extraction and TaskEngine→API error mapping helpers.
tests/unit/api/conftest.py	Adds `TaskEngine` to API test app wiring; improves typing of fake repos.
src/ai_company/observability/events/task_engine.py	New event constants for TaskEngine lifecycle/mutations/drain/publishing.
src/ai_company/observability/events/api.py	Adds API event constants for auth fallback and task transition failures.
src/ai_company/engine/task_engine_version.py	Implements in-memory optimistic version tracking (`VersionTracker`).
src/ai_company/engine/task_engine_models.py	Adds mutation request/result models + `TaskStateChanged` event model.
src/ai_company/engine/task_engine_config.py	Adds config model for queue sizing, drain timeout, snapshot publishing.
src/ai_company/engine/task_engine_apply.py	Implements mutation application logic + dispatch with typed error results.
src/ai_company/engine/task_engine.py	Implements the single-writer engine loop, CRUD convenience API, draining shutdown, snapshot publishing.
src/ai_company/engine/errors.py	Adds typed TaskEngine error hierarchy for API + engine callers.
src/ai_company/engine/agent_engine.py	Adds best-effort reporting of final run status back to centralized TaskEngine.
src/ai_company/engine/init.py	Re-exports TaskEngine APIs and new error types from `ai_company.engine`.
src/ai_company/config/schema.py	Adds `task_engine: TaskEngineConfig` to RootConfig.
src/ai_company/config/defaults.py	Seeds default config dict with `"task_engine": {}`.
src/ai_company/api/state.py	Adds AppState storage/access for `task_engine` with 503 behavior when unset.
src/ai_company/api/controllers/tasks.py	Migrates task CRUD endpoints from direct repo writes to TaskEngine + error mapping.
src/ai_company/api/app.py	Starts/stops TaskEngine during app lifecycle; adds safe shutdown helper.
docs/design/engine.md	Documents TaskEngine architecture/behavior and updates loop timeout description.
docs/architecture/tech-stack.md	Marks state coordination via TaskEngine as “Adopted” and updates mechanism description.
CLAUDE.md	Updates architecture notes and event-name examples to include TaskEngine.
.github/workflows/pages-preview.yml	Removes explicit PR head SHA checkout for security hardening.

Comments suppressed due to low confidence (1)

src/ai_company/api/controllers/tasks.py:136

The controller claims “full CRUD via TaskEngine”, but there is no cancel endpoint and no API support for expected_version/optimistic concurrency (UpdateTaskRequest/TransitionTaskRequest don’t expose it and the controller doesn’t accept it). This diverges from the PR description; either add the missing endpoints/parameters or adjust the description/scope to match the actual API surface.

class TaskController(Controller):
    """Full CRUD for tasks via ``TaskEngine``."""

    path = "/tasks"
    tags = ("tasks",)
    guards = [require_read_access]  # noqa: RUF012

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-12T19:59:01Z

tests/unit/engine/test_task_engine_coverage.py

+        assert eng._in_flight is None
+
+        # Release the block and clean up
+        block.set()  # type: ignore[unreachable]


# type: ignore[unreachable] on block.set() looks incorrect/unnecessary here (the line is reachable, and there’s no obvious mypy unreachable warning to suppress). Consider removing the ignore to avoid masking future static-analysis issues.

Suggested change

block.set() # type: ignore[unreachable]

block.set()

Copilot · 2026-03-12T19:59:01Z

src/ai_company/api/controllers/tasks.py

+                id=task_id,
+            )
+        return NotFoundError(str(exc))
+    if isinstance(exc, TaskEngineNotRunningError | TaskEngineQueueFullError):


isinstance() does not accept a PEP 604 union (A | B) as the second argument; this will raise TypeError at runtime (and will break the helper + its tests). Use a tuple of exception classes instead (e.g. (TaskEngineNotRunningError, TaskEngineQueueFullError)).

Suggested change

if isinstance(exc, TaskEngineNotRunningError | TaskEngineQueueFullError):

if isinstance(exc, (TaskEngineNotRunningError, TaskEngineQueueFullError)):

Copilot · 2026-03-12T19:59:02Z

src/ai_company/api/controllers/tasks.py

+        if data.created_by != requester:
+            logger.info(
+                API_TASK_UPDATED,
+                note="created_by differs from authenticated requester",
+                created_by=data.created_by,
+                requester=requester,
+            )


This audit log for created_by vs authenticated requester uses the API_TASK_UPDATED event constant even though this is part of task creation. That makes downstream event analytics/alerts misleading; consider introducing a dedicated event (or reusing the existing task-created event) for this mismatch audit log.

Copilot · 2026-03-12T19:59:02Z

src/ai_company/api/controllers/tasks.py

+        logger.error(
+            API_TASK_TRANSITION_FAILED,
+            resource="task",
+            task_id=task_id,
+            error=str(exc),
+            error_type=type(exc).__name__,
+        )
+        return ServiceUnavailableError("Service temporarily unavailable")
+    if isinstance(exc, TaskInternalError):
+        logger.error(
+            API_TASK_TRANSITION_FAILED,
+            resource="task",
+            task_id=task_id,


API_TASK_TRANSITION_FAILED is emitted here for non-transition failures too (create/update/delete can all flow through _map_task_engine_errors). This makes logs/events ambiguous. Consider renaming/adding an event like API_TASK_MUTATION_FAILED and include mutation_type/endpoint context in the log fields.

greptile-apps · 2026-03-12T20:00:16Z

Greptile Summary

This PR introduces a centralized single-writer TaskEngine using an actor pattern (asyncio.Queue + background processing loop) for all task state mutations, together with a full CRUD Litestar controller at /tasks. The architecture is sound — the lifecycle lock cleanly prevents race conditions between submit() and stop(), the VersionTracker provides optimistic concurrency within an engine lifetime, and the typed error hierarchy maps cleanly to HTTP status codes. Test coverage is extensive (7 042 tests, 94.5% coverage).

Key findings:

Dead processing loop leaves _running=True — When MemoryError/RecursionError escapes _processing_loop (e.g. from a deeply nested task data structure hitting Python's recursion limit), self._running is never cleared. New submit() calls pass the guard, enqueue mutations, and await envelope.future indefinitely since no consumer remains. Adding self._running = False before re-raising in the _processing_loop exception handler would close this window.
stop() re-raises stale background task exception, aborting shutdown — If the processing loop died prior to stop() being called, asyncio.shield(done_task_with_exception) immediately re-raises the stored exception. This propagates through _try_stop (which re-raises MemoryError/RecursionError) and _safe_shutdown, so the bridge, message bus, and persistence backend are never cleanly shut down. Distinguishing a stale task exception from a new drain-time exception in stop() would avoid this.
apply_cancel discards mutation.reason — The cancellation reason is logged but not forwarded to task.with_transition(). If Task supports storing a reason field, this is a silent data loss on every cancellation.
start() reuses TASK_ENGINE_STARTED event constant for the double-start warning — minor observability concern; a dedicated event constant would improve log filtering.

Confidence Score: 3/5

Safe to merge with low risk for normal operation, but two edge-case logic bugs in the catastrophic error path could cause hung callers or incomplete shutdown under RecursionError/MemoryError conditions.
The core actor-pattern design, lifecycle lock, and CRUD API are well-implemented and extensively tested. The two logic issues are in the rare MemoryError/RecursionError error path — they won't affect normal operation but are real bugs that could surface under production load with deeply nested task payloads. The apply_cancel reason loss is a functional correctness concern but lower severity. Combined, these warrant addressing before merge.
src/ai_company/engine/task_engine.py (processing loop zombie state and stop() exception propagation), src/ai_company/engine/task_engine_apply.py (cancel reason discarded)

Important Files Changed

Filename	Overview
src/ai_company/engine/task_engine.py	Core new file — well-structured actor-pattern engine with lifecycle lock and graceful drain; two logic issues: dead loop leaves `_running=True` (hung futures after RecursionError), and `stop()` re-raises background task exceptions aborting the shutdown sequence.
src/ai_company/engine/task_engine_apply.py	Mutation dispatch logic is clean and well-validated; `apply_cancel` silently discards the `reason` field from the mutation — it is logged but not forwarded to `task.with_transition()`.
src/ai_company/engine/task_engine_models.py	Typed mutation and result models are thorough — frozen Pydantic models, immutability enforcement via MappingProxyType, and a consistency validator on TaskMutationResult. No issues found.
src/ai_company/engine/task_engine_version.py	Volatile in-memory version tracker; documented limitation about post-restart seeds is clear. No issues found beyond the acknowledged limitation.
src/ai_company/api/controllers/tasks.py	Full CRUD controller with correct exception mapping and consistent error handling across all endpoints; `_map_task_engine_errors` is well-factored and covers all error types.
src/ai_company/api/app.py	Lifecycle management updated to include TaskEngine start/stop in correct order; shutdown sequence can be aborted if task engine re-raises a stale background task exception.
src/ai_company/engine/errors.py	Clean typed error hierarchy added; sibling placement of TaskInternalError relative to TaskMutationError is intentional and well-documented. No issues found.
src/ai_company/engine/agent_engine.py	TaskEngine integration in `_report_to_task_engine` is best-effort with correct exception branching; intentionally discarded return value is now documented with a comment.
src/ai_company/engine/task_engine_config.py	Minimal, well-constrained configuration model with sensible defaults and validation bounds. No issues found.

Sequence Diagram

sequenceDiagram
    participant Caller
    participant TE as TaskEngine
    participant Q as asyncio.Queue
    participant BG as _processing_loop
    participant Apply as dispatch
    participant DB as Persistence
    participant MB as MessageBus

    Caller->>TE: submit(mutation)
    Note over TE: acquire _lifecycle_lock
    TE->>Q: put_nowait(envelope)
    Note over TE: release _lifecycle_lock
    TE-->>Caller: await envelope.future (suspended)

    BG->>Q: get() with poll timeout
    Q-->>BG: envelope
    BG->>Apply: dispatch(mutation, persistence, versions)
    Apply->>DB: get/save/delete task
    DB-->>Apply: Task or None
    Apply-->>BG: TaskMutationResult

    alt mutation succeeded
        BG->>MB: publish TaskStateChanged
    end

    BG->>Caller: future.set_result(result) — caller resumes

    Note over TE: stop() called
    TE->>BG: set _running=False
    BG->>Q: drain pending envelopes
    alt drain timeout
        TE->>BG: cancel()
        TE->>Caller: fail remaining futures
    end

Prompt To Fix All With AI

This is a comment left during a code review.
Path: src/ai_company/engine/task_engine.py
Line: 628-631

Comment:
**Dead loop leaves `_running=True`, causing hung submissions**

When `MemoryError`/`RecursionError` propagates out of `_processing_loop`, the background task dies but `self._running` is never set to `False`. Any subsequent `submit()` call passes the `if not self._running` guard, enqueues its envelope, and then `await envelope.future` hangs indefinitely — the queue has no consumer.

This window exists until `stop()` is called. `RecursionError` is survivable (e.g., deeply nested task data hitting Python's recursion limit), so the engine can remain in this zombie state for an extended period while the application continues to issue mutations that silently stall all callers.

The fix is to set `self._running = False` before re-raising, so `submit()` correctly raises `TaskEngineNotRunningError` instead of hanging:

```python
except (MemoryError, RecursionError) as exc:
    if not envelope.future.done():
        envelope.future.set_exception(exc)
    self._running = False  # prevent new submissions from hanging
    raise
```

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: src/ai_company/engine/task_engine.py
Line: 186-192

Comment:
**`stop()` re-raises background task exceptions, aborting the shutdown sequence**

When `_processing_loop` previously died with `MemoryError`/`RecursionError`, `self._processing_task` is already a done task with that exception stored. When `stop()` is subsequently called:

1. `asyncio.shield(self._processing_task)` creates a future that immediately adopts the stored exception.
2. `asyncio.wait_for(...)` raises the exception immediately (no timeout).
3. The `except BaseException: ...; raise` block correctly fails remaining futures but then **re-raises** `MemoryError`/`RecursionError`.

This propagates through `_try_stop` in `_safe_shutdown` (which also re-raises these errors), meaning the bridge, message bus, and persistence backend are **never shut down cleanly** when a catastrophic engine fault precedes shutdown.

`RecursionError` in particular is process-survivable, so clean shutdown matters. Consider distinguishing between an exception that originated *from the background task's prior run* (should be logged-and-absorbed during shutdown) versus one that originates from the drain logic itself (should propagate):

```python
except BaseException as exc:
    self._fail_remaining_futures(self._in_flight)
    # Only re-raise if this is a new error, not a stale task exception
    if self._processing_task is not None and self._processing_task.done():
        task_exc = self._processing_task.exception() if not self._processing_task.cancelled() else None
        if task_exc is exc:
            logger.error(TASK_ENGINE_LOOP_ERROR, error=f"Processing task died: {exc!r}")
            return  # don't abort shutdown for a pre-existing task failure
    raise
```

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: src/ai_company/engine/task_engine_apply.py
Line: 393-394

Comment:
**`apply_cancel` silently discards `mutation.reason`**

The cancellation reason from `CancelTaskMutation.reason` is logged at line 420 but is **not passed** into the task transition. Compare with `apply_transition`, where `mutation.overrides` (including any reason-related overrides) are forwarded to `task.with_transition()`.

If `Task.with_transition()` accepts a `reason` keyword argument, or if a `cancellation_reason` / `reason` field exists on `Task`, the cancellation reason will not be persisted on the task object. Callers retrieving the task via `get_task()` after a cancellation will see no record of why it was cancelled.

If `with_transition` does not accept a `reason` parameter at all, this is only a logging concern. Either way, it's worth verifying the intent and documenting it explicitly in the `apply_cancel` docstring.

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: src/ai_company/engine/task_engine.py
Line: 128-146

Comment:
**`start()` uses `TASK_ENGINE_STARTED` for both warning and info**

On line 136, the `logger.warning(TASK_ENGINE_STARTED, error=msg)` call fires `TASK_ENGINE_STARTED` for the double-start error path. Using the same event constant for a `WARNING` (error case) and an `INFO` (success case on line 143) makes log filtering and alerting ambiguous. Consider introducing a dedicated `TASK_ENGINE_ALREADY_RUNNING` constant for the warning path so operators can distinguish a misconfigured double-start from a normal start event.

How can I resolve this? If you propose a fix, please make it concise.

_{Last reviewed commit: d23952a}

src/ai_company/api/controllers/tasks.py

Local agents (6 agents) + external reviewers (Copilot, Gemini, Greptile, CodeRabbit) identified 14 findings. All implemented: - Add error handling to list_tasks/get_task read endpoints (CRITICAL) - Add error handling to get_task read endpoint (CRITICAL) - Add dedicated event constants for reads, list cap, and futures failure - Merge duplicate except blocks in transition_task - Replace generic API_TASK_TRANSITION_FAILED with API_TASK_MUTATION_FAILED - Add created_by mismatch audit event - Narrow mutation_type to specific Literal per mutation class - Add error_code consistency checks to TaskMutationResult validator - Expand _map_task_engine_errors docstring with mapping table - Replace TASK_ENGINE_DRAIN_TIMEOUT with TASK_ENGINE_FUTURES_FAILED - Add read-through error wrapping tests (4 tests) - Add list_tasks safety cap test - Remove misleading computed_field reference in docstring - Add CreateTaskData max-length boundary tests (4 tests) CodeRabbit follow-up (3 additional items): - Assert error_code on version conflict integration tests - Assert sanitized 503 messages don't leak internal errors - PEP 758 except syntax: not applicable with `as` clause (ruff limitation)

github-actions · 2026-03-12T20:29:31Z

Dependency Review

✅ No vulnerabilities or license issues or OpenSSF Scorecard issues found.

Scanned Files

None

codecov · 2026-03-12T20:32:10Z

Codecov Report

❌ Patch coverage is 94.70014% with 38 lines in your changes missing coverage. Please review.
✅ Project coverage is 93.75%. Comparing base (9a9c600) to head (d23952a).
⚠️ Report is 1 commits behind head on main.
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
src/ai_company/engine/task_engine.py	93.90%	8 Missing and 7 partials ⚠️
src/ai_company/api/controllers/tasks.py	80.00%	13 Missing and 1 partial ⚠️
src/ai_company/api/app.py	80.55%	5 Missing and 2 partials ⚠️
src/ai_company/engine/agent_engine.py	91.66%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #328      +/-   ##
==========================================
+ Coverage   93.64%   93.75%   +0.10%     
==========================================
  Files         427      433       +6     
  Lines       19177    19831     +654     
  Branches     1846     1911      +65     
==========================================
+ Hits        17959    18592     +633     
- Misses        943      957      +14     
- Partials      275      282       +7

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

src/ai_company/engine/agent_engine.py

coderabbitai

Actionable comments posted: 3

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/ai_company/api/controllers/tasks.py (1)

316-404: ⚠️ Potential issue | 🟠 Major

Add the missing cancel route.

The controller exposes list/get/create/update/transition/delete, but nothing maps to TaskEngine.cancel_task(). That leaves the dedicated cancel mutation path unreachable from /tasks, despite it being part of this feature's API surface.

Suggested endpoint shape

`@post`("/{task_id:str}/cancel", guards=[require_write_access])
async def cancel_task(
    self,
    state: State,
    task_id: str,
    data: CancelTaskRequest,
) -> ApiResponse[Task]:
    app_state: AppState = state.app_state
    task = await app_state.task_engine.cancel_task(
        task_id,
        requested_by=_extract_requester(state),
        reason=data.reason,
    )
    return ApiResponse(data=task)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/ai_company/api/controllers/tasks.py` around lines 316 - 404, The
controller is missing an endpoint to expose TaskEngine.cancel_task, so add a new
POST route method named cancel_task in the same controller with decorators
`@post`("/{task_id:str}/cancel", guards=[require_write_access]) that accepts
(state: State, task_id: str, data: CancelTaskRequest) and calls
app_state.task_engine.cancel_task(task_id,
requested_by=_extract_requester(state), reason=data.reason), returns
ApiResponse(data=task) and wrap TaskEngine exceptions with
_map_task_engine_errors like the other handlers; include a logger.info call
(e.g. API_TASK_CANCELED, task_id=task_id) consistent with existing logging
patterns.

♻️ Duplicate comments (8)

src/ai_company/engine/task_engine_models.py (2)

332-374: ⚠️ Potential issue | 🟠 Major

Add a stable task_id to TaskStateChanged.

task is explicitly None on delete, and _publish_snapshot() currently has no other identifier to serialize. Consumers cannot tell which task was removed, so delete snapshots are not actionable.

Suggested model change

 class TaskStateChanged(BaseModel):
@@
+    task_id: NotBlankStr = Field(description="Affected task identifier")
     task: Task | None = Field(
         default=None,
         description="Task snapshot after mutation",
     )

Populate it from result.task.id when present, or from mutation.task_id for deletes.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/ai_company/engine/task_engine_models.py` around lines 332 - 374,
TaskStateChanged needs a stable task identifier so delete events remain
actionable: add a new field task_id: NotBlankStr | None (or appropriate type) to
the TaskStateChanged model and ensure _publish_snapshot() populates it from
result.task.id when result.task exists, otherwise use mutation.task_id for
delete events; update any constructors/creation sites that build
TaskStateChanged to set task_id accordingly while leaving task (snapshot) None
on deletes.

148-170: ⚠️ Potential issue | 🟠 Major

Freeze updates and overrides after validation.

The deep copy only severs aliasing with caller input. Both fields remain directly mutable on the frozen model, so code can still modify a queued mutation after validation and before the engine consumes it.

Suggested fix

+from collections.abc import Mapping
+from types import MappingProxyType
@@
-    updates: dict[str, object] = Field(description="Field-value pairs to apply")
+    updates: Mapping[str, object] = Field(description="Field-value pairs to apply")
@@
-        object.__setattr__(self, "updates", copy.deepcopy(self.updates))
+        object.__setattr__(
+            self,
+            "updates",
+            MappingProxyType(copy.deepcopy(dict(self.updates))),
+        )
@@
-    overrides: dict[str, object] = Field(
+    overrides: Mapping[str, object] = Field(
         default_factory=dict,
         description="Additional field overrides",
     )
@@
-        object.__setattr__(self, "overrides", copy.deepcopy(self.overrides))
+        object.__setattr__(
+            self,
+            "overrides",
+            MappingProxyType(copy.deepcopy(dict(self.overrides))),
+        )

As per coding guidelines "Use immutability by creating new objects, never mutating existing ones. For non-Pydantic internal collections, use copy.deepcopy() at construction + MappingProxyType wrapping."

Also applies to: 202-227

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/ai_company/engine/task_engine_models.py` around lines 148 - 170, The
updates (and similarly overrides) dicts are deep-copied but left mutable on the
model; after validation freeze them by wrapping the deepcopy in
types.MappingProxyType and assign via object.__setattr__ so the model exposes an
immutable mapping; ensure MappingProxyType is imported and apply the same
pattern for both the "updates" field handling in __init__ and the analogous code
for "overrides" (referencing the _reject_immutable_fields validator, the
"updates" attribute, and the overrides handling block around lines ~202-227).

src/ai_company/engine/task_engine.py (3)

622-631: 🛠️ Refactor suggestion | 🟠 Major

Log successful mutation application at INFO.

This is the single authoritative writer, but _process_one() only emits a DEBUG receipt and optional snapshot logs. The committed mutation itself should produce an INFO event with task/version/status context.

Suggested fix

+# also import TASK_ENGINE_MUTATION_APPLIED from
+# ai_company.observability.events.task_engine
@@
             if not envelope.future.done():
                 envelope.future.set_result(result)
+            if result.success:
+                logger.info(
+                    TASK_ENGINE_MUTATION_APPLIED,
+                    mutation_type=mutation.mutation_type,
+                    request_id=mutation.request_id,
+                    task_id=(
+                        result.task.id
+                        if result.task is not None
+                        else getattr(mutation, "task_id", None)
+                    ),
+                    previous_status=(
+                        result.previous_status.value
+                        if result.previous_status is not None
+                        else None
+                    ),
+                    new_status=(
+                        result.task.status.value
+                        if result.task is not None
+                        else None
+                    ),
+                    version=result.version,
+                )
             if result.success and self._config.publish_snapshots:
                 await self._publish_snapshot(mutation, result)

Based on learnings "All state transitions must log at INFO level" and "Use DEBUG logging for object creation, internal flow, entry/exit of key functions."

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/ai_company/engine/task_engine.py` around lines 622 - 631, Add an
INFO-level log when a mutation is successfully applied inside _process_one after
awaiting _dispatch_mutation (the block handling result and envelope.future);
specifically, when result is returned and before calling _publish_snapshot, emit
an info log that includes the mutation id/context (from mutation), task id or
name (if available on mutation), the resulting version (from self._versions or
result), and the result.status/success to provide authoritative state-transition
telemetry; keep the existing DEBUG receipt and snapshot logs intact and only add
this single INFO log entry adjacent to the existing envelope.future.set_result
and _publish_snapshot calls.

166-183: ⚠️ Potential issue | 🔴 Critical

Don't let wait_for() clear _in_flight before the timeout handler runs.

asyncio.wait_for(self._processing_task, ...) cancels the worker on timeout before the except TimeoutError block executes. By the time saved_in_flight is read, _process_one() has already hit its finally block and cleared it to None, so the blocked caller's future is never resolved.

Suggested fix

             try:
                 await asyncio.wait_for(
-                    self._processing_task,
+                    asyncio.shield(self._processing_task),
                     timeout=effective_timeout,
                 )
                 logger.info(TASK_ENGINE_DRAIN_COMPLETE)
             except TimeoutError:
                 logger.warning(
                     TASK_ENGINE_DRAIN_TIMEOUT,
                     remaining=self._queue.qsize(),
                 )
-                # Capture in-flight ref before cancel — the finally block
-                # in _process_one clears self._in_flight on CancelledError.
+                # Shield keeps the task alive long enough for us to capture
+                # the in-flight envelope before explicit cancellation.
                 saved_in_flight = self._in_flight
                 self._processing_task.cancel()
                 with contextlib.suppress(asyncio.CancelledError):
                     await self._processing_task
                 self._fail_remaining_futures(saved_in_flight)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/ai_company/engine/task_engine.py` around lines 166 - 183, The worker task
is being cancelled by asyncio.wait_for on timeout which lets _process_one reach
its finally block and clear self._in_flight before the except handler can
capture it; change the wait_for call to await
asyncio.wait_for(asyncio.shield(self._processing_task),
timeout=effective_timeout) so the underlying _processing_task is protected from
automatic cancellation, then in the TimeoutError handler capture saved_in_flight
= self._in_flight, cancel self._processing_task explicitly, await it inside
contextlib.suppress(asyncio.CancelledError), and call
_fail_remaining_futures(saved_in_flight) as you already do.

101-122: ⚠️ Potential issue | 🔴 Critical

Guard shutdown and enqueue with the same lifecycle lock.

A submitter can pass the running check, then stop() can drain and tear down the worker, and only afterward the envelope gets queued. That future then has no consumer and can hang forever.

Suggested fix

     def __init__(
         self,
         *,
         persistence: PersistenceBackend,
         message_bus: MessageBus | None = None,
         config: TaskEngineConfig | None = None,
     ) -> None:
+        self._lifecycle_lock = asyncio.Lock()
         self._persistence = persistence
         self._message_bus = message_bus
@@
-        if not self._running:
-            return
-        self._running = False
+        async with self._lifecycle_lock:
+            if not self._running or self._processing_task is None:
+                return
+            self._running = False
+            processing_task = self._processing_task
@@
-                await asyncio.wait_for(
-                    self._processing_task,
-                    timeout=effective_timeout,
-                )
+                await asyncio.wait_for(processing_task, timeout=effective_timeout)
@@
-        if not self._running:
-            logger.warning(
-                TASK_ENGINE_NOT_RUNNING,
-                mutation_type=mutation.mutation_type,
-                request_id=mutation.request_id,
-            )
-            msg = "TaskEngine is not running"
-            raise TaskEngineNotRunningError(msg)
-
-        envelope = _MutationEnvelope(mutation=mutation)
-        try:
-            self._queue.put_nowait(envelope)
-        except asyncio.QueueFull:
-            logger.warning(
-                TASK_ENGINE_QUEUE_FULL,
-                mutation_type=mutation.mutation_type,
-                request_id=mutation.request_id,
-                queue_size=self._queue.qsize(),
-            )
-            msg = "TaskEngine queue is full"
-            raise TaskEngineQueueFullError(msg) from None
+        async with self._lifecycle_lock:
+            if not self._running or self._processing_task is None:
+                logger.warning(
+                    TASK_ENGINE_NOT_RUNNING,
+                    mutation_type=mutation.mutation_type,
+                    request_id=mutation.request_id,
+                )
+                msg = "TaskEngine is not running"
+                raise TaskEngineNotRunningError(msg)
+
+            envelope = _MutationEnvelope(mutation=mutation)
+            try:
+                self._queue.put_nowait(envelope)
+            except asyncio.QueueFull:
+                logger.warning(
+                    TASK_ENGINE_QUEUE_FULL,
+                    mutation_type=mutation.mutation_type,
+                    request_id=mutation.request_id,
+                    queue_size=self._queue.qsize(),
+                )
+                msg = "TaskEngine queue is full"
+                raise TaskEngineQueueFullError(msg) from None

Also applies to: 146-184, 237-272

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/ai_company/engine/task_engine.py` around lines 101 - 122, The
submit/enqueue race occurs because the running check and queue.put are not
atomic with stop(); add a lifecycle asyncio.Lock (e.g., self._lifecycle_lock =
asyncio.Lock()) in __init__ and use it to guard the check-and-enqueue path in
the submit/enqueue functions and the shutdown path in stop() so that stop() can
drain/teardown only after acquiring the same lock; ensure you hold the lock
while checking self._running and performing self._queue.put_nowait/_queue.put
and while stop() flips self._running, cancels/awaits self._processing_task, and
drains the queue to avoid enqueuing envelopes with no consumer (also apply same
locking to other enqueue/submit sites referenced in the ranges 146-184 and
237-272).

tests/unit/engine/test_task_engine_integration.py (1)

99-115: 🧹 Nitpick | 🔵 Trivial

Prefer TaskGroup for this concurrent fan-out.

If one create fails before the final join, the remaining tasks keep running until gather(). TaskGroup gives the same coverage with structured cancellation and cleaner failure handling.

Suggested refactor

-        create_tasks = [
-            asyncio.create_task(
-                eng.create_task(make_create_data(), requested_by="alice")
-            )
-            for _ in range(5)
-        ]
-
-        # Yield to the event loop so the tasks can enqueue their mutations before stop()
-        await asyncio.sleep(0)
-
-        # Stop while tasks may still be in flight — drain timeout is generous
-        await eng.stop(timeout=5.0)
-
-        # All futures resolved (drained during stop or completed before stop)
-        results = await asyncio.gather(*create_tasks)
+        async with asyncio.TaskGroup() as tg:
+            create_tasks = [
+                tg.create_task(
+                    eng.create_task(make_create_data(), requested_by="alice")
+                )
+                for _ in range(5)
+            ]
+
+            # Yield to the event loop so the tasks can enqueue their mutations before stop()
+            await asyncio.sleep(0)
+
+            # Stop while tasks may still be in flight — drain timeout is generous
+            await eng.stop(timeout=5.0)
+
+        # All futures resolved (drained during stop or completed before stop)
+        results = [task.result() for task in create_tasks]

As per coding guidelines "Prefer asyncio.TaskGroup for fan-out/fan-in parallel operations in new code (multiple tool invocations, parallel agent calls). Prefer structured concurrency over bare create_task."

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/unit/engine/test_task_engine_integration.py` around lines 99 - 115,
Replace the ad-hoc create_task/gather pattern with structured concurrency using
asyncio.TaskGroup: instead of building create_tasks with asyncio.create_task and
later awaiting asyncio.gather, open an asyncio.TaskGroup, spawn tasks calling
eng.create_task(make_create_data(), requested_by="alice") inside it, and rely on
the TaskGroup to propagate failures and cancel siblings; keep the yield to the
event loop (await asyncio.sleep(0)) if needed before calling await
eng.stop(timeout=5.0), and remove the final asyncio.gather by letting the
TaskGroup scope ensure all tasks are resolved before asserting the number of
completed tasks results from the group.

src/ai_company/api/controllers/tasks.py (1)

185-186: 🧹 Nitpick | 🔵 Trivial

Use the repository's Python 3.14 multi-except form consistently.

These handlers still use tuple-style except (...) as exc: blocks instead of the except A, B as exc: form required by the project rule.

As per coding guidelines "Use except A, B: syntax (no parentheses) for multiple exception types — PEP 758 on Python 3.14."

Also applies to: 211-212, 261-267, 305-312, 354-362, 395-402
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/ai_company/api/controllers/tasks.py` around lines 185 - 186, Replace
tuple-style multi-except blocks with the Python 3.14 multi-except form used
across the repo: change occurrences like the except (TaskInternalError,
TaskEngineNotRunningError) as exc: to the form except TaskInternalError,
TaskEngineNotRunningError as exc: so the same exception object is captured;
apply the same change to the other listed handlers in this file (the blocks
around lines referenced: the handlers at 211-212, 261-267, 305-312, 354-362,
395-402) and ensure each uses the unparenthesized `except A, B as exc:` syntax
and still re-raises via raise _map_task_engine_errors(exc) from exc.

tests/unit/api/controllers/test_task_helpers.py (1)

72-88: ⚠️ Potential issue | 🟡 Minor

Assert the exact sanitized 503 strings.

These cases only prove the raw engine text is hidden. They'll still pass if the mapping regresses to some other 503 message, which weakens the API contract the controller is deliberately sanitizing.

Suggested assertions

     def test_not_running_maps_to_service_unavailable(self) -> None:
         exc = TaskEngineNotRunningError("not running")
         result = _map_task_engine_errors(exc)
         assert isinstance(result, ServiceUnavailableError)
-        assert "not running" not in str(result)
+        assert str(result) == "Service temporarily unavailable"

     def test_queue_full_maps_to_service_unavailable(self) -> None:
         exc = TaskEngineQueueFullError("queue full")
         result = _map_task_engine_errors(exc)
         assert isinstance(result, ServiceUnavailableError)
-        assert "queue full" not in str(result)
+        assert str(result) == "Service temporarily unavailable"

     def test_internal_error_maps_to_service_unavailable(self) -> None:
         exc = TaskInternalError("internal fault")
         result = _map_task_engine_errors(exc)
         assert isinstance(result, ServiceUnavailableError)
-        assert "internal fault" not in str(result)
+        assert str(result) == "Internal server error"
+        assert "internal fault" not in str(result)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/unit/api/controllers/test_task_helpers.py` around lines 72 - 88, The
tests only check that the original engine text is removed; change each test to
assert the exact sanitized 503 text produced by the mapping function so the API
contract is enforced. Locate the sanitized message constant (the string used
inside _map_task_engine_errors, e.g. SANITIZED_SERVICE_UNAVAILABLE_MESSAGE or
similar) and replace the current negative assertions with equality checks like
assert str(result) == SANITIZED_SERVICE_UNAVAILABLE_MESSAGE for
TaskEngineNotRunningError, TaskEngineQueueFullError, and TaskInternalError to
ensure the mapping returns the exact expected ServiceUnavailableError message.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/ai_company/api/controllers/tasks.py`:
- Around line 296-312: The updates dict produced by
data.model_dump(exclude_none=True) drops expected_version from the call even
though TaskEngine.update_task and transition_task accept optimistic concurrency;
change the controller to extract expected_version from the incoming DTO (e.g.,
data.expected_version or data.model_dump()["expected_version"]) and pass it
through to the engine call (add an expected_version=expected_version argument to
the TaskEngine.update_task and transition_task invocations) while keeping the
existing updates payload and requester (see updates variable, update_task call,
transition_task usage and _extract_requester); also ensure
UpdateTaskRequest/TransitionTaskRequest expose expected_version if missing.

In `@src/ai_company/engine/task_engine.py`:
- Around line 294-299: Create_task(), delete_task(), and cancel_task() currently
allow raw PydanticValidationError to escape when constructing
CreateTaskMutation/DeleteTaskMutation/CancelTaskMutation, breaking the engine
contract; wrap the mutation construction in a try/except that catches
pydantic.ValidationError (or PydanticValidationError) and re-raises as
TaskMutationError with a clear message and the original error as context,
mirroring how update_task() and transition_task() handle model-construction
failures; apply the same fix to the other occurrences where mutations are built
(the other Create/Delete/Cancel mutation sites referenced) so all mutation
construction paths normalize to TaskMutationError before calling self.submit().

In `@tests/unit/engine/test_task_engine_convenience.py`:
- Around line 72-90: The tests for typed-error fallbacks are not exercising the
intended branches; update test_none_code_falls_through to set
TaskMutationResult.error_code=None (so TaskEngine._raise_typed_error will fall
through the typed-error path) and update test_missing_error_uses_default_message
to omit or set an empty/None error on TaskMutationResult (so the fallback
default message is used) and assert TaskMutationError matches that default
message; reference the TaskMutationResult construction in each test and the
TaskEngine._raise_typed_error / TaskMutationError expectations when making the
changes.

---

Outside diff comments:
In `@src/ai_company/api/controllers/tasks.py`:
- Around line 316-404: The controller is missing an endpoint to expose
TaskEngine.cancel_task, so add a new POST route method named cancel_task in the
same controller with decorators `@post`("/{task_id:str}/cancel",
guards=[require_write_access]) that accepts (state: State, task_id: str, data:
CancelTaskRequest) and calls app_state.task_engine.cancel_task(task_id,
requested_by=_extract_requester(state), reason=data.reason), returns
ApiResponse(data=task) and wrap TaskEngine exceptions with
_map_task_engine_errors like the other handlers; include a logger.info call
(e.g. API_TASK_CANCELED, task_id=task_id) consistent with existing logging
patterns.

---

Duplicate comments:
In `@src/ai_company/api/controllers/tasks.py`:
- Around line 185-186: Replace tuple-style multi-except blocks with the Python
3.14 multi-except form used across the repo: change occurrences like the except
(TaskInternalError, TaskEngineNotRunningError) as exc: to the form except
TaskInternalError, TaskEngineNotRunningError as exc: so the same exception
object is captured; apply the same change to the other listed handlers in this
file (the blocks around lines referenced: the handlers at 211-212, 261-267,
305-312, 354-362, 395-402) and ensure each uses the unparenthesized `except A, B
as exc:` syntax and still re-raises via raise _map_task_engine_errors(exc) from
exc.

In `@src/ai_company/engine/task_engine_models.py`:
- Around line 332-374: TaskStateChanged needs a stable task identifier so delete
events remain actionable: add a new field task_id: NotBlankStr | None (or
appropriate type) to the TaskStateChanged model and ensure _publish_snapshot()
populates it from result.task.id when result.task exists, otherwise use
mutation.task_id for delete events; update any constructors/creation sites that
build TaskStateChanged to set task_id accordingly while leaving task (snapshot)
None on deletes.
- Around line 148-170: The updates (and similarly overrides) dicts are
deep-copied but left mutable on the model; after validation freeze them by
wrapping the deepcopy in types.MappingProxyType and assign via
object.__setattr__ so the model exposes an immutable mapping; ensure
MappingProxyType is imported and apply the same pattern for both the "updates"
field handling in __init__ and the analogous code for "overrides" (referencing
the _reject_immutable_fields validator, the "updates" attribute, and the
overrides handling block around lines ~202-227).

In `@src/ai_company/engine/task_engine.py`:
- Around line 622-631: Add an INFO-level log when a mutation is successfully
applied inside _process_one after awaiting _dispatch_mutation (the block
handling result and envelope.future); specifically, when result is returned and
before calling _publish_snapshot, emit an info log that includes the mutation
id/context (from mutation), task id or name (if available on mutation), the
resulting version (from self._versions or result), and the result.status/success
to provide authoritative state-transition telemetry; keep the existing DEBUG
receipt and snapshot logs intact and only add this single INFO log entry
adjacent to the existing envelope.future.set_result and _publish_snapshot calls.
- Around line 166-183: The worker task is being cancelled by asyncio.wait_for on
timeout which lets _process_one reach its finally block and clear
self._in_flight before the except handler can capture it; change the wait_for
call to await asyncio.wait_for(asyncio.shield(self._processing_task),
timeout=effective_timeout) so the underlying _processing_task is protected from
automatic cancellation, then in the TimeoutError handler capture saved_in_flight
= self._in_flight, cancel self._processing_task explicitly, await it inside
contextlib.suppress(asyncio.CancelledError), and call
_fail_remaining_futures(saved_in_flight) as you already do.
- Around line 101-122: The submit/enqueue race occurs because the running check
and queue.put are not atomic with stop(); add a lifecycle asyncio.Lock (e.g.,
self._lifecycle_lock = asyncio.Lock()) in __init__ and use it to guard the
check-and-enqueue path in the submit/enqueue functions and the shutdown path in
stop() so that stop() can drain/teardown only after acquiring the same lock;
ensure you hold the lock while checking self._running and performing
self._queue.put_nowait/_queue.put and while stop() flips self._running,
cancels/awaits self._processing_task, and drains the queue to avoid enqueuing
envelopes with no consumer (also apply same locking to other enqueue/submit
sites referenced in the ranges 146-184 and 237-272).

In `@tests/unit/api/controllers/test_task_helpers.py`:
- Around line 72-88: The tests only check that the original engine text is
removed; change each test to assert the exact sanitized 503 text produced by the
mapping function so the API contract is enforced. Locate the sanitized message
constant (the string used inside _map_task_engine_errors, e.g.
SANITIZED_SERVICE_UNAVAILABLE_MESSAGE or similar) and replace the current
negative assertions with equality checks like assert str(result) ==
SANITIZED_SERVICE_UNAVAILABLE_MESSAGE for TaskEngineNotRunningError,
TaskEngineQueueFullError, and TaskInternalError to ensure the mapping returns
the exact expected ServiceUnavailableError message.

In `@tests/unit/engine/test_task_engine_integration.py`:
- Around line 99-115: Replace the ad-hoc create_task/gather pattern with
structured concurrency using asyncio.TaskGroup: instead of building create_tasks
with asyncio.create_task and later awaiting asyncio.gather, open an
asyncio.TaskGroup, spawn tasks calling eng.create_task(make_create_data(),
requested_by="alice") inside it, and rely on the TaskGroup to propagate failures
and cancel siblings; keep the yield to the event loop (await asyncio.sleep(0))
if needed before calling await eng.stop(timeout=5.0), and remove the final
asyncio.gather by letting the TaskGroup scope ensure all tasks are resolved
before asserting the number of completed tasks results from the group.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: 3ca1a4ac-36f4-45e4-b992-a019195b7ad3

📥 Commits

Reviewing files that changed from the base of the PR and between ce3c594 and 327681e.

📒 Files selected for processing (12)

.github/workflows/pages-preview.yml
CLAUDE.md
src/ai_company/api/controllers/tasks.py
src/ai_company/engine/task_engine.py
src/ai_company/engine/task_engine_models.py
src/ai_company/observability/events/api.py
src/ai_company/observability/events/task_engine.py
tests/unit/api/controllers/test_task_helpers.py
tests/unit/engine/test_task_engine_convenience.py
tests/unit/engine/test_task_engine_integration.py
tests/unit/engine/test_task_engine_models.py
tests/unit/engine/test_task_engine_mutations.py

📜 Review details

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Greptile Review
GitHub Check: Test (Python 3.14)

🧰 Additional context used

📓 Path-based instructions (5)

**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

**/*.py: No from __future__ import annotations — Python 3.14 has PEP 649
Use except A, B: syntax (no parentheses) for multiple exception types — PEP 758 on Python 3.14
All public functions require type hints; mypy strict mode enforced
Docstrings must use Google style and are required on public classes and functions — enforced by ruff D rules
Use immutability by creating new objects, never mutating existing ones. For non-Pydantic internal collections, use copy.deepcopy() at construction + MappingProxyType wrapping. For Pydantic frozen models, use model_copy(update=...) for runtime state evolution.
Use frozen Pydantic models for config/identity; use mutable-via-copy models for runtime state that evolves. Never mix static config fields with mutable runtime fields in one model.
Adopt Pydantic v2 conventions: use @computed_field for derived values instead of storing redundant fields; use NotBlankStr for all identifier/name fields (including optional and tuple variants) instead of manual whitespace validators
Prefer asyncio.TaskGroup for fan-out/fan-in parallel operations in new code (multiple tool invocations, parallel agent calls). Prefer structured concurrency over bare create_task.
Line length must be 88 characters — enforced by ruff
Functions must be less than 50 lines; files must be less than 800 lines
Handle errors explicitly, never silently swallow errors
Validate at system boundaries (user input, external APIs, config files)

Files:

src/ai_company/observability/events/task_engine.py
src/ai_company/api/controllers/tasks.py
tests/unit/engine/test_task_engine_integration.py
src/ai_company/observability/events/api.py
tests/unit/engine/test_task_engine_convenience.py
src/ai_company/engine/task_engine_models.py
src/ai_company/engine/task_engine.py
tests/unit/api/controllers/test_task_helpers.py
tests/unit/engine/test_task_engine_models.py
tests/unit/engine/test_task_engine_mutations.py

src/ai_company/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

src/ai_company/**/*.py: Every module with business logic MUST import from ai_company.observability import get_logger and create logger = get_logger(__name__)
Never use import logging / logging.getLogger() / print() in application code — only use the structured logger from ai_company.observability
Logger variable must always be named logger (not _logger, not log)
Use event name constants from domain-specific modules under ai_company.observability.events (e.g., PROVIDER_CALL_START from events.provider, BUDGET_RECORD_ADDED from events.budget). Import directly: from ai_company.observability.events.<domain> import EVENT_CONSTANT
Always use structured logging: logger.info(EVENT, key=value) — never logger.info('msg %s', val)
All error paths must log at WARNING or ERROR with context before raising
All state transitions must log at INFO level
Use DEBUG logging for object creation, internal flow, entry/exit of key functions
Pure data models, enums, and re-exports do NOT need logging

Files:

src/ai_company/observability/events/task_engine.py
src/ai_company/api/controllers/tasks.py
src/ai_company/observability/events/api.py
src/ai_company/engine/task_engine_models.py
src/ai_company/engine/task_engine.py

**/{src,tests}/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

NEVER use real vendor names (Anthropic, OpenAI, Claude, GPT, etc.) in project-owned code, docstrings, comments, tests, or config examples. Use generic names: example-provider, example-large-001, example-medium-001, example-small-001. Tests must use test-provider, test-small-001, etc. Vendor names only in: (1) docs/design/operations.md, (2) .claude/ files, (3) third-party import paths.

Files:

src/ai_company/observability/events/task_engine.py
src/ai_company/api/controllers/tasks.py
tests/unit/engine/test_task_engine_integration.py
src/ai_company/observability/events/api.py
tests/unit/engine/test_task_engine_convenience.py
src/ai_company/engine/task_engine_models.py
src/ai_company/engine/task_engine.py
tests/unit/api/controllers/test_task_helpers.py
tests/unit/engine/test_task_engine_models.py
tests/unit/engine/test_task_engine_mutations.py

tests/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

tests/**/*.py: Markers for tests must be: @pytest.mark.unit, @pytest.mark.integration, @pytest.mark.e2e, @pytest.mark.slow
Prefer @pytest.mark.parametrize for testing similar cases

Files:

tests/unit/engine/test_task_engine_integration.py
tests/unit/engine/test_task_engine_convenience.py
tests/unit/api/controllers/test_task_helpers.py
tests/unit/engine/test_task_engine_models.py
tests/unit/engine/test_task_engine_mutations.py

src/ai_company/{providers,engine}/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

All provider calls go through BaseCompletionProvider which applies retry + rate limiting automatically. Never implement retry logic in driver subclasses or calling code.

Files:

src/ai_company/engine/task_engine_models.py
src/ai_company/engine/task_engine.py

🧠 Learnings (13)