feat: add HR engine and performance tracking (#45, #47) by Aureliolo · Pull Request #193 · Aureliolo/synthorg

Aureliolo · 2026-03-10T08:00:04Z

Summary

HR Engine: Full hiring pipeline (request → candidate generation → approval → instantiation), onboarding checklist management, offboarding pipeline (task reassignment → memory archival → team notification → termination), agent registry service
Performance Tracking: Task metric recording, collaboration scoring (behavioral telemetry strategy), quality scoring (CI-based strategy), Theil-Sen robust trend detection, multi-window rolling metrics (7d/30d/90d)
Persistence: SQLite repositories for lifecycle events, task metrics, and collaboration metrics (extracted to hr_repositories.py)
Pre-PR Review Fixes (9 agents, 57 findings addressed):
- Critical bug fix: trend detection now filters records per time window
- Re-instantiation vulnerability guard
- Narrowed except Exception blocks to specific types across 5+ locations
- Added missing logger.warning() before raises per CLAUDE.md rules
- Fixed NotBlankStr type violations, added model validators
- Updated DESIGN_SPEC.md, CLAUDE.md, README.md

Closes #45, closes #47

Test plan

All 5502 unit tests pass (uv run pytest tests/ -n auto)
mypy strict passes (uv run mypy src/ tests/)
ruff lint + format clean
Pre-existing flaky test test_circuit_breaker_after_max_errors (not related to this PR)
Coverage at 41% (pre-existing, not caused by this PR)

Review coverage

Pre-reviewed by 9 agents: code-reviewer, python-reviewer, pr-test-analyzer, silent-failure-hunter, comment-analyzer, type-design-analyzer, logging-audit, resilience-audit, docs-consistency. 57 findings triaged and implemented.

🤖 Generated with Claude Code

github-actions · 2026-03-10T08:00:13Z

Dependency Review

✅ No vulnerabilities or license issues or OpenSSF Scorecard issues found.

Scanned Files

None

coderabbitai · 2026-03-10T08:00:24Z

Caution

Review failed

The pull request is closed.

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: 334d218f-afad-4bc4-b69b-f996fd1203e3

📥 Commits

Reviewing files that changed from the base of the PR and between 48f28cf and f2c49f7.

📒 Files selected for processing (65)

CLAUDE.md
DESIGN_SPEC.md
README.md
src/ai_company/communication/enums.py
src/ai_company/core/enums.py
src/ai_company/hr/__init__.py
src/ai_company/hr/archival_protocol.py
src/ai_company/hr/enums.py
src/ai_company/hr/errors.py
src/ai_company/hr/full_snapshot_strategy.py
src/ai_company/hr/hiring_service.py
src/ai_company/hr/models.py
src/ai_company/hr/offboarding_service.py
src/ai_company/hr/onboarding_service.py
src/ai_company/hr/performance/__init__.py
src/ai_company/hr/performance/behavioral_collaboration_strategy.py
src/ai_company/hr/performance/ci_quality_strategy.py
src/ai_company/hr/performance/collaboration_protocol.py
src/ai_company/hr/performance/config.py
src/ai_company/hr/performance/models.py
src/ai_company/hr/performance/multi_window_strategy.py
src/ai_company/hr/performance/quality_protocol.py
src/ai_company/hr/performance/theil_sen_strategy.py
src/ai_company/hr/performance/tracker.py
src/ai_company/hr/performance/trend_protocol.py
src/ai_company/hr/performance/window_protocol.py
src/ai_company/hr/persistence_protocol.py
src/ai_company/hr/queue_return_strategy.py
src/ai_company/hr/reassignment_protocol.py
src/ai_company/hr/registry.py
src/ai_company/observability/events/hr.py
src/ai_company/observability/events/performance.py
src/ai_company/observability/events/persistence.py
src/ai_company/persistence/protocol.py
src/ai_company/persistence/repositories.py
src/ai_company/persistence/sqlite/backend.py
src/ai_company/persistence/sqlite/hr_repositories.py
src/ai_company/persistence/sqlite/migrations.py
src/ai_company/persistence/sqlite/repositories.py
tests/unit/api/conftest.py
tests/unit/communication/test_enums.py
tests/unit/core/test_enums.py
tests/unit/hr/__init__.py
tests/unit/hr/conftest.py
tests/unit/hr/performance/__init__.py
tests/unit/hr/performance/conftest.py
tests/unit/hr/performance/test_behavioral_collaboration_strategy.py
tests/unit/hr/performance/test_ci_quality_strategy.py
tests/unit/hr/performance/test_models.py
tests/unit/hr/performance/test_multi_window_strategy.py
tests/unit/hr/performance/test_theil_sen_strategy.py
tests/unit/hr/performance/test_tracker.py
tests/unit/hr/test_enums.py
tests/unit/hr/test_errors.py
tests/unit/hr/test_full_snapshot_strategy.py
tests/unit/hr/test_hiring_service.py
tests/unit/hr/test_models.py
tests/unit/hr/test_offboarding_service.py
tests/unit/hr/test_onboarding_service.py
tests/unit/hr/test_persistence.py
tests/unit/hr/test_queue_return_strategy.py
tests/unit/hr/test_registry.py
tests/unit/observability/test_events.py
tests/unit/persistence/test_migrations_v2.py
tests/unit/persistence/test_protocol.py

📝 Walkthrough

Summary by CodeRabbit

New Features
- Introduces a full HR domain: hiring pipeline, candidate generation, approvals, onboarding, offboarding, registry, and archival workflows.
- Adds performance tracking: task metrics, quality scoring, collaboration scoring, multi-window aggregates, and trend detection.
- Adds HR persistence (SQLite) and observability events for HR and performance workflows.
- Adds new agent status ONBOARDING and HR notification message type.
Documentation
- Updated docs and README to reflect HR Engine and Performance Tracking.
Tests
- Comprehensive unit tests for HR, performance, persistence, and services.

Walkthrough

Adds a new modular HR package and performance subsystem: hiring, onboarding, offboarding, archival and registry services; pluggable protocols and strategies for reassignment, archival, quality, collaboration, windowing and trend detection; SQLite-backed HR persistence (v2 migration); observability event constants; and extensive unit tests and fixtures.

Changes

Cohort / File(s)	Summary
HR Package (core services & init) `src/ai_company/hr/__init__.py`, `src/ai_company/hr/{hiring_service.py,onboarding_service.py,offboarding_service.py,registry.py}`	Adds ai_company.hr package and core services: HiringService, OnboardingService, OffboardingService, and AgentRegistryService implementing workflows for hiring, onboarding, offboarding, and agent registration/status management.
HR Domain Models, Enums, Errors `src/ai_company/hr/{models.py,enums.py,errors.py}`	Introduces frozen Pydantic domain models (CandidateCard, HiringRequest, FiringRequest, OnboardingChecklist, OffboardingRecord, AgentLifecycleEvent, performance models), enums for HR flows, and a hierarchical HR error taxonomy.
Archival & Reassignment `src/ai_company/hr/{archival_protocol.py,full_snapshot_strategy.py,queue_return_strategy.py,reassignment_protocol.py}`	Adds MemoryArchivalStrategy protocol and FullSnapshotStrategy implementation; TaskReassignmentStrategy protocol and QueueReturnStrategy implementation with per-entry error handling and observability.
Performance subsystem (package & protocols) `src/ai_company/hr/performance/{__init__.py,models.py,config.py,tracker.py,quality_protocol.py,collaboration_protocol.py,window_protocol.py,trend_protocol.py}`	New performance package: models, config, PerformanceTracker service, and pluggable protocols for quality scoring, collaboration scoring, windowing, and trend detection.
Performance strategies `src/ai_company/hr/performance/{ci_quality_strategy.py,behavioral_collaboration_strategy.py,multi_window_strategy.py,theil_sen_strategy.py}`	Concrete strategies: CI signal quality scorer, BehavioralTelemetry collaboration scorer, MultiWindowStrategy for rolling aggregates, and Theil‑Sen trend detector.
Persistence: protocols & SQLite implementations `src/ai_company/hr/persistence_protocol.py`, `src/ai_company/persistence/protocol.py`, `src/ai_company/persistence/repositories.py`, `src/ai_company/persistence/sqlite/{backend.py,hr_repositories.py,migrations.py,repositories.py}`	Adds HR persistence protocols, extends PersistenceBackend interface, implements SQLite repositories for lifecycle events, task metrics, collaboration metrics, updates backend wiring and bumps schema to v2 with migrations.
Observability events & enums `src/ai_company/observability/events/{hr.py,performance.py,persistence.py}`, `src/ai_company/communication/enums.py`, `src/ai_company/core/enums.py`	Adds HR and performance event constants, extends persistence event constants, adds `HR_NOTIFICATION` MessageType and `AgentStatus.ONBOARDING`.
Tests & test fixtures `tests/unit/{hr,hr/performance,persistence,api,communication,core,observability}/...`	Adds wide test coverage: unit tests for services, strategies, models, persistence (migrations + SQLite repos), protocols, and fixtures/fake repos to support the new HR/perf surface.
Docs & design `CLAUDE.md`, `DESIGN_SPEC.md`, `README.md`	Updates design doc and README to reflect modular HR package and performance tracking additions and new observability logging constants.

Sequence Diagram(s)

sequenceDiagram
    actor User
    participant HiringService as "HiringService"
    participant ApprovalStore as "ApprovalStore (optional)"
    participant AgentRegistry as "AgentRegistryService"
    participant Onboarding as "OnboardingService (optional)"
    User->>HiringService: create_request(...)
    HiringService->>HiringService: persist request
    User->>HiringService: generate_candidate(request)
    HiringService->>HiringService: append candidate
    User->>HiringService: submit_for_approval(request,candidate_id)
    alt ApprovalStore configured
        HiringService->>ApprovalStore: create approval item
        ApprovalStore-->>HiringService: approval result (async)
    else auto-approve
        HiringService->>HiringService: mark APPROVED
    end
    User->>HiringService: instantiate_agent(request)
    HiringService->>AgentRegistry: register(agent_identity)
    AgentRegistry-->>HiringService: registered
    opt OnboardingService present
        HiringService->>Onboarding: start_onboarding(agent_id)
        Onboarding-->>HiringService: checklist
    end
    HiringService-->>User: AgentIdentity

sequenceDiagram
    actor User
    participant OffboardingService as "OffboardingService"
    participant Reassign as "TaskReassignmentStrategy"
    participant Archival as "MemoryArchivalStrategy"
    participant MessageBus as "MessageBus (optional)"
    participant AgentRegistry as "AgentRegistryService"

    User->>OffboardingService: offboard(firing_request)
    OffboardingService->>AgentRegistry: get(agent_id)
    OffboardingService->>Reassign: reassign(agent_id, active_tasks)
    Reassign-->>OffboardingService: updated_tasks
    OffboardingService->>Archival: archive(agent_id,...)
    Archival-->>OffboardingService: ArchivalResult
    alt MessageBus configured
        OffboardingService->>MessageBus: publish(notification)
    end
    OffboardingService->>AgentRegistry: unregister(agent_id)
    AgentRegistry-->>OffboardingService: removed
    OffboardingService-->>User: OffboardingRecord

sequenceDiagram
    participant Tracker as "PerformanceTracker"
    participant Quality as "QualityScoringStrategy"
    participant Collaboration as "CollaborationScoringStrategy"
    participant Window as "MetricsWindowStrategy"
    participant Trend as "TrendDetectionStrategy"

    Tracker->>Tracker: record_task_metric(record)
    Tracker->>Quality: score(agent_id, task_id, task_result, criteria)
    Quality-->>Tracker: QualityScoreResult
    Tracker->>Tracker: record_collaboration_event(record)
    Tracker->>Collaboration: score(agent_id, collab_records)
    Collaboration-->>Tracker: CollaborationScoreResult
    Tracker->>Window: compute_windows(records, now)
    Window-->>Tracker: tuple[WindowMetrics]
    Tracker->>Trend: detect(metric_name, values, window_size)
    Trend-->>Tracker: TrendResult
    Tracker-->>Tracker: assemble AgentPerformanceSnapshot

Estimated code review effort

🎯 5 (Critical) | ⏱️ ~120 minutes

Possibly related PRs

feat: wire all modules into observability system #97 — Related observability additions: extends event-constant patterns and logging conventions that this PR also expands for HR/perf.
feat: add pluggable PersistenceBackend protocol with SQLite implementation #179 — Directly related: extends PersistenceBackend and SQLite persistence surfaces that this PR builds upon for HR repositories and migrations.
feat: add HR engine and performance tracking (#45, #47) #193 — Highly related: overlaps in introducing the ai_company.hr package, event constants, persistence changes, and enum additions referenced in this PR.

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feat/hr-engine-and-performance-tracking

✨ Simplify code

Create PR with simplified code
Commit simplified code in branch feat/hr-engine-and-performance-tracking

greptile-apps · 2026-03-10T08:04:09Z

Greptile Summary

This PR adds a complete HR engine (hiring pipeline, onboarding/offboarding orchestration, agent registry) and a performance-tracking subsystem (task metrics, collaboration scoring, Theil-Sen trend detection, multi-window rolling aggregates) backed by a new SQLite schema migration (V2). The overall architecture is sound — protocols are well-defined, error handling is explicit, and the previous round of pre-PR reviews has visibly cleaned up broad exception clauses and lock usage.

Key findings:

score_task_quality never persists scored records (tracker.py:163): The method returns a new TaskMetricRecord with quality_score set but does not write it back to self._task_metrics. All subsequent get_snapshot() calls will compute overall_quality_score=None and the quality trend will always be empty — the feature is silently broken for any caller following the natural record_task_metric → score_task_quality → get_snapshot flow.
Cost trend direction is semantically inverted (tracker.py:315): TheilSenTrendStrategy labels a positive slope IMPROVING for every metric. For quality_score this is correct, but for cost_usd a rising slope means rising cost, which should map to DECLINING. As-is, an agent spending more over time will be reported as improving.
memory_archive_id is permanently None (offboarding_service.py:144): OffboardingRecord has a memory_archive_id field for auditability, but ArchivalResult exposes no archive ID, so the field is hardcoded to None and can never be populated without a model change.
Wrong log event constant in _get_request (hiring_service.py:94): A "request not found" error is logged under HR_HIRING_REQUEST_CREATED, which will corrupt any monitoring alert or log query watching that event constant for successful creations.

Confidence Score: 2/5

Two silent functional bugs in the performance tracker make quality scoring and cost trend direction broken by default; needs fixes before merge.
The hiring pipeline, registry, offboarding pipeline, SQLite repositories, and migration are all well-implemented. However, the performance tracker has two logic bugs that silently produce incorrect results: quality scores computed by score_task_quality are never reflected in snapshots or trends, and the cost_usd trend direction is inverted (rising cost is labeled IMPROVING). These are not edge cases — they affect every standard usage of the tracker.
src/ai_company/hr/performance/tracker.py requires the most attention for both the score persistence gap and the cost trend inversion. src/ai_company/hr/archival_protocol.py and src/ai_company/hr/offboarding_service.py need a coordinated fix for the missing archive_id.

Important Files Changed

Filename	Overview
src/ai_company/hr/performance/tracker.py	Two logic bugs: `score_task_quality` never persists scored records back to `self._task_metrics`, making all snapshot quality scores and quality-trend computations return `None`/empty; and the cost trend direction is semantically inverted (rising cost_usd is labeled IMPROVING).
src/ai_company/hr/hiring_service.py	Well-structured hiring pipeline with proper guard for re-instantiation; minor style issue: `_get_request` logs the wrong event constant (`HR_HIRING_REQUEST_CREATED`) on a not-found error path.
src/ai_company/hr/offboarding_service.py	Clean four-step pipeline with correctly scoped non-fatal error handling; `memory_archive_id` is hardcoded to `None` because `ArchivalResult` has no archive ID field, leaving the `OffboardingRecord` field permanently unpopulated.
src/ai_company/hr/registry.py	All operations including reads now hold `async with self._lock`; clean implementation with correct error types and structured logging throughout.
src/ai_company/hr/performance/theil_sen_strategy.py	Correct O(n²) Theil-Sen implementation with proper insufficient-data guard; issue is in how the caller (tracker.py) applies direction semantics to different metrics, not in this file itself.
src/ai_company/hr/performance/ci_quality_strategy.py	Log-scaled cost efficiency with configurable `cost_budget` parameter; clean implementation addressing the previously flagged score-collapse issue.
src/ai_company/persistence/sqlite/hr_repositories.py	Well-structured SQLite repositories with correct parameterized queries, proper bool-to-int conversion for SQLite, and explicit error narrowing; `INSERT` is non-idempotent by design.
src/ai_company/persistence/sqlite/migrations.py	Clean incremental migration system using `user_version` pragma; V2 adds lifecycle_events, task_metrics, and collaboration_metrics with appropriate composite indexes.
src/ai_company/hr/archival_protocol.py	`ArchivalResult` is missing an `archive_id` field, leaving `OffboardingRecord.memory_archive_id` permanently `None`; otherwise protocol definition is clean.

Sequence Diagram

sequenceDiagram
    participant Caller
    participant HiringService
    participant AgentRegistryService
    participant OnboardingService
    participant PerformanceTracker
    participant OffboardingService

    Note over Caller,OffboardingService: Hiring Pipeline
    Caller->>HiringService: create_request(role, dept, level, ...)
    HiringService-->>Caller: HiringRequest (PENDING)
    Caller->>HiringService: generate_candidate(request)
    HiringService-->>Caller: HiringRequest (+ CandidateCard)
    Caller->>HiringService: submit_for_approval(request, candidate_id)
    alt approval_store present
        HiringService->>ApprovalStore: add(ApprovalItem)
    else auto-approve
        HiringService-->>Caller: HiringRequest (APPROVED)
    end
    Caller->>HiringService: instantiate_agent(request)
    HiringService->>AgentRegistryService: register(identity)
    HiringService->>OnboardingService: start_onboarding(agent_id)
    HiringService-->>Caller: AgentIdentity (ACTIVE)

    Note over Caller,PerformanceTracker: Performance Tracking
    Caller->>PerformanceTracker: record_task_metric(record)
    PerformanceTracker-->>Caller: TaskMetricRecord (quality_score=None)
    Caller->>PerformanceTracker: score_task_quality(task_result, criteria)
    PerformanceTracker-->>Caller: TaskMetricRecord (quality_score=X)
    Note right of PerformanceTracker: ⚠️ stored record NOT updated
    Caller->>PerformanceTracker: get_snapshot(agent_id)
    PerformanceTracker-->>Caller: AgentPerformanceSnapshot (quality=None!)

    Note over Caller,OffboardingService: Offboarding Pipeline
    Caller->>OffboardingService: offboard(FiringRequest)
    OffboardingService->>TaskRepository: list_tasks + reassign
    OffboardingService->>ArchivalStrategy: archive(agent_id, ...)
    OffboardingService->>MessageBus: publish(HR_NOTIFICATION)
    OffboardingService->>AgentRegistryService: update_status(TERMINATED)
    OffboardingService-->>Caller: OffboardingRecord (memory_archive_id=None)

_{Last reviewed commit: f2c49f7}

src/ai_company/hr/hiring_service.py

src/ai_company/hr/offboarding_service.py

src/ai_company/hr/performance/ci_quality_strategy.py

src/ai_company/hr/registry.py

Copilot

Pull request overview

Adds a new HR subsystem (hiring/onboarding/offboarding/registry) plus performance tracking (metrics, scoring, rolling windows, trend detection), and wires persistence/observability support for the new domain.

Changes:

Introduces ai_company.hr package: lifecycle services (registry, hiring, onboarding, offboarding) + pluggable reassignment/archival strategies.
Introduces ai_company.hr.performance package: metric models, quality/collaboration scoring, multi-window aggregation, Theil–Sen trend detection, and a tracker service.
Extends SQLite persistence schema to v2 and adds SQLite repositories + persistence backend/protocol surface area for HR/performance tables and events.

Reviewed changes

Copilot reviewed 63 out of 65 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
tests/unit/persistence/test_protocol.py	Extends persistence protocol compliance fakes to include HR/performance repos.
tests/unit/persistence/test_migrations_v2.py	Adds unit tests validating schema v2 migrations, tables, and indexes.
tests/unit/observability/test_events.py	Updates expected observability event modules to include `hr` and `performance`.
tests/unit/hr/test_registry.py	Adds unit tests for `AgentRegistryService` behavior.
tests/unit/hr/test_queue_return_strategy.py	Adds unit tests for queue-return task reassignment strategy.
tests/unit/hr/test_persistence.py	Adds unit tests for HR SQLite repositories round-trips and filters.
tests/unit/hr/test_onboarding_service.py	Adds unit tests for onboarding checklist lifecycle and activation.
tests/unit/hr/test_offboarding_service.py	Adds unit tests for offboarding pipeline orchestration with fakes.
tests/unit/hr/test_models.py	Adds unit tests for HR domain model validation and immutability.
tests/unit/hr/test_hiring_service.py	Adds unit tests for hiring pipeline stages and error cases.
tests/unit/hr/test_full_snapshot_strategy.py	Adds unit tests for snapshot archival strategy, including promotion behavior.
tests/unit/hr/test_errors.py	Adds unit tests for HR error hierarchy.
tests/unit/hr/test_enums.py	Adds unit tests for HR enums and expected values.
tests/unit/hr/performance/test_theil_sen_strategy.py	Adds unit tests for Theil–Sen trend detection behavior.
tests/unit/hr/performance/test_multi_window_strategy.py	Adds unit tests for multi-window aggregation and edge cases.
tests/unit/hr/performance/test_ci_quality_strategy.py	Adds unit tests for CI-signal quality scoring strategy.
tests/unit/hr/performance/test_behavioral_collaboration_strategy.py	Adds unit tests for behavioral telemetry collaboration scoring.
tests/unit/hr/performance/conftest.py	Adds helpers/factories for performance tracking unit tests.
tests/unit/hr/performance/init.py	Marks performance unit test package.
tests/unit/hr/conftest.py	Adds shared HR fixtures and factory builders for unit tests.
tests/unit/hr/init.py	Marks HR unit test package.
tests/unit/core/test_enums.py	Updates expected `AgentStatus` enum member count.
tests/unit/communication/test_enums.py	Updates expected `MessageType` enum member count.
tests/unit/api/conftest.py	Extends fake persistence backend with HR/performance repositories.
src/ai_company/persistence/sqlite/repositories.py	Updates module docstring to point HR repos to `hr_repositories.py`.
src/ai_company/persistence/sqlite/migrations.py	Bumps schema to v2 and adds v2 migration statements for HR tables/indexes.
src/ai_company/persistence/sqlite/hr_repositories.py	Adds SQLite repositories for lifecycle events, task metrics, collaboration metrics.
src/ai_company/persistence/sqlite/backend.py	Wires new HR repositories into SQLite persistence backend.
src/ai_company/persistence/repositories.py	Re-exports new HR repository protocols.
src/ai_company/persistence/protocol.py	Extends `PersistenceBackend` protocol with HR repository properties.
src/ai_company/observability/events/persistence.py	Adds structured event constants for HR persistence operations.
src/ai_company/observability/events/performance.py	Adds structured event constants for performance tracking operations.
src/ai_company/observability/events/hr.py	Adds structured event constants for HR domain operations.
src/ai_company/hr/registry.py	Introduces an in-memory agent registry service with status updates and queries.
src/ai_company/hr/reassignment_protocol.py	Adds protocol for task reassignment strategies.
src/ai_company/hr/queue_return_strategy.py	Implements queue-return reassignment strategy (interrupt + clear assignee).
src/ai_company/hr/persistence_protocol.py	Adds repository protocols for lifecycle events and performance metrics.
src/ai_company/hr/performance/window_protocol.py	Adds protocol for rolling-window aggregation strategies.
src/ai_company/hr/performance/trend_protocol.py	Adds protocol for trend detection strategies.
src/ai_company/hr/performance/tracker.py	Adds in-memory performance tracker for recording/scoring/querying snapshots.
src/ai_company/hr/performance/theil_sen_strategy.py	Implements Theil–Sen robust slope trend detector.
src/ai_company/hr/performance/quality_protocol.py	Adds protocol for quality scoring strategies.
src/ai_company/hr/performance/multi_window_strategy.py	Implements multi-window rolling aggregation (7d/30d/90d, etc.).
src/ai_company/hr/performance/models.py	Adds performance tracking Pydantic models (records, results, windows, snapshots).
src/ai_company/hr/performance/config.py	Adds performance configuration model (windows, thresholds, weights).
src/ai_company/hr/performance/collaboration_protocol.py	Adds protocol for collaboration scoring strategies.
src/ai_company/hr/performance/ci_quality_strategy.py	Implements CI-signal-based quality scoring strategy.
src/ai_company/hr/performance/behavioral_collaboration_strategy.py	Implements behavioral telemetry collaboration scoring strategy.
src/ai_company/hr/performance/init.py	Marks performance package.
src/ai_company/hr/onboarding_service.py	Implements onboarding checklist management and activation on completion.
src/ai_company/hr/offboarding_service.py	Implements offboarding pipeline orchestration (reassign/archive/notify/terminate).
src/ai_company/hr/models.py	Adds HR Pydantic models for hiring/firing/onboarding/offboarding/lifecycle events.
src/ai_company/hr/hiring_service.py	Implements hiring pipeline (request → candidates → approval → instantiation).
src/ai_company/hr/full_snapshot_strategy.py	Implements full-snapshot memory archival with optional org-memory promotion.
src/ai_company/hr/errors.py	Adds HR error hierarchy (hiring/offboarding/registry/performance).
src/ai_company/hr/enums.py	Adds HR enums (request status, reasons, steps, event types, trend direction).
src/ai_company/hr/archival_protocol.py	Adds protocol + result model for memory archival strategies.
src/ai_company/hr/init.py	Marks HR package.
src/ai_company/core/enums.py	Adds `AgentStatus.ONBOARDING`.
src/ai_company/communication/enums.py	Adds `MessageType.HR_NOTIFICATION`.
README.md	Updates milestone feature list to include HR engine and performance tracking.
DESIGN_SPEC.md	Updates package tree documentation to reflect new HR/performance modules and persistence files.
CLAUDE.md	Updates repository overview and logging conventions to include HR/performance event constants.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-10T08:12:17Z

src/ai_company/hr/registry.py

+        name_lower = name.lower()
+        for identity in self._agents.values():
+            if str(identity.name).lower() == name_lower:
+                return identity
+        return None


These iterations over self._agents.values() are not protected by self._lock. If another task registers/unregisters concurrently, Python can raise RuntimeError: dictionary changed size during iteration. Consider taking a snapshot of self._agents.values() under the lock (or holding the lock during the loop).

Copilot · 2026-03-10T08:12:17Z

src/ai_company/hr/registry.py

+        Returns:
+            Tuple of active agent identities.
+        """
+        return tuple(a for a in self._agents.values() if a.status == AgentStatus.ACTIVE)


list_active() reads/iterates registry state without holding self._lock, so concurrent register()/unregister() can trigger RuntimeError: dictionary changed size during iteration or return inconsistent results. Consider snapshotting under the lock (or holding the lock while building the tuple).

Copilot · 2026-03-10T08:12:17Z

src/ai_company/hr/registry.py

+        dept_lower = department.lower()
+        return tuple(
+            a for a in self._agents.values() if str(a.department).lower() == dept_lower
+        )


list_by_department() iterates self._agents.values() without self._lock. Concurrent writes can raise RuntimeError: dictionary changed size during iteration. Consider snapshotting the values under the lock before filtering.

Copilot · 2026-03-10T08:12:17Z

src/ai_company/hr/hiring_service.py

+        except Exception as exc:
+            msg = f"Failed to instantiate agent for request {request.id!r}"
+            logger.exception(
+                HR_HIRING_INSTANTIATION_FAILED,
+                request_id=str(request.id),
+                error=str(exc),
+            )
+            raise HiringError(msg) from exc


Catching a blanket Exception here will also swallow unexpected programmer errors (e.g., AttributeError/TypeError) and re-raise them as HiringError, which can make debugging harder. Consider catching the specific exception types you expect from AgentIdentity/ModelConfig validation and registry.register() (e.g., Pydantic validation errors, AgentAlreadyRegisteredError, etc.), and let genuinely unexpected exceptions propagate.

Copilot · 2026-03-10T08:12:18Z

src/ai_company/hr/performance/tracker.py

+    def __init__(
+        self,
+        *,
+        quality_strategy: QualityScoringStrategy,
+        collaboration_strategy: CollaborationScoringStrategy,
+        window_strategy: MetricsWindowStrategy,
+        trend_strategy: TrendDetectionStrategy,
+        config: PerformanceConfig | None = None,
+    ) -> None:
+        self._quality_strategy = quality_strategy
+        self._collaboration_strategy = collaboration_strategy
+        self._window_strategy = window_strategy
+        self._trend_strategy = trend_strategy
+        self._config = config or PerformanceConfig()
+        self._task_metrics: dict[str, list[TaskMetricRecord]] = {}
+        self._collab_metrics: dict[str, list[CollaborationMetricRecord]] = {}
+


PerformanceConfig includes windows, improving_threshold, declining_threshold, and collaboration_weights, but this tracker only uses config.min_data_points. This can lead to confusing/ineffective configuration (changing thresholds/windows has no effect). Either wire these config values into the injected strategies (or construct default strategies from config) or remove the unused fields/config parameter from PerformanceTracker.

Copilot · 2026-03-10T08:12:18Z

src/ai_company/hr/performance/theil_sen_strategy.py

+        # Compute all pairwise slopes.
+        slopes: list[float] = []
+        for (t1, v1), (t2, v2) in combinations(values, 2):
+            dt_days = (t2.timestamp() - t1.timestamp()) / _SECONDS_PER_DAY
+            if abs(dt_days) < _MIN_DELTA_DAYS:
+                continue
+            slope = (v2 - v1) / dt_days
+            slopes.append(slope)


Theil-Sen slope calculation depends on time ordering. combinations(values, 2) uses the incoming order, so if values isn’t sorted by timestamp you can get negative dt_days and inverted slopes, which can flip the detected trend direction. Consider sorting by timestamp (or otherwise ensuring x2 > x1) before computing slopes.

Copilot · 2026-03-10T08:12:18Z

src/ai_company/hr/full_snapshot_strategy.py

+                    org_category = _CATEGORY_MAP[entry.category]
+                    author = OrgFactAuthor(agent_id=agent_id)
+                    write_req = OrgFactWriteRequest(
+                        content=NotBlankStr(entry.content),
+                        category=org_category,
+                    )
+                    await org_memory_backend.write(write_req, author=author)


Org memory promotion appears to be effectively disabled: OrgFactAuthor(agent_id=agent_id) will fail validation because non-human authors require both agent_id and seniority (see OrgFactAuthor validator). Since the exception is caught, promotable entries will be skipped and promoted_count will stay 0, contradicting the docstring. Pass an author with seniority (e.g., derive from AgentIdentity/registry) or change the promotion API to accept the author explicitly.

coderabbitai

Actionable comments posted: 30

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

tests/unit/communication/test_enums.py (1)

19-31: ⚠️ Potential issue | 🟡 Minor

Add assertion for the new HR_NOTIFICATION value.

The member count was updated to 10, but test_values doesn't verify the new HR_NOTIFICATION member. For consistency with other enum tests in this file (which verify all member values), add the missing assertion.

💚 Proposed fix

     def test_values(self) -> None:
         assert MessageType.TASK_UPDATE.value == "task_update"
         assert MessageType.QUESTION.value == "question"
         assert MessageType.ANNOUNCEMENT.value == "announcement"
         assert MessageType.REVIEW_REQUEST.value == "review_request"
         assert MessageType.APPROVAL.value == "approval"
         assert MessageType.DELEGATION.value == "delegation"
         assert MessageType.STATUS_REPORT.value == "status_report"
         assert MessageType.ESCALATION.value == "escalation"
         assert MessageType.MEETING_CONTRIBUTION.value == "meeting_contribution"
+        assert MessageType.HR_NOTIFICATION.value == "hr_notification"

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/unit/communication/test_enums.py` around lines 19 - 31, The test suite
added a new enum member but test_values doesn't assert it; update test_values in
tests/unit/communication/test_enums.py to include an assertion that
MessageType.HR_NOTIFICATION.value == "hr_notification" (alongside the other
MessageType assertions) so the new member is verified and the member count
matches test_member_count.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@DESIGN_SPEC.md`:
- Line 2827: The structure map entry incorrectly points to hr.enums for
AgentStatus; update the map so the AgentStatus reference points to the module
where it is actually defined (core.enums) instead of hr.enums, i.e., replace the
AgentStatus module reference in the map from hr.enums to core.enums so readers
are directed to the correct definition.

In `@src/ai_company/hr/archival_protocol.py`:
- Around line 51-70: The archive method's docstring is missing a Raises section
documenting MemoryArchivalError; update the docstring for async def archive (in
archival_protocol.py) to include a "Raises:" block that lists
MemoryArchivalError (and when it is raised, e.g., on archival failure or backend
errors) to match TaskReassignmentStrategy's style and the project's errors.py
hierarchy so callers know to handle this exception.

In `@src/ai_company/hr/full_snapshot_strategy.py`:
- Around line 65-186: The archive() method is too long and should be broken into
helpers; extract the phases into private methods to improve readability and
testability: implement a _retrieve_entries(agent_id, memory_backend) that wraps
memory_backend.retrieve and error handling, an _archive_entries(entries,
archival_store, agent_id, now) that constructs ArchivalEntry, calls
archival_store.archive and returns (archived_count, deleted_ids), a
_promote_to_org(entries, org_memory_backend, agent_id) that handles category
mapping via _CATEGORY_MAP, OrgFactAuthor/OrgFactWriteRequest and returns
promoted_count, and a _clean_hot_store(deleted_ids, memory_backend, agent_id)
that deletes ids and returns hot_store_cleaned; update archive() to call these
helpers and assemble the ArchivalResult while preserving existing logging and
exception behavior (use function names: archive, _retrieve_entries,
_archive_entries, _promote_to_org, _clean_hot_store, ArchivalEntry,
archival_store.archive, org_memory_backend.write, memory_backend.delete).
- Around line 156-169: The delete loop is re-wrapping already-validated IDs with
NotBlankStr; remove the redundant wrap and pass memory_id directly to
memory_backend.delete from the hot store cleanup loop (symbols: deleted_ids,
memory_id, memory_backend.delete, NotBlankStr, hot_store_cleaned); ensure
consistency with how deleted_ids is populated (either keep deleted_ids as
list[NotBlankStr] and pass the elements unchanged, or keep them as list[str] and
pass the raw str) and update the code that builds deleted_ids accordingly so
validation happens only once when entries are read from the backend.

In `@src/ai_company/hr/hiring_service.py`:
- Around line 318-329: The new AgentIdentity is not copying the candidate's
skills, causing AgentIdentity.skills to remain the default empty SkillSet;
update the AgentIdentity construction (AgentIdentity(...)) to set its skills
from the CandidateCard (e.g., use candidate.skills or convert candidate.skills
into the expected SkillSet type) so the hired agent retains the
CandidateCard.skills for downstream skill-based routing and gap-filling logic.
Ensure any necessary conversion or validation is applied so the types match
between CandidateCard.skills and AgentIdentity.skills.
- Around line 129-168: The generate_candidate method is mutating _requests using
the caller-supplied HiringRequest snapshot (request) which can cause
lost-updates; change it to re-load the authoritative request from self._requests
by id (self._requests[str(request.id)]) before creating the CandidateCard, then
apply changes to that fresh instance (use model_copy or a helper like
_apply_update_to_request) and write back into self._requests; encapsulate the
read-modify-write in a small helper (e.g., _get_current_request and
_save_updated_request) and use those helpers here (and similarly in the other
methods flagged at lines 170-244) to avoid overwriting concurrent updates and
preserve existing candidates/approval fields.
- Around line 327-349: The code always creates the new identity with
AgentStatus.ONBOARDING which leaves agents stuck if no onboarding service exists
or if start_onboarding fails; fix by: when instantiating the identity choose
status = AgentStatus.ONBOARDING only if self._onboarding_service is not None,
otherwise use AgentStatus.ACTIVE; additionally wrap the await
self._onboarding_service.start_onboarding(...) call in a try/except so that if
start_onboarding raises you perform compensation: call the registry
rollback/unregister method (e.g., self._registry.unregister(identity.id) or
appropriate removal), remove or revert the request entry in self._requests, and
raise a HiringError from the exception; reference symbols:
AgentStatus.ONBOARDING, AgentStatus.ACTIVE, _onboarding_service,
_registry.register, start_onboarding, self._requests,
HiringRequestStatus.INSTANTIATED.
- Line 154: The assignment for estimated_monthly_cost currently uses
"request.budget_limit_monthly or 50.0", which treats 0.0 as missing; change it
to preserve an explicit 0.0 by using a None-check (e.g., use
request.budget_limit_monthly if request.budget_limit_monthly is not None else
50.0) so estimated_monthly_cost uses the provided 0.0 but falls back to 50.0
only when budget_limit_monthly is None; update the expression where
estimated_monthly_cost is set (referencing request.budget_limit_monthly and
estimated_monthly_cost) accordingly.

In `@src/ai_company/hr/models.py`:
- Around line 270-273: Replace the plain string type on the tasks_reassigned
field with the NotBlankStr type so identifiers are validated: change the
annotation from tuple[str, ...] to tuple[NotBlankStr, ...] on the
tasks_reassigned Field, keep the default=() and description, and add an import
for NotBlankStr from core.types if it’s not already imported; ensure any type
hints/usages that rely on tasks_reassigned reflect the new NotBlankStr element
type.

In `@src/ai_company/hr/offboarding_service.py`:
- Around line 109-114: The warning logs the wrong event constant when the agent
lookup fails: replace the HR_FIRING_INITIATED constant used in logger.warning
with HR_FIRING_COMPLETE and include the error context (msg) as before; locate
the agent lookup around the self._registry.get(agent_id) call where identity is
checked, update the logger.warning call that currently references
HR_FIRING_INITIATED to use HR_FIRING_COMPLETE, keeping agent_id and error=msg,
then raise AgentNotFoundError as before.

In `@src/ai_company/hr/onboarding_service.py`:
- Around line 46-78: Check registry membership for agent_id at the start of
start_onboarding and reject missing/invalid agents before creating the
OnboardingChecklist; add a lookup (e.g., via existing registry/service used
elsewhere) and raise OnboardingError with a clear message if not found. Extract
the finalization logic that persists/completes a checklist into a small helper
(e.g., _finalize_checklist or finalize_checklist_write) and call it from
complete_step/update_status so the write-after-await path isn't embedded in the
long method. Ensure start_onboarding uses the registry check and only constructs
and stores the checklist after validation, and update
complete_step/update_status to call the new helper to persist the final
checklist.

In `@src/ai_company/hr/performance/multi_window_strategy.py`:
- Around line 111-159: The code currently computes success_rate for any non-zero
count even when the window is below the minimum sample size; change it so
success_rate is only computed and returned when has_enough is True (otherwise
set it to None). Concretely, remove or stop using the unconditional success_rate
= completed / count assignment and instead compute success_rate inside the
has_enough branch (or set success_rate = None when not has_enough), then pass
round(success_rate, 4) only when success_rate is not None in the WindowMetrics
constructor (refer to the has_enough variable, the success_rate name, and the
final WindowMetrics return block to locate the change).

In `@src/ai_company/hr/performance/quality_protocol.py`:
- Around line 25-27: Change the name property return type from plain str to
NotBlankStr to enforce non-blank strategy names: update the annotation on the
name property in the protocol (def name(self) -> NotBlankStr), add the required
import from core.types (NotBlankStr), and ensure any callers/implementations and
tests return/consume NotBlankStr-compatible values (matching existing
QualityScoreResult.strategy_name usage).

In `@src/ai_company/hr/performance/tracker.py`:
- Around line 302-324: get_collaboration_metrics currently only supports a since
filter while get_task_metrics supports both since and until; add an optional
until: AwareDatetime | None = None parameter to get_collaboration_metrics,
update the docstring to document it, and apply the same filter logic used in
get_task_metrics (filter records by recorded_at >= since and recorded_at <=
until when those args are provided) using the same _collab_metrics traversal
(preserve behavior when agent_id is provided); ensure the default remains None
for backward compatibility and update any callers/tests that expect the new
signature.
- Around line 221-272: _compute_trends contains duplicated logic for building
(timestamp, value) tuples and calling self._trend_strategy.detect for
quality_score and cost_usd; extract that into a small helper (e.g.,
_compute_metric_trend or compute_and_append_trend) that accepts metric_name
(NotBlankStr), a value extractor (lambda or attr name) and window.window_size,
builds the values tuple from window_records, calls self._trend_strategy.detect
and appends the result when values exist; update _compute_trends to call this
helper for "quality_score" (skip None quality_score) and "cost_usd" to remove
the duplicated blocks while preserving filtering, window_size param and
appending behavior.

In `@src/ai_company/hr/queue_return_strategy.py`:
- Around line 75-87: The code uses the same event constant
HR_FIRING_TASKS_REASSIGNED for both the failure warning and the success info
log, which harms observability; update the failure path in the logger.warning
call inside the except block to use a distinct constant (e.g.,
HR_FIRING_TASK_REASSIGN_FAILED) or create that constant if it doesn't exist,
keep the same structured fields (agent_id, task_id, error=msg), and leave the
raise TaskReassignmentError(msg) from exc unchanged so success logs (logger.info
with HR_FIRING_TASKS_REASSIGNED) remain distinct from failures.

In `@src/ai_company/hr/reassignment_protocol.py`:
- Around line 27-42: The protocol method reassign lacks a Raises docstring
entry; update its docstring to document that implementations (e.g.,
QueueReturnStrategy) may raise TaskReassignmentError when task state transitions
fail—add a Raises section to reassign describing TaskReassignmentError, when it
is raised (on transition failures/failed clearing of assigned_to) and any
conditions or guarantees callers can expect.

In `@src/ai_company/hr/registry.py`:
- Around line 39-47: The constructor stores a MessageBus in self._message_bus
but register(), unregister(), and update_status() only log events; integrate the
bus so lifecycle changes are published: in AgentRegistry.register,
AgentRegistry.unregister, and AgentRegistry.update_status (and any helper that
mutates self._agents) after acquiring self._lock and updating self._agents, call
self._message_bus.publish/send (guarding for None) with a concise event object
(include event type like
"agent_registered"/"agent_unregistered"/"agent_status_updated", the
AgentIdentity and new status) so callers wired with a MessageBus receive
notifications; preserve existing locking and error handling and make publishing
best-effort (log but don’t raise if bus fails).

In `@src/ai_company/persistence/sqlite/hr_repositories.py`:
- Around line 307-343: The CollaborationMetricRepository.query method lacks a
deterministic ordering; update the SQL in query (method name: query of
CollaborationMetricRepository) to append an ORDER BY clause (e.g., ORDER BY
recorded_at ASC, id ASC) after the WHERE block (or on the base SELECT when no
WHERE) so results are consistently ordered before executing self._db.execute;
keep parameters and error handling unchanged and still convert rows via
self._row_to_record.
- Around line 204-244: The TaskMetricRepository.query method (and the analogous
method in CollaborationMetricRepository) returns rows without a deterministic
order; update the SQL constructed in TaskMetricRepository.query to append an
ORDER BY clause (e.g., ORDER BY completed_at DESC, id DESC or your chosen
deterministic columns) when building the final sql string so results are
consistently ordered, ensure the clause is added after the WHERE when clauses
exist (in the same place where sql is currently appended), and keep the existing
parameter handling and error logging (PERSISTENCE_TASK_METRIC_QUERY_FAILED /
PERSISTENCE_TASK_METRIC_QUERIED) unchanged.

In `@src/ai_company/persistence/sqlite/migrations.py`:
- Around line 123-140: Add composite indexes so agent+time predicates are
covered: create "idx_tm_agent_id_completed_at" on task_metrics(agent_id,
completed_at) and "idx_cm_agent_id_recorded_at" on
collaboration_metrics(agent_id, recorded_at). Insert these CREATE INDEX IF NOT
EXISTS statements into the migrations list alongside the existing single-column
indexes (near idx_tm_agent_id / idx_tm_completed_at and idx_cm_agent_id /
idx_cm_recorded_at) so TaskMetricRepository.query and
CollaborationMetricRepository.query (which filter by agent_id plus time range)
can use the composite indexes; ensure the column order is agent_id then the
timestamp (completed_at / recorded_at).

In `@tests/unit/api/conftest.py`:
- Around line 116-138: The list_events implementation in
FakeLifecycleEventRepository ignores the since parameter; update list_events
(and the in-memory filtering logic in the FakeLifecycleEventRepository class) to
filter events by their timestamp/creation time: when since is provided, only
include events whose event.timestamp (or created_at if your event model uses
that attribute) is >= since; handle both datetime and numeric timestamps
consistently (convert types as needed) and ensure the result variable is updated
before returning so time-based tests behave correctly.
- Around line 162-180: The query method of FakeCollaborationMetricRepository
currently ignores the since parameter; update query (in class
FakeCollaborationMetricRepository, method query) to filter self._records by the
since timestamp when provided (e.g., keep only records where record.created_at
or record.timestamp >= since), in addition to the existing agent_id filter, and
return the filtered tuple; ensure you handle both datetime and string/ISO inputs
consistently (convert/compare as appropriate) and don't modify other method
signatures.
- Around line 140-159: The query method of FakeTaskMetricRepository currently
ignores the since and until parameters; update the
FakeTaskMetricRepository.query implementation to apply time-range filtering
after the agent_id filter by comparing each record's timestamp field (e.g.,
record.timestamp or record.created_at — use the actual timestamp attribute used
by your task metric objects) against the since and until values, only including
records where (since is None or record.timestamp >= since) and (until is None or
record.timestamp <= until); keep the existing agent_id filter and return a tuple
of the filtered list so tests relying on time-based filtering behave correctly.

In `@tests/unit/core/test_enums.py`:
- Around line 42-43: Update the test_agent_status_values test to include an
assertion for the new AgentStatus.ONBOARDING member: locate the
test_agent_status_values function and add one more assertion that checks
AgentStatus.ONBOARDING exists and equals the expected enum value using the same
assertion style as the other values in that test (i.e., mirror how
AgentStatus.ACTIVE/INACTIVE/OTHER are asserted to ensure consistency).

In `@tests/unit/hr/performance/test_behavioral_collaboration_strategy.py`:
- Around line 81-106: Update the docstring on test_all_components_none to match
the assertions: replace the incorrect "All optional components None -> neutral
5.0, confidence 0.0." with a description that reflects the actual scenario and
expected values, e.g., "Only loop_prevention present -> score 10.0, confidence
0.1." in the test_all_components_none test function (referencing the use of
make_collab_metric, NOW, and the call to strategy.score with
NotBlankStr("agent-001")).

In `@tests/unit/hr/performance/test_ci_quality_strategy.py`:
- Around line 129-157: Merge the two separate tests
test_cost_efficiency_zero_cost and test_cost_efficiency_high_cost into the
existing parametrized test_cost_efficiency_parametrized: add parameter rows for
(cost_usd=0.0, expected=10.0) and (cost_usd=15.0, expected=0.0), use the same
setup via self._make_strategy() and make_task_metric, call
strategy.score(agent_id=NotBlankStr("agent-001"),
task_id=NotBlankStr("task-001"), task_result=task_result,
acceptance_criteria=()), and assert dict(result.breakdown)["cost_efficiency"] ==
expected; then remove the now-duplicate test_cost_efficiency_zero_cost and
test_cost_efficiency_high_cost (and the other duplicate pair noted) so only the
parametrized test covers these cases.

In `@tests/unit/hr/performance/test_tracker.py`:
- Around line 253-285: The test test_snapshot_with_windows_and_trends asserts
trends are present despite only recording one TaskMetricRecord and not supplying
the time-series that TrendStrategy.detect uses; either (A) make the scenario
satisfy the trend-detection threshold by adding enough TaskMetricRecord entries
(call tracker.record_task_metric multiple times at different timestamps so
WindowMetrics.data_point_count and TrendStrategy.detect receive sufficient
samples) before calling tracker.get_snapshot, or (B) change the assertion to
expect no trends (assert snapshot.trends == ()) to verify the threshold guard;
update references in the test around
MockWindowStrategy/WindowMetrics.data_point_count, tracker.record_task_metric,
and tracker.get_snapshot accordingly.

In `@tests/unit/hr/test_offboarding_service.py`:
- Around line 38-52: The test double list_tasks in
tests/unit/hr/test_offboarding_service.py declares assigned_to: str | None but
OffboardingService calls it with NotBlankStr(agent_id); change the signature to
assigned_to: NotBlankStr | None to match the expected type, add the appropriate
import for NotBlankStr from the module where it’s defined, and update any
related type hints in that file so the fake ListTasks function signature aligns
with the OffboardingService usage (function name: list_tasks).

In `@tests/unit/persistence/test_protocol.py`:
- Around line 76-117: Add protocol-compliance tests for the new fake repos so
they assert isinstance against the repository protocols: add three tests in the
TestProtocolCompliance suite that check _FakeLifecycleEventRepository() is a
LifecycleEventRepository, _FakeTaskMetricRepository() is a TaskMetricRepository,
and _FakeCollaborationMetricRepository() is a CollaborationMetricRepository;
import the protocols (LifecycleEventRepository, TaskMetricRepository,
CollaborationMetricRepository) and place the assertions alongside the existing
protocol tests (e.g., the block around TestProtocolCompliance where other
isinstance checks live).

---

Outside diff comments:
In `@tests/unit/communication/test_enums.py`:
- Around line 19-31: The test suite added a new enum member but test_values
doesn't assert it; update test_values in tests/unit/communication/test_enums.py
to include an assertion that MessageType.HR_NOTIFICATION.value ==
"hr_notification" (alongside the other MessageType assertions) so the new member
is verified and the member count matches test_member_count.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: e95b47f7-a83d-4db9-853b-3da1d0d5e8a2

📥 Commits

Reviewing files that changed from the base of the PR and between 8c39742 and 48f28cf.

📒 Files selected for processing (65)

CLAUDE.md
DESIGN_SPEC.md
README.md
src/ai_company/communication/enums.py
src/ai_company/core/enums.py
src/ai_company/hr/__init__.py
src/ai_company/hr/archival_protocol.py
src/ai_company/hr/enums.py
src/ai_company/hr/errors.py
src/ai_company/hr/full_snapshot_strategy.py
src/ai_company/hr/hiring_service.py
src/ai_company/hr/models.py
src/ai_company/hr/offboarding_service.py
src/ai_company/hr/onboarding_service.py
src/ai_company/hr/performance/__init__.py
src/ai_company/hr/performance/behavioral_collaboration_strategy.py
src/ai_company/hr/performance/ci_quality_strategy.py
src/ai_company/hr/performance/collaboration_protocol.py
src/ai_company/hr/performance/config.py
src/ai_company/hr/performance/models.py
src/ai_company/hr/performance/multi_window_strategy.py
src/ai_company/hr/performance/quality_protocol.py
src/ai_company/hr/performance/theil_sen_strategy.py
src/ai_company/hr/performance/tracker.py
src/ai_company/hr/performance/trend_protocol.py
src/ai_company/hr/performance/window_protocol.py
src/ai_company/hr/persistence_protocol.py
src/ai_company/hr/queue_return_strategy.py
src/ai_company/hr/reassignment_protocol.py
src/ai_company/hr/registry.py
src/ai_company/observability/events/hr.py
src/ai_company/observability/events/performance.py
src/ai_company/observability/events/persistence.py
src/ai_company/persistence/protocol.py
src/ai_company/persistence/repositories.py
src/ai_company/persistence/sqlite/backend.py
src/ai_company/persistence/sqlite/hr_repositories.py
src/ai_company/persistence/sqlite/migrations.py
src/ai_company/persistence/sqlite/repositories.py
tests/unit/api/conftest.py
tests/unit/communication/test_enums.py
tests/unit/core/test_enums.py
tests/unit/hr/__init__.py
tests/unit/hr/conftest.py
tests/unit/hr/performance/__init__.py
tests/unit/hr/performance/conftest.py
tests/unit/hr/performance/test_behavioral_collaboration_strategy.py
tests/unit/hr/performance/test_ci_quality_strategy.py
tests/unit/hr/performance/test_models.py
tests/unit/hr/performance/test_multi_window_strategy.py
tests/unit/hr/performance/test_theil_sen_strategy.py
tests/unit/hr/performance/test_tracker.py
tests/unit/hr/test_enums.py
tests/unit/hr/test_errors.py
tests/unit/hr/test_full_snapshot_strategy.py
tests/unit/hr/test_hiring_service.py
tests/unit/hr/test_models.py
tests/unit/hr/test_offboarding_service.py
tests/unit/hr/test_onboarding_service.py
tests/unit/hr/test_persistence.py
tests/unit/hr/test_queue_return_strategy.py
tests/unit/hr/test_registry.py
tests/unit/observability/test_events.py
tests/unit/persistence/test_migrations_v2.py
tests/unit/persistence/test_protocol.py

📜 Review details

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: CI Pass
GitHub Check: Agent
GitHub Check: Greptile Review

🧰 Additional context used

📓 Path-based instructions (4)

**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

**/*.py: No from __future__ import annotations — Python 3.14 has PEP 649 native lazy annotations
Use PEP 758 except syntax: use except A, B: (no parentheses) — ruff enforces this on Python 3.14
All public functions must have type hints; mypy strict mode is enforced
Public classes and functions must have Google-style docstrings (enforced by ruff D rules)
Every module with business logic must include logging setup: from ai_company.observability import get_logger then logger = get_logger(__name__). Never use import logging or print() in application code. Variable name must always be logger
Use structured logging with event name constants: logger.info(EVENT_CONSTANT, key=value) — never use string interpolation like logger.info('msg %s', val). Import event constants directly from ai_company.observability.events.<domain>
All error paths must log at WARNING or ERROR with context before raising; all state transitions must log at INFO; DEBUG for object creation, internal flow, and entry/exit of key functions
Line length maximum is 88 characters (enforced by ruff)
Functions must be < 50 lines; files must be < 800 lines
Handle errors explicitly; never silently swallow exceptions
Validate at system boundaries: user input, external APIs, and config files. Do not validate unnecessarily at internal boundaries

Files:

src/ai_company/hr/performance/window_protocol.py
src/ai_company/persistence/protocol.py
tests/unit/hr/test_hiring_service.py
src/ai_company/observability/events/performance.py
tests/unit/hr/performance/test_tracker.py
src/ai_company/core/enums.py
src/ai_company/hr/performance/__init__.py
tests/unit/hr/performance/conftest.py
tests/unit/hr/test_errors.py
tests/unit/hr/performance/test_behavioral_collaboration_strategy.py
tests/unit/hr/performance/test_ci_quality_strategy.py
tests/unit/hr/performance/test_models.py
src/ai_company/persistence/sqlite/migrations.py
src/ai_company/hr/performance/ci_quality_strategy.py
tests/unit/hr/test_offboarding_service.py
src/ai_company/hr/queue_return_strategy.py
src/ai_company/hr/hiring_service.py
src/ai_company/hr/performance/trend_protocol.py
src/ai_company/communication/enums.py
tests/unit/hr/test_enums.py
src/ai_company/hr/performance/config.py
src/ai_company/hr/persistence_protocol.py
src/ai_company/hr/enums.py
src/ai_company/hr/performance/behavioral_collaboration_strategy.py
src/ai_company/observability/events/persistence.py
src/ai_company/hr/performance/collaboration_protocol.py
tests/unit/hr/performance/test_theil_sen_strategy.py
src/ai_company/persistence/repositories.py
tests/unit/hr/test_onboarding_service.py
src/ai_company/hr/performance/theil_sen_strategy.py
src/ai_company/hr/offboarding_service.py
src/ai_company/persistence/sqlite/backend.py
src/ai_company/hr/models.py
src/ai_company/hr/full_snapshot_strategy.py
src/ai_company/hr/registry.py
tests/unit/hr/test_models.py
src/ai_company/persistence/sqlite/repositories.py
src/ai_company/hr/errors.py
tests/unit/hr/test_queue_return_strategy.py
tests/unit/communication/test_enums.py
tests/unit/core/test_enums.py
tests/unit/hr/performance/test_multi_window_strategy.py
src/ai_company/hr/reassignment_protocol.py
src/ai_company/hr/onboarding_service.py
src/ai_company/hr/archival_protocol.py
src/ai_company/persistence/sqlite/hr_repositories.py
tests/unit/hr/test_registry.py
tests/unit/persistence/test_migrations_v2.py
src/ai_company/hr/__init__.py
tests/unit/hr/conftest.py
tests/unit/hr/test_full_snapshot_strategy.py
src/ai_company/observability/events/hr.py
tests/unit/api/conftest.py
src/ai_company/hr/performance/quality_protocol.py
src/ai_company/hr/performance/models.py
tests/unit/observability/test_events.py
tests/unit/hr/test_persistence.py
tests/unit/persistence/test_protocol.py
src/ai_company/hr/performance/multi_window_strategy.py
src/ai_company/hr/performance/tracker.py

src/ai_company/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

src/ai_company/**/*.py: Use Pydantic v2 with frozen models for config/identity; use separate mutable-via-copy models (with model_copy(update=...)) for runtime state that evolves. Never mix static config fields with mutable runtime fields in one model
Use @computed_field for derived values instead of storing + validating redundant fields (e.g. TokenUsage.total_tokens); use NotBlankStr from core.types for all identifier/name fields—including optional (NotBlankStr | None) and tuple variants—instead of manual whitespace validators
Create new objects for immutability; never mutate existing ones. For non-Pydantic internal collections (registries, BaseTool), use copy.deepcopy() at construction + MappingProxyType for read-only enforcement. For dict/list fields in frozen Pydantic models, rely on frozen=True and copy.deepcopy() at system boundaries (tool execution, LLM provider serialization, inter-agent delegation, persistence serialization)
Prefer asyncio.TaskGroup for fan-out/fan-in parallel operations in new code (multiple tool invocations, parallel agent calls). Prefer structured concurrency over bare create_task

Files:

src/ai_company/hr/performance/window_protocol.py
src/ai_company/persistence/protocol.py
src/ai_company/observability/events/performance.py
src/ai_company/core/enums.py
src/ai_company/hr/performance/__init__.py
src/ai_company/persistence/sqlite/migrations.py
src/ai_company/hr/performance/ci_quality_strategy.py
src/ai_company/hr/queue_return_strategy.py
src/ai_company/hr/hiring_service.py
src/ai_company/hr/performance/trend_protocol.py
src/ai_company/communication/enums.py
src/ai_company/hr/performance/config.py
src/ai_company/hr/persistence_protocol.py
src/ai_company/hr/enums.py
src/ai_company/hr/performance/behavioral_collaboration_strategy.py
src/ai_company/observability/events/persistence.py
src/ai_company/hr/performance/collaboration_protocol.py
src/ai_company/persistence/repositories.py
src/ai_company/hr/performance/theil_sen_strategy.py
src/ai_company/hr/offboarding_service.py
src/ai_company/persistence/sqlite/backend.py
src/ai_company/hr/models.py
src/ai_company/hr/full_snapshot_strategy.py
src/ai_company/hr/registry.py
src/ai_company/persistence/sqlite/repositories.py
src/ai_company/hr/errors.py
src/ai_company/hr/reassignment_protocol.py
src/ai_company/hr/onboarding_service.py
src/ai_company/hr/archival_protocol.py
src/ai_company/persistence/sqlite/hr_repositories.py
src/ai_company/hr/__init__.py
src/ai_company/observability/events/hr.py
src/ai_company/hr/performance/quality_protocol.py
src/ai_company/hr/performance/models.py
src/ai_company/hr/performance/multi_window_strategy.py
src/ai_company/hr/performance/tracker.py

**/*.md

📄 CodeRabbit inference engine (CLAUDE.md)

DESIGN_SPEC.md is MANDATORY reading before implementing any feature. The spec is the starting point for architecture, data models, and behavior. If implementation deviates from the spec, alert the user and explain why — user decides whether to proceed or update the spec. Do not silently diverge. When spec sections are referenced (e.g. 'Section 10.2'), read that section verbatim before coding. When approved deviations occur, update DESIGN_SPEC.md to reflect the new reality

Files:

CLAUDE.md
README.md
DESIGN_SPEC.md

tests/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

tests/**/*.py: Use pytest markers: @pytest.mark.unit, @pytest.mark.integration, @pytest.mark.e2e, @pytest.mark.slow. Minimum coverage 80% (enforced in CI). Async mode: asyncio_mode = 'auto' — no manual @pytest.mark.asyncio needed. Per-test timeout: 30 seconds. Use pytest-xdist via -n auto for parallelism. Prefer @pytest.mark.parametrize for testing similar cases
NEVER use real vendor names (Anthropic, OpenAI, Claude, GPT, etc.) in project-owned code, docstrings, comments, tests, or config examples. Use generic names: example-provider, example-large-001, example-medium-001, example-small-001, large/medium/small as aliases. Tests must use test-provider, test-small-001, etc. Vendor names may only appear in: (1) DESIGN_SPEC.md provider list, (2) .claude/ files, (3) third-party import paths

Files:

tests/unit/hr/test_hiring_service.py
tests/unit/hr/performance/test_tracker.py
tests/unit/hr/performance/conftest.py
tests/unit/hr/test_errors.py
tests/unit/hr/performance/test_behavioral_collaboration_strategy.py
tests/unit/hr/performance/test_ci_quality_strategy.py
tests/unit/hr/performance/test_models.py
tests/unit/hr/test_offboarding_service.py
tests/unit/hr/test_enums.py
tests/unit/hr/performance/test_theil_sen_strategy.py
tests/unit/hr/test_onboarding_service.py
tests/unit/hr/test_models.py
tests/unit/hr/test_queue_return_strategy.py
tests/unit/communication/test_enums.py
tests/unit/core/test_enums.py
tests/unit/hr/performance/test_multi_window_strategy.py
tests/unit/hr/test_registry.py
tests/unit/persistence/test_migrations_v2.py
tests/unit/hr/conftest.py
tests/unit/hr/test_full_snapshot_strategy.py
tests/unit/api/conftest.py
tests/unit/observability/test_events.py
tests/unit/hr/test_persistence.py
tests/unit/persistence/test_protocol.py

🧠 Learnings (8)