feat: implement tool permission checking (#16) by Aureliolo · Pull Request #147 · Aureliolo/synthorg

Aureliolo · 2026-03-07T07:10:30Z

Summary

ToolPermissionChecker enforces access-level gating with priority-based resolution: denied list → allowed list → level categories → deny
BaseTool.category attribute (ToolCategory enum) enables per-tool categorization for permission checks
ToolInvoker integrates permission checking: filters tool definitions for LLM prompt + checks at invocation time (defense-in-depth)
AgentEngine creates permission-aware invokers from AgentIdentity.tools at the start of each run() call
5 access levels: sandboxed → restricted → standard → elevated (hierarchical), plus custom (allow-list only)
New ToolPermissionDeniedError in the tool error hierarchy
ToolAccessLevel and ToolCategory enums added to core.enums
ToolPermissions.access_level field added to the agent identity card

Pre-PR Review Fixes (10 agents, 18 findings addressed)

Fixed integration tests missing ToolCategory on test tools
Added integration test for permission-denied tool call path (E2E)
Added filter_definitions tests (denied list, sort order)
Fixed invoke() docstring to include permission-check step
Fixed ToolAccessLevel docstring re: CUSTOM level semantics
Made filter_definitions sort explicit + added DEBUG log for filtered-out tools
Changed .get() to direct dict access for fail-loud on unmapped access levels
Fixed denial_reason / _check_permission docstrings
Extracted format_task_instruction to prompt.py (agent_engine.py under 800 lines)
Updated DESIGN_SPEC.md: §3.1 access_level, §11.1.1 permission model, §11.2 M3 scope note, §15.3 project structure, §15.5 conventions

Closes #16

Test plan

2053 tests pass (3 new tests added)
95.40% coverage (80% minimum)
ruff lint + format clean
mypy strict clean
All pre-commit hooks pass
CI pipeline (lint + type-check + test)

Review coverage

Pre-reviewed by 10 agents: code-reviewer, python-reviewer, pr-test-analyzer, silent-failure-hunter, comment-analyzer, type-design-analyzer, logging-audit, resilience-audit, security-reviewer, docs-consistency. 27 findings triaged, 18 implemented, 9 skipped (by design / future scope / too minor).

Add access-level gating for tool invocations per DESIGN_SPEC §11.2. Permission resolution uses priority order: denied list > allowed list > access level categories. Progressive trust is disabled for M3 (static access levels only). New types: ToolAccessLevel (5 levels), ToolCategory (12 categories), ToolPermissionChecker, ToolPermissionDeniedError. Permission checking integrates at both prompt filtering (LLM sees only permitted tools) and runtime enforcement (belt-and-suspenders deny on invoke). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Pre-reviewed by 10 agents, 18 findings addressed: - Fix integration tests missing ToolCategory on test tools - Add permission-denied integration test (E2E denial path) - Add filter_definitions tests (denied list, sort order) - Fix invoke() docstring to include permission-check step - Fix ToolAccessLevel docstring re: CUSTOM level semantics - Make filter_definitions sort explicit + add DEBUG log - Change .get() to direct dict access for fail-loud on unmapped levels - Fix denial_reason/check_permission docstrings - Extract format_task_instruction to prompt.py (agent_engine < 800 lines) - Update DESIGN_SPEC.md: §3.1 access_level, §11.1.1 permission model, §11.2 M3 scope note, §15.3 project structure, §15.5 conventions Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

github-actions · 2026-03-07T07:10:39Z

Dependency Review

✅ No vulnerabilities or license issues or OpenSSF Scorecard issues found.

Scanned Files

None

coderabbitai · 2026-03-07T07:10:52Z

Caution

Review failed

The pull request is closed.

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: e158bbf4-cb74-41a0-a875-c31e5d8373d0

📥 Commits

Reviewing files that changed from the base of the PR and between 2e1abdc and 63f8b23.

📒 Files selected for processing (10)

DESIGN_SPEC.md
src/ai_company/core/enums.py
src/ai_company/engine/agent_engine.py
src/ai_company/tools/base.py
src/ai_company/tools/permissions.py
tests/unit/engine/test_react_loop.py
tests/unit/tools/conftest.py
tests/unit/tools/test_base.py
tests/unit/tools/test_invoker.py
tests/unit/tools/test_permissions.py

📝 Walkthrough

Summary by CodeRabbit

New Features
- Permission-based tool access with hierarchical access levels (Sandboxed, Restricted, Standard, Elevated, Custom).
- Tools now have category classifications (e.g., Code Execution, File System, Web).
- Runtime enforcement filters tools shown to the assistant and blocks unauthorized invocations with clear denial reasons.
Documentation
- Updated design notes describing category-level gating, permission behavior, and planned finer-grained sandboxing controls.

Walkthrough

Adds category- and access-level-based tool permissioning: new enums (ToolAccessLevel, ToolCategory), a ToolPermissionChecker, permission-aware ToolInvoker flow (pre-validation checks and filtering), BaseTool category support, observability events, and tests/integration wiring to enforce and surface permission denials.

Changes

Cohort / File(s)	Summary
Design & Public API `DESIGN_SPEC.md`, `src/ai_company/core/enums.py`, `src/ai_company/core/__init__.py`	Adds ToolAccessLevel and ToolCategory enums; documents M3 category-level gating and new public permission/sandboxing types.
Agent Permissions Model `src/ai_company/core/agent.py`	Adds `ToolPermissions.access_level: ToolAccessLevel` to model agent permission level.
Permission Checker `src/ai_company/tools/permissions.py`	New `ToolPermissionChecker` implementing priority-based resolution (denied > allowed > level/category > deny), filter_definitions(), denial_reason(), and from_permissions().
Tool API & Errors `src/ai_company/tools/base.py`, `src/ai_company/tools/errors.py`, `src/ai_company/tools/__init__.py`	BaseTool now accepts/exposes a `category`; adds `ToolPermissionDeniedError` and exports permission types from tools package.
Invoker Integration `src/ai_company/tools/invoker.py`	ToolInvoker gains optional `permission_checker`, `get_permitted_definitions()`, `_check_permission()`, and pre-validation permission check returning a denial ToolResult.
Engine Integration & Prompts `src/ai_company/engine/agent_engine.py`, `src/ai_company/engine/prompt.py`, `src/ai_company/engine/react_loop.py`	AgentEngine constructs a permission-aware ToolInvoker (`_make_tool_invoker`) and threads it into context/execute paths; replaces internal formatting helper with `format_task_instruction`; react loop now uses `get_permitted_definitions()`.
Observability `src/ai_company/observability/events/tool.py`	Adds events: `TOOL_PERMISSION_DENIED`, `TOOL_PERMISSION_CHECKER_CREATED`, `TOOL_PERMISSION_FILTERED`.
Examples & Tests `src/ai_company/tools/examples/echo.py`, `tests/...` (many files)	EchoTool updated to set category; tests and fixtures updated/added extensively for permission scenarios and to exercise filtering and denial behavior (unit + integration).

Sequence Diagram(s)

sequenceDiagram
    participant Agent as AgentEngine
    participant Invoker as ToolInvoker
    participant Checker as ToolPermissionChecker
    participant Registry as ToolRegistry
    participant Tool as BaseTool

    Agent->>Invoker: new(registry, permission_checker)
    Agent->>Invoker: get_permitted_definitions()
    Invoker->>Checker: filter_definitions(registry)
    Checker-->>Invoker: permitted ToolDefinitions

    Agent->>Invoker: invoke(tool_call)
    Invoker->>Checker: _check_permission(tool_name, category)
    alt denied
        Checker-->>Invoker: raises/returns denial
        Invoker-->>Agent: ToolResult(error=ToolPermissionDeniedError)
    else allowed
        Invoker->>Tool: validate params / execute
        Tool-->>Invoker: ToolResult(success)
        Invoker-->>Agent: ToolResult
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

Possibly related PRs

feat: implement tool permission checking (#16) #147 — Implements overlapping tool-permissions feature (ToolPermissionChecker, enums, invoker filtering); likely the same or closely related work.
feat: implement basic tool system (registry, invocation, results) (#15) #104 — Core tool subsystem changes (ToolInvoker, BaseTool, errors) that this PR extends with permission-aware behavior.
feat: implement AgentEngine core orchestrator (#11) #143 — AgentEngine/tooling flow changes; this PR threads a permission-aware ToolInvoker into those execution paths.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 49.48% which is insufficient. The required threshold is 100.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'feat: implement tool permission checking (`#16`)' clearly and concisely summarizes the main change—implementing a tool permission system—and directly corresponds to the primary objective of the PR.
Description check	✅ Passed	The description is comprehensive and directly related to the changeset, detailing the ToolPermissionChecker implementation, access levels, integration points, and pre-PR fixes—all substantive changes present in the PR.
Linked Issues check	✅ Passed	The PR fully addresses all coding requirements of issue `#16`: implements ToolPermissionChecker with priority-based resolution, adds ToolAccessLevel and ToolCategory enums, integrates permission checks at tool invocation, enforces access levels and allow/deny lists, provides clear denial messages, ensures fast config-based checks, and includes comprehensive unit and integration tests.
Out of Scope Changes check	✅ Passed	All changes are directly aligned with the stated objectives of issue `#16`. The extraction of format_task_instruction to prompt.py, DESIGN_SPEC.md updates, and test additions are all in service of the permission-checking feature and code organization goals mentioned in the PR description.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feat/tool-permission-checking

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist · 2026-03-07T07:10:56Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request implements a robust tool permission checking system, enhancing the security and control over which tools AI agents can access and execute. By introducing explicit access levels, tool categorization, and a dedicated permission checker, the system ensures that agents only interact with approved functionalities, both at the prompt generation stage and during actual tool invocation. This provides a layered defense mechanism, preventing unauthorized tool usage and improving the overall reliability and safety of agent operations.

Highlights

Tool Permission Checking: Introduced a ToolPermissionChecker to enforce access-level gating for tools. This checker uses a priority-based system: explicitly denied tools are blocked first, then explicitly allowed tools are permitted, followed by category-based access levels, and finally, denial by default. Name matching for allowed/denied lists is case-insensitive.
ToolInvoker Integration: The ToolInvoker now integrates the ToolPermissionChecker. It filters tool definitions presented to the LLM based on permissions and performs a defense-in-depth check at invocation time, returning a ToolPermissionDeniedError if an unpermitted tool is called.
Agent Identity and Tool Permissions: The AgentIdentity model now includes a ToolPermissions.access_level field, allowing agents to be configured with one of five hierarchical access levels (sandboxed, restricted, standard, elevated) or a custom level, which dictates the categories of tools they can use.
Tool Categorization: A new BaseTool.category attribute (using the ToolCategory enum) has been added to enable per-tool categorization, which is fundamental for the access-level gating mechanism.
Refactoring and Error Handling: The _format_task_instruction helper function was extracted from agent_engine.py to prompt.py for better modularity. A new ToolPermissionDeniedError was added to the tool error hierarchy for specific permission-related failures.

Changelog

DESIGN_SPEC.md
- Updated the tools configuration to include an access_level field.
- Added a detailed section on 'Permission checking (M3)' in 11.1.1, outlining the new ToolPermissionChecker and its integration.
- Included an 'M3 implementation note' in 11.2 clarifying the current category-level gating.
- Updated the project structure to reflect the new permissions.py file and ToolCategory in base.py.
- Added 'Tool permission checking' to the conventions table.
src/ai_company/core/init.py
- Imported ToolAccessLevel and ToolCategory enums.
src/ai_company/core/agent.py
- Imported ToolAccessLevel.
- Added access_level field to the ToolPermissions model with a default of ToolAccessLevel.STANDARD.
src/ai_company/core/enums.py
- Defined new ToolAccessLevel and ToolCategory StrEnum classes.
src/ai_company/engine/agent_engine.py
- Imported format_task_instruction and ToolPermissionChecker.
- Modified run and _execute methods to accept and pass tool_invoker.
- Updated _prepare_context to use the tool_invoker for retrieving permitted tool definitions.
- Refactored _format_task_instruction by moving it to prompt.py and updating its call site.
- Modified _make_tool_invoker to instantiate a ToolPermissionChecker based on agent identity and pass it to the ToolInvoker.
src/ai_company/engine/prompt.py
- Moved the _format_task_instruction function from agent_engine.py and renamed it to format_task_instruction.
src/ai_company/engine/react_loop.py
- Updated _get_tool_definitions to retrieve permitted tool definitions via tool_invoker.get_permitted_definitions().
src/ai_company/observability/events/tool.py
- Added new event constants: TOOL_PERMISSION_DENIED, TOOL_PERMISSION_CHECKER_CREATED, and TOOL_PERMISSION_FILTERED.
src/ai_company/tools/init.py
- Updated the module docstring to reflect the inclusion of permissions.
- Imported and exposed ToolPermissionDeniedError and ToolPermissionChecker.
src/ai_company/tools/base.py
- Imported ToolCategory.
- Added a category attribute to the BaseTool class, defaulting to ToolCategory.OTHER.
src/ai_company/tools/errors.py
- Added the ToolPermissionDeniedError class to the tool error hierarchy.
src/ai_company/tools/examples/echo.py
- Added category=ToolCategory.OTHER to the EchoTool initialization.
src/ai_company/tools/invoker.py
- Imported TOOL_PERMISSION_DENIED and ToolPermissionChecker.
- Added an optional permission_checker parameter to the constructor.
- Implemented get_permitted_definitions to filter tools based on permissions.
- Added _check_permission to perform permission validation before tool execution.
- Integrated permission checking into the invoke method's workflow.
src/ai_company/tools/permissions.py
- Added new file permissions.py containing the ToolPermissionChecker class.
- Implemented logic for ToolPermissionChecker including access level categories, explicit allow/deny lists, and priority-based resolution.
- Provided methods for checking permissions (is_permitted, check), generating denial reasons (denial_reason), and filtering tool definitions (filter_definitions).
tests/integration/engine/test_agent_engine_integration.py
- Imported ToolPermissions, ToolAccessLevel, and ToolCategory.
- Updated UppercaseTool initialization to include a category.
- Added a new integration test case TestPermissionDeniedToolCall to verify that a sandboxed agent is correctly denied access to a deployment tool.
tests/unit/engine/test_agent_engine.py
- Updated EchoTool initialization to include a category.
tests/unit/engine/test_run_result.py
- Updated the module docstring.
- Removed the import of _format_task_instruction from agent_engine.py.
- Imported format_task_instruction from prompt.py.
- Updated all calls to _format_task_instruction to use format_task_instruction.
tests/unit/tools/conftest.py
- Imported ToolAccessLevel, ToolCategory, and ToolPermissionChecker.
- Added _CategorizedTool for use in permission-related tests.
- Introduced new fixtures: permission_registry, permission_checker, and permission_invoker for testing tool permissions.
tests/unit/tools/test_invoker.py
- Imported ToolRegistry.
- Added TestInvokerPermissionCheck to test permission checking within the ToolInvoker.
- Added TestGetPermittedDefinitions to verify the filtering of tool definitions based on permissions.
tests/unit/tools/test_permissions.py
- Added new file test_permissions.py with comprehensive unit tests for the ToolPermissionChecker class.
- Included tests for various access levels, explicit allow/deny lists, resolution priority, denial reason generation, and definition filtering.

Activity

The pull request includes ToolPermissionChecker to enforce access-level gating with priority-based resolution: denied list → allowed list → level categories → deny.
The BaseTool.category attribute (ToolCategory enum) was added to enable per-tool categorization for permission checks.
The ToolInvoker now integrates permission checking, filtering tool definitions for LLM prompts and checking at invocation time for defense-in-depth.
The AgentEngine creates permission-aware invokers from AgentIdentity.tools at the start of each run() call.
Five access levels were introduced: sandboxed → restricted → standard → elevated (hierarchical), plus custom (allow-list only).
A new ToolPermissionDeniedError was added to the tool error hierarchy.
ToolAccessLevel and ToolCategory enums were added to core.enums.
The ToolPermissions.access_level field was added to the agent identity card.
Pre-PR review fixes addressed 18 findings from 10 agents, including integration tests for ToolCategory and permission-denied tool calls, filter_definitions tests, docstring fixes, and refactoring format_task_instruction.
The test plan confirms 2053 passing tests (3 new), 95.40% coverage, ruff lint/format clean, mypy strict clean, and all pre-commit hooks passing.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a comprehensive tool permission checking system, defining ToolAccessLevel and ToolCategory enums and integrating them into the AgentIdentity and ToolPermissions models. A new ToolPermissionChecker class is implemented to enforce these permissions, using a priority-based system of explicit deny/allow lists and hierarchical access levels (sandboxed, restricted, standard, elevated, custom) based on tool categories. The ToolInvoker now utilizes this checker to filter tool definitions presented to the LLM and to validate tool calls at invocation time, ensuring agents only use permitted tools. The AgentEngine has been updated to incorporate this new permission logic, including moving the format_task_instruction helper to prompt.py. Review comments identify a critical bug in agent_engine.py where RecursionError is not properly caught due to outdated Python 2 exception syntax, a prompt injection vulnerability in format_task_instruction due to direct embedding of untrusted task content, and suggest improving maintainability in ToolPermissionChecker by defining hierarchical permission sets incrementally.

gemini-code-assist · 2026-03-07T07:14:09Z

src/ai_company/engine/agent_engine.py

                timeout_seconds=timeout_seconds,
+                tool_invoker=tool_invoker,
            )
        except MemoryError, RecursionError:


The code uses the outdated Python 2 syntax for catching multiple exceptions: except MemoryError, RecursionError:. In Python 3, this is interpreted as except MemoryError as RecursionError:, which means it only catches MemoryError and assigns the exception object to the variable name RecursionError. Consequently, RecursionError is NOT caught by this block. This violates the design principle that RecursionError is a non-recoverable error that should be re-raised. Instead, it will be caught by subsequent except Exception blocks and handled as a recoverable error, which could lead to inconsistent system state.

except (MemoryError, RecursionError):

gemini-code-assist · 2026-03-07T07:14:09Z

src/ai_company/engine/prompt.py

+    Returns:
+        Markdown-formatted task instruction string.
+    """
+    parts = [f"# Task: {task.title}", "", task.description]


Untrusted task content (task.title and task.description) is directly included in the LLM prompt without proper sanitization or delimiting. An attacker who can control the task metadata could inject malicious instructions to manipulate the agent's behavior (Prompt Injection). It is recommended to use clear delimiters (e.g., XML tags) and instructions to the LLM to treat the content as data.

gemini-code-assist · 2026-03-07T07:14:09Z

src/ai_company/tools/permissions.py

+    _LEVEL_CATEGORIES: ClassVar[dict[ToolAccessLevel, frozenset[ToolCategory]]] = {
+        ToolAccessLevel.SANDBOXED: frozenset(
+            {
+                ToolCategory.FILE_SYSTEM,
+                ToolCategory.CODE_EXECUTION,
+                ToolCategory.VERSION_CONTROL,
+            }
+        ),
+        ToolAccessLevel.RESTRICTED: frozenset(
+            {
+                ToolCategory.FILE_SYSTEM,
+                ToolCategory.CODE_EXECUTION,
+                ToolCategory.VERSION_CONTROL,
+                ToolCategory.WEB,
+            }
+        ),
+        ToolAccessLevel.STANDARD: frozenset(
+            {
+                ToolCategory.FILE_SYSTEM,
+                ToolCategory.CODE_EXECUTION,
+                ToolCategory.VERSION_CONTROL,
+                ToolCategory.WEB,
+                ToolCategory.TERMINAL,
+                ToolCategory.ANALYTICS,
+            }
+        ),
+        ToolAccessLevel.ELEVATED: frozenset(ToolCategory),
+        ToolAccessLevel.CUSTOM: frozenset(),
+    }


For better maintainability, you could define the hierarchical permission sets incrementally. This avoids repeating the categories for each access level and reduces the chance of inconsistencies if categories are added or changed in the future.

_SANDBOXED_CATS = frozenset( { ToolCategory.FILE_SYSTEM, ToolCategory.CODE_EXECUTION, ToolCategory.VERSION_CONTROL, } ) _RESTRICTED_CATS = _SANDBOXED_CATS | {ToolCategory.WEB} _STANDARD_CATS = _RESTRICTED_CATS | { ToolCategory.TERMINAL, ToolCategory.ANALYTICS, } _LEVEL_CATEGORIES: ClassVar[dict[ToolAccessLevel, frozenset[ToolCategory]]] = { ToolAccessLevel.SANDBOXED: _SANDBOXED_CATS, ToolAccessLevel.RESTRICTED: _RESTRICTED_CATS, ToolAccessLevel.STANDARD: _STANDARD_CATS, ToolAccessLevel.ELEVATED: frozenset(ToolCategory), ToolAccessLevel.CUSTOM: frozenset(), }

coderabbitai

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/ai_company/tools/base.py (1)
70-77: ⚠️ Potential issue | 🟠 Major

Make tool categorization explicit instead of defaulting to OTHER.

The category parameter defaults to ToolCategory.OTHER, and at least 13 existing BaseTool subclasses in the codebase do not pass category= to super().__init__(). These tools will silently fall back to OTHER, creating a fail-open permission model where every uncategorized legacy tool is treated as intentionally low-risk. If access-level checks permit OTHER, this becomes a security boundary violation. Require an explicit category for every tool before shipping.
Suggested change
     def __init__(
         self,
         *,
         name: str,
         description: str = "",
         parameters_schema: dict[str, Any] | None = None,
-        category: ToolCategory = ToolCategory.OTHER,
+        category: ToolCategory,
     ) -> None:
Also applies to: 93-96, 107-110
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/ai_company/tools/base.py` around lines 70 - 77, The BaseTool constructor
currently defaults category to ToolCategory.OTHER which causes silent, unsafe
fallbacks; modify BaseTool.__init__ to require category (remove the default),
and add a defensive runtime check (raise ValueError) if category is missing or
None so instantiations fail loudly; update all BaseTool subclasses and any calls
that relied on the default to pass an explicit category argument to
super().__init__; also apply the same change to the other BaseTool constructor
overloads/variants (the other __init__ signatures around the class) so all entry
points enforce explicit ToolCategory.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Outside diff comments:
In `@src/ai_company/tools/base.py`:
- Around line 70-77: The BaseTool constructor currently defaults category to
ToolCategory.OTHER which causes silent, unsafe fallbacks; modify
BaseTool.__init__ to require category (remove the default), and add a defensive
runtime check (raise ValueError) if category is missing or None so
instantiations fail loudly; update all BaseTool subclasses and any calls that
relied on the default to pass an explicit category argument to super().__init__;
also apply the same change to the other BaseTool constructor overloads/variants
(the other __init__ signatures around the class) so all entry points enforce
explicit ToolCategory.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: 7d354dca-9812-45d9-a126-3e5c49c54331

📥 Commits

Reviewing files that changed from the base of the PR and between 57e487b and 2e1abdc.

📒 Files selected for processing (20)

DESIGN_SPEC.md
src/ai_company/core/__init__.py
src/ai_company/core/agent.py
src/ai_company/core/enums.py
src/ai_company/engine/agent_engine.py
src/ai_company/engine/prompt.py
src/ai_company/engine/react_loop.py
src/ai_company/observability/events/tool.py
src/ai_company/tools/__init__.py
src/ai_company/tools/base.py
src/ai_company/tools/errors.py
src/ai_company/tools/examples/echo.py
src/ai_company/tools/invoker.py
src/ai_company/tools/permissions.py
tests/integration/engine/test_agent_engine_integration.py
tests/unit/engine/test_agent_engine.py
tests/unit/engine/test_run_result.py
tests/unit/tools/conftest.py
tests/unit/tools/test_invoker.py
tests/unit/tools/test_permissions.py

📜 Review details

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Agent
GitHub Check: Greptile Review

🧰 Additional context used

📓 Path-based instructions (5)

**/*.py