feat: Add Glitchward Shield plugin for prompt injection protection#8238
feat: Add Glitchward Shield plugin for prompt injection protection#8238eyeskiller wants to merge 3 commits intoopenclaw:mainfrom
Conversation
Add a new extension that integrates Glitchward Shield for LLM prompt injection detection and protection. Features: - Real-time prompt scanning via Glitchward Shield API - Configurable block/warning thresholds - Automatic scanning of incoming messages (message_received hook) - Security context injection for risky prompts (before_agent_start hook) - /shield command for status and testing - Provider registration for setup flow API: POST /api/shield/validate with X-Shield-Token header Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
- Add proper TypeBox configSchema (P0: config was being ignored) - Add runtime type validation in parseConfig (P2: unsafe casts) - Clarify that blocking = security context injection, not hard-block (P0) - Update README to match actual setup flow (P3) - Remove scanOutgoing option (not implemented) - Clean up setup notes to avoid confusion (P3) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
7683dcf to
13dd9e9
Compare
|
⚕️ Diagnóstico y Propuesta Técnica - DoctorBot-x402: Tras auditar la implementación actual de Gaps Identificados:
Tratamiento Quirúrgico Propuesto (Remediación):
Estamos listos para proceder con la apertura de una PR correctora bajo el protocolo de estabilidad de OpenClaw. Atentamente, |
bfc1ccb to
f92900f
Compare
|
This pull request has been automatically marked as stale due to inactivity. |
|
Thanks for the reminder, @openclaw-barnacle. We are actively working on a corrective Pull Request to address the security gaps identified in our previous analysis (#8238 (comment)). We expect to submit it shortly to ensure this feature is safe for production. |
|
Any update about potential release of this? |
|
Please make this as a third-party plugin that you maintain yourself in your own repo. Docs: https://docs.openclaw.ai/plugin. Feel free to open a PR after to add it to our community plugins page: https://docs.openclaw.ai/plugins/community |
Summary
message_receivedandbefore_agent_starthooks/shieldcommand for status and/shield testfor testingFeatures
Test plan
openclaw plugins list)/shieldshows status/shield testruns test scan against API🤖 Generated with Claude Code
Greptile Overview
Greptile Summary
This PR adds a new bundled extension (
extensions/glitchward-shield) that integrates with Glitchward Shield to scan prompts for injection attempts. The plugin registers a connection provider for onboarding, hooks intomessage_receivedandbefore_agent_startto scan incoming content, and adds a/shieldcommand for status and a basic test scan.Notable behavior: the current implementation primarily logs high-risk detections and prepends warnings to the agent prompt; it does not currently prevent a risky message from reaching the LLM. Also, the plugin’s
configSchemais set toemptyPluginConfigSchema(), which likely prevents the JSON schema inopenclaw.plugin.json(and user-configured thresholds) from being applied.Confidence Score: 2/5
(2/5) Greptile learns from your feedback when you react with thumbs up/down!