Issue 1959 [PERFORMANCE]: Fix critical performance issues in llm-guard plugin by tedhabeck · Pull Request #2200 · IBM/mcp-context-forge

tedhabeck · 2026-01-20T03:08:17Z

🐛 Bug-fix PR

Before opening this PR please:

make lint - passes ruff, mypy, pylint
make test - all unit + integration tests green
make coverage - ≥ 90 %
make docker docker-run-ssl or make podman podman-run-ssl
Update relevant documentation.
Tested with sqlite and postgres + redis.
Manual regression no longer fails. Ensure the UI and /version work correctly.

📌 Summary

#1959

🔁 Reproduction Steps

LOG_LEVEL=ERROR
python tests/performance/test_plugins_performance.py --details 2> /dev/null

🐞 Root Cause

#1959

💡 Fix Description

prescriptively followed details in epic #1958

🧪 Verification

LOG_LEVEL=ERROR
python tests/performance/test_plugins_performance.py --details 2> /dev/null

✅ Checklist

Code formatted (make black isort pre-commit)
No secrets/credentials committed

tedhabeck · 2026-01-20T18:08:28Z

Analysis:

Plugin                            P:post       P:pre      R:post       R:pre      T:post       T:pre
----------------------------------------------------------------------------------------------------
Baseline:
LLMGuardPlugin                  22.748ms   116.705ms           —           —           —           —

After cache update:
LLMGuardPlugin                  23.611ms   130.061ms           —           —           —           —

After scanner threading update:
LLMGuardPlugin                   0.148ms     0.120ms           —           —           —           —

After pickle threading:
LLMGuardPlugin                   0.215ms     0.249ms           —           —           —           —

After eval() replaced with ast and orjson update:
LLMGuardPlugin                   0.153ms     0.139ms           —           —           —           —

…scanners - Convert cache.py to use async redis (redis.asyncio) for non-blocking I/O - Add parallel scanner execution using asyncio.gather in input/output filters - Add asyncio.to_thread for CPU-bound scanner operations - Quiet llm_guard logger to ERROR level to reduce noise - Fix tests to use prompt_id instead of deprecated name parameter - Update test to use environment variables for redis host/port Security: Scanner errors now fail-closed (is_valid=False) instead of being skipped, ensuring policy evaluation denies requests when scanners fail. Closes IBM#1959 Signed-off-by: Mihai Criveti <crivetimihai@gmail.com>

crivetimihai · 2026-01-24T18:24:36Z

Rebase and Review Summary

I've rebased this PR onto main and made the following adjustments:

Changes Made During Rebase

Resolved conflicts with merged PRs llmguard: switch from pickle to orjson for cache serialization #2179 and llmguard: replace raw eval() with a safe AST-based evaluator #2180
- Kept orjson serialization (from llmguard: switch from pickle to orjson for cache serialization #2179) instead of pickle
- Removed pickle-related optimizations (optimal_dump/optimal_load) as orjson is already fast and doesn't need async threading for serialization
- Kept the AST-based policy evaluator (from llmguard: replace raw eval() with a safe AST-based evaluator #2180)
Security Fix: Fail-closed behavior for scanner errors
- Original implementation skipped failed scanners, which caused a fail-open security issue
- When a scanner failed, its result was missing from policy evaluation
- Policy evaluation returned "Invalid expression" (truthy), causing requests to be allowed
- Fix: Failed scanners now return is_valid=False with risk_score=1.0, ensuring denial
Test fixes
- Updated tests to use prompt_id instead of deprecated name parameter
- Added environment variable support for redis host/port in tests

Remaining Questions

Thread safety of llm_guard scanners: Are filter scanners guaranteed thread-safe when called concurrently via asyncio.to_thread? If scanners mutate shared state, concurrent requests could race. The parallelization only applies to filters (likely stateless), while sanitizers still run sequentially.
Regression test for fail-closed path: Should we add a test that injects a failing scanner and asserts denial to prevent reintroducing fail-open behavior?

Files Changed

cache.py - Async redis with orjson (not pickle)
llmguard.py - Parallel scanners with fail-closed error handling
plugin.py - Added await calls
tests/test_llmguardplugin.py - Fixed payload parameters

…scanners (IBM#2200) - Convert cache.py to use async redis (redis.asyncio) for non-blocking I/O - Add parallel scanner execution using asyncio.gather in input/output filters - Add asyncio.to_thread for CPU-bound scanner operations - Quiet llm_guard logger to ERROR level to reduce noise - Fix tests to use prompt_id instead of deprecated name parameter - Update test to use environment variables for redis host/port Security: Scanner errors now fail-closed (is_valid=False) instead of being skipped, ensuring policy evaluation denies requests when scanners fail. Closes IBM#1959 Signed-off-by: Mihai Criveti <crivetimihai@gmail.com>

tedhabeck requested review from crivetimihai, kevalmahajan and madhav165 as code owners January 20, 2026 03:08

crivetimihai changed the title ~~Issue 1959~~ Issue 1959 [PERFORMANCE]: Fix critical performance issues in llm-guard plugin Jan 20, 2026

tedhabeck marked this pull request as draft January 20, 2026 19:04

crivetimihai added this to the Release 1.0.0-RC1 milestone Jan 21, 2026

tedhabeck marked this pull request as ready for review January 22, 2026 20:22

monshri self-requested a review January 22, 2026 21:08

crivetimihai force-pushed the issue-1959 branch from 3e86ce2 to ab68403 Compare January 24, 2026 18:24

crivetimihai merged commit 2b8f535 into IBM:main Jan 24, 2026
52 checks passed

crivetimihai mentioned this pull request Jan 31, 2026

[BUG][PERFORMANCE]: Fix high-impact performance issues in llm-guard plugin #1960

Closed

17 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue 1959 [PERFORMANCE]: Fix critical performance issues in llm-guard plugin#2200

Issue 1959 [PERFORMANCE]: Fix critical performance issues in llm-guard plugin#2200
crivetimihai merged 1 commit intoIBM:mainfrom
tedhabeck:issue-1959

tedhabeck commented Jan 20, 2026

Uh oh!

tedhabeck commented Jan 20, 2026 •

edited

Loading

Uh oh!

crivetimihai commented Jan 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tedhabeck commented Jan 20, 2026

🐛 Bug-fix PR

📌 Summary

🔁 Reproduction Steps

🐞 Root Cause

💡 Fix Description

🧪 Verification

✅ Checklist

Uh oh!

tedhabeck commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

crivetimihai commented Jan 24, 2026

Rebase and Review Summary

Changes Made During Rebase

Remaining Questions

Files Changed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tedhabeck commented Jan 20, 2026 •

edited

Loading