fix(cache): enable prompt cache retention for Anthropic Vertex AI by affsantos · Pull Request #60888 · openclaw/openclaw

affsantos · 2026-04-04T13:50:20Z

Summary

Problem: resolveAnthropicCacheRetentionFamily does not recognize anthropic-vertex as a first-class Anthropic provider. Without explicit user config, resolveCacheRetention returns undefined instead of defaulting to "short" — unlike the direct anthropic provider which defaults automatically. Additionally, resolveAnthropicEphemeralCacheControl gates the 1-hour TTL ("1h") behind a URL check for api.anthropic.com, silently blocking the 1-hour cache for Vertex AI even when explicitly requested via cacheRetention: "long".
Why it matters: Vertex AI users who opt into cacheRetention: "long" silently get the 5-minute TTL instead of the expected 1-hour. The cache diagnostics (PR feat(agents): add prompt cache break diagnostics #60707) also cannot correctly observe retention for Vertex AI because cacheRetention resolves to undefined through the canonical path. Both Anthropic and Google Vertex AI docs confirm 1-hour TTL is supported on current Claude models on Vertex AI.
What changed: (1) Added "anthropic-vertex" to resolveAnthropicCacheRetentionFamily so it returns "anthropic-direct" — matching the direct Anthropic provider behavior. This gives Vertex AI default cacheRetention: "short" through the canonical path. (2) Expanded the URL check in resolveAnthropicEphemeralCacheControl to also allow aiplatform.googleapis.com endpoints for the 1-hour TTL.
What did NOT change (scope boundary): No changes to the Vertex AI stream function, the enableCacheControl: true hardcoding, payload shaping, system prompt boundary splitting, or any other provider transport. The 5-minute cache was already working for Vertex AI via a fallback in resolveAnthropicEphemeralCacheControl — this fix makes the canonical path consistent and unblocks 1-hour TTL.

🤖 AI-assisted

Marked as AI-assisted
Degree of testing: fully tested (5 new tests + 36 total pass across touched test files)
I understand what the code does
Bot review conversations will be resolved

Change Type (select all)

Scope (select all touched areas)

Linked Issue/PR

Related fix(cache): compact newest tool results first to preserve prompt cache prefix #58036 (compact newest tool results first to preserve prompt cache prefix)
Related fix(cache): sort MCP tools deterministically to stabilize prompt cache #58037 (sort MCP tools deterministically to stabilize prompt cache)
Related fix(cache): delay history image pruning to preserve prompt cache prefix #58038 (delay history image pruning to preserve prompt cache prefix)
Related fix(agents): close remaining prompt cache boundary gaps #60691 (close remaining prompt cache boundary gaps)
Related fix(agents): stabilize prompt cache fingerprints #60731 (stabilize prompt cache fingerprints)
This PR fixes a bug or regression

Root Cause (if applicable)

Root cause: resolveAnthropicCacheRetentionFamily only checks for provider === "anthropic" for the "anthropic-direct" family. The "anthropic-vertex" provider falls through to the "custom-anthropic-api" branch, which requires hasExplicitCacheConfig: true — so without explicit user config, retention is undefined. Separately, resolveAnthropicEphemeralCacheControl only allows "1h" TTL when the base URL contains api.anthropic.com, which excludes the aiplatform.googleapis.com Vertex AI endpoints.
Missing detection / guardrail: No test coverage for anthropic-vertex in the cache retention resolution path. No test for Vertex AI URLs in the payload policy TTL gate.
Contributing context: The Vertex AI transport (anthropic-vertex-stream.ts) correctly sets enableCacheControl: true and passes through cacheRetention from stream options, so the transport itself is fine. The gap is in the resolution layer that feeds the transport.

Regression Test Plan (if applicable)

Coverage level that should have caught this:
- Unit test
- Seam / integration test
- End-to-end test
- Existing coverage already sufficient
Target test or file: extra-params.cache-retention-default.test.ts, anthropic-payload-policy.test.ts
Scenario the test should lock in: (1) anthropic-vertex defaults to "short" without config; (2) anthropic-vertex honors explicit "long" and "none"; (3) Vertex AI URLs get ttl: "1h" with long retention; (4) Vertex AI URLs get plain ephemeral with short retention.
Why this is the smallest reliable guardrail: Direct unit tests on the two resolution functions that had the gap. No runtime/integration test needed because the transport layer already has coverage.

User-visible / Behavior Changes

Vertex AI users with cacheRetention: "long" (or PI_CACHE_RETENTION=long) now correctly get the 1-hour cache TTL instead of silently falling back to 5 minutes.
Vertex AI cache retention now resolves through the canonical path ("short" by default), making cache diagnostics and observability accurate.

Diagram (if applicable)

Before:
[anthropic-vertex + cacheRetention=long] -> resolveAnthropicCacheRetentionFamily -> "custom-anthropic-api"
  -> resolveAnthropicEphemeralCacheControl(baseUrl=*.aiplatform.googleapis.com, retention=long)
  -> URL check fails (not api.anthropic.com) -> { type: "ephemeral" } (5m, NOT 1h)

After:
[anthropic-vertex + cacheRetention=long] -> resolveAnthropicCacheRetentionFamily -> "anthropic-direct"
  -> resolveAnthropicEphemeralCacheControl(baseUrl=*.aiplatform.googleapis.com, retention=long)
  -> URL check passes (aiplatform.googleapis.com) -> { type: "ephemeral", ttl: "1h" } ✓

Security Impact (required)

New permissions/capabilities? No
Secrets/tokens handling changed? No
New/changed network calls? No
Command/tool execution surface changed? No
Data access scope changed? No

Repro + Verification

Environment

OS: macOS
Runtime: Node 22
Model/provider: Claude Sonnet 4.6 via Anthropic Vertex AI
Relevant config: provider: anthropic-vertex, cacheRetention: "long" in extra params

Steps

Configure OpenClaw with anthropic-vertex provider and cacheRetention: "long" in model params
Send a multi-turn conversation
Inspect the API payload sent to Vertex AI

Expected

cache_control: { type: "ephemeral", ttl: "1h" } on system and last user message blocks

Actual (before fix)

cache_control: { type: "ephemeral" } — 5-minute TTL silently used despite requesting "long"

Evidence

Tested patch in our OpenClaw instance and analyse payloads
Failing test/log before + passing after
- 5 new tests pass: 3 in extra-params.cache-retention-default.test.ts (vertex default/long/none), 2 in anthropic-payload-policy.test.ts (vertex 1h TTL, vertex 5m TTL)
- All 36 tests pass across the 3 touched test files
- All 11 existing anthropic-vertex-stream.test.ts tests continue to pass

Human Verification (required)

Verified scenarios: Ran all 3 affected test files (36 tests total). Verified the resolveAnthropicCacheRetentionFamily change returns "anthropic-direct" for "anthropic-vertex". Verified the URL check matches both api.anthropic.com and aiplatform.googleapis.com (global and regional patterns).
Edge cases checked: Global Vertex AI endpoint (aiplatform.googleapis.com without region prefix), regional endpoint (us-east5-aiplatform.googleapis.com), custom proxy URL (still excluded from 1h TTL). Explicit "none" disables caching for Vertex AI. Existing Bedrock, OpenRouter, and custom provider behavior unchanged.

Review Conversations

I replied to or resolved every bot review conversation I addressed in this PR.
I left unresolved only the conversations that still need reviewer or maintainer judgment.

Compatibility / Migration

Backward compatible? Yes
Config/env changes? No — existing cacheRetention / PI_CACHE_RETENTION config now works correctly for Vertex AI
Migration needed? No

Risks and Mitigations

Risk: Sending ttl: "1h" to Vertex AI endpoints that run older Claude models (3.7 Sonnet, 3.5 Sonnet) which do not support 1-hour TTL per Google docs (same behavior as the direct Anthropic API btw)
- Mitigation: The Vertex AI provider catalog (extensions/anthropic-vertex/provider-catalog.ts) only offers current models (Claude Opus 4.6, Claude Sonnet 4.6) which support 1-hour TTL. Older models are not in the catalog. If a user manually configures an unsupported model, the Vertex AI API would reject the request — same behavior as the direct Anthropic API.

greptile-apps · 2026-04-04T13:55:31Z

Greptile Summary

This PR fixes two related gaps in the Anthropic Vertex AI prompt caching path: resolveAnthropicCacheRetentionFamily now returns "anthropic-direct" for "anthropic-vertex" (matching its behavior in isAnthropicFamilyCacheTtlEligible which already included Vertex AI), and resolveAnthropicEphemeralCacheControl now allows "1h" TTL for aiplatform.googleapis.com base URLs alongside api.anthropic.com. Five new targeted unit tests lock in the corrected behavior.

Confidence Score: 5/5

This PR is safe to merge — it makes two small, targeted bug fixes with five new unit tests and no changes to transport, auth, or payload structure.

Both changes are additive and consistent with existing patterns: the anthropic-vertex family classification aligns with the pre-existing isAnthropicFamilyCacheTtlEligible check, and the URL substring guard follows the same String.includes style already used for api.anthropic.com. All remaining observations are P2 style-level notes. No blocking issues.

No files require special attention.

_{Reviews (2): Last reviewed commit: "Merge branch 'main' into fix/vertex-ai-p..." | Re-trigger Greptile}

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 6bd8e12269

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

affsantos · 2026-04-04T14:04:53Z

Addressed the Greptile finding by added "anthropic-vertex" to isAnthropicFamilyCacheTtlEligible in 789ac78. Both functions in the file now consistently recognize anthropic-vertex as a first-class Anthropic provider.

affsantos · 2026-04-04T15:33:57Z

@greptile-apps review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d631ba8fc2

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

…enclaw#60888) * fix(cache): enable prompt cache retention for Anthropic Vertex AI * fix(cache): add anthropic-vertex to isAnthropicFamilyCacheTtlEligible * fix(cache): use hostname parsing for long-TTL endpoint eligibility * docs(changelog): note anthropic vertex cache ttl fix --------- Co-authored-by: affsantos <andreffsantos91@gmail.com> Co-authored-by: Vincent Koc <vincentkoc@ieee.org>

openclaw-barnacle Bot added agents Agent runtime and tooling size: S labels Apr 4, 2026

affsantos marked this pull request as ready for review April 4, 2026 13:52

chatgpt-codex-connector Bot reviewed Apr 4, 2026

View reviewed changes

Comment thread src/agents/anthropic-payload-policy.ts Outdated

chatgpt-codex-connector Bot reviewed Apr 4, 2026

View reviewed changes

Comment thread src/agents/anthropic-payload-policy.ts Outdated

affsantos force-pushed the fix/vertex-ai-prompt-cache-retention branch from d631ba8 to 86aea6c Compare April 4, 2026 15:51

affsantos and others added 4 commits April 5, 2026 07:45

fix(cache): enable prompt cache retention for Anthropic Vertex AI

2cf1843

fix(cache): add anthropic-vertex to isAnthropicFamilyCacheTtlEligible

9fd9598

fix(cache): use hostname parsing for long-TTL endpoint eligibility

2b59bdb

docs(changelog): note anthropic vertex cache ttl fix

d844348

vincentkoc force-pushed the fix/vertex-ai-prompt-cache-retention branch from 3b8c691 to d844348 Compare April 5, 2026 06:46

Merge branch 'main' into fix/vertex-ai-prompt-cache-retention

8ef766e

vincentkoc merged commit eb0f367 into openclaw:main Apr 5, 2026
7 checks passed

hxy91819 mentioned this pull request Apr 20, 2026

fix(agents): honor explicit long Anthropic cache TTL on custom hosts #67800

Merged

clawsweeper Bot mentioned this pull request May 2, 2026

[codex] Fix Anthropic Vertex npm audit regression #76221

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(cache): enable prompt cache retention for Anthropic Vertex AI#60888

fix(cache): enable prompt cache retention for Anthropic Vertex AI#60888
vincentkoc merged 5 commits intoopenclaw:mainfrom
affsantos:fix/vertex-ai-prompt-cache-retention

affsantos commented Apr 4, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot commented Apr 4, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

affsantos commented Apr 4, 2026 •

edited

Loading

Uh oh!

affsantos commented Apr 4, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

affsantos commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Change Type (select all)

Scope (select all touched areas)

Linked Issue/PR

Root Cause (if applicable)

Regression Test Plan (if applicable)

User-visible / Behavior Changes

Diagram (if applicable)

Security Impact (required)

Repro + Verification

Environment

Steps

Expected

Actual (before fix)

Evidence

Human Verification (required)

Review Conversations

Compatibility / Migration

Risks and Mitigations

Uh oh!

greptile-apps Bot commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

affsantos commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

affsantos commented Apr 4, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

affsantos commented Apr 4, 2026 •

edited

Loading

greptile-apps Bot commented Apr 4, 2026 •

edited

Loading

affsantos commented Apr 4, 2026 •

edited

Loading