feat(ai): implement three-layer caching optimization (#91, #92, #93) by hrygo · Pull Request #118 · hrygo/divinesense

hrygo · 2026-02-07T14:10:55Z

概述

实现三层 AI 缓存优化架构，提升响应速度、降低 API 成本、优化 Token 预算分配。

Issue [AI优化] 语义缓存层实现 - 基于 Embedding 相似度匹配 #91: 语义缓存层 (L2) - 基于 Embedding 相似度匹配
Issue [AI优化] 工具级检索结果缓存 - 减少重复数据库查询 #92: 工具级结果缓存 (L3) - 减少重复数据库查询
Issue [AI优化] 动态 Token 预算分配 - 按意图类型自适应调整 #93: 动态 Token 预算分配 - 按意图类型自适应调整

变更内容

新增文件

ai/cache/semantic.go - 语义缓存实现
ai/cache/semantic_test.go - 语义缓存测试
ai/agent/tools/cache.go - 工具结果缓存实现
ai/agent/tools/cache_test.go - 工具缓存测试
ai/context/budget_profiles.go - 预算配置文件

修改文件

ai/context/budget.go - 添加 AllocateForAgent() 方法
ai/context/builder_impl.go - 使用基于 AgentType 的预算分配
.env.example - 添加缓存配置说明和默认值

缓存架构

┌─────────────────────────────────────────────────────────────────┐
│                        用户查询                                  │
└─────────────────────────────────────────────────────────────────┘
                            │
        ┌───────────────┼───────────────┐
        ▼               ▼               ▼
┌──────────────┐ ┌──────────────┐ ┌──────────────┐
│ L1: 路由缓存  │ │ L2: 语义缓存  │ │ L3: 工具缓存  │
│ ~0ms 延迟    │ │ ~10ms 延迟   │ │ ~1ms 延迟    │
│ 70% 命中率    │ │ 30% 命中率    │ │ 40% 命中率    │
└──────────────┘ └──────────────┘ └──────────────┘

Token 预算配置

AgentType	ShortTerm	LongTerm	Retrieval	说明
memo_search	30%	10%	60%	检索优先
schedule_*	55%	25%	20%	对话为主
amazing	40%	15%	45%	平衡分配
geek/evolution	0%	0%	0%	CC 管理

测试计划

单元测试通过
golangci-lint 通过
包构建验证
环境变量加载测试

截图/演示

环境变量配置示例

# 语义缓存配置
DIVINESENSE_SEMANTIC_CACHE_MAX_ENTRIES=1000
DIVINESENSE_SEMANTIC_CACHE_SIMILARITY_THRESHOLD=0.95
DIVINESENSE_SEMANTIC_CACHE_TTL=24h

# 工具缓存配置
DIVINESENSE_TOOL_CACHE_MAX_ENTRIES=100

# 预算覆盖 (可选)
DIVINESENSE_BUDGET_MEMO_SEARCH_RETRIEVAL=0.70
DIVINESENSE_BUDGET_SCHEDULE_CREATE_SHORT_TERM=0.60

检查清单

Resolves #91, #92, #93

Co-Authored-By: Claude Opus 4.6 noreply@anthropic.com

- Add SemanticCache with two-layer lookup (SHA256 + cosine similarity) * Exact match cache using SHA256 hash keys * Semantic match cache using vector similarity (default threshold: 0.95) * LRU eviction with TTL support (default 24 hours) - Add ToolResultCache for tool execution results * Per-tool TTL configuration (schedule_query: 30s, memo_search: 5m) * Write operations not cached (TTL = 0) * Multi-tenant isolation by user ID - Add profile-based token budget allocation (#93) * 7 preset profiles: memo_search, schedule_create, schedule_query, amazing, geek, evolution, default * Environment variable overrides (DIVINESENSE_BUDGET_*) * GEEK and EVOLUTION modes use zero budget (Claude Code CLI manages context) - Add comprehensive unit tests for both cache layers - Fix errcheck issues with //nolint:errcheck comments for safe type assertions - Load budget profile overrides from environment on service initialization Refs #91, #92, #93 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

hrygo force-pushed the feat/91-92-93-ai-caching-optimization branch from fa9915c to 37de13c Compare February 7, 2026 14:20

hrygo force-pushed the feat/91-92-93-ai-caching-optimization branch from 37de13c to 7678595 Compare February 7, 2026 14:20

hrygo merged commit e1a2fc3 into main Feb 7, 2026
8 of 9 checks passed

hrygo deleted the feat/91-92-93-ai-caching-optimization branch February 7, 2026 14:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(ai): implement three-layer caching optimization (#91, #92, #93)#118

feat(ai): implement three-layer caching optimization (#91, #92, #93)#118
hrygo merged 1 commit into
mainfrom
feat/91-92-93-ai-caching-optimization

hrygo commented Feb 7, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

hrygo commented Feb 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

概述

变更内容

新增文件

修改文件

缓存架构

Token 预算配置

测试计划

截图/演示

环境变量配置示例

检查清单

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hrygo commented Feb 7, 2026 •

edited

Loading