✨ feat: add visual understanding tool by tjx666 · Pull Request #14376 · lobehub/lobehub

tjx666 · 2026-05-01T10:19:46Z

💻 Change Type

🔗 Related Issue

Related to LOBE-8387

🔀 Description of Change

Add the @lobechat/builtin-tool-lobe-agent package with the lobe-agent identifier.
Move visual media fallback execution into the Lobe Agent tool as analyzeVisualMedia.
Wire the client/server tool runtimes so non-visual models can use a configured visual model for image and video attachments.
Keep VISUAL_UNDERSTANDING_* env and server config names scoped to the visual capability.

🧪 How to Test

Tested locally
Added/updated tests
No tests needed

Commands:

bunx vitest run --silent='passed-only' 'src/store/tool/slices/builtin/executors/index.test.ts' 'src/server/services/toolExecution/serverRuntimes/__tests__/lobeAgent.test.ts' 'src/helpers/toolEngineering/index.test.ts' 'src/server/modules/Mecha/AgentToolsEngine/__tests__/index.test.ts' 'src/services/agentRuntime/__tests__/index.test.ts' 'src/components/DragUpload/useDragUpload.test.tsx' 'src/hooks/useVisualMediaUploadAbility.test.ts'
cd packages/prompts && bunx vitest run --silent='passed-only' 'src/prompts/files/index.test.ts'
bunx vitest run --silent='passed-only' 'src/services/chat/chat.test.ts' 'src/services/chat/mecha/contextEngineering.test.ts'
bunx eslint packages/builtin-tool-lobe-agent/src packages/builtin-tools/src/index.ts packages/builtin-tools/src/identifiers.ts packages/prompts/src/prompts/files/index.test.ts src/store/tool/slices/builtin/executors/index.ts src/store/tool/slices/builtin/executors/index.test.ts src/server/services/toolExecution/serverRuntimes/index.ts src/server/services/toolExecution/serverRuntimes/lobeAgent.ts src/server/services/toolExecution/serverRuntimes/__tests__/lobeAgent.test.ts src/helpers/toolEngineering/index.ts src/helpers/toolEngineering/index.test.ts src/server/modules/Mecha/AgentToolsEngine/index.ts src/server/modules/Mecha/AgentToolsEngine/__tests__/index.test.ts src/server/services/aiAgent/index.ts src/services/chat/chat.test.ts src/services/chat/mecha/contextEngineering.test.ts --max-warnings=0

📸 Screenshots / Videos

N/A

📝 Additional Information

Security review: checked the submodule diff for cloud/business/billing-specific implementation details; no new cloud-specific logic is exposed.

vercel · 2026-05-01T10:19:50Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
lobehub	Ready	Preview, Comment	May 1, 2026 10:48am

sourcery-ai

Sorry @tjx666, you have reached your weekly rate limit of 500000 diff characters.

Please try again later or upgrade to continue using Sourcery

codecov · 2026-05-01T10:25:17Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 84.89%. Comparing base (626d274) to head (f6f8f9a).
⚠️ Report is 1 commits behind head on canary.

Additional details and impacted files

@@             Coverage Diff             @@
##           canary   #14376       +/-   ##
===========================================
+ Coverage   68.96%   84.89%   +15.92%     
===========================================
  Files        2403      589     -1814     
  Lines      209462    42088   -167374     
  Branches    26268     6441    -19827     
===========================================
- Hits       144465    35732   -108733     
+ Misses      64854     6213    -58641     
  Partials      143      143

Flag	Coverage Δ
app	`?`
database	`?`
packages/agent-runtime	`79.93% <ø> (ø)`
packages/context-engine	`83.87% <ø> (ø)`
packages/conversation-flow	`92.40% <ø> (ø)`
packages/file-loaders	`87.60% <ø> (ø)`
packages/memory-user-memory	`74.74% <ø> (ø)`
packages/model-bank	`99.94% <ø> (ø)`
packages/model-runtime	`83.79% <ø> (ø)`
packages/prompts	`69.01% <100.00%> (ø)`
packages/python-interpreter	`92.90% <ø> (ø)`
packages/ssrf-safe-fetch	`0.00% <ø> (ø)`
packages/utils	`88.02% <ø> (ø)`
packages/web-crawler	`88.41% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
Store	`∅ <ø> (∅)`
Services	`∅ <ø> (∅)`
Server	`∅ <ø> (∅)`
Libs	`∅ <ø> (∅)`
Utils	`93.47% <ø> (+13.52%)`	⬆️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f6f8f9a449

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-01T10:28:38Z

+    const [sourceMessage] = await messageModel.queryByIds([this.messageId], {
+      postProcessUrl: (path) => fileService.getFullFileUrl(path),


Resolve visual source message from user turn

This runtime assumes context.messageId is the user turn with attachments, but in resume/continue flows the parent message can be an assistant message, so querying only this.messageId can return no imageList/videoList and produce NO_VISUAL_FILES even when the preceding user turn has visuals. That breaks visual-fallback tool calls during regeneration/continuation for non-vision models; the runtime should walk to the related user message (as the client executor does) instead of reading only one message id.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-01T10:28:38Z

+  const fallbackSupportVision = useModelSupportVision(
+    visualUnderstanding?.model ?? '',
+    visualUnderstanding?.provider ?? '',
+  );
+  const fallbackSupportVideo = useModelSupportVideo(
+    visualUnderstanding?.model ?? '',
+    visualUnderstanding?.provider ?? '',


Do not block fallback uploads on unknown visual model cards

The fallback gate is derived from useModelSupportVision/useModelSupportVideo for the configured visualUnderstanding model, but those selectors return false when the model is not in enabledAiModels. In that case, uploads are rejected client-side even though server-side visual understanding is enabled by VISUAL_UNDERSTANDING_* envs and runtime execution can still proceed, so custom/hidden fallback model IDs become unusable from the UI.

Useful? React with 👍 / 👎.

♻️ refactor: introduce lobe agent builtin tool

f6f8f9a

dosubot Bot added the size:XL This PR changes 500-999 lines, ignoring generated files. label May 1, 2026

sourcery-ai Bot reviewed May 1, 2026

View reviewed changes

dosubot Bot added feature:agent-builder Agent builder feature:tool Tool calling and function execution feature:vision labels May 1, 2026

tjx666 changed the title ~~♻️ refactor: introduce lobe agent builtin tool~~ ✨ feat: add visual understanding tool May 1, 2026

chatgpt-codex-connector Bot reviewed May 1, 2026

View reviewed changes

vercel Bot deployed to Preview May 1, 2026 10:28 View deployment

🐛 fix: satisfy visual understanding executor types

f898e18

tjx666 closed this May 1, 2026

tjx666 deleted the refactor/lobe-agent-tool branch May 1, 2026 10:30

tjx666 mentioned this pull request May 1, 2026

✨ feat: add visual understanding tool #14378

Merged

12 tasks

vercel Bot deployed to Preview May 1, 2026 10:48 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

✨ feat: add visual understanding tool#14376

✨ feat: add visual understanding tool#14376
tjx666 wants to merge 2 commits into
canaryfrom
refactor/lobe-agent-tool

tjx666 commented May 1, 2026

Uh oh!

vercel Bot commented May 1, 2026 •

edited

Loading

Uh oh!

sourcery-ai Bot left a comment

Uh oh!

codecov Bot commented May 1, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 1, 2026

Uh oh!

chatgpt-codex-connector Bot May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		const [sourceMessage] = await messageModel.queryByIds([this.messageId], {
		postProcessUrl: (path) => fileService.getFullFileUrl(path),

Uh oh!

Conversation

tjx666 commented May 1, 2026

💻 Change Type

🔗 Related Issue

🔀 Description of Change

🧪 How to Test

📸 Screenshots / Videos

📝 Additional Information

Uh oh!

vercel Bot commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sourcery-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 1, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 1, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vercel Bot commented May 1, 2026 •

edited

Loading

codecov Bot commented May 1, 2026 •

edited

Loading