Skip to content

✨ feat: add gpt-image-2 to LobeHub-hosted card#14039

Merged
tjx666 merged 7 commits into
canaryfrom
feat/add-gpt-image-2
Apr 22, 2026
Merged

✨ feat: add gpt-image-2 to LobeHub-hosted card#14039
tjx666 merged 7 commits into
canaryfrom
feat/add-gpt-image-2

Conversation

@tjx666

@tjx666 tjx666 commented Apr 22, 2026

Copy link
Copy Markdown
Member

πŸ’» Change Type

  • ✨ feat
  • πŸ› fix

πŸ”— Related Issue

N/A β€” ships OpenAI's gpt-image-2 (released 2026-04-21) on the LobeHub-hosted card, and fixes three pre-existing or newly-surfaced bugs along the way.

πŸ”€ Description of Change

1. Add gpt-image-2 card (lobehub/chat/image.ts + new gptImage2Schema)

  • Token-based pricing: textInput $5/M, textInput_cacheRead $1.25/M, imageInput $8/M, imageInput_cacheRead $2/M, imageOutput $30/M.
  • approximatePricePerImage: 0.053 (medium 1024Γ—1024 β‰ˆ 1767 output tokens Γ— $30/M).
  • size enum exposes the 8 official "Popular sizes" from the image-generation guide. Model actually accepts any WΓ—H satisfying max edge ≀ 3840, both edges multiples of 16, AR ≀ 3:1, pixels 655,360–8,294,400 β€” but our schema/UI does not yet support free-form inputs, so we stick to the documented Popular list for now.
  • releasedAt: '2026-04-21' (matches the gpt-image-2-2026-04-21 snapshot ID).

2. Fix: maxFileSize unit in gpt-image schemas

UploadCard compares file.size (bytes) against maxFileSize, but gptImage1Schema had maxFileSize: 5 (bare number), so every ref image upload was silently dropped by the front-end filter with no network request. Changed to 5 * 1024 * 1024. Fixes the same bug in gpt-image-1 that nobody noticed because imageUrls had no UI affordance until now.

3. Fix: do not send input_fidelity to gpt-image-2

gpt-image-2 dropped this parameter ("output is already high fidelity by default" β€” cookbook). The old model.includes('gpt-image-') && !model.includes('mini') check would attach an unsupported arg and break edit requests. Switched to an explicit allowlist (gpt-image-1 / gpt-image-1.5).

πŸ§ͺ How to Test

  • Added new test: should NOT send input_fidelity for gpt-image-2 (unsupported param)
  • Existing test should include input_fidelity parameter for gpt-image-1 model still passes
  • Tested locally on /image β€” ref image upload now makes network requests instead of being silently filtered

πŸ“ Additional Information

Breaking changes vs gpt-image-1.5:

  • input_fidelity removed (always high)
  • Per-image cost ↑ ~56% at medium 1024Γ—1024 (1056 β†’ 1767 output tokens, despite imageOutput rate dropping $32 β†’ $30/M)
  • size is now constraint-based, not enum (we still enumerate Popular sizes for UI)

Not addressed in this PR (potential follow-ups):

  • quality parameter (low/medium/high) is not exposed β€” 35Γ— price range ($0.006 / $0.053 / $0.211 @ 1024Γ—1024); users cannot opt into low-cost mode, and our approximatePricePerImage assumes medium.
  • background parameter (opaque β†’ pure white, not alpha transparent) β€” not exposed.
  • Azure variant (azureOpenai/index.ts:178) still unconditionally sends input_fidelity; fine for now since Azure endpoints are single-model, but needs the same allowlist treatment when gpt-image-2 lands on Azure.

tjx666 added 4 commits April 22, 2026 15:14
Add GPT Image 2 (released 2026-04-21) with its official "Popular sizes"
preset list (up to 4K) and standard-tier pricing. Keep gpt-image-1.5
enabled as the previous-generation sibling.
maxFileSize is compared against file.size (bytes) in UploadCard, but
gptImage1Schema/gptImage2Schema had it set to a bare `5` instead of
`5 * 1024 * 1024`, so every ref image upload was silently filtered out.
Medium quality 1024x1024 uses ~1767 output tokens at \$30/M,
which is \$0.053 per image β€” not \$0.032.
Source: https://developers.openai.com/api/docs/guides/image-generation#calculating-costs
gpt-image-2 dropped the input_fidelity parameter ("output is already
high fidelity by default"), so the old includes('gpt-image-') allowlist
would attach an unsupported arg and break edit requests.

Switch to an explicit allowlist of gpt-image-1 / gpt-image-1.5, and
attach { cause } to the pre-existing rethrows (lint gate).

Source: https://developers.openai.com/cookbook/examples/multimodal/image-gen-models-prompting-guide
@vercel

vercel Bot commented Apr 22, 2026

Copy link
Copy Markdown

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
lobehub Ready Ready Preview, Comment Apr 22, 2026 8:55am

Request Review

@sourcery-ai sourcery-ai Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We've reviewed this pull request using the Sourcery rules engine

Replace the generic "Image Generation" starter with a featured
"GPT Image 2 πŸ”₯" entry (mirroring the Seedance 2.0 treatment), and
land users directly on /image with gpt-image-2 pre-selected.

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

πŸ’‘ Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 8e2709187c

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with πŸ‘.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

// gpt-image-2 dropped input_fidelity ("output is already high fidelity by default").
// https://developers.openai.com/cookbook/examples/multimodal/image-gen-models-prompting-guide
const supportsInputFidelity =
isImageEdit && (model === 'gpt-image-1' || model === 'gpt-image-1.5');

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Preserve input_fidelity for gpt-image-1 snapshot aliases

The new supportsInputFidelity check only matches exact model IDs (gpt-image-1/gpt-image-1.5), but createImage forwards payload.model as-is, so snapshot or aliased IDs (for example dated snapshots of GPT Image 1) no longer get input_fidelity: 'high' on edit calls even though the previous logic covered them. In those cases image-edit requests regress in behavior/quality without any explicit error, so this should use alias normalization or a prefix-based match for the gpt-image-1 family instead of exact equality.

Useful? React with πŸ‘Β / πŸ‘Ž.

@codecov

codecov Bot commented Apr 22, 2026

Copy link
Copy Markdown

Codecov Report

βœ… All modified and coverable lines are covered by tests.
βœ… Project coverage is 66.98%. Comparing base (b8cd21a) to head (1b32a99).
⚠️ Report is 3 commits behind head on canary.

Additional details and impacted files
@@           Coverage Diff            @@
##           canary   #14039    +/-   ##
========================================
  Coverage   66.98%   66.98%            
========================================
  Files        2104     2104            
  Lines      179774   179774            
  Branches    21270    22051   +781     
========================================
+ Hits       120419   120421     +2     
+ Misses      59232    59230     -2     
  Partials      123      123            
Flag Coverage Ξ”
app 59.66% <100.00%> (+<0.01%) ⬆️
database 92.27% <ΓΈ> (ΓΈ)
packages/agent-runtime 79.82% <ΓΈ> (ΓΈ)
packages/context-engine 83.24% <ΓΈ> (ΓΈ)
packages/conversation-flow 92.40% <ΓΈ> (ΓΈ)
packages/file-loaders 87.02% <ΓΈ> (ΓΈ)
packages/memory-user-memory 74.74% <ΓΈ> (ΓΈ)
packages/model-bank 99.86% <ΓΈ> (ΓΈ)
packages/model-runtime 84.22% <100.00%> (ΓΈ)
packages/prompts 69.08% <ΓΈ> (ΓΈ)
packages/python-interpreter 92.90% <ΓΈ> (ΓΈ)
packages/ssrf-safe-fetch 0.00% <ΓΈ> (ΓΈ)
packages/utils 87.95% <ΓΈ> (ΓΈ)
packages/web-crawler 88.66% <ΓΈ> (ΓΈ)

Flags with carried forward coverage won't be shown. Click here to find out more.

Components Coverage Ξ”
Store 66.61% <ΓΈ> (ΓΈ)
Services 51.74% <ΓΈ> (ΓΈ)
Server 66.81% <ΓΈ> (+<0.01%) ⬆️
Libs 52.57% <ΓΈ> (ΓΈ)
Utils 80.59% <ΓΈ> (ΓΈ)
πŸš€ New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • πŸ“¦ JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

tjx666 added 2 commits April 22, 2026 16:42
Exact-equality allowlist missed dated snapshot IDs (e.g.
gpt-image-1-2026-01-15) that the old includes() check previously
covered, so edits against those aliases would silently regress.

Use a family-prefix regex with a delimiter guard, and keep the mini
exclusion. Covers both gpt-image-1 and gpt-image-1.5 including their
snapshot aliases; still excludes gpt-image-2.
Brand-colored Jimeng icon reads better alongside the "Seedance 2.0 πŸ”₯"
label than the generic lucide VideoIcon. Revert the icon portion of
6d0c8d7 while keeping the model query param navigation.
@tjx666 tjx666 merged commit 16f2b97 into canary Apr 22, 2026
34 of 35 checks passed
@tjx666 tjx666 deleted the feat/add-gpt-image-2 branch April 22, 2026 08:57
@7788jay

7788jay commented Apr 26, 2026

Copy link
Copy Markdown

I'm using the Canary version and have configured both gpt-image-1.5 and gpt-image-2, but in the drawing interface only gpt-image-1.5 is available to select, while gpt-image-2 shows up in the chat interface.

arvinxx added a commit that referenced this pull request Apr 27, 2026
# πŸš€ LobeHub v2.1.53 (20260427)

**Release Date:** April 27, 2026
**Since v2.1.52:** 194 merged PRs Β· 17 contributors

> Introduce Heterogeneous Agent β€” Claude Code and Codex run as
first-class desktop runtimes, paired with a new Agent Signal package,
sharper desktop UX, and a wave of flagship model additions.

---

## ✨ Highlights

- **Introduce Heterogeneous Agent** β€” Claude Code and Codex run as
first-class desktop agents: subagent rendering, partial-message
streaming, multi-turn resume, terminal error surfacing, rich tool
inspectors, and runtime polish. (#14162, #13754, #14067, #14001, #13970,
#13942)
- **Screen capture & Quick Chat tray** β€” New desktop screen capture
overlay (macOS permission-gated) with Quick Chat tray and upload
pipeline improvements; chat input auto-focuses on overlay mount.
(#13818, #14097, #14105)
- **Desktop topic & tab UX** β€” Dedicated topic popup window with
cross-window sync, Cmd+W/Cmd+T tab shortcuts, TabBar polish, recent
working directories expanded to 20, and human approval notifications.
(#13957, #13983, #13972, #14036, #14092)
- **Git workflow built-in** β€” One-click pull/push from the branch chip,
ahead/behind badge, and submodule/worktree repo detection. (#14041,
#13980, #13978)
- **Agent Signal package** β€” New `@lobechat/agent-signal` runtime for
dynamic memory feedback signals, with OTel metrics and self-iteration in
Lab. (#14157, #14170, #14159, #14169, #14187)
- **New models** β€” Claude Opus 4.7 with `xhigh` effort tier, GPT-5.5,
DeepSeek V4 Flash/Pro with reasoning slider, Kimi K2.6, MiMo-V2.5/Pro,
gpt-image-2, Qwen3.6 Flash/Plus, and Pixverse-c1. (#13903, #14147,
#14114, #14004, #14089, #14039, #13923)
- **New providers** β€” OpenCode Zen, OpenCode Go, and Azure OpenAI Router
runtime. (#13943, #14064, #13823)
- **Mobile settings overhaul** β€” Full settings menu and responsive
profile layout for mobile. (#14019)

---

## πŸ—οΈ Heterogeneous Agent

- Claude Code runtime, working-directory awareness, and sidebar polish.
(#13970)
- CC subagent rendering with persistent streamed text; parallel-tool
orphan fix. (#14001, #13968, #14024)
- Per-step usage persisted to each step assistant message. (#13964)
- Per-phase workflow expand defaults; full-expand toggle with
three-level expansion. (#14171, #13906)
- Hetero-mode actions bar; tool inspector polish. (#13963, #14034,
#14030)
- Codex desktop integration with rich tool rendering and devtools
preview. (#14067, #14100)
- Codex terminal error surfacing and CLI output tracing. (#14166)
- Tighten `isCanUseVision` default and add aggregator fallback. (#14172)
- Persist `ccSessionId` in topic metadata for CC multi-turn resume.
(#13902)
- CC account card, topic filter, and integration polish. (#13955,
#13942, #13950)
- Token-level deltas streamed via `--include-partial-messages`. (#13929)

---

## 🧠 Agent Signal & Self-Iteration

- New `@lobechat/agent-signal` package with dynamic feedback signals.
(#14157)
- AgentSignalRuntime wired through agent-tracing and observability-otel
metrics. (#14170, #14159)
- Self-iteration feature flag added to Lab; front-side flag check.
(#14169, #14186)
- Signal policy for receiving memory feedback dynamically. (#14187)

---

## πŸ’¬ Conversation

- Queue follow-up sends during running CC turns. (#14179)
- Persist per-topic chat scroll position; pin user message + fold long
messages. (#14191, #14056)
- Inline resend when editing last user message. (#14080)
- Disable first-block markdown streaming to prevent flicker. (#14193,
#13904)
- Prevent Markdown stream replay when vlist remounts streaming items.
(#14086)
- Stop repinning after manual scroll; unify scroll-to-user + spacer
hooks. (#14099, #14132)

---

## πŸ“± Platforms & Integrations

### Desktop / Electron

- Screen capture overlay, Quick Chat tray, and upload pipeline
improvements. (#13818)
- macOS permission gate for screen capture; auto-focus chat panel input.
(#14097, #14105)
- Dedicated topic popup window with cross-window sync. (#13957)
- TabBar polish: `+` button for new topic, dark theme blend, close icon
by default. (#13972, #14203, #13973)
- Recent working directories expanded from 5 to 20; submodule/worktree
repo detection. (#14036, #13978)
- Cmd+W / Cmd+T tab shortcuts and global shortcut consolidation.
(#13983, #13880)
- Linux icon configuration; human approval desktop notifications.
(#14042, #14092)

### Git Workflow

- One-click pull/push from branch chip; ahead/behind badge with
refactored GitCtr. (#14041, #13980)

### Mobile

- Full settings menu and responsive profile layout. (#14019)
- Agent route added to mobile router; mobile agent topic route
registered. (#14103, #14158)
- Session list skeleton row layout corrected. (#14040)

### Bot / Messaging

- DM strategy support; bot emoji and markdown render optimization.
(#14201, #14091, #14140)
- Slack webhook fix; bot platform setup guide reference. (#14052,
#14121)

---

## πŸ€– Models & Providers

### New models

- **Claude Opus 4.7** with `xhigh` effort tier; strip temperature/top_p.
(#13903, #13909)
- **GPT-5.5**. (#14147)
- **DeepSeek V4** Flash/Pro cards with reasoning slider; cache-hit and
Pro discount pricing. (#14114, #14209, #14196, #14131)
- **Kimi K2.6** model with LobeHub-hosted card. (#14004, #14006)
- **MiMo-V2.5 / V2.5-Pro**. (#14089)
- **gpt-image-2**, **Qwen3.6 Flash/Plus**, **Pixverse-c1**. (#14039,
#13923)

### New providers

- **OpenCode Zen** and **OpenCode Go** with env-var support. (#13943,
#14064)
- **Azure OpenAI Router** runtime support. (#13823)
- Model alias mapping for image and video runtimes. (#13896)
- Seedance video models migrated to Dreamina. (#14144)

### Runtime reliability

- Sanitize invalid tool_call arguments to unbreak strict providers.
(#14033)
- Tolerate null `function.name` in streaming tool_call deltas. (#14139)
- Preserve Gemini 3 `thoughtSignature` in `call_tools_batch`
normalization. (#14032)
- Downgrade `image_url` parts when target model lacks vision. (#14029)
- Preserve Cloudflare provider error context. (#14136)
- Use `safety_identifier` for OpenAI Responses API. (#14148)
- Unwrap underlying PG error in `formatErrorEventData`. (#14038)

---

## πŸ–₯️ User Experience

- **Onboarding** β€” Preset agent naming suggestions, structured hunk ops
for `updateDocument`, persona analytics snapshot, footer promotion
pipeline, wrap-up button. (#13931, #13989, #13930, #13853, #13934)
- **Document workflow** β€” Agent documents promoted as primary workspace
panel; history management and compare workflow; web-crawl docs
associated with agent documents. (#13924, #13725, #13893)
- **cmdk** β€” Agent identity surfaced on topic search results;
topic/message search scoped to current agent. (#14204, #13960)
- **Floating chat panel** and workspace improvements. (#13887)
- **Topic completion status** with dropdown action and filter. (#14005)

---

## πŸ”§ Tooling

- Redis-backed feature flag provider for runtime config. (#14098)
- Vite upgraded to 8.0.0 with Rolldown strict execution order. (#12720,
#14058)
- `@lobechat/model-bank` automated npm release with provenance. (#14015,
#14017, #14018)
- Skill activation fallback when `activateTools` cannot find identifier.
(#14010)
- Cron tool: timezone and existing jobs injected into system prompt;
clarified `lobe-gtd` and `lobe-cron` descriptions. (#14012, #14013)

---

## πŸ”’ Security & Reliability

- **Security:** uuid bumped to v14 (advisory). (#14083)
- **Security:** validate avatar URL and scope old-avatar deletion to
owner. (#13982)
- **Security:** clear OIDC sessions on better-auth signout; return 401
(not 500) for expired OIDC JWT. (#13916, #14014)
- **Reliability:** scope pending-approval check to current assistant
turn. (#14182)
- **Reliability:** sanitize heterogeneous-agent attachment cache
filenames. (#13937)
- **Reliability:** reduce subagent task status error noise. (#14026)

---

## πŸ‘₯ Contributors

Huge thanks to **17 contributors** who shipped **194 merged PRs** this
week.

@hardy Β· @shaun0927 Β· @hezhijie0327 Β· @sxjeru Β· @arvinxx Β· @Innei Β·
@tjx666 Β· @lijian Β· @neko Β· @rdmclin2 Β· @AmAzing129 Β· @sudongyuer Β·
@CanisMinor Β· @rivertwilight

Plus @lobehubbot and renovate[bot] for maintenance.

---

**Full Changelog**:
v2.1.52...v2.1.53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants