Skip to content

fix(cli): use byte length instead of string length for readStdin size limits#26224

Merged
cocosheng-g merged 1 commit intomainfrom
fix/issue-23417-read-stdin-byte-length
Apr 30, 2026
Merged

fix(cli): use byte length instead of string length for readStdin size limits#26224
cocosheng-g merged 1 commit intomainfrom
fix/issue-23417-read-stdin-byte-length

Conversation

@Adib234
Copy link
Copy Markdown
Contributor

@Adib234 Adib234 commented Apr 29, 2026

Summary

Fixes the size limitation logic in readStdin() to use byte length instead of string length. This ensures that the 8MB limit is accurately enforced for multi-byte characters (e.g., CJK characters, emojis) and prevents data corruption caused by splitting multi-byte characters at the truncation boundary.

Details

  • Replaced chunk.length with Buffer.byteLength(chunk, 'utf8') for accurate byte tracking.
  • Implemented truncateUtf8Bytes utility to safely truncate UTF-8 strings at a byte limit without splitting multi-byte characters (continuation bytes).
  • Aligned the implementation with the approach used in readStdinLines.ts (from PR feat(cli): allow -i/--prompt-interactive with piped stdin #23414).

Related Issues

Fixes #23417

How to Validate

  1. Run unit tests for readStdin:
    npm test -w @google/gemini-cli -- src/utils/readStdin.test.ts
  2. The new tests verify:
    • Truncation of 3-byte characters (e.g., '한') happens at a valid byte boundary (result size is a multiple of 3).
    • The string length can exceed the 8MB limit if characters were 1-byte, but for 3-byte characters, it correctly truncates earlier.
    • No replacement characters (\uFFFD) are present in the output.

Pre-Merge Checklist

  • Updated relevant documentation and README (if needed)
  • Added/updated tests (if needed)
  • Noted breaking changes (if any)
  • Validated on required platforms/methods:
    • MacOS
      • npm run
      • npx
      • Docker
      • Podman
      • Seatbelt
    • Windows
      • npm run
      • npx
      • Docker
    • Linux
      • npm run
      • npx
      • Docker

@Adib234 Adib234 requested a review from a team as a code owner April 29, 2026 20:28
@gemini-code-assist
Copy link
Copy Markdown
Contributor

Summary of Changes

Hello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request addresses a critical issue in the CLI's stdin processing where size limits were incorrectly calculated using string length instead of byte length. By implementing a robust truncation utility and switching to byte-based tracking, the changes prevent potential data corruption when handling multi-byte characters like emojis or CJK scripts, ensuring the 8MB limit is accurately and safely enforced.

Highlights

  • Accurate Byte Tracking: Updated the stdin reading logic to use Buffer.byteLength instead of string length, ensuring the 8MB limit is enforced based on actual byte size rather than character count.
  • Safe Truncation: Introduced a truncateUtf8Bytes utility to prevent data corruption by ensuring multi-byte UTF-8 characters are not split at the truncation boundary.
  • Enhanced Testing: Added unit tests to verify correct truncation behavior for multi-byte characters and confirm that the byte limit is strictly respected.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

@github-actions
Copy link
Copy Markdown

Size Change: +341 B (0%)

Total Size: 33.9 MB

Filename Size Change
./bundle/chunk-7ZN6GXL7.js 0 B -655 kB (removed) 🏆
./bundle/chunk-BD6OX4SV.js 0 B -19.5 kB (removed) 🏆
./bundle/chunk-E2R4MZ6Y.js 0 B -3.43 kB (removed) 🏆
./bundle/chunk-OIOKKUJV.js 0 B -3.8 kB (removed) 🏆
./bundle/chunk-OPA4WVYG.js 0 B -49.2 kB (removed) 🏆
./bundle/chunk-PJHLIBE4.js 0 B -12.6 kB (removed) 🏆
./bundle/chunk-TICW6OCB.js 0 B -2.72 MB (removed) 🏆
./bundle/chunk-V5P72VLL.js 0 B -14.7 MB (removed) 🏆
./bundle/core-OFHG7KTO.js 0 B -48.2 kB (removed) 🏆
./bundle/devtoolsService-CP3UFFJT.js 0 B -28 kB (removed) 🏆
./bundle/gemini-JN76QNQH.js 0 B -576 kB (removed) 🏆
./bundle/interactiveCli-7HNUA2FH.js 0 B -1.31 MB (removed) 🏆
./bundle/liteRtServerManager-CCWRD3CR.js 0 B -2.11 kB (removed) 🏆
./bundle/oauth2-provider-OY6XXPXN.js 0 B -9.16 kB (removed) 🏆
./bundle/chunk-46J7SEEX.js 655 kB +655 kB (new file) 🆕
./bundle/chunk-5P7IJU4A.js 12.6 kB +12.6 kB (new file) 🆕
./bundle/chunk-F7K7MC7M.js 3.43 kB +3.43 kB (new file) 🆕
./bundle/chunk-FW2E5CIG.js 19.5 kB +19.5 kB (new file) 🆕
./bundle/chunk-OFF52XJ2.js 2.72 MB +2.72 MB (new file) 🆕
./bundle/chunk-P6HGPNBX.js 49.2 kB +49.2 kB (new file) 🆕
./bundle/chunk-QBYP7U2D.js 14.7 MB +14.7 MB (new file) 🆕
./bundle/chunk-RYYKIE5U.js 3.8 kB +3.8 kB (new file) 🆕
./bundle/core-EKVTRB73.js 48.2 kB +48.2 kB (new file) 🆕
./bundle/devtoolsService-AC7WBMN5.js 28 kB +28 kB (new file) 🆕
./bundle/gemini-UFXMTGW6.js 576 kB +576 kB (new file) 🆕
./bundle/interactiveCli-T6UUALNJ.js 1.31 MB +1.31 MB (new file) 🆕
./bundle/liteRtServerManager-B5RLTDCS.js 2.11 kB +2.11 kB (new file) 🆕
./bundle/oauth2-provider-MMDT26M7.js 9.16 kB +9.16 kB (new file) 🆕
ℹ️ View Unchanged
Filename Size Change
./bundle/bundled/third_party/index.js 8 MB 0 B
./bundle/chunk-34MYV7JD.js 2.45 kB 0 B
./bundle/chunk-5AUYMPVF.js 858 B 0 B
./bundle/chunk-5PS3AYFU.js 1.18 kB 0 B
./bundle/chunk-664ZODQF.js 124 kB 0 B
./bundle/chunk-DAHVX5MI.js 206 kB 0 B
./bundle/chunk-IUUIT4SU.js 56.5 kB 0 B
./bundle/chunk-RJTRUG2J.js 39.8 kB 0 B
./bundle/chunk-XRLFHCHC.js 1.97 MB 0 B
./bundle/cleanup-PUFSBMNE.js 0 B -932 B (removed) 🏆
./bundle/devtools-36NN55EP.js 696 kB 0 B
./bundle/dist-T73EYRDX.js 356 B 0 B
./bundle/events-XB7DADIJ.js 418 B 0 B
./bundle/examples/hooks/scripts/on-start.js 188 B 0 B
./bundle/examples/mcp-server/example.js 1.43 kB 0 B
./bundle/gemini.js 5.1 kB 0 B
./bundle/getMachineId-bsd-TXG52NKR.js 1.55 kB 0 B
./bundle/getMachineId-darwin-7OE4DDZ6.js 1.55 kB 0 B
./bundle/getMachineId-linux-SHIFKOOX.js 1.34 kB 0 B
./bundle/getMachineId-unsupported-5U5DOEYY.js 1.06 kB 0 B
./bundle/getMachineId-win-6KLLGOI4.js 1.72 kB 0 B
./bundle/memoryDiscovery-FN3IAPBT.js 980 B 0 B
./bundle/multipart-parser-KPBZEGQU.js 11.7 kB 0 B
./bundle/node_modules/@google/gemini-cli-devtools/dist/client/main.js 222 kB 0 B
./bundle/node_modules/@google/gemini-cli-devtools/dist/src/_client-assets.js 229 kB 0 B
./bundle/node_modules/@google/gemini-cli-devtools/dist/src/index.js 13.4 kB 0 B
./bundle/node_modules/@google/gemini-cli-devtools/dist/src/types.js 132 B 0 B
./bundle/sandbox-macos-permissive-open.sb 890 B 0 B
./bundle/sandbox-macos-permissive-proxied.sb 1.31 kB 0 B
./bundle/sandbox-macos-restrictive-open.sb 3.36 kB 0 B
./bundle/sandbox-macos-restrictive-proxied.sb 3.56 kB 0 B
./bundle/sandbox-macos-strict-open.sb 4.82 kB 0 B
./bundle/sandbox-macos-strict-proxied.sb 5.02 kB 0 B
./bundle/src-QVCVGIUX.js 47 kB 0 B
./bundle/start-G4ZLYOAR.js 0 B -652 B (removed) 🏆
./bundle/tree-sitter-7U6MW5PS.js 274 kB 0 B
./bundle/tree-sitter-bash-34ZGLXVX.js 1.84 MB 0 B
./bundle/cleanup-GJCTUL3H.js 932 B +932 B (new file) 🆕
./bundle/start-D4R4JOGO.js 652 B +652 B (new file) 🆕

compressed-size-action

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request improves the handling of standard input truncation by switching from character-based length to byte-based length. It introduces a truncateUtf8Bytes utility to ensure that multi-byte UTF-8 characters are not split at the truncation boundary, preventing data corruption or invalid characters. New unit tests verify the correct truncation of multi-byte characters and the enforcement of the 8MB byte limit. I have no feedback to provide.

@gemini-cli gemini-cli Bot added the area/core Issues related to User Interface, OS Support, Core Functionality label Apr 29, 2026
@cocosheng-g cocosheng-g added this pull request to the merge queue Apr 30, 2026
Merged via the queue into main with commit 487fb21 Apr 30, 2026
49 of 50 checks passed
@cocosheng-g cocosheng-g deleted the fix/issue-23417-read-stdin-byte-length branch April 30, 2026 14:29
TirthNaik-99 pushed a commit to TirthNaik-99/gemini-cli that referenced this pull request May 4, 2026
kimjune01 pushed a commit to kimjune01/gemini-cli-claude that referenced this pull request May 6, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area/core Issues related to User Interface, OS Support, Core Functionality

Projects

None yet

Development

Successfully merging this pull request may close these issues.

fix(cli): readStdin uses string.length instead of byte length for size limits

2 participants