fix(cli): readStdin uses string.length instead of byte length for size limits

## Bug

`readStdin()` in `packages/cli/src/utils/readStdin.ts` defines the stdin size limit as `8 * 1024 * 1024` (8 MB in bytes), but enforces it using `string.length` after `process.stdin.setEncoding('utf8')`.

## Root cause

```ts
process.stdin.setEncoding('utf8');

if (totalSize + chunk.length > MAX_STDIN_SIZE) {
  const remainingSize = MAX_STDIN_SIZE - totalSize;
  data += chunk.slice(0, remainingSize);
}
````

* `string.length` counts UTF-16 code units, not UTF-8 bytes
* multi-byte characters (e.g., CJK, emoji) are undercounted
* `string.slice()` may split surrogate pairs, producing malformed output

## Impact

* The 8 MB limit is not byte-accurate for non-ASCII input
* Truncation may corrupt characters at the boundary

## Suggested fix

* Use `Buffer.byteLength(chunk, 'utf8')` for byte-accurate size tracking
* Use a byte-safe truncation method (e.g., via `Buffer`) to avoid splitting multi-byte characters
* Align implementation with the approach used in `readStdinLines.ts` (PR #23414)

## Scope

This issue applies to:

* `packages/cli/src/utils/readStdin.ts`

Note:
`readStdinLines.ts` was addressed separately in PR #23414.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(cli): readStdin uses string.length instead of byte length for size limits #23417

Bug

Root cause

Impact

Suggested fix

Scope

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

fix(cli): readStdin uses string.length instead of byte length for size limits #23417

Description

Bug

Root cause

Impact

Suggested fix

Scope

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions