Text decoding should be specified for JS, JSON & text imports

ECMA-262 does not define how to parse a file into a module, when it's imported into JS. For browsers, this is defined by the HTML standard, which [includes this](https://html.spec.whatwg.org/#fetch-a-single-module-script):
> Let _sourceText_ be the result of [UTF-8 decoding](https://encoding.spec.whatwg.org/#utf-8-decode) _bodyBytes_.

That applies for all imports, be it for JS modules, JSON modules, or the upcoming [text modules](https://github.com/tc39/proposal-import-text).

For text modules, the intent in its [HTML standard changes](https://github.com/whatwg/html/pull/11933) is to match the behaviour of Fetch's `.text()`, which already now [mandates](https://fetch.spec.whatwg.org/#dom-body-text) using the [UTF-8 decode](https://encoding.spec.whatwg.org/#utf-8-decode) algorithm for all files.

I'd be surprised if some runtime wasn't currently at least detecting a BOM and potentially parsing UTF-16 content as UTF-16 rather than UTF-8.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text decoding should be specified for JS, JSON & text imports #117

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Text decoding should be specified for JS, JSON & text imports #117

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions