[ES|QL] Read many large keyword or text fields can take a ton of untracked memory

This issue was found during adding new heap attack tests for subqueries. When there are multiple subqueries that read many large keyword fields or only one single giant text field, the issue is exposed.

The memory consumed in the following places are not tracked properly yet:

- `ValuesSourceReaderOperator.FieldWork`
- `BlockSourceReader.scratch`

There are two indices referenced by the two queries below

- Index `manybigfields` has 1000 keyword fields, each field is a random 1KB string, each document is 1MB, and there are 500 documents.
- Index `bigtext` has 1 text field, each field/document is a random 5MB string, and there are 40 documents.

**Query #1**
```
FROM
    (FROM manybigfields)
    , (FROM manybigfields)
    , (FROM manybigfields)
    , (FROM manybigfields)
    , (FROM manybigfields)
    , (FROM manybigfields)
    , (FROM manybigfields)
    , (FROM manybigfields)
```

`ValuesSourceReaderOperator` seems to have some untracked memory consumed by lucene, the size in the dominator tree does not quite reflect it. There are 1000 `ValuesSourceReaderOperator.FieldWork`, although heap dump says they are tiny(which is questionable?), `Block[]#1` in the screenshot is populated by `ValuesSourceReaderOperator#3`, heap dump says `ValuesSourceReaderOperator#3` is about 92KB itself, however `Block[]#1` is 6MB in the heap dump. There could be some hidden memory usage not shown yet.

<img width="1694" height="817" alt="Image" src="https://github.com/user-attachments/assets/aefe08e2-de82-420d-8b73-6f6c9ce37967" />

<img width="1521" height="895" alt="Image" src="https://github.com/user-attachments/assets/8ace4ad6-02cb-4462-b669-4b20188e7e56" />

<img width="1690" height="868" alt="Image" src="https://github.com/user-attachments/assets/21b5da66-e9dd-4526-85eb-4565453d25ef" />

<img width="1710" height="841" alt="Image" src="https://github.com/user-attachments/assets/035d2acf-b7b5-4a29-9f07-ceaffc2aa367" />

**Query #2**
```
FROM
    (FROM bigtext)
    , (FROM bigtext)
    , (FROM bigtext)
    , (FROM bigtext)
    , (FROM bigtext)
    , (FROM bigtext)
    , (FROM bigtext)
    , (FROM bigtext)
| LIMIT 30
```

`BlockSourceReader.scratch` is about 15MB in the heap dump and it is not tracked by circuit breaker

<img width="1695" height="836" alt="Image" src="https://github.com/user-attachments/assets/febe8154-256e-40d3-9943-34fbfe90ad49" />

<img width="1692" height="867" alt="Image" src="https://github.com/user-attachments/assets/fa1dd1b8-89e6-4abd-822d-f43c5aed65a0" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ES|QL] Read many large keyword or text fields can take a ton of untracked memory #140218

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[ES|QL] Read many large keyword or text fields can take a ton of untracked memory #140218

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions