[Feature]: Hermes codex_responses stream fails when codex.rate_limits arrives before response.created

### Problem or Use Case

When Hermes is used with `provider=custom` and `api_mode=codex_responses` against a codex-lb `/v1/responses` backend, the streaming path can fail if the backend emits `codex.rate_limits` before `response.created`.

Observed error:

```text
No response received: Expected to have received response.created before codex.rate_limits
```

In regression testing, the core RuntimeError is:

```text
Expected to have received `response.created` before `codex.rate_limits`
```

Hermes already handles one Responses streaming edge case by falling back when `response.completed` is missing. However, it does not currently handle another real-world compatibility case: a provider-specific prelude event arriving before `response.created`.

This makes the current stream event-order assumption too strict for some OpenAI-compatible backends such as codex-lb, causing the whole request to fail even though a valid final response could still be obtained through a fallback path.

### Proposed Solution

Add a fallback branch in `_run_codex_stream(...)` for errors matching:

```text
Expected to have received `response.created` before ...
```

Instead of aborting the conversation, Hermes should fall back to a non-streaming `responses.create(...)` call without `stream=True`.

Suggested behavior:
- keep the existing fallback for missing `response.completed`
- add detection for `response.created` / prelude ordering mismatches
- fall back to non-stream `responses.create(...)` for this class of stream-protocol mismatch
- add a regression test covering the `codex.rate_limits` prelude case

This approach is low-risk, preserves existing behavior, and improves compatibility with OpenAI-compatible backends that emit provider-specific prelude events before `response.created`.

### Alternatives Considered

_No response_

### Feature Type

Performance / reliability

### Scope

None

### Contribution

- [ ] I'd like to implement this myself and submit a PR

### Debug Report (optional)

```shell

```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Hermes codex_responses stream fails when codex.rate_limits arrives before response.created #14634

Problem or Use Case

Proposed Solution

Alternatives Considered

Feature Type

Scope

Contribution

Debug Report (optional)

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[Feature]: Hermes codex_responses stream fails when codex.rate_limits arrives before response.created #14634

Description

Problem or Use Case

Proposed Solution

Alternatives Considered

Feature Type

Scope

Contribution

Debug Report (optional)

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions