Add configurable timeout to `Inference.complete`

The current implementation uses `urllib.request` with no timeout. A hung LLM provider will hang the Vera runtime indefinitely.

Add a configurable timeout (default 30s) overridable via `VERA_INFERENCE_TIMEOUT` env var. Return `Err("inference timeout after 30s")` on expiry.

```python
req = _urlreq.Request(url, data=body, headers=headers, method="POST")
resp = _urlreq.urlopen(req, timeout=timeout_secs)
```

Apply the same timeout to the JS browser runtime's XHR call.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add configurable timeout to `Inference.complete` #378

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Add configurable timeout to Inference.complete #378

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

Add configurable timeout to `Inference.complete` #378