Skip to content

Commit df99db7

Browse files
committed
doc updates
1 parent 29a03c8 commit df99db7

5 files changed

Lines changed: 14 additions & 41 deletions

File tree

docs/content/configuration/examples.md

Lines changed: 5 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -124,10 +124,7 @@ proxy:
124124
profile: "auto"
125125
load_balancer: "least-connections"
126126
connection_timeout: 45s
127-
retry:
128-
enabled: true
129-
on_connection_failure: true
130-
max_attempts: 0 # Try all available endpoints
127+
# Note: Retry is automatic and built-in for connection failures
131128

132129
discovery:
133130
type: "static"
@@ -212,10 +209,7 @@ proxy:
212209
profile: "auto"
213210
load_balancer: "round-robin" # Test all endpoints equally
214211
connection_timeout: 10s
215-
retry:
216-
enabled: true # Enable to test retry logic
217-
on_connection_failure: true
218-
max_attempts: 2 # Limited retries for debugging
212+
# Note: Retry is automatic and built-in for connection failures
219213

220214
discovery:
221215
type: "static"
@@ -353,11 +347,7 @@ proxy:
353347
profile: "standard" # No streaming for public API
354348
load_balancer: "least-connections"
355349
connection_timeout: 20s
356-
# Automatic retry with failover to other endpoints
357-
retry:
358-
enabled: true
359-
on_connection_failure: true
360-
max_attempts: 2 # Limit retries for public API
350+
# Note: Automatic retry with failover to other endpoints is built-in
361351

362352
discovery:
363353
type: "static"
@@ -442,10 +432,7 @@ proxy:
442432
profile: "auto"
443433
load_balancer: "least-connections"
444434
connection_timeout: 30s
445-
retry:
446-
enabled: true
447-
on_connection_failure: true
448-
max_attempts: 0 # Try all endpoints for maximum availability
435+
# Note: Automatic retry tries all endpoints for maximum availability
449436

450437
discovery:
451438
type: "static"
@@ -517,10 +504,7 @@ proxy:
517504
load_balancer: "priority"
518505
connection_timeout: 30s
519506
response_timeout: 900s
520-
retry:
521-
enabled: true # Automatic retry on failures
522-
on_connection_failure: true
523-
max_attempts: 0 # Try all available endpoints
507+
# Note: Automatic retry on failures tries all available endpoints
524508

525509
discovery:
526510
type: "static"

docs/content/configuration/practices/overview.md

Lines changed: 5 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -398,17 +398,16 @@ proxy:
398398

399399
### Retry Strategy
400400

401-
Automatic retry is enabled by default for connection failures:
401+
Automatic retry on connection failures is built-in as of v0.0.16:
402402

403403
```yaml
404404
proxy:
405-
retry:
406-
enabled: true # Automatic failover
407-
on_connection_failure: true
408-
max_attempts: 0 # Try all endpoints (or set limit)
405+
# Note: Retry is automatic and built-in for connection failures
406+
engine: "olla" # Circuit breaker integration
407+
load_balancer: "priority" # Failover to next endpoint
409408
```
410409

411-
The retry mechanism intelligently:
410+
The automatic retry mechanism intelligently:
412411
- Only retries connection failures (not application errors)
413412
- Automatically tries different endpoints
414413
- Marks failed endpoints as unhealthy

docs/content/configuration/practices/performance.md

Lines changed: 2 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -359,10 +359,7 @@ proxy:
359359
profile: "auto" # Dynamic selection
360360
load_balancer: "least-connections"
361361
connection_timeout: 60s # Long connection reuse
362-
retry:
363-
enabled: true
364-
on_connection_failure: true
365-
max_attempts: 2 # Limit retries for performance
362+
# Note: Automatic retry on connection failures is built-in
366363
367364
discovery:
368365
model_discovery:
@@ -388,10 +385,7 @@ proxy:
388385
profile: "streaming" # Optimise for streaming
389386
load_balancer: "priority" # Fastest decisions
390387
connection_timeout: 120s # Reuse connections
391-
retry:
392-
enabled: true
393-
on_connection_failure: true
394-
max_attempts: 1 # Fast failure
388+
# Note: Automatic retry on connection failures is built-in
395389
396390
discovery:
397391
static:

docs/content/getting-started/quickstart.md

Lines changed: 1 addition & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -180,11 +180,7 @@ proxy:
180180
engine: "olla" # High-performance engine
181181
load_balancer: "least-connections"
182182
connection_timeout: 30s
183-
# Automatic retry on connection failures is enabled by default
184-
retry:
185-
enabled: true
186-
on_connection_failure: true
187-
max_attempts: 0 # Try all available endpoints
183+
# Note: Automatic retry on connection failures is built-in
188184
```
189185

190186
### Rate Limiting

docs/content/index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -91,7 +91,7 @@ Olla provides detailed response headers for observability:
9191
|--------|-------------|
9292
| `X-Olla-Endpoint` | Backend endpoint name |
9393
| `X-Olla-Model` | Model used for the request |
94-
| `X-Olla-Backend-Type` | Backend type (ollama/openai/lmstudio) |
94+
| `X-Olla-Backend-Type` | Backend type (ollama/openai/lmstudio/vllm) |
9595
| `X-Olla-Request-ID` | Unique request identifier |
9696
| `X-Olla-Response-Time` | Total processing time |
9797

0 commit comments

Comments
 (0)