Skip to content

Commit f27da83

Browse files
committed
docs: remove Aegis-AI integration block temporarily to prepare for new hero section
1 parent 0c3288b commit f27da83

1 file changed

Lines changed: 0 additions & 20 deletions

File tree

README.md

Lines changed: 0 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -97,27 +97,7 @@ curl http://localhost:5413/v1/chat/completions \
9797
```
9898
---
9999

100-
## 🛡️ Aegis-AI & System Integration
101100

102-
`mlx-server` is designed to be a completely transparent, drop-in substitution for `llama-server` or cloud VLM gateways within local intelligence platforms like **Aegis-AI**, offering dramatically faster zero-latency inference on macOS instances.
103-
104-
When configuring local inference workflows (e.g., within `~/.aegis-ai/llm-config.json`), apply the following integration details:
105-
106-
### 1. Gateway Emulation
107-
`mlx-server` exposes a fully standard OpenAI-compatible API layer:
108-
- **`GET /health`**: Exposes advanced JSON containing GPU metrics and VRAM allocations.
109-
- **`GET /v1/models`**: Lists actively loaded topologies.
110-
- **`POST /v1/chat/completions`**: Supports both `stream: true` and `stream: false`. Natively handles tool-calls, system prompts, and temperature variables.
111-
112-
### 2. Process Configuration (The Sidecar)
113-
When your agent engine spins up `mlx-server` using standard subprocess management, you must be explicitly aware of the memory requirements for *Mixture of Expert* (MoE) models.
114-
115-
> [!CAUTION]
116-
> **Critical Memory Routing (`--stream-experts`)**
117-
> When managing MoE models (e.g., `Qwen3.5-122B-A10B` where active parameters are significantly smaller than the total payload), you **must artificially append** the `--stream-experts true` flag to the process arguments. If omitted, macOS will inevitably suffer a `Data Abort` memory-mapping fault when mapping > 100GB of tensors onto the unified GPU hardware.
118-
119-
120-
---
121101
## ⚙️ CLI Options
122102

123103
| Option | Default | Description |

0 commit comments

Comments
 (0)