
GPT-RAG Solution Accelerator
GPT-RAG is an enterprise-grade accelerator for building conversational AI assistants on Azure, powered by intelligent agents that understand questions, find the right information, and deliver clear, accurate answers using trusted enterprise data.
Designed with Zero-Trust security and Infrastructure as Code (IaC) principles from the ground up, GPT-RAG accelerates production deployments while ensuring consistency, governance, and operational excellence. It supports text, image, and voice scenarios, enabling organizations to rapidly create rich multimodal experiences.
Architecture at a glance
GPT-RAG can start as a basic deployment and expand into Zero Trust, public ingress, existing-platform integration, or optional AI capabilities as needed. See the Architecture page for the required-vs-configurable deployment table.
Full Zero Trust reference architecture. This is the complete network-isolated view, not the minimum basic deployment.
The complementary modular view below separates the basic deployment from optional add-on layers.
Runtime Services
| Services | Description |
|---|---|
| Orchestrator | Manages agentic workflows with Microsoft Agent Framework, Azure AI, and strategy-specific integrations. |
| Web UI | User interface for chat interactions, supports streaming and custom themes. |
| Data Ingestion | Extracts, chunks, and indexes enterprise data for optimized retrieval. |
| MCP Server | Optional Model Context Protocol service for tool hosting and business logic integration. |
Contributing
We welcome contributions from the community! Check our Contribution Guidelines for CLA, code of conduct, and PR guidelines.