Skip to content
@doublewordai

doublewordai

Popular repositories Loading

  1. control-layer control-layer Public

    The world’s fastest AI model gateway (450x less overhead than LiteLLM). Unified access to LLMs across endpoints (openAI, self-hosted, etc.) behind a single authentication layer - with API key gener…

    Rust 52 7

  2. deepseek-reddit-agent deepseek-reddit-agent Public

    An example notebook which shows how you can build a LLM agent that scrapes information from Reddit and summarize key bullets using a self-hosted DeepSeek-R1-Distill-Llama-8B deployed with Titan Tak…

    Jupyter Notebook 11 2

  3. autobatcher autobatcher Public

    Drop-in AsyncOpenAI replacement that transparently batches requests

    Python 10 1

  4. zerodp zerodp Public

    ZeroDP implements an efficient zero-copy data parallel approach for serving Mixture-of-Experts (MoE) models, where expert weights are shared across data parallel ranks via CUDA IPC (Inter-Process C…

    Python 3 1

  5. inference-stack inference-stack Public

    The Doubleword Inference Stack is the easiest & most performant way to run genAI infrastructure in your private environment.

    Go Template 2

  6. outlet outlet Public

    A high-performance Axum middleware for capturing and correlating HTTP requests and responses with full streaming support.

    Rust 2

Repositories

Showing 10 of 38 repositories
  • fusillade Public

    Batched LLM request processing daemon with efficient request coalescing and per-model concurrency control

    doublewordai/fusillade’s past year of commit activity
    Rust 1 0 3 1 Updated Mar 11, 2026
  • control-layer Public

    The world’s fastest AI model gateway (450x less overhead than LiteLLM). Unified access to LLMs across endpoints (openAI, self-hosted, etc.) behind a single authentication layer - with API key generation, user management, request logging, and more

    doublewordai/control-layer’s past year of commit activity
    Rust 52 Apache-2.0 7 19 8 Updated Mar 11, 2026
  • onwards Public

    A router for openAI compatible endpoints

    doublewordai/onwards’s past year of commit activity
    Rust 1 MIT 1 3 5 Updated Mar 11, 2026
  • control-layer-chart Public

    A Helm chart for the Doubleword control layer

    doublewordai/control-layer-chart’s past year of commit activity
    Go Template 0 Apache-2.0 0 1 1 Updated Mar 11, 2026
  • outlet-postgres Public

    A plugin for the https://github.com/doublewordai/outlet middleware, for publishing requests & responses through an axum server to postgres

    doublewordai/outlet-postgres’s past year of commit activity
    Rust 0 MIT 0 2 2 Updated Mar 11, 2026
  • outlet Public

    A high-performance Axum middleware for capturing and correlating HTTP requests and responses with full streaming support.

    doublewordai/outlet’s past year of commit activity
    Rust 2 MIT 0 2 7 Updated Mar 11, 2026
  • inference-lab Public

    High-performance LLM inference simulator for analyzing serving systems

    doublewordai/inference-lab’s past year of commit activity
    Rust 1 MIT 0 1 10 Updated Mar 11, 2026
  • shenron-configs Public

    Public Shenron release configs

    doublewordai/shenron-configs’s past year of commit activity
    0 0 0 0 Updated Mar 10, 2026
  • silt Public
    doublewordai/silt’s past year of commit activity
    Rust 1 MIT 0 1 10 Updated Mar 8, 2026
  • arbiter Public

    DeBERTa inference server for sequence classification

    doublewordai/arbiter’s past year of commit activity
    Rust 0 0 2 10 Updated Mar 8, 2026

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…