KwaaiNet

KwaaiNet is a decentralized AI node architecture for Layer 8 — the trust and intelligence layer above the traditional network stack — built by the Kwaai Foundation, a 501(c)(3) nonprofit AI lab focused on democratizing AI.

Each KwaaiNet node combines:

A decentralized trust graph (cryptographic identity, verifiable credentials, local trust scores).
Shared, sharded LLM compute over heterogeneous CPUs/GPUs using Petals-style distributed inference. Apple Silicon Macs use llama.cpp with Metal for 30+ tok/s local inference; Linux nodes use CUDA-accelerated block sharding.
Secure multi-tenant knowledge storage via Virtual Private Knowledge (VPK) with encrypted vector search.
Intent-based, peer-to-peer networking that routes based on "what I need" (model, trust tier, latency), not just IP addresses.

From an app's point of view, KwaaiNet looks like a familiar chat-completion style HTTP API. Under the hood, it is a person-anchored Layer 8 fabric where every node is tied to an accountable human or organization.

Why KwaaiNet?

Today's "Layer 8" — the AI and agent layer that mediates how people see information and act in the world — is mostly provided by closed platforms you rent and cannot inspect.

KwaaiNet offers an alternative:

Owners, not renters — Run intelligent agents on infrastructure you and your community own and govern, instead of renting access to proprietary stacks.
Trust-first, not anonymous compute — Every node carries an Ed25519-anchored identity, W3C Verifiable Credentials, and a local, time-decayed trust score; there is no central trust registry.
Knowledge as a first-class, private citizen — VPK lets you shard encrypted knowledge across nodes and query it without exposing raw content.
Intent-based networking — Nodes route requests based on intents like "model X, minimum trust tier Verified, max latency Y," making the network semantic and economic, not just transport. See docs/network-and-intent-routing.md for the full intent lifecycle.

For the full architectural and philosophical context, see:

Layer 8: The Decentralized AI Trust Layer (whitepaper) — available via the Kwaai website.
KwaaiNet: Decentralized AI Node Architecture for Layer 8 (technical architecture) — available via the Kwaai website.

Project status: where we are now

KwaaiNet is under active development. The Rust CLI and node implementation already ship many core capabilities; others are in progress or still research.

Today, a KwaaiNet node can:

Run as a native Rust binary (kwaainet) with pre-built cross-platform releases.
Generate a persistent Ed25519 keypair at ~/.kwaainet/identity.key and derive a stable PeerId / did:peer: DID.
Maintain a local W3C Verifiable Credential wallet under ~/.kwaainet/credentials/ with credential types like FiduciaryPledgeVC, VerifiedNodeVC, UptimeVC, ThroughputVC, EventAttendeeVC, and PeerEndorsementVC.
Compute a local, time-decayed trust score for peers, grouped into tiers (Unknown, Known, Verified, Trusted).
Join a libp2p + Kademlia DHT swarm compatible with Petals/Hivemind for node discovery and health checks.
Serve and consume block-sharded LLM inference (CandelEngine): SafeTensors loading, RoPE, GQA, SwiGLU, per-session KV-cache, and temperature/top-k/top-p sampling, exposed through an OpenAI-compatible HTTP API.
Run distributed inference across multiple machines with session-pinned peer paths that keep KV-caches coherent, automatic gap-filling, and graceful failover when peers go offline.
Download models selectively with kwaainet shard download --start-block N --blocks M — fetch only the weight files needed for your block range (10x reduction for large models).
Dual inference backends: llama.cpp with Metal GPU for 30+ tok/s on Apple Silicon (GGUF models); candle with CUDA for distributed block sharding on Linux.
Pre-form inference circuits (kwaainet shard circuit create) for stable, reusable peer paths across multiple chat completions.
Auto-detect local models and network state to smart-select what to serve, and appear on the public map when properly configured at map.kwaai.ai.

See the latest GitHub Release for the most recent feature list and release notes.

Quickstart: run a node and make a request

This quickstart shows how to install the native Rust CLI, start a node, and send a simple chat-completion request against its OpenAI-compatible endpoint.

Note: Exact flags and defaults may evolve. Check kwaainet --help for current options.

1. Install the `kwaainet` CLI

Shell installer (macOS / Linux):

curl --proto '=https' --tlsv1.2 -LsSf https://github.com/Kwaai-AI-Lab/KwaaiNet/releases/latest/download/kwaainet-installer.sh | sh

PowerShell installer (Windows):

powershell -ExecutionPolicy Bypass -c "irm https://github.com/Kwaai-AI-Lab/KwaaiNet/releases/latest/download/kwaainet-installer.ps1 | iex"

Homebrew (macOS / Linux — optional):

brew install kwaai-ai-lab/tap/kwaainet

cargo binstall (downloads prebuilt binary):

cargo binstall kwaainet

Nix (reproducible build):

nix build github:Kwaai-AI-Lab/KwaaiNet
./result/bin/kwaainet --help

Or enter a development shell with all dependencies pinned:

nix develop github:Kwaai-AI-Lab/KwaaiNet

See nix/README.md for the full Nix guide.

RISC-V (cross-compile via Nix):

nix build github:Kwaai-AI-Lab/KwaaiNet#kwaainet-riscv64-linux-gnu
file result-kwaainet-riscv64-linux-gnu/bin/kwaainet
# → ELF 64-bit LSB pie executable, UCB RISC-V

Copy the binary to your RISC-V board and run. See nix/README.md for all cross-compilation targets (aarch64-musl, x86_64-musl, riscv64-gnu).

Build from source:

cargo install --git https://github.com/Kwaai-AI-Lab/KwaaiNet kwaainet

Then confirm:

kwaainet --help

2. Initialize and start a node

Initialize node identity and config:

kwaainet setup

This generates ~/.kwaainet/identity.key (Ed25519 keypair) and creates a default config with a smart default node name (e.g. alice-linux-aarch64).

If kwaainet start reports that p2pd is missing (e.g. manual install from a .tar.xz), run kwaainet setup --get-deps to download and install it automatically.

Start the node:

kwaainet start --daemon

The node will:

Connect to bootstrap peers and announce itself on the DHT.
Load or download model shards (depending on your configuration).
Expose an HTTP API compatible with the OpenAI chat-completion interface.

3. Call the OpenAI-compatible API

curl http://localhost:11435/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "your-model-id",
    "messages": [
      {"role": "user", "content": "Hello, KwaaiNet!"}
    ]
  }'

This sends a chat-completion request to your local node, which may route it through a shard chain of other nodes depending on configuration and trust requirements.

For a full walkthrough including platform specifics, model discovery, and Python/JS examples see docs/getting-started-node.md and docs/api-quickstart.md.

4. Distributed inference across the network

Download the model (or just the blocks you need):

kwaainet shard download

Run inference across the live KwaaiNet peer network:

kwaainet shard run "What is the capital of France?"

The coordinator discovers block servers via DHT, pins a stable peer path for the session, and forwards activations through the chain:

Pinned path:
  [ 1] blocks   0– 23  john-linux-draak-x86_64/v0.3.27
  [ 2] blocks  24– 31  john-linux-draca-x86_64/v0.3.27

  Assistant: The capital of France is Paris.

Add --stats to see per-token timing breakdown (prefill, decode, throughput). For local-only inference without networking: kwaainet shard run "prompt" --local.

On Apple Silicon Macs with a GGUF model, local inference uses llama.cpp with Metal GPU acceleration (~30 tok/s). Build with --features llama-cpp to enable.

See docs/sharded-llm-processing.md for the full architecture of block-sharded inference, KV-cache management, and data flow diagrams.

Roadmap: destination vs current implementation

KwaaiNet's roadmap is defined as the gap between the aspirational Layer 8 architecture in the whitepapers and the currently shipping Rust implementation.

Area	Aspirational (whitepapers)	Current implementation (Rust node)
Trust	5-layer trust pipeline including Testable Credentials (PVP-1) and EigenTrust propagation.	Identity + VC wallet + local time-decayed trust scores shipped; ToIP work in progress.
Compute	Sharded inference, decentralized training, safe tool-calling with trust-gated policies.	Dual backend: llama.cpp for 30+ tok/s local on Apple Silicon, candle for distributed block sharding on Linux/CUDA. Inference circuits, session-pinned paths, selective download, OpenAI-compatible API shipped.
Storage	Fully distributed personal AI memory via cross-node VPK sharding and DHT-backed resolution.	VPK process, roles (bob/eve/both), encrypted vector search, and DHT advertisement shipped.
Network	Intent-casting as a Layer 8 business protocol with economic settlement and neutrality guarantees.	libp2p + Kademlia DHT, trust-gated routing by model/trust/latency shipped.

See docs/roadmap.md for the full living roadmap with contribution ideas for each area.

Who is building KwaaiNet?

KwaaiNet is developed by the Kwaai Foundation, a 501(c)(3) nonprofit AI lab and proud signatory of the GliaNet Fiduciary Pledge.

Mission: democratize AI by building open, person-anchored infrastructure and Personal AI systems.
Values: personal control, self-sovereign identity, transparency, openness.
Role of KwaaiNet: serve as the decentralized AI trust and compute layer (Layer 8) for the broader Kwaai ecosystem and allied open-source projects.

Kwaai is working closely with the Linux Foundation Trust Over IP (ToIP) – Decentralized Trust Graph Working Group, which defines socio-technical standards for decentralized trust graphs that span people, organizations, and AI agents. This collaboration helps align KwaaiNet's Layer 8 trust fabric with emerging open standards for decentralized identifiers, verifiable credentials, and trust graphs at Internet scale.

Kwaai is also collaborating with:

Mozilla / Mozilla.ai — on shared aims around trustworthy, user-controlled AI and open tooling for agentic systems.
SingularityNET — exploring best-of-breed combinations of decentralized AI infrastructure and open model ecosystems.
IEEE P7012 — Standard for Machine Readable Personal Privacy Terms, bringing Layer 8's person-anchored agents and trust fabric into conversation with machine-readable privacy and consent standards.

Learn more at kwaai.ai and the Kwaai-AI-Lab GitHub organization.

Documentation

Document	Description
docs/README.md	Docs index — audience map and navigation guide
docs/getting-started-node.md	Install, initialize, and run your first node
docs/api-quickstart.md	Call the OpenAI-compatible API from curl, Python, and JS
docs/roadmap.md	Layer 8 destination vs current implementation vs gaps
docs/reputation.md	Local trust scores, EigenTrust propagation, endorsement accountability
docs/sharded-llm-processing.md	Block-sharded inference pipeline, KV-cache, and activation data flows
docs/network-and-intent-routing.md	P2P fabric, trust-gated routing, and the full intent lifecycle
docs/METAL_PERFORMANCE_ANALYSIS.md	Metal GPU performance analysis and optimization roadmap
docs/MLX_BACKEND_PLAN.md	MLX backend research — investigation results and path forward
docs/ARCHITECTURE.md	Node architecture, lobes, and Layer 8 stack
docs/WHITEPAPER.md	Layer 8: The Decentralized AI Trust Layer (whitepaper)
nix/README.md	Nix build, dev shell, and test infrastructure
docs/contributor-guide.md	How to contribute — 1 hour / 1 day / 1 week paths
CONTRIBUTING.md	Development workflow and code contribution guidelines
CONTRIBUTORS.md	Project contributors
CHANGELOG.md	Release history

Contributing

KwaaiNet welcomes contributions from node operators, application developers, protocol researchers, and documentation writers.

Read docs/contributor-guide.md for "1 hour / 1 day / 1 week" entry points mapped to the roadmap.
Read CONTRIBUTING.md for the development workflow and code contribution guidelines.
Explore open issues and join Kwaai community channels at kwaai.ai.

Name		Name	Last commit message	Last commit date
Latest commit History 344 Commits
.github/workflows		.github/workflows
apps/map		apps/map
core		core
docker		docker
docs		docs
nix		nix
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTORS.md		CONTRIBUTORS.md
Dockerfile.map-frontend		Dockerfile.map-frontend
Dockerfile.map-server		Dockerfile.map-server
KwaaiNet.code-workspace		KwaaiNet.code-workspace
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
flake.lock		flake.lock
flake.nix		flake.nix
setup.ps1		setup.ps1
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KwaaiNet

Why KwaaiNet?

Project status: where we are now

Quickstart: run a node and make a request

1. Install the `kwaainet` CLI

2. Initialize and start a node

3. Call the OpenAI-compatible API

4. Distributed inference across the network

Roadmap: destination vs current implementation

Who is building KwaaiNet?

Documentation

Contributing

About

Uh oh!

Releases 49

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

KwaaiNet

Why KwaaiNet?

Project status: where we are now

Quickstart: run a node and make a request

1. Install the kwaainet CLI

2. Initialize and start a node

3. Call the OpenAI-compatible API

4. Distributed inference across the network

Roadmap: destination vs current implementation

Who is building KwaaiNet?

Documentation

Contributing

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 49

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Install the `kwaainet` CLI

Packages