Clarion

Clarion is an AI-powered litigation tool. It ingests case evidence such as PDFs, audio, images, and video, then:

Indexes facts - Builds citation-backed evidence references for specific claims.
Finds contradictions - Flags conflicts between sources like witness statements and official records.
Generates reports - Produces courtroom-ready reports that can include AI-generated images and reconstructions.

You create a case, upload evidence, analyze it, and generate a report through the API or the deployed web app. Clarion also includes a voice workflow for asking case questions and editing reports by speech.

Quick Start

cd backend
pip install -r requirements.txt
cp .env.example .env
PYTHONPATH=. uvicorn app.main:app --reload

API: http://127.0.0.1:8000
Docs: http://127.0.0.1:8000/docs

System Overview

Clarion currently runs as a small multi-service system on Google Cloud Run.

Core runtime pieces:

clarion-experience - Next.js frontend for case intake, report viewing, and editing.
clarion-api - Public FastAPI service. Handles uploads, case APIs, report/job APIs, exports, and voice endpoints.
clarion-intelligence-worker - Private FastAPI worker service. Runs long-lived report generation and case analysis work.

Supporting infrastructure:

Cloud Tasks - Queues report and analysis work for the worker service.
Firestore - Persists case state, report jobs, workflow progress, and analysis metadata.
GCS - Stores uploads, generated reports, media assets, and manifests.
Signed URL delivery - The API converts private GCS artifact URIs into short-lived browser-safe URLs.
Optional reconstruction job path - Reconstruction still has a separate Cloud Run Job path available when needed.

The backend codebase is deployed in different service modes:

CLARION_SERVICE_MODE=api for clarion-api
CLARION_SERVICE_MODE=worker for clarion-intelligence-worker

In API mode, the app exposes public routes like /cases, /generate, /upload, and /voice. In worker mode, it exposes only authenticated internal routes under /internal/*.

Tech

Backend: Python, FastAPI, Pydantic
Frontend: Next.js
AI: Google Gemini, Imagen, and Veo
Storage: Firestore and GCS
Execution: Cloud Tasks dispatches a warm Cloud Run worker service for report and analysis; reconstruction remains separate

Google Cloud Deployment

Current production shape

The active backend deployment is now two Cloud Run services, not one service plus report/analysis jobs:

clarion-api - public backend service
clarion-intelligence-worker - private backend worker service

Live services currently deployed in us-central1:

clarion-api
clarion-intelligence-worker
clarion-experience

Cloud Run Jobs may still exist in the project:

clarion-report-worker
clarion-analysis-worker
clarion-reconstruction-worker

Only the reconstruction job is still aligned with the active design. Report and analysis are now intended to run through the warm worker service.

Config files

The repo already includes separate env files for the two backend services:

backend/cloudrun.env.yaml - API service config
backend/cloudrun.worker.env.yaml - worker service config

Important settings:

CLARION_SERVICE_MODE=api on the public API
CLARION_SERVICE_MODE=worker on the worker service
INTELLIGENCE_WORKER_BASE_URL on the API must point to the deployed worker service URL
INTELLIGENCE_WORKER_AUDIENCE should usually match that same worker URL

Async queues and worker endpoints

Clarion currently uses these queues:

clarion-report-jobs
clarion-analysis-jobs
clarion-reconstruction-jobs

The main async path is:

Report tasks call POST /internal/report-jobs/{job_id} on clarion-intelligence-worker
Analysis tasks call POST /internal/case-analysis/{case_id} on clarion-intelligence-worker

Cloud Tasks should send OIDC tokens using the task-runner service account, and the worker service should grant that principal roles/run.invoker.

Backend deploy flow

Deploy the worker service first:

cd backend

gcloud run deploy clarion-intelligence-worker \
  --source=. \
  --region=us-central1 \
  --service-account=clarion-runtime@YOUR_PROJECT.iam.gserviceaccount.com \
  --no-allow-unauthenticated \
  --concurrency=1 \
  --min-instances=1 \
  --timeout=1800 \
  --env-vars-file=cloudrun.worker.env.yaml

Then put the worker URL into INTELLIGENCE_WORKER_BASE_URL and INTELLIGENCE_WORKER_AUDIENCE inside backend/cloudrun.env.yaml, and deploy the API:

gcloud run deploy clarion-api \
  --source=. \
  --region=us-central1 \
  --service-account=clarion-runtime@YOUR_PROJECT.iam.gserviceaccount.com \
  --allow-unauthenticated \
  --min-instances=1 \
  --env-vars-file=cloudrun.env.yaml

The API and worker use the same source tree; only the env file and service mode differ.

Report Workflow

flowchart TD
    U["User / Experience UI"] --> API["clarion-api<br/>POST /cases/{caseId}/report-jobs"]
    API --> STORE["ReportJobStore<br/>create queued job + save request"]
    API --> TASKS["Cloud Tasks<br/>enqueue report task"]
    TASKS --> WORKER["clarion-intelligence-worker<br/>POST /internal/report-jobs/{job_id}"]
    WORKER --> ORCH["ReportGenerationOrchestrator.run_job(...)"]
    ORCH --> STORE

    ORCH --> ADKRT["AdkReportingPipeline.run(...)"]

    subgraph ADK["ADK + Gemini reporting workflow"]
        PLANNER["TimelinePlannerAgent<br/>Gemini text model"]
        GREVIEW["GroundingReviewerAgent<br/>Gemini text model"]
        GREFINE["TimelineRefinerAgent<br/>Gemini text model"]
        CTX["ContextEnrichmentAgent<br/>Gemini helper + Google Search"]
        MEDIA["MediaPlannerAgent<br/>Gemini helper"]
        COMPOSER["FinalComposerAgent<br/>Gemini text model"]
        CREVIEW["CompositionReviewerAgent<br/>Gemini helper"]
        CREFINE["CompositionRefinerAgent<br/>Gemini text model"]
        RESULT["PipelineResult<br/>blocks + image_requests + reconstruction_requests"]
    end

    ADKRT --> PLANNER
    PLANNER --> GREVIEW
    GREVIEW -- "issues found" --> GREFINE
    GREFINE --> GREVIEW
    GREVIEW -- "approved" --> CTX
    GREVIEW -- "approved" --> MEDIA
    CTX --> COMPOSER
    MEDIA --> COMPOSER
    COMPOSER --> CREVIEW
    CREVIEW -- "issues found" --> CREFINE
    CREFINE --> CREVIEW
    CREVIEW -- "approved" --> RESULT

    ADKRT -- "ADK failure" --> FALLBACK["HeuristicReportingPipeline"]
    FALLBACK --> RESULT

    RESULT --> REPORT["create_initial_report(...)<br/>text blocks + media placeholders"]

    subgraph MEDIAEXEC["Media execution"]
        IMG["GeminiImageGenerator<br/>Imagen"]
        RECON["ReconstructionMediaService<br/>Veo"]
        IMGASSET["image asset + manifest"]
        VIDASSET["video asset + manifest"]
    end

    RESULT --> IMG
    RESULT --> RECON
    IMG --> IMGASSET
    RECON --> VIDASSET

    IMGASSET --> ATTACH["attach_media_asset(...)"]
    VIDASSET --> ATTACH
    ATTACH --> FINAL["finalize_report(...)<br/>persist report.json + manifest"]
    FINAL --> STORE

    STORE --> STATUS["GET /generate/jobs/{job_id}<br/>SSE + polling"]
    STATUS --> U

Private GCS Artifact Delivery

Cloud Run serves report and reconstruction artifacts from a private GCS bucket by generating V4 signed URLs at request time.

Set SIGNED_URL_SERVICE_ACCOUNT_EMAIL to the service account that should sign artifact URLs.
Enable iamcredentials.googleapis.com in the same project as the signer.
Grant the API runtime service account roles/iam.serviceAccountTokenCreator on SIGNED_URL_SERVICE_ACCOUNT_EMAIL.
Keep the bucket private. Clarion expects signed URLs instead of public storage.googleapis.com links.

Post-deploy validation:

Submit a reconstruction or report job until it reaches completed.
Call the polling or report endpoint and confirm the returned artifact URL is HTTPS and includes X-Goog-Algorithm, X-Goog-Credential, and X-Goog-Signature.
Fetch that URL from your browser or curl outside GCP and confirm the object loads without making the bucket public.

Notes

The checked-in Cloud Run env files currently contain a real Google API key value. That should be rotated and ideally moved into Secret Manager.
For local configuration, start from backend/.env.example.
For deeper schema details, see backend/app/models/schema.py and backend/app/models/report_schema.py.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
backend		backend
docs		docs
experience		experience
mock-evidence		mock-evidence
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clarion

Quick Start

System Overview

Tech

Google Cloud Deployment

Current production shape

Config files

Async queues and worker endpoints

Backend deploy flow

Report Workflow

Private GCS Artifact Delivery

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Clarion

Quick Start

System Overview

Tech

Google Cloud Deployment

Current production shape

Config files

Async queues and worker endpoints

Backend deploy flow

Report Workflow

Private GCS Artifact Delivery

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages