Skip to content
View stevenchouai's full-sized avatar

Block or report stevenchouai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
stevenchouai/README.md

Steven Chou — Personal AI Operating System

Steven Chou

I build Personal AI Operating Systems: memory → agents → evaluation → public proof.

If you are deciding whether to follow or work with me, the short version is:

I turn AI-agent ideas into inspectable systems: captured knowledge, explicit identity, real tool use, eval gates, and public proof that compounds over time.

Why Stay Here

Visitor question What this profile should prove
Can he build? Public repos, working demos, diagrams, tests, and writeups instead of claims.
Does he understand agents deeply? Agent architecture research, runtime plumbing, and trace-first evaluation.
Is there a coherent direction? Every project rolls up into one Personal AI OS thesis, not random side projects.
Should I follow? Follow if you care about AI agents becoming reliable personal/work infrastructure, not just chat UI demos.

The Shape of the Work

From points to line to surface to body: StevenOS proof chain

point:   individual tools and experiments
line:    memory → identity → agents → evals → proof
surface: repos, writing, demos, resume, and operating loops reinforce each other
body:    Steven as one inspectable AI-native builder entity

StevenOS Stack

StevenOS repository stack

Layer Repository / Surface Role in the system Public conversion asset
Memory knowledge-harness (local hardening before public release) Routes agents through an Obsidian-backed knowledge base without mixing content and runtime state. Architecture note + public-safe demo pending.
Identity digital-twin File-first operating layer for making agents inherit style, judgment, memory, and reusable workflows. README, docs site, operating-layer essay.
Agent runtime Hermes / OpenClaw contributions Real assistant plumbing: messaging, Feishu threads, gateway/runtime debugging, OAuth, CLI backends, tools. Issue/PR writeups and debugging notes pending.
Model + tool infra CLIProxyAPI (public surface pending cleanup) Normalizes model access and CLI-compatible API infrastructure. Cleanup checklist + launch note pending.
Evaluation agent-scorecard Trace-first quality gate for deciding whether agents deserve more tokens, permissions, and autonomy. Runnable examples + reports.
Public proof stevenchouai.github.io · resume system Converts repos, essays, demos, and job-market artifacts into one navigable proof chain. Homepage, blog, proof-chain page.

Featured Proof

Agent systems

  • Agent Scorecard — deterministic checks for tool use, verification, durable artifacts, side-effect safety, and anti-busywork behavior.
  • Claude Code Sourcemap — source-map-based architecture guide for AI coding agents.
  • Hermes / OpenClaw work — practical runtime and gateway fixes across real assistant stacks, not toy demos.

Open Source Contributions

Project Contribution Status Why it matters
NousResearch/hermes-agent #21254fix(update): migrate config in non-interactive updates (salvaged from my original #19221) Merged · merge commit 8cef149 Makes detached / gateway / non-interactive update flows safer by migrating config before restart.
NousResearch/hermes-agent #17895fix(feishu): preserve threaded replies Open Preserves Feishu/Lark threaded reply routing for real agent gateway conversations.
openclaw/openclaw #75024fix(feishu): preserve threads without root_id Open · CI green-ish Handles Feishu/Lark thread fallback behavior in another production-style agent runtime.

Personal AI infrastructure

  • Digital Twin — a personal agent operating layer built around explicit files, skills, memory, and durable outputs.
  • knowledge-harness (local hardening before public release) — CLI/runtime wrapper around an Obsidian LLM wiki.
  • Input Copilot iOS (local proof) — capture → profile signal → radar → Obsidian export loop.

Career and communication systems

  • Resume system (public writeup pending) — dual-track PM / Engineer resume workflow with local JD matching, AI tailoring, ATS review, and reproducible PDF output.
  • ManageUp (archive / visibility pending) — MCP server + skill library for manager-facing reporting.

Follower Growth Loop

I do not expect followers to come from a prettier README alone. The loop has to be:

  1. Build useful primitives — agent memory, identity, runtime, eval, and proof-chain tools.
  2. Publish one concrete artifact per week — a repo improvement, demo, architecture note, benchmark, debugging case, or before/after workflow.
  3. Package each artifact into a small distribution unit — GitHub README update + blog note + X/LinkedIn thread + one clear screenshot/diagram.
  4. Route readers back to the proof chain — every post should answer: “what can I inspect or reuse now?”
  5. Measure retention signals — stars, follows, profile clicks, repo clones, comments, inbound DMs, and which pages cause people to continue reading.

30-Day Public Build Plan

Week Ship Why it should help conversion
1 Add clear visitor CTA, proof map, and follower thesis to this profile. People understand the category and why to follow within 10 seconds.
2 Harden one local proof repo into a public artifact or public writeup. Converts “interesting private system” into inspectable evidence.
3 Publish one agent-eval case study using Agent Scorecard. Shows judgment: not just building agents, but deciding when they deserve trust.
4 Turn one runtime/debugging win into a practical architecture note. Attracts expert builders who follow for hard-earned implementation detail.

Success metric: each shipped artifact should create a visible next click — repo → demo/report → essay → follow/contact. If it cannot create a next click, it is probably internal value, not public conversion value yet.

Operating Principles

  • Trace over vibes — logs, tests, files, reports, screenshots, and commits beat confident claims.
  • Coherence over volume — every project should explain its upstream and downstream role.
  • Small tools before dashboards — build the proof loop before polishing the surface.
  • AI as leverage, not theater — an agent earns autonomy only by producing verified artifacts.

Selected Writing

Follow / Contact

Pinned Loading

  1. claude-code-sourcemap claude-code-sourcemap Public

    Forked from ChinaSiro/claude-code-sourcemap

    Claude Code source-map research and AI coding agent architecture tutorial

    TypeScript 1

  2. digital-twin digital-twin Public

    File-first blueprint for a personal agent operating layer with identity, style, memory, and workflows

    1

  3. manage-up manage-up Public

    你是一个P6工程师,每天都在研究怎么给老板汇报才能让老板高兴,从而陷入了自我质疑。但当你下载了这个skill,你发现汇报变得如此简单

    TypeScript 1

  4. stevenchouai.github.io stevenchouai.github.io Public

    HTML 1

  5. resume resume Public

    Forked from sb2nov/resume

    Software developer resume in Latex

    TeX