Introduction

OpenKoi is a standalone, CLI-first AI agent platform written in Rust. Unlike traditional AI tools that generate a single response and stop, OpenKoi follows a rigorous Plan-Execute-Evaluate-Refine cycle — iterating on its own output until results meet your quality standards.

It ships as a single static binary with zero runtime dependencies. No Node.js, no Python, no Docker containers.

bash

cargo install openkoi
openkoi "Refactor the auth module to use JWT tokens"

That's it. OpenKoi detects your API keys from the environment, picks the best available model, infers the task type, and starts iterating.

Core Design Principles

Principle	What It Means
Single binary	`cargo install openkoi` or download one file. ~20MB static binary, zero runtime dependencies.
Token-frugal	Context compression, evaluation caching, diff-patch instead of full regeneration. Saves cost without sacrificing quality.
Zero-config	`openkoi "task"` works immediately. Detects API keys from env vars and existing CLI tools.
Local-first	All data stays on your machine — SQLite database, filesystem skills, local config. No cloud requirement.
Model-agnostic	Anthropic, OpenAI, Google, Ollama, AWS Bedrock, or any OpenAI-compatible endpoint. Assign different models per role.
Learn from use	Observes your daily patterns, extracts recurring workflows, and proposes new skills to automate them.
Iterate to quality	The agent is its own reviewer. It only stops when the task passes evaluation — but knows when to stop early.
Extensible	WASM plugins for isolation, Rhai scripts for quick customization, MCP for external tools.

Why Rust?

OpenKoi is written in Rust to deliver a premium CLI experience that respects your time and resources:

Metric	OpenKoi (Rust)	Typical TS/Python CLI
Startup time	< 10ms	200–500ms
Idle memory	~5MB	50–100MB
Binary size	~15–25MB	100MB+ with runtime
Concurrency	Tokio async, zero-cost	Event loop or GIL-bound
Safety	Memory-safe, strict typing	Runtime exceptions

These numbers matter when the agent runs as a background daemon, processes long-running tasks, or operates on resource-constrained machines.

Architecture Overview

OpenKoi's architecture centers on an Orchestrator that coordinates all subsystems:

                    ┌─────────────────┐
                    │   Orchestrator   │
                    └────────┬────────┘
         ┌──────────┬───────┼───────┬──────────┐
         ▼          ▼       ▼       ▼          ▼
    ┌─────────┐ ┌───────┐ ┌─────┐ ┌────────┐ ┌──────────┐
    │Executor │ │Evalua-│ │Learn│ │Pattern │ │App       │
    │         │ │tor    │ │er   │ │Miner   │ │Integra-  │
    └────┬────┘ └───────┘ └─────┘ └────────┘ │tions     │
         │                                     └──────────┘
    ┌────▼────┐
    │  Tools  │ ← MCP subprocesses, WASM, Rhai
    └─────────┘

Executor carries out tasks by calling model providers and invoking tools. Evaluator judges the output against rubrics and decides whether another iteration is needed. Learner persists successful patterns into memory. Pattern Miner analyzes usage over time and proposes new skills. App Integrations connect to Slack, GitHub, Jira, and 7 other services.

Underneath, OpenKoi uses:

SQLite for persistent memory, sessions, and learnings
sqlite-vec for vector similarity search on embeddings
MCP (Model Context Protocol) for safe tool execution
Skill Registry for loading .SKILL.md files at multiple precedence levels

The Iteration Engine

The core loop that makes OpenKoi different from one-shot agents:

Plan — Analyze the task, select relevant skills and tools, estimate iteration count
Execute — Run the plan using the assigned Executor model and available tools
Evaluate — Score the output against bundled and custom rubrics
Refine — If the score is below threshold, feed evaluation feedback back and iterate

The engine is token-aware: it compresses context between iterations, caches evaluation results, and uses diff-patch logic to avoid regenerating entire outputs. It also knows when to stop early — if the evaluator is confident the output is good, it won't waste tokens on unnecessary iterations.

Default iteration limit is 3, configurable per task or globally.

The Soul System

OpenKoi includes an optional Soul System that tracks personality traits, preferences, and interaction styles. Over time, the agent adapts to your communication patterns:

Personality axes — Formality, verbosity, emoji usage, technical depth
Evolution — The soul evolves based on your feedback and interaction patterns
Customization — Override any axis in config.toml or per-session
Persistence — Soul state is stored locally and never leaves your machine

The soul is entirely optional — OpenKoi works fine without it. But for users who want a more personalized experience, it provides a consistent agent personality that improves over time.

What OpenKoi Is Not

Not a chatbot — It's a task-execution engine that happens to have a chat mode
Not cloud-dependent — All data is local. Cloud providers are used only for model inference
Not locked to one model — Switch providers freely; assign different models to different roles
Not a framework — It's a complete, ready-to-use tool. No code required to get started

Next Steps

Installation & Setup — Get running in 60 seconds
CLI Reference — All commands and flags
Configuration — Customize behavior via config.toml
Architecture Deep Dive — Full module layout and data flow

Introduction ​

Core Design Principles ​

Why Rust? ​

Architecture Overview ​

The Iteration Engine ​

The Soul System ​

What OpenKoi Is Not ​

Next Steps ​