Omegon

Terminal-native AI agent harness — for operators who build. Single binary. Multiple providers. Zero runtime dependencies.

Use the version selector on the home page to switch between stable and nightly install channels.

What is Omegon?

Omegon is ~16MB Rust binary that provides a complete AI-assisted development environment. It connects to 16 inference providers through native Rust clients (no subprocess shims, no Node.js at runtime), manages persistent project memory across sessions, compiles project OpenAPI specs into agent tools, decomposes work into parallel git worktree children, and tracks design decisions in a knowledge graph — all from your terminal.

It is not a wrapper around curl. It is not an IDE plugin. It is not a chatbot with a file picker. It is a systems engineering harness that happens to use LLMs as one of its subsystems.

Three-Axis Routing Model

Omegon organizes inference along three independent axes, giving operators fine-grained control over model capability, reasoning budget, and context:

Axis	Values	Controls
Model Grade Intent	F → D → C → B → A → S (provider-neutral model grades)	Which model family to use
Thinking Level	off → minimal → low → medium → high	Extended reasoning budget
Context Class	Compact (128k) → Standard (272k) → Extended (400k) → Massive (1M+)	Context window capacity

The agent adjusts these axes autonomously based on task complexity, or operators can override with /model, /think, and /context. See Three-Axis Model for details.

Key Features

16 Inference Providers — Anthropic/Claude, OpenAI API, OpenAI/Codex, GitHub Copilot, OpenRouter, Groq, xAI (Grok), Mistral AI, Cerebras, Google Gemini, Google Antigravity, OpenCode Go, Perplexity AI, DwarfStar Local, Ollama (Local), Ollama Cloud. Native Rust clients with automatic failover.
Parallel Execution — Cleave tasks into parallel worktree children, or use delegate for bounded scout/patch/verify subagents. Workbench keeps child operations visible until reconciled.
Project Memory — Persistent facts, episodes, semantic recall, Git-tracked JSONL federation status, and knowledge graphs across sessions.
OpenAPI Tools — Turn project REST APIs into structured tools from .omegon/openapi.toml.
Design Tree — knowledge graph tracking architecture decisions, dependencies, and implementation status.
OpenSpec Lifecycle — Spec-driven development: propose, spec, fast-forward, cleave, assess, archive.
Sentry Automation — Long-running task execution with triggers, budgets, run logs, and optional auto model routing.
Live Dashboard — Terminal TUI with a semantic wrapped multiline editor, Workbench operation rows, segment rendering with model tags, context and memory telemetry, plus a web dashboard over WebSocket.
Plugins — Personas with organic memory growth, tones, and memory layers. Not prompt templates — living cognitive profiles.
Extensions — First-party and community extensions for communication (Vox), image generation (Scry), and more. Process-isolated, crash-safe, installed with a single command.
Sessions — Auto-save and resume, plus a session log tool and narrative history for orientation across runs.
Codebase Search — Ranked concept search across code and project knowledge via codebase_search.
Signed Releases — Cosign signatures, CycloneDX SBOM, GitHub Attestations, and installer verification before update.

Get Working Fast

If your goal is a first successful run rather than a deep tour of Omegon internals, start with the provider happy paths:

Choose Your Path — Claude subscription, API key, or local Ollama
Claude Subscription Path — fastest “it works” route for many operators
API Key Path — direct setup for key-based providers
Local Ollama Path — on-device inference without external API keys

See Installation for all methods, andQuick Start if you want a broader first-session walkthrough.

At a Glance

Runtime	Single ~16 MB download (11 crates)
Providers	16 inference + 3 search, native Rust clients
Agent tools	24+ structured tools (file ops, memory, OpenAPI APIs, design, search, codebase retrieval, and inference control)
License	BSL 1.1 (converts to MIT on 2031-03-19)