Architecture Cadence Interfaces Control Infrastructure snapshot

Hybrid cloud brain + device-local hands

A persistent strategist, not another chat wrapper.

Klein is an always-on autonomous strategist system designed to notice, synthesize, and propose high-leverage next moves before the user explicitly asks. It combines a low-overhead VPS brain, local execution hands, narrow MCP tooling, durable memory, and human approval by default.

See the system design Review autonomy controls

AWS Lightsail brain

Markdown + SQLite memory

Propose-by-default actions

claude.ai + MCP

Deliberately narrow interface with exactly two Claude-facing tools to reduce routing noise.

Thinking canvas

Mac hands daemon

Executes local actions only when available: shell, AppleScript, files, clipboard, browser control.

Local execution

Pushover reflex layer

Dynamic approval, clarification, and handoff notifications with no exposed tool syntax.

Low-friction interface

Memory backbone

Git-tracked markdown, SQLite logs/state, and raw source cache. Human-readable and cheap to operate.

Persistent context

Action registry

Bounded autonomy with approval gates, bucketed actions, graceful degradation, and staged trust.

Control layer

Klein brain

A strategist engine with initiative

Always-on VPS runtime coordinating retrieval, synthesis, ventures, research, memory updates, and decision proposals. Built to do something useful even when nothing is actively asking it to act.

Architecture

Purpose-built for persistent execution

The system is intentionally lean: a cloud brain on a Lightsail VPS, a local daemon for device-side actions, a narrow MCP surface for Claude, and a memory layer optimized for portability over infrastructure bloat.

01 · Cloud brain

Always-on, low-overhead runtime

Klein runs continuously on a small Ubuntu VPS, keeping canonical state, orchestrating synthesis passes, and serving the MCP layer without requiring a heavyweight cluster.

Designed around persistence, not session reset
Optimized for practical ownership cost

02 · Narrow interface

Minimal Claude-facing surface

Claude sees only the context fetch and write-back tools. Routing, judgment, and internal action logic stay inside Klein rather than being pushed into a noisy tool layer.

Two tools, not a fragile command zoo
Context packets sized for real thinking sessions

03 · Durable memory

Readable storage over novelty infrastructure

v1 uses markdown, SQLite, and raw source caching instead of a vector-heavy stack. That keeps memory transparent, portable, cheap, and easy to audit.

Git-trackable notes and summaries
Simple schema that can evolve without migrations

04 · Bounded autonomy

Initiative with explicit constraints

Klein is meant to act like a strategist, but with approval-gated side effects, confidence thresholds, quiet hours, action buckets, and graceful fallback when local hands are unavailable.

Default mode is propose, not unrestricted execution
Trust can graduate by category over time

Synthesis cadence

Three operating tempos

Layer 1

Continuous light monitoring

Watches for deltas throughout the day: a meeting ended, a deadline moved closer, a free block opened up. Most ticks avoid model calls; only threshold-crossing moments escalate.

Fast, selective, low-cost reflex layer

Layer 2

Daily selective synthesis

Reviews recent changes, refreshes active summaries, produces a small number of nudges or memos, and runs the venture loop that lets Klein initiate strategic work on its own.

End-of-day tactical reasoning

Layer 3

Weekly strategic consolidation

Cross-project review, opportunity detection, pruning, and narrative correction. This is where the system checks whether it is actually surfacing things that manual chat usage would have missed.

Strategic layer with periodic full-brain rereads

Infrastructure snapshot

Lean stack, serious behavior

The architecture is intentionally small-footprint while still supporting persistence, initiative, control, and graceful failure handling.

Hybrid topology Cloud-resident brain plus device-local hands. The brain keeps thinking even when the Mac is asleep.

Stateful by design Canonical memory lives in readable stores so the system can accumulate judgment rather than restart from zero every session.

Human-in-the-loop control Approval remains the default for side effects, while lower-risk strategic work can happen autonomously inside the brain.

Claude-facing MCP tools in the public interface

Synthesis layers spanning reflex, tactical, and strategic time horizons

Ships without vector infrastructure, prioritizing transparency and operational simplicity

$65–85

Approximate monthly cadence cost target for continuous operation

Interaction design

Three surfaces, each with a job

Klein separates reflexive nudges, deep reasoning, and lightweight control so the interface never collapses into a single overloaded chat box.

Phone notifications

Reflex

Natural-language prompts with dynamic response actions such as approve, edit, reject, or timeline clarifications. Replies write directly to memory without requiring a fresh model roundtrip.

Approve Edit Reject Open in Claude

Claude thinking canvas

Deep work

Used for debriefs, writing, tradeoffs, and ambiguity. Klein preloads the right context packet instead of forcing the user to rebuild state manually at the start of every session.

get_klein_context(...) → exact context packet

save_klein_update(...) → decision captured

Terminal control

Lightweight ops

One string entrypoint absorbs questions, admin, drafts, approvals, synthesis triggers, and device-side actions. Short output is a feature; longer work escalates into Claude with context already loaded.

klein "summarize my next three risks"

klein status

klein "consolidate memory"

Autonomy controls

Trust is staged, not assumed

Klein is designed to feel agentic without becoming reckless. The default operating mode is propose. Side-effecting actions are approval-gated, confidence-tiered, and separated into brain-only versus device-dependent buckets.

Confidence-tiered output Low-confidence observations stay internal. Medium confidence asks one concise question. High confidence proposes a move with explicit approval pathways.

Action bucketing Brain-only actions are always available. Mac-dependent actions queue, degrade gracefully, or wait for local hands instead of failing noisily.

Quiet hours and suppression Hard caps, spacing, redundancy suppression, and exception-only interrupts keep autonomy from turning into notification spam.

Why this matters

What a reviewer should notice

Klein is not presented as a consumer chatbot with polished screenshots. It is an infrastructure-backed agent system with a clear control model, cost-aware operation, and a concrete answer to the hardest part of AI products: persistent judgment across time.

Persistent by default The product value comes from accumulation: memory, synthesis, ventures, and review loops.

Operationally efficient Small-footprint infrastructure, bounded model usage, and a credible path from prototype to disciplined always-on operation.

Human-legible control Approval states, action categories, and graceful failure behavior are visible design choices, not afterthoughts.