Running Otherix on macOS for development¶

Otherix does not have a native macOS agent. For local development on macOS, run the standard Linux agent inside a Lima VM.

The control plane (otherix-api) is plain Go and runs natively on macOS without help from Lima. Only the agent needs Linux. The dev pipeline below automates VM creation, cross-compilation, and agent service management. Cluster CA + CP server cert auto-generate inside the CP on first boot; the agent picks up its mTLS material via the join-token bootstrap protocol orchestrated by make seed-dev.

Prerequisites¶

macOS 13 (Ventura) or later. Older versions have limited Lima support.
Apple Silicon (M1/M2/M3+) or Intel Mac with virtualization extensions. Apple Silicon is recommended.
Homebrew.
Lima: brew install lima.

Nested KVM and CPU choice¶

For the agent's QEMU to use KVM (instead of slow TCG software emulation), /dev/kvm must be present inside the Lima VM, which requires nested virtualization. On Apple Silicon that is a hardware capability gated by both the chip and the OS:

Apple M3 or later AND macOS 15 (Sequoia) or later: nested virtualization is available, but it is not automatic. Lima must be told to enable it via nestedVirtualization: true (vz mode only). The Makefile (lima-ensure-one) detects a capable host (chip generation >= M3 and macOS major >= 15) and injects --set .nestedVirtualization=true at VM create time, so on these machines /dev/kvm appears in the guest and the agent runs VMs under KVM.
M1 / M2: the silicon has no nested virtualization at all. The agent runs its VMs under TCG (software emulation) - functional but slow. Switching vmType does not help: vz and qemu+HVF hit the same hardware wall, and qemu+TCG could only emulate virtualization extensions (software on top of software, slower than just using TCG for the workload). For real KVM-accelerated VMs you need an M3+/macOS15 Mac or a native Linux/KVM host.
Intel Macs: same - no nested virtualization; TCG fallback applies.

Why nestedVirtualization is not in the Lima template

nestedVirtualization: true hard-fails Lima start on M1/M2 (and any host that does not support it). That is why it is injected conditionally by the Makefile rather than baked into dev/lima/otherix-dev.yaml. Do not add it to the static template - it would break the dev setup on M1/M2.

The provision script in dev/lima/otherix-dev.yaml checks /dev/kvm and prints whether KVM or the TCG fallback is in effect. See Lima's VM type documentation for background.

Setup¶

# 1. (no external dependencies) The control plane runs an embedded etcd
#    member - there is no Postgres to start and no migrations
#    to apply. For a clean-slate run, wipe any prior dev state:
make etcd-reset

# 2. Stage both Lima VMs (Ubuntu 24.04, native arch): provision qemu /
#    dirs / systemd unit, cross-build agent, copy binary + config
#    into each VM. The agents are NOT started — the join-token bootstrap
#    flow (Step 5) needs to provision bootstrap.env + token first.
make bootstrap-dev

# 3. Run the control plane natively on macOS. On first boot it
#    auto-generates the cluster CA + a per-replica CP server cert.
#    The dev config flips agent_client.enabled=true so the
#    in-process workers can dispatch to the agent over mTLS.
#
#    First-time only: provide bootstrap admin credentials so the CP
#    seeds the admin user row on first boot.
#    (Shortcut: `make local-dev-start` runs steps 3+4 — api + agent +
#    CLI — in one command. The explicit steps below are the manual path.)
export OTHERIX_BOOTSTRAP_ADMIN_USERNAME=admin
export OTHERIX_BOOTSTRAP_ADMIN_PASSWORD='correct-horse-battery-staple'
make build-api && ./bin/otherix-api --config dev/config/api.yaml

# 4. (separate terminal) bootstrap the agent — mints a join token via
#    the CLI, provisions bootstrap.env + token plaintext to the Lima
#    VM, starts the agent, and waits for the CP-side `nodes` row to
#    appear after the bootstrap protocol commits. Idempotent — re-
#    runs revoke the previous CLI token and mint a fresh one.
#
#    Re-export the admin credentials if running in a fresh shell:
export OTHERIX_BOOTSTRAP_ADMIN_USERNAME=admin
export OTHERIX_BOOTSTRAP_ADMIN_PASSWORD='correct-horse-battery-staple'
make seed-dev

# 5. Verify the agent is reachable. The heartbeat path is the
#    canonical reachability proof — once seed-dev finishes the node
#    flips to `ready` within a heartbeat cycle (15s in dev).
./bin/otherix node list
# NAME    ARCHITECTURE  STATUS  CORDONED  AGE
# node-1  arm64         ready   no        20s
# node-2  arm64         ready   no        20s

# 6. Daily redeploy after agent code changes (cross-build + copy + restart).
#    No cert material needs to be re-provisioned — bootstrap is a
#    one-time event and the cert material survives restarts.
make deploy-dev

# 7. Tail agent logs from inside a VM (otherix-dev-2 for node-2).
limactl shell otherix-dev-1 sudo journalctl -u otherix-agent -f

# 8. Tear down (stops + deletes both Lima VMs; CP cert + cluster CA
#    persist in the embedded-etcd data dir until `make etcd-reset`).
make clean-dev

The macOS dev stack runs two Lima VMs, otherix-dev-1 (node-1) and otherix-dev-2 (node-2), forming a real cross-host WireGuard mesh. Inside each VM the agent reads its config from /etc/otherix/agent.yaml and persists its cert material to /var/lib/otherix/certs/ (filesystem convention — /var/lib/otherix/ for runtime state, /etc/otherix/ for operator-provided config).

Verifying CP↔agent connectivity¶

The heartbeat path is the canonical reachability proof — otherix node list shows the live state of every registered node. Once make seed-dev finishes, the agent's first heartbeat lands within a cycle (15s in dev) and the row flips to ready:

./bin/otherix node list
# NAME    ARCHITECTURE  STATUS  CORDONED  AGE
# node-1  arm64         ready   no        20s
# node-2  arm64         ready   no        20s

If a node lingers in pending past ~30s, inspect that node's agent journal (use otherix-dev-2 for node-2):

limactl shell otherix-dev-1 sudo journalctl -u otherix-agent --no-pager | tail -50

Common boot patterns to look for:

agent: bootstrap complete — first-boot bootstrap landed successfully.
agent: using existing cert material — subsequent boot, reusing /var/lib/otherix/certs/agent.{crt,key}.
bootstrap protocol: — bootstrap-side error. Token may be expired, CA fingerprint may not match, or CP unreachable.
partial bootstrap state — fatal. Cert OR key file missing; delete the orphan and mint a fresh token.

A direct mTLS dial is also possible via a CP-issued cert, but requires extracting the per-replica CP cert from the local cache (when enabled) OR an openssl s_client dump. The forthcoming CP-mediated otherix node ping <name> command replaces this need — a direct ping-agent operator-workstation flow was removed in Step 4 because the inter-step CP cert lifecycle (per-replica certs signed by the cluster CA, kept replica-local) made out-of-band distribution to the operator workstation impractical.

Iteration 3 Phase C — `otherix vm` subcommands¶

After Phase C lands, operator-driven VM management via CLI is available. The CLI talks to the Control Plane's /v1/vms surface over HTTP with a bearer token.

Prerequisites:

The Control Plane is running and has at least one admin user. The bootstrap path seeds an admin when OTHERIX_BOOTSTRAP_ADMIN_USERNAME + OTHERIX_BOOTSTRAP_ADMIN_PASSWORD are set on first start.
A node is registered and has the auto-provisioned default storage pool. No image pre-staging is needed - the agent fetches the image URL on first vm create. (Phase D documents the seed sequence end-to-end.)
An access token in hand. Two options:

# Option A — JWT access token (15-min default TTL).
TOKEN=$(curl -s -X POST http://localhost:8080/v1/auth/login \
  -H 'Content-Type: application/json' \
  -d '{"username":"admin","password":"…"}' \
  | jq -r .access_token)

# Option B — long-lived API token (no expiry by default).
# Requires a JWT first to authenticate the create call.
JWT=$(... see Option A ...)
TOKEN=$(curl -s -X POST http://localhost:8080/v1/users/me/api-tokens \
  -H "Authorization: Bearer $JWT" \
  -H 'Content-Type: application/json' \
  -d '{"name":"cli","description":"operator workstation"}' \
  | jq -r .plaintext_token)

export OTHERIX_API_TOKEN="$TOKEN"

Subcommands:

# Create a VM (async; returns task id).
./bin/otherix vm create demo-vm \
  --image-url https://cloud-images.ubuntu.com/minimal/releases/noble/release/ubuntu-24.04-minimal-cloudimg-arm64.img \
  --arch arm64 \
  --vcpus 2 --memory-mib 2048 \
  --wait
# created task=<task-uuid> status=pending
# .....
# vm running task=<task-uuid>

# List VMs.
./bin/otherix vm list
# ID                                    NAME      STATUS   POOL                                  IMAGE
# <vm-uuid>                             demo-vm   running  <pool-uuid>                           ubuntu-24.04-minimal-cloudimg-arm64.img

# Get a single VM.
./bin/otherix vm get <vm-uuid>
# id: <vm-uuid>
# name: demo-vm
# status: running
# ...

# Delete (with confirmation prompt unless --force).
./bin/otherix vm delete <vm-uuid> --wait --force
# deleted task=<task-uuid> status=pending
# .....
# vm deleted task=<task-uuid>

Output formats: --output json (get, list) emits the raw envelope for programmatic consumers; default is multi-line key=value for get and aligned table for list.

Authentication: --token flag overrides $OTHERIX_API_TOKEN. Either source MUST be set; the CLI exits 1 with a usage error otherwise. Both JWT and otx_* API tokens are accepted by the CP's Authn middleware. The kubectl-style alternative otherix config add cluster (see "CLI configuration" below) stores a long-lived token on disk so subsequent invocations need neither --token nor OTHERIX_API_TOKEN.

Error classification (parseable from shell): api_error: <code>: <message> for CP-side failures (vm_not_found, node_not_ready, pool_not_found, qemu_spawn_failed, …), request_timeout, connection_refused, request_failed for transport failures.

CLI configuration¶

otherix config manages a kubectl-style credential store at ~/.otherix/config (or $OTHERIX_CONFIG). After a one-time otherix config add cluster the operator can drop --token and --endpoint from every subsequent invocation — the stored (server, token) pair is the default.

Initial setup¶

$ otherix config add cluster \
    --name production \
    --server http://localhost:8080 \
    --login admin \
    --password 'correct-horse-battery-staple'
cluster added: name=production server=http://localhost:8080 current=true

Missing flags trigger interactive prompts when stdin is a TTY (the password prompt is masked via golang.org/x/term). In non-TTY contexts (CI, scripts) every required value must come from a flag or env var (OTHERIX_SERVER, OTHERIX_LOGIN, OTHERIX_PASSWORD).

Daily usage¶

# Uses the current cluster automatically.
$ otherix vm list
ID                                    NAME     STATUS   POOL                                  IMAGE
…

# Override per-invocation (no persistent state change).
$ otherix --cluster localdev vm list

Multi-cluster management¶

$ otherix config list
NAME        SERVER                       CURRENT
production  https://otherix.example.com  *
localdev    http://localhost:8080  

$ otherix config use localdev
current cluster: localdev

$ otherix config show              # current cluster, token masked
$ otherix config show production --show-token   # reveal plaintext

$ otherix config remove localdev --force         # skips confirmation

config remove with stdin attached to a TTY prompts for y/N; --force is required for non-interactive removal.

Config file location¶

Resolved in this order — the first match wins:

--config <path> flag
$OTHERIX_CONFIG
~/.otherix/config (default)

The file is a small YAML document (apiVersion / kind / clusters / current-cluster) written with 0600 file and 0700 parent-directory perms. Atomic write (sibling temp + fsync + rename) means a crash mid-config add cluster cannot leave a half-written credential store.

XDG_CONFIG_HOME is intentionally NOT consulted — kubectl and docker both pick a tool-specific home-relative path and we follow, keeping "where is my config?" a one-answer question.

Authentication precedence¶

--endpoint and --token are resolved independently — each follows its own chain:

Layer	endpoint	token
1	`--endpoint`	`--token`
2	`$OTHERIX_SERVER`	`$OTHERIX_API_TOKEN`
3	named cluster (`--cluster`)	named cluster (`--cluster`)
4	current-cluster	current-cluster
5	fail with "no endpoint configured"	fail with "no token configured"

The two layers can come from different sources — a typical CI flow sets OTHERIX_API_TOKEN from a secret and --endpoint from a config repo.

Backward compat: Phase C scripts that set only OTHERIX_API_TOKEN against a local CP now need a matching OTHERIX_SERVER=http://localhost:8080 (or a config use of a pre-seeded cluster). The implicit "default to localhost" of the old vm-level --endpoint flag was dropped — the new model refuses to guess.

CLI resource discovery¶

Phase 2 of the name-based references work introduced read-only discovery commands so operators can resolve names before submitting vm create. Each command supports --output table|json|text (defaults vary), cursor pagination via --limit / --cursor, and --show-ids to surface the UUIDs the table normally hides.

Images¶

There is no template entity and no otherix template command group. A VM is created directly from an image URL (otherix vm create --image-url <url> --arch <arch> ...). The image bytes are not a control-plane resource: the agent owns a per-pool, basename-keyed image cache that materializes the URL on first use. Inspect the cache for a pool through otherix pool get <name>, which surfaces an images: list (name, sha, size) reported by the agent through heartbeat.

Storage pools¶

Phase 1.5 multi-instance reality: the same pool name may live on multiple nodes, and pool list exposes one row per per-node instance. The DEFAULT column flips on when the pool name matches cluster_settings.default_pool_name.

Full CRUD surface:

$ otherix pool list
NAME      NODE      TYPE       PATH                           AVAILABLE  DEFAULT  AGE
default   node-a    local_dir  /var/lib/otherix/pools/default     80.0GiB    yes      5d
default   node-b    local_dir  /var/lib/otherix/pools/default     95.0GiB    yes      5d
fast-ssd  node-a    local_dir  /var/lib/otherix/pools/fast        400.0GiB            3d

# Aggregated cluster-wide view by name
$ otherix pool get default
name: default
type: local_dir
is_cluster_default: true
instances:
  - node: node-a
    id: <uuid>
    path: /var/lib/otherix/pools/default
    available: 80.0GiB
  - node: node-b
    id: <uuid>
    path: /var/lib/otherix/pools/default
    available: 95.0GiB

# Flat instance view by UUID
$ otherix pool get <pool-uuid>
id: <pool-uuid>
name: default
node: node-a
type: local_dir
path: /var/lib/otherix/pools/default
available_bytes: 80.0GiB
is_cluster_default: true
...

# Register a pool on a node (admin-only)
$ otherix pool create fast-ssd --node node-a --path /var/lib/otherix/pools/fast
pool fast-ssd created on node node-a
type: local_dir
path: /var/lib/otherix/pools/fast

# Delete a pool (refuses when vm_disks reference it)
$ otherix pool delete fast-ssd --force
pool fast-ssd deleted

One pool create invocation registers one (name, node) row — re-run the command per node to fan a pool name out across the cluster. Deletion has no --force-cascade; the operator must remove dependent VM disks first. The agent-owned image cache is not a delete blocker.

Note: seed-dev.sh registers the initial pool through otherix pool create (no direct store access - the control plane is the sole writer, now backed by embedded etcd). The agent's pool registry is name-keyed and populated through reconciliation from CP (the pools: block in agent.yaml is eliminated), so CLI-created pools work end-to-end including agent-side image cache materialization + vm-disk allocation.

Nodes¶

$ otherix node list
NAME      ARCHITECTURE  STATUS  CORDONED  AGE
node-a    arm64         ready   no        7d
node-b    arm64         ready   no        7d
node-c    amd64         ready   yes       5d

$ otherix node list --architecture arm64 --status ready

$ otherix node get node-a
name: node-a
architecture: arm64
status: ready
migration_host: 10.0.0.10
migration_port_range: 49152-49251
cpu_cores_total: 32
memory_total_mib: 65536
...

admin / operator callers see the full projection (above); developer / viewer callers see a reduced shape (no migration capability, no hardware inventory) — the CLI text renderer only prints fields the wire envelope populated, so the reduced shape is clean.

Cluster configuration¶

Phase 1.5 surfaced a cluster-wide default-pool reference (the pool name VM create falls back to when the request omits --pool). Three commands manage it. The mutating verbs require admin (cluster:manage); operators / others receive 403 permission_denied verbatim.

# Inspect the current default
$ otherix cluster get-default-pool
default-pool: default

# When unset, exit 0 with a parseable informational line
$ otherix cluster get-default-pool
no default pool configured (run 'otherix cluster set-default-pool <name>' to configure)

# Promote a different pool (server validates the name exists)
$ otherix cluster set-default-pool fast-ssd
default pool set to 'fast-ssd'

# Clear the default — interactive prompt unless --force
$ otherix cluster unset-default-pool --force
default pool cleared

VM operations with names¶

Every VM command takes the VM name in positional / flag inputs; UUID literals are rejected by the server with 400 validation_failed.

# Create — uses cluster default pool, scheduler picks node
$ otherix vm create demo-vm \
    --image-url https://cloud-images.ubuntu.com/minimal/releases/noble/release/ubuntu-24.04-minimal-cloudimg-arm64.img \
    --arch arm64 \
    --vcpus 2 --memory-mib 2048 \
    --wait

# Create — explicit pool + node placement hint
$ otherix vm create pinned-vm \
    --image-url https://cloud-images.ubuntu.com/minimal/releases/noble/release/ubuntu-24.04-minimal-cloudimg-arm64.img \
    --arch arm64 \
    --pool fast-ssd \
    --node node-a \
    --vcpus 4 --memory-mib 4096

# Inspect by name
$ otherix vm get demo-vm

# List — UUIDs hidden by default
$ otherix vm list
NAME       STATUS   POOL      IMAGE
demo-vm    running  default   ubuntu-24.04-minimal-cloudimg-arm64.img
pinned-vm  running  fast-ssd  ubuntu-24.04-minimal-cloudimg-arm64.img

# List with UUIDs (--show-ids)
$ otherix vm list --show-ids
ID                                    NAME       STATUS   POOL      IMAGE
<uuid>                                demo-vm    running  default   ubuntu-24.04-minimal-cloudimg-arm64.img
<uuid>                                pinned-vm  running  fast-ssd  ubuntu-24.04-minimal-cloudimg-arm64.img

# Delete by name (interactive prompt without --force)
$ otherix vm delete demo-vm --wait --force

Declarative manifests¶

Instead of imperative flags you can apply resources from YAML manifests. otherix create -f reads one or more multi-document files (kinds Network, StoragePool, VM), orders them Network -> StoragePool -> VM so name references resolve, and creates each resource. otherix delete -f removes the same set in reverse order (VM -> StoragePool -> Network).

# cluster.yaml: a managed bridge plus a VM attached to it
$ cat cluster.yaml
apiVersion: otherix/v1
kind: Network
metadata:
  name: demo-net
spec:
  type: bridge
  managed: true
  bridgeName: otdemo0
  mtu: 1500
---
apiVersion: otherix/v1
kind: VM
metadata:
  name: demo-vm
spec:
  imageURL: https://cloud-images.ubuntu.com/minimal/releases/noble/release/ubuntu-24.04-minimal-cloudimg-arm64.img
  arch: arm64
  network: demo-net
  vcpus: 2
  memoryMib: 2048
  # inline cloud-config is sent as user_data at create time:
  userData: |
    #cloud-config
    package_update: true

# Apply, waiting for the VM task to finish (--wait blocks on VM tasks and
# pool reconciliation; networks are created synchronously, no wait)
$ otherix create -f cluster.yaml --wait --wait-timeout 300s

# Round-trip: project a live resource back to a manifest
$ otherix vm get demo-vm -o yaml
$ otherix network get demo-net -o yaml

# Tear down (reverse order, no confirmation prompt)
$ otherix delete -f cluster.yaml --force

Caveat: some fields do NOT round-trip through get -o yaml, because the API view does not surface them or the server derives them. Keep the source manifest as the record of what you applied.

VM: userData (user_data), networkConfig (network_config), cloudInitDisabled, firmware/firmwareID, and diskGiB are consumed at create time and not in the view, so the projected manifest omits them and re-applying reverts those to server defaults. Only the first NIC is projected (the manifest schema attaches a single network); a VM with more than one NIC loses the extras.
Network: bridge networks round-trip in full. An overlay network projects only type + subnet (the create API forbids the server-derived bridgeName/mtu/vlan), so re-applying allocates a fresh VNI rather than preserving the original.
StoragePool: round-trips except the operator-settable config blob. A multi-node pool projects as a single nodeList document when every instance shares a path, or as one document per instance when their paths differ, so each node keeps its own path on re-apply.

The config blob on both Network and StoragePool is not yet manifest-expressible (no config field in the v1 schema) and is dropped on projection; a resource created out-of-band with a non-empty config does not round-trip it.

VM placement scheduler¶

otherix vm create reaches the api-server's vm.create handler, which runs internal/scheduler.SchedulePlacement to pick the (node, pool instance) target. Two algorithms ship:

resource_aware (default) — kubernetes-style LeastAllocated scoring across cpu_cores_available and memory_available_mib from heartbeat. Lower post-placement utilization wins.
least_vm_count — Phase 1.5 fallback. Picks the node with the fewest pinned VMs. Operator opt-out.

Both algorithms break ties by node name lowercase lexicographic.

Switch via api config:

# dev/config/api.yaml or deploy/config/api.example.yaml
placement:
  algorithm: "resource_aware"   # or "least_vm_count"

Invalid values fail at startup. Operator visibility into placement inputs:

# Heartbeat metrics underpin the fit check / scoring.
$ otherix node list
NAME    ARCH    STATUS  CORDONED  LAST_HEARTBEAT  AGE
node-1  arm64   ready   false     8s ago          1h
node-2  arm64   ready   false     7s ago          1h

# `node get` shows per-node CPU / memory totals and live availability —
# the same view the scheduler sees.
$ otherix node get node-1
...
hardware:
  cpu_cores: used 2/4 cores
  memory:    used 6144/16384 MiB
agent:
  last_heartbeat_at: 2026-05-12T19:42:11Z (8s ago)

When no node has sufficient resources, vm create returns 409 with a structured payload (details.reason="insufficient_resources") listing each candidate's utilization by name — actionable for capacity diagnostics.

Disk-aware filtering shipped as three sub-iterations on 2026-05-12: A (periodic pool-scan worker; default 15 min, dev compressed to 1 min); B (pool_effective_capacity view with pending-disk subtraction); C (scheduler integration). Placement decisions now consider CPU + memory + disk all with effective accounting. otherix vm create against a pool that lacks free disk space returns 409 no_eligible_nodes with the per-pool (pool) name and disk_used_bytes / disk_total_bytes populated in details.node_utilization. On-demand otherix pool scan <name> is still available for immediate refresh; Tunable via workers.storage_pool_scan in the api config. otherix pool {get,list} continues to render a "(effective N free)" suffix on the available column when pending VM disks have not yet been observed by a scan, parallel to the node CLI's "(effective N free)" rendering.

Per-resource placement settings (cpu / memory / disk × enabled + overcommit_ratio) sit under placement.resources in api config — strict no-overcommit defaults in dev, same shape annotated with safety notes in deploy/config/api.example.yaml. Memory overcommit risks, host configuration prerequisites (vm.overcommit_memory sysctl, swap sizing), recommended ratios per use case, and disk overcommit considerations are documented in docs/scheduler-configuration.md. The api binary emits slog Warn lines at startup for each overcommit- enabled resource (placement.resources.memory.overcommit_ratio=1.50 — overcommit enabled (OOM kill risk under memory pressure); see docs/scheduler-configuration.md) and for the all-disabled fallback case — surfaced on each restart as a reminder of the trade-off.

Node-pressure detection layers on top of the capacity / overcommit settings: pressured nodes (or pools, for disk pressure) are excluded from placement entirely. Three pressure types operational:

memory — per-node, heartbeat-driven (10%, 3 heartbeats default).
system_disk — per-node, heartbeat-driven (10%, 3 heartbeats default). Agent reads root filesystem via syscall.Statfs("/").
disk — per-pool, scan-driven (15%, single scan default).

Tunable via placement.pressure.{memory,system_disk,disk}.{enabled, threshold_percent,consecutive_required}. The api binary emits slog lines on set / clear transitions for all three (Warn on set, Info on clear). CLI surfaces conditions in three places:

otherix node list STATUS column combines raw status with node-scoped pressure (memory + system_disk) into ready / under_pressure / cordoned, under_pressure / unreachable.
otherix node get <node> renders a pressure: section with per- condition state (memory + system_disk).
otherix pool list adds an independent pool STATUS column; otherix pool get <pool> renders a pressure: section with the disk: condition.

Shared filesystem (typical homelab — pool on same FS as /) causes both system_disk_pressure and pool disk_pressure to fire on the same condition. Both surface — this is accurate, not duplication. Operator guide at docs/scheduler-configuration.md.

Iteration 3 Phase D — end-to-end smoke test¶

Phase D ships the operator workflow that lets you create a real Ubuntu VM end-to-end on real hardware: Lima VM (the agent host) → otherix vm create → qemu boots → SSH login. This section walks through the full sequence.

Prerequisites¶

Steps 0+1+2 of this doc completed (Lima VM running, agent built, mTLS certs generated).
No external store to provision: the api-server runs an embedded etcd member. For a clean-slate run, make etcd-reset.
Bootstrap admin seeded — set the env vars BEFORE the first api start, then start the api with the dev config:
```
export OTHERIX_BOOTSTRAP_ADMIN_USERNAME=admin
export OTHERIX_BOOTSTRAP_ADMIN_PASSWORD='correct-horse-battery-staple'
make build-api && ./bin/otherix-api --config dev/config/api.yaml
```
(make local-dev-start automates this whole flow.) The dev config dev/config/api.yaml — the workers (vm.create / vm.delete / storage_pool.scan) dispatch end-to-end against the real agent (mTLS material auto-managed via the api config's cp_cert block). The production deploy/config/api.example.yaml config keeps agent_client.enabled: false by default and refuses to start with workers.enabled: true until operators provision real mTLS material (Phase 2 lock).

The admin user lives in the users table with role='admin' so subsequent API calls authenticate.

Step 1 — run seed-dev¶

make seed-dev

The target executes dev/scripts/seed-dev.sh, which orchestrates the bootstrap flow end-to-end:

Configures the CLI cluster — calls otherix config add cluster --force using the bootstrap admin credentials (same env vars the CP boot hook consumes). Persists a long-lived API token into ~/.otherix/config.
Mints a join token per node via otherix node join-token create --node-name node-1 --ttl 10m --output json (and again for node-2). Captures each token plaintext + active cluster CA fingerprint.
Provisions the agent host: writes /etc/otherix/bootstrap-token (mode 0600) + /etc/otherix/bootstrap.env (mode 0644, contains OTHERIX_BOOTSTRAP__* koanf env-var overrides). Linux native user-mode lays these out under ~/.config/otherix/ instead.
Starts the agent — systemctl restart otherix-agent (Lima) or systemctl --user start otherix-agent (Linux native).
Polls for nodes.id WHERE name='node-1' (and node-2) for up to 60s. Each row appears once that node's CSR redemption commits at the CP side.
Does NOT create a storage pool: the CP auto-provisions the cluster default pool (default, from default_pool_name in code defaults) at /var/lib/otherix/pools/default on every node as it reaches ready, and that is the cluster default. So vm create resolves without --pool, and seed-dev no longer runs pool create or cluster set-default-pool. After seed-dev finishes, the dev cluster has the nodes registered, the CLI cluster configured, and the default pool auto-provisioned. There is no template to register: operators create VMs directly from an image URL with otherix vm create --image-url <url> --arch <arch>. The agent materializes the image onto its target pool inline on first use, so the first create just pays the one-time download.

The node row arrives in pending status and flips to ready once the first heartbeat lands. The dev heartbeat cadence (15s interval / 45s stale threshold) makes the transition operator-friendly. Watch with ./bin/otherix node list.

Sample output:

>> seed-dev complete
   nodes    : node-1 (id=<uuid>) / node-2 (id=<uuid>)
   pool     : default (cluster default, CP-auto-provisioned on ready nodes)
   images   : none staged (vm create fetches the image URL on first use)

Step 2 — create a VM¶

A VM is created directly from an image URL: --image-url and --arch are required, the rest optional. --pool is optional - when omitted, the server resolves the cluster default-pool reference held in cluster_settings. The CP auto-provisions default as the cluster default pool on every ready node, so the command below works without --pool:

./bin/otherix vm create demo-vm \
    --image-url https://cloud-images.ubuntu.com/minimal/releases/noble/release/ubuntu-24.04-minimal-cloudimg-arm64.img \
    --arch arm64 \
    --vcpus 2 --memory-mib 2048 --wait
# created task=<task-uuid> status=pending
# .....
# vm running task=<task-uuid>

To target a specific node explicitly:

./bin/otherix vm create demo-vm \
    --image-url https://cloud-images.ubuntu.com/minimal/releases/noble/release/ubuntu-24.04-minimal-cloudimg-arm64.img \
    --arch arm64 \
    --pool default \
    --node node-1 \
    --vcpus 2 --memory-mib 2048 --wait

The scheduler picks among ready, uncordoned nodes hosting the pool name and uses Least-VM-count tie-breaking. A --node hint pins placement to exactly that node; mismatch (pool not on hinted node) surfaces as 409 pool_not_on_node.

Behind the scenes: handler enqueues vm.create task → worker loads the VM (image URL + pool + node) → agentclient.PostVMCreate over mTLS → agent materializes the image into its per-pool cache (download on first use) → agent spawns qemu → agent's task surface flips to terminal-success → CP polls → CP projects vm_runtime row with phase=running.

Step 4 — observe the VM¶

./bin/otherix vm list
# NAME      STATUS   POOL       IMAGE
# demo-vm   running  default    ubuntu-24.04-minimal-cloudimg-arm64.img

./bin/otherix vm get demo-vm
# id: <vm-uuid>
# name: demo-vm
# owner_id: <user-uuid>
# image_url: https://cloud-images.ubuntu.com/minimal/releases/noble/release/ubuntu-24.04-minimal-cloudimg-arm64.img
# image_format: qcow2
# pool: default
# node: node-1
# architecture: arm64
# vcpus: 2
# memory_mib: 2048
# status: running
# desired_phase: running
# created_at: ...
# updated_at: ...

Pass --show-ids to vm list if the UUIDs are useful for scripting.

Step 5 — console access¶

The control plane exposes the VM serial console over a WebSocket that the agent bridges to the qemu serial socket. Attach with the CLI:

./bin/otherix vm console demo-vm

This issues a single-use console token, opens the stream in your raw terminal, and detaches on Ctrl+] (0x1D). The Ubuntu cloud image boots; press Enter to get a login prompt.

As a low-level fallback you can still read the qemu serial socket directly inside the Lima VM hosting the guest (otherix-dev-1 for node-1):

limactl shell otherix-dev-1 sudo socat - UNIX-CONNECT:/var/lib/otherix/vms/<vm-uuid>/console.sock

(Cloud-init authentication setup arrives in Iteration 5 — for this smoke test, the boot reaching the login prompt confirms the chain works.)

Step 6 — cleanup¶

./bin/otherix vm delete demo-vm --wait --force
# deleted task=<task-uuid> status=pending
# .....
# vm deleted task=<task-uuid>

Verify:

limactl shell otherix-dev-1 pgrep -af qemu-system   # nothing
limactl shell otherix-dev-1 ls /var/lib/otherix/vms/    # empty

Cluster default-pool configuration¶

Phase 1.5 introduced a cluster-wide default-pool reference held in the cluster_settings singleton — VM create requests without --pool resolve through it. The CP seeds it from default_pool_name on boot (code default default) and auto-provisions that pool on every node, so the dev cluster's default is default (the seed script no longer sets it). The otherix cluster subcommand group exposes inspect / set / unset verbs (Phase 2 added these — see "Cluster configuration" above for the broader walkthrough).

# Inspect current default
./bin/otherix cluster get-default-pool
# default-pool: default

# Promote a different pool (admin only)
./bin/otherix cluster set-default-pool fast-ssd

# Clear the default — subsequent vm create without --pool returns
# 400 default_pool_not_set until a new default is configured
./bin/otherix cluster unset-default-pool --force

Pool selection determines node placement — each pool instance lives on exactly one node, so promoting fast-ssd to default targets whichever node hosts the fast-ssd instance. To list available instances, run ./bin/otherix pool list --node <name> and pick the desired identifier.

Failure modes¶

Symptom	Likely cause
`vm get` shows `status: creating` indefinitely	agent endpoint unreachable; check Lima port-forward
`task.error.code = qemu_spawn_failed`	KVM unavailable inside the Lima VM (Apple Silicon vz quirk); the Iteration 1 agent automatically falls back to TCG, but a misconfigured cmdline can still fail
`task.error.code = image_unavailable`	the agent could not materialize the image URL onto the chosen pool during `vm create` (download or cache write failed). Agent-side fetch failures surface under their own code (e.g. `checksum_mismatch`, `download_failed`, or `agent_unreachable`). Read the task error message for the cause, then retry
`task.error.code = node_not_ready`	a fresh `make clean-dev` flipped the node row to pending; rerun `seed-dev.sh`
`api_error: unauthenticated` from the CLI	`OTHERIX_API_TOKEN` expired (JWTs are 15-min by default); re-login or use a long-lived `otx_*` API token
`default_pool_not_set` on `vm create` without `--pool`	cluster default-pool unset; configure via `PUT /v1/cluster/default-pool` or pass `--pool` explicitly
`pool_not_on_node` on `vm create --node X`	the requested pool name has no instance on node `X`; pick a different node or remove the `--node` hint
`no_eligible_nodes` on `vm create`	the pool exists but every hosting node is cordoned, unreachable, or not yet `ready`

End-to-end target achieved¶

Reaching the login prompt in Step 5 means the full chain works end-to- end: CLI → cpclient → CP HTTP → worker dispatcher (etcd-backed) → agentclient mTLS → agent → qemu → Ubuntu boot. Phase D's integration tests cover the machine-checkable invariants (handler envelopes, RBAC, task projection, idempotency, resumption); this smoke test covers the human-facing UX and the real-hardware surfaces the integration tests cannot reach (real qemu, real KVM/TCG fallback, real serial console).

Later iterations have since layered in: - cloud-init with a NoCloud seed ISO; - the full VM lifecycle (start / stop / poweroff / reboot / pause / resume / reset); - the WebSocket console (otherix vm console, replacing the socat tunnel); - declarative YAML manifests (otherix create -f / delete -f).

Still ahead: live migration, snapshots, and multi-disk VMs.

Running the agent against a remote control plane¶

If the control plane runs somewhere other than the macOS host (a remote dev box, a staging cluster), override the bootstrap CP URL when running seed-dev.sh. The agent picks it up via the koanf OTHERIX_BOOTSTRAP__CP_URL env var that seed-dev writes to bootstrap.env on the Lima VM:

# In your shell before running seed-dev:
export OTHERIX_CP_URL="https://<reachable-cp-host>:8443"
make seed-dev

The CP server cert SAN must include the hostname or IP the agent dials. The per-replica cert auto-detects localhost, 127.0.0.1, os.Hostname(), and the non-wildcard listen address; operators extend the set via cp_cert.additional_sans in dev/config/api.yaml (or the production config). The dev config ships host.lima.internal pre-registered.

Limitations¶

Networking. Lima's networking model differs from native Linux in subtle ways. Bridges and VLANs work inside the Lima VM, but exposing them to the macOS host or to other LAN hosts requires additional Lima configuration (vmnet networks). Consult Lima's networking documentation.
Performance. VMs running under Otherix-under-Lima are three virtualization layers deep (macOS hypervisor → Lima VM → guest VM). This is fine for development; do not expect bare-metal-equivalent numbers.
Cross-platform live migration. Migrating VMs between a Lima-hosted agent and a bare-metal Linux agent is technically possible (both sides speak the same agent protocol) but is not officially supported and may have edge cases that the test matrix does not cover.

Alternatives¶

If Lima does not fit your workflow, two other ways to work with Otherix from a Mac:

Run only the control plane locally; agents remotely. The control plane runs natively on macOS (it is a standard REST API server that embeds etcd in-process - no external database); only the agent needs Linux. Connect to agents running on remote test hosts over the network.
Run everything in a remote dev environment. Tools like a Linux dev box over SSH, GitHub Codespaces, or any Linux cloud VM remove the question entirely.

Troubleshooting¶

Bootstrap failures¶

Symptoms surface in journalctl -u otherix-agent after make seed-dev.

Symptom	Cause	Recovery
`bootstrap: CA fingerprint mismatch (expected sha256:… got sha256:…)`	Operator typo OR active MITM	Re-check the fingerprint in `~/.otherix/config` cluster CA against `bootstrap.env`. If correct, escalate to network team.
`bootstrap: CSR submission rejected by CP: HTTP 401 token_expired`	Token TTL elapsed (default 10m)	Re-run `make seed-dev` — mints a fresh token.
`bootstrap: CSR submission rejected by CP: HTTP 401 token_exhausted`	Multi-use token cap reached	Re-run `make seed-dev` — mints a fresh token.
`bootstrap: fetch /v1/ca: dial tcp …: connection refused`	CP not running OR unreachable from Lima VM	Verify the dev api-server is still alive (`tail -f .local/run/otherix-api.log`); inside Lima, `curl -k https://host.lima.internal:8443/healthz`.
`cert <path> exists but key <path> missing` (or vice-versa)	Partial-state bootstrap (mid-flight crash, manual file deletion)	Manual cleanup — delete cert + key + CA (`/var/lib/otherix/certs/agent.{crt,key}` + `ca.crt` on Lima, or `~/.config/otherix/certs/agent.*` + `ca.crt` on Linux native), then `make seed-dev` again. Agent identity is derived from the cert CN — no node-id sidecar to clean up.
Node lingers in `pending` past 60s	Agent reachable but heartbeat not arriving	`limactl shell otherix-dev-1 sudo journalctl -u otherix-agent -f` (use `otherix-dev-2` for node-2) — look for `heartbeat` lines OR mTLS handshake failures.

Lifecycle failures (after bootstrap)¶

Symptom	Likely cause
`vm get` shows `status: creating` indefinitely	agent endpoint unreachable; check Lima port-forward
`task.error.code = qemu_spawn_failed`	KVM unavailable inside the Lima VM (Apple Silicon vz quirk); the agent automatically falls back to TCG, but a misconfigured cmdline can still fail
`task.error.code = image_unavailable`	the agent could not materialize the image URL onto the chosen pool during `vm create` (download or cache write failed). No pre-staging is required; agent-side fetch failures appear under their own code (`checksum_mismatch`, `download_failed`, `agent_unreachable`). Read the task error and retry
`task.error.code = node_not_ready`	a fresh `make clean-dev` removed the agent; re-run `make bootstrap-dev` + `make seed-dev`
`api_error: unauthenticated` from the CLI	stored API token revoked OR cluster CA rotated; re-run `make seed-dev` to refresh the cluster credential
`default_pool_not_set` on `vm create` without `--pool`	cluster default-pool unset; `otherix cluster set-default-pool <name>` or pass `--pool` explicitly

Running Otherix on macOS for development¶

Prerequisites¶

Nested KVM and CPU choice¶

Setup¶

Verifying CP↔agent connectivity¶

Iteration 3 Phase C — otherix vm subcommands¶

CLI configuration¶

Initial setup¶

Daily usage¶

Multi-cluster management¶

Config file location¶

Authentication precedence¶

CLI resource discovery¶

Images¶

Storage pools¶

Nodes¶

Cluster configuration¶

VM operations with names¶

Declarative manifests¶

VM placement scheduler¶

Iteration 3 Phase D — end-to-end smoke test¶

Prerequisites¶

Step 1 — run seed-dev¶

Step 2 — create a VM¶

Step 4 — observe the VM¶

Step 5 — console access¶

Step 6 — cleanup¶

Cluster default-pool configuration¶

Failure modes¶

End-to-end target achieved¶

Running the agent against a remote control plane¶

Limitations¶

Alternatives¶

Troubleshooting¶

Bootstrap failures¶

Lifecycle failures (after bootstrap)¶

Iteration 3 Phase C — `otherix vm` subcommands¶