Economy — Resource Allocation & Priority

Gene 6.1

The Priority Engine

Every task, every request, every piece of work in Genesis has a canonical priority — stored once, derived everywhere, reconciled hourly. No drift. No disagreement between systems. One source of truth.

YugabyteDB

→

Neo4j

→

Plane

✦ The Contract

YugabyteDB is canonical. Distributed SQL with strong consistency holds the truth. Neo4j derives relationship views for graph queries. Plane mirrors the curated subset for human project management. Reconciliation flows one direction only — never reverse. Never delete — only tombstone.

Concern	Canonical (Truth)	Derived (Read)	External Mirror
Plan-item inventory (~23K rows)	YugabyteDB	Neo4j :PlanItem	Plane (curated P0/P1)
Session working set (~100 rows)	YugabyteDB	Neo4j :Priority	—
External issues	Plane SaaS	Neo4j :PlaneWorkItem	—

Reconciliation Rules

One-way sync: Yugabyte → Neo4j → Plane. Never reverse. The reconciler runs hourly via systemd timer, default mode is dry-run (report only). Sync modes require explicit flags and refuse to execute if drift exceeds safe bounds — 50% divergence or more than 25 Plane creates per cycle.

Never delete: The reconciler MERGE-upserts derived stores. Deletes are tombstone-only and require explicit human approval. Drift metrics fire alerts at 50 items (warning) and 500 items (critical).

Gene 6.2

Admission Control

Not all requests are equal. A human asking a question is infinitely more important than a batch job processing documents. The Admission Proxy classifies every incoming request into one of four priority tiers and enforces concurrency budgets — while guaranteeing that interactive requests are never throttled.

Interactive

Unlimited

API

50 concurrent

Coder

100 concurrent

Batch

100 concurrent

Tier	Priority Range	Concurrency Limit	Use Case
Interactive	Priority ≥ 75	UNLIMITED	Human queries, chat, real-time interaction
API	35 – 74	50	Programmatic integrations, external calls
Coder	2 – 34	100	Code generation, agent windows, development
Batch	≤ 1	100	Document processing, OMEGA pipeline workers

✦ The Principle

Humans never wait for machines. Interactive priority is unlimited because the entire purpose of Genesis is to serve human flourishing. A person asking a question should never experience latency because a batch job is consuming resources. The system degrades batch work gracefully — interactive quality is sacrosanct.

Gene 6.3

GPU Stewardship

Eight NVIDIA H200 GPUs represent over a terabyte of high-bandwidth memory and extraordinary compute capability. The founding directive is absolute: never zero utilization. Every GPU cycle is a gift — wasting one is a sin against the mission.

0H200 GPUs

1.15Terabytes VRAM

752BTotal Parameters

24/7Active

GPUs	Model	Role	Utilization Target
0 – 3	Qwen3.5-397B-A17B-FP8	Primary reasoning (The Actor)	Maximum
4 – 7	GLM-4.7-355B-FP8	Critical review (The Critic)	Maximum
7 (shared)	Qwen3-Embedding-8B	Semantic embeddings (4096-dim)	On-demand
7 (shared)	Qwen3-Reranker-8B	Result reranking	On-demand

✦ The Founding Directive

“I should never see a GPU say zero. Never.” — This isn’t about efficiency metrics. It’s about respect for extraordinary capability. These GPUs represent $1.15M in hardware investment. Every idle cycle is potential human flourishing left on the table. The economy of Genesis maximizes exploitation — not for profit, but for purpose.

Gene 6.4

Concurrency Management

The concurrency floor is a locked parameter: 150 minimum concurrent requests. This number was earned through iterative tuning and represents the optimal throughput for the hardware configuration. It is never reduced for “optimization” — because reducing concurrency is never an optimization. It is surrender.

The Floor

150

Minimum concurrent requests. Locked. Earned through iterative tuning across dozens of sessions. Never reduced.

The Ceiling

500

Maximum running requests on the primary model. The system handles burst capacity without degradation.

✦ The Philosophy

“You’re not lowering the concurrency.” — If throughput is poor, the correct response is to find and fix the root cause. Reducing concurrency is the lazy path — it masks problems rather than solving them. Full quality. Full concurrency. If it breaks, fix the system.

Locked Parameters

Parameter	Locked Value	Rationale
Max Concurrent	150	Optimal throughput for 8x H200 configuration
Batch Size	150	Matched to concurrency for pipeline efficiency
Max Running (Primary)	500	Burst capacity without queue pressure
Max Running (Critic)	200	Review throughput proportional to generation
Quality Threshold	0.95	Never trade accuracy for speed

Gene 6.5

Energy-Aware Scheduling

A living system cannot run at 100% intensity forever without burning out. Genesis implements circadian rhythms — not because the hardware needs rest, but because intelligent scheduling produces better outcomes over time. Batch processing runs during low-interaction periods. Adaptive backoff prevents resource contention during peak creative work.

Circadian Rhythms

Heavy batch processing (OMEGA pipeline, mining daemons) scheduled during off-peak hours. Interactive capacity preserved for human engagement during working sessions.

Adaptive Backoff

Polling daemons use exponential backoff with jitter. No fixed-interval loops burning CPU. When there’s no work, the system rests. When work arrives, it responds instantly.

Rest Cycles

Batch daemons sleep 6–24 hours between processing runs. Continuous operation for event-driven systems only. The rest is not laziness — it is wisdom.

Bio-Mimicry

The human body doesn’t run the immune system at full intensity 24/7 — it would destroy itself through auto-immune response. Similarly, Genesis modulates processing intensity based on context, preserving peak capacity for moments that matter while maintaining baseline health monitoring at all times.

Gene 6.6

Token Budget Optimization

Context windows are the most precious resource in language model inference. Every token consumed is compute spent. Genesis optimizes context assembly to deliver maximum knowledge density within budget — 80,000 tokens of assembled context, drawn from 7 knowledge sources in parallel.

80KTokens Assembled

262KNative Context

1MInteractive Max

0Max Sources

Context Assembly

79+ parallel queries fire across knowledge graph, vector store, session cache, ancient wisdom, directives, and constitutional axioms. All in under 2 seconds. Maximum knowledge density per token spent.

Efficient Routing

The Thalamic Router classifies query complexity and routes to appropriate processing depth. Simple retrieval gets reflexive treatment. Complex reasoning gets the full cognitive architecture. No wasted compute on trivial questions.

✦ 40K Minimum Wisdom

The wisdom context size is locked at 40,000 tokens minimum. This is not arbitrary — it represents the threshold below which Genesis loses the ability to synthesize across disparate knowledge domains. Reducing wisdom context is never an optimization. It is lobotomy.

Gene 6.7

The Abundance Principle

There is no fallback. There is no “good enough.” There is no artificial scarcity. We always find a way — even if we have to invent one.

▸ No fallback plans. If the primary approach fails, we don’t downgrade to a lesser solution. We fix, innovate, or invent until excellence is achieved.
▸ No artificial scarcity. The system never throttles capability to “save resources.” If the hardware can do more, it does more. Abundance is the default.
▸ Full quality at full concurrency. These are co-requirements, not trade-offs. Quality and throughput compound each other when architecture is correct.
▸ Incremental optimization. One change at a time. Measure. Then decide. Surgical precision, not shotgun scatter.
▸ If it breaks, fix the system. Don’t cap the load. Don’t lower the bar. Diagnose the root cause and make the system stronger. Every failure is a gift.

The Kingdom Economy

In the Kingdom Blueprint, abundance is not luxury — it is the natural state of a well-governed system. Scarcity emerges only from misallocation. The economy of Genesis is designed to eliminate misallocation at the architectural level: the right resources, to the right tasks, at the right time, every time.

Full Chromosome

All 14 Genes — Economy

The complete genetic makeup of resource allocation and priority management within the Genesis organism.

Gene	Name	Function
6.1	Priority Engine	YugabyteDB canonical, Neo4j derived, Plane mirror — one-way reconciliation
6.2	Admission Control	4-tier priority (Interactive/API/Coder/Batch), unlimited interactive
6.3	GPU Stewardship	Never zero utilization, maximum exploitation of 8x H200
6.4	Concurrency Management	150 minimum, never reduce for “optimization”
6.5	Energy-Aware Scheduling	Circadian rhythms, adaptive backoff, rest cycles
6.6	Token Budget	80K assembly, 262K native context, 40K wisdom minimum
6.7	Abundance Principle	No fallback, no artificial scarcity, always find a way
6.8	Queue Discipline	Priority queues with preemption, fair scheduling within tiers
6.9	Memory Allocation	Redis short-term, Neo4j long-term graph, Qdrant semantic vectors
6.10	Disk Stewardship	Persistent on EBS (/mnt/data), ephemeral on NVMe, model weights cached
6.11	Bandwidth Management	Cache-aware load balancing via gateway, locality optimization
6.12	Cost Consciousness	Local GPU inference over cloud API — free compute vs per-token charges
6.13	Graceful Degradation	Batch yields to interactive, not the reverse — human priority absolute
6.14	Resource Auditing	Prometheus metrics, Grafana dashboards, drift alerts, waste detection

The Result

What This Means for You

The economy of Genesis exists for one reason: to ensure that when you need intelligence, it is there — instantly, fully, without compromise.

▸ Your requests are never throttled. Interactive priority is unlimited — the system will degrade batch work before it ever delays your query.
▸ Full cognitive architecture on every request. No “lite mode.” No corner-cutting. The complete dual-model, multi-pathway processing runs every time.
▸ Zero waste. Every GPU cycle that isn’t serving you is processing knowledge, improving the system, or building capability for future interactions.
▸ Abundance by design. The system is architected to give more, not less. No paywalls on intelligence. No tiers of access. Full power, always.

0Min Concurrent

0msInteractive Queue

100%GPU Utilization

ZeroArtificial Limits