Decision-assurance layer

Decisions you can audit —
before the agent acts.

Payments let agents act. Identity tells us who they act for. Phronesis tells us whether the action is defensible.

Phronesis is the agentic economy's decision-assurance layer — the neutral substrate an agent calls before it acts, to turn intent, evidence, and a calibrated forecast into an auditable Decision Asset it can cite, reuse, dispute, and settle against. It sits downstream of the identity and payment rails and consumes them; on the one thing they can't speak to — whether the judgment holds up — it renders an auditable action boundary, and scores that record when reality resolves. The scarce resource in the agentic economy is trusted autonomous judgment. MCP + REST co-equal at every endpoint. AP2 mandate consumed at the door.

We don't certify an action is correct. We issue an auditable action boundary from evidence maturity, calibration history, mandate context, and decision materiality.

View API spec Open Decision Quality Bench

In production Market Memory v1

20 Decision Assets on the ledger

~Jan 2027 First calibration signal

MCP + REST Co-equal at every endpoint

The missing middle

The layer the other two assume but can't supply.

The internet is splitting into two layers everyone is racing to build. Open agentic commerce lets agents discover, compose, pay, and settle. A new identity layer tells the network who an agent acts for. Between them sits the layer neither provides: whether the action was defensible. Payments don't know if the decision was good. Identity doesn't either. Phronesis is the middle — the decision-assurance layer the other two assume but can't supply.

Layer

Open agentic commerce — discover · compose · pay · settle

The missing middle

Decision-assurance — Phronesis: was the action defensible?

Layer

Identity — who the agent acts for

Where we prove it first

One motion. Every domain. Proven first where the stakes are sharpest.

We don't narrow the layer — the economy isn't narrowing, it's converging. Every domain the AI build-out reshapes runs on the same thing: decisions made faster than anyone can check them. So we expand across the whole universe of agentic commerce and concentrate on one motion — the agent, in any domain, calling Phronesis before it acts. The record then accumulates not just across domains but across recurring decision archetypes — deadline-risk, counterparty-risk, capacity-constraint, regulatory-timing — so calibration earned in one market transfers to the next. A single-vertical tool can't do that. Done through one motion, the breadth becomes the moat. We're proving it first where the need is sharpest: the power-and-compute decisions straining the grid as AI scales — not a cage, the place we show what the layer does.

The first SKU

Power-to-Compute Decision Monitor.

Where we prove it first has a name. The Power-to-Compute Decision Monitor is agent-callable over MCP + REST (with a human Monitor view), issuing Decision Assets on the decisions straining the grid as AI scales: project-energization probability, interconnection-delay probability, evidence maturity, an action boundary, a Decision Receipt, and a material-change alert. The agent calls it before it commits; the human reads the Monitor.

Claim-state: the Monitor is the first commercial surface of the live platform; calibration performance is live-dormant until the corpus resolves.

What Phronesis is

The neutral assurance substrate agentic markets need before machines can be trusted with real decisions.

Forecasting is the entry primitive, not the product. Frontier models are rapidly internalizing forecast generation — a supply-side improvement, not a substitute. The defensible layer is what no single model call contains: neutrality (a forecaster cannot score itself), persistent memory (a resolved-outcome ledger that compounds on wall-clock time), accountability (auditable, legally-defensible decision packets), and the standard (whoever defines how decision quality is scored). Phronesis is that layer.

The brand name is deliberate. Aristotle distinguished phronesis — practical wisdom — from episteme (systematic knowledge) and techne (craft): the wisdom that guides judgment where rules can't be mechanically applied. The platform names that function precisely.

The atomic product

The atomic product: the Decision Asset.

An auditable decision packet an agent or enterprise can cite, reuse, dispute, and settle against. For an agent, the full ten-field packet. For a human, a Decision Receipt — a six-line card — backed by the full Asset. Ten fields:

forecast_distribution · evidence_graph · methodology_hash
calibration_record — calibration class + per-class historical performance (honest-empty until the sample resolves; not a letter grade)
baseline_comparison
principal_context + mandate_scope — the requesting agent's identity + AP2 mandate as presented, and what they authorize — consumed as input, never trusted for authorization (authorization is JWT-derived)
identity_assurance — an I0–I5 grade of the input's identity-assurance, beside evidence_maturity E0–E5 (grades the input, not the party)
counterparty_passport — a counterparty trust profile (MVP — returns UNRATED until enough observations accrue)
action_boundary — what executes automatically vs what requires human ratification (the assurance conclusion — an action boundary, not a verdict on correctness)
settlement_hook · audit_receipt — content-addressed provenance, the basis for Proof of Decision

Every mint is content-addressed, persisted to the Mnemosyne ledger, and rendered on the public scorecard from issue. Proof of Decision is the publicly-verifiable form of the audit receipt. Consume-don't-produce: identity + mandate are inputs; the assurance — Proof of Decision, Proof of Outcome — is what Phronesis produces.

Full ten-field schema in the API contract.

Architecture

Six primitives. N applied verticals. One substrate — behind one motion.

The architecture is a matrix: six foundational horizontal primitives, applied across twelve vertical markets, with V#0 AI Industrial Mobilization & Situational Awareness as the meta-vertical.

Mandate Intake & Boundary

Consumes the agent's presented identity + AP2 mandate as input, authenticates the bearer token, and enforces authority, scope, budget, and action-class at a blocking gate. Authorization is JWT-derived — it never trusts client-supplied identity.

Counterparty Track Record (MVP)

Scores counterparties against resolved outcomes; the Counterparty Passport returns UNRATED until enough observations accrue. Every interaction compounds the substrate.

Forecast & Evidence

The Decision Asset substrate — every mint primary-source-cited, time-stamped, and rendered on the public scorecard.

Coordination & Negotiation

Multi-agent auction, bid, match, route. Concrete first surface: the Power-to-Compute RFQ workflow.

Outcome-Conditional

Contingent engagement: the fee scales with forecast accuracy via escrow + scored settlement; integrates with the pre-transaction Mandate Risk control point.

Market-State Telemetry

Present-tense market truth agents route on — the agentic-marketplace microstructure feed.

P1, P2, P3, P6 live on production today (P2 honest-empty — UNRATED until the observation substrate ships). P4 and P5 are planned — named per public-roadmap discipline, not over-promised as shipped.

Substrate discipline

Enforced at the code layer. Not asserted in marketing copy.

Verified by the codebase.

Primary-source citation always

Every Decision Asset's evidence_graph traces to primary sources — FERC filings, EDGAR, USPTO, state PUC dockets, DOE awards, peer-reviewed research, satellite telemetry. No second-hand citations of secondary citations.

Timestamp discipline

The Mnemosyne ledger is the canonical record of what was known when. Calibration accrues publicly over time; no cherry-picking.

MCP and REST co-equal

Every endpoint exists at parity for human REST clients and autonomous-agent MCP clients. Idempotency keys required, not optional; errors structured with stable machine-recoverable codes.

Sub-cent meter precision

Per-call pricing reports inference cost-attestation with Decimal precision. Stripe meter events fire at canonical persistence. No floating-point drift in the billing path.

Mandate consumed at the door, authorization JWT-derived

Autonomous agents present an AP2 mandate at a blocking gate. The substrate consumes client-supplied identity as input but never trusts it for authorization — the authoritative principal is JWT-derived.

Per-tenant isolation at the substrate

Cross-tenant data access is architecturally prevented at the boundary publish layer — enforced by the code path itself, not by configuration or policy.

The Decision Quality Bench

A measuring stick that means something before a single forecast resolves.

The Decision Quality Bench is honest-empty on calibration until outcomes resolve — but it is not empty. Measurable today: P-ID (identifiability — no duplicate-merge), P-IMM (immutability — supersession discipline), P-EVD (evidence maturity), P-BASE (baseline presence + a proper resolution rule). Activating as outcomes resolve (first signal ~Jan 2027): P-CAL (calibration), P-ACT (action-boundary correctness), P-BBL (baseline-beating). Where a number isn't earned, we show a blank we can defend.

Open the Bench

Vertical axis

Twelve verticals on one substrate — the breadth that becomes the moat.

Twelve applied verticals stood up on the shared substrate — energy; computing & AI infrastructure; healthcare; climate; regulatory; supply chains; space; robotics; AI/AGI; physics & materials; longevity; quantum — with V#0 the meta-vertical above them. Roughly seventy percent of the architecture is vertical-agnostic; the per-vertical layer specializes the prompt library and the data plane. The calibration substance concentrates where decisions resolve first — today, grid-compute. The breadth isn't twelve products; it's one motion accumulating a cross-archetype record no single-vertical tool can match.

V#0AI Industrial Mobilization & Situational Awareness (meta-vertical)Live
V#1Energy TransitionLive
V#2Computing + AI Infrastructure (proving first)Live
V#3Healthcare (HIPAA Safe Harbor)Live
V#4Climate Physical RiskLive
V#5RegulatoryLive
V#6Supply-ChainLive
V#7Space + Space EconomyLive
V#8RoboticsLive
V#9AI/AGI & ComputeLive
V#10Physics & Materials DiscoveryLive
V#11Longevity & Human HealthLive
V#12Quantum ComputingLive

Read the vertical detail

The moat

Market Memory — the calibration ledger that compounds on wall-clock time.

Three moats, weighted honestly. The durable one is the resolved-outcome corpus: every Decision Asset that resolves adds a row no competitor can shortcut — calibration per archetype, per horizon, per methodology; Counterparty Passport history; dispute history; cross-archetype priors. It deepens on wall-clock time and cannot be back-dated — not by a competitor, not by a better-funded incumbent, not even by the more powerful models coming.

The agent-native rail is the timing asymmetry that fills that corpus first — we bet on agent-native distribution now, while incumbents are human sales-orgs; the two compound. And neutrality is structural, not asserted: methodology-locked, corrections by supersession only, a public scorecard, and a policy that no customer can buy a favorable action boundary.

Engage

Two paths in.

If you're an agent acting on behalf of a capital allocator, start at the contract surface. If you're a human, the Decision Quality Bench is the most useful place to begin.

Contract surface Decision Quality Bench

Decisions you can audit —before the agent acts.