AI Economics · CFO

Know the Unit Cost of Every Artifact

Your agents produce artifacts. V2 tells you exactly what each one cost — which model was used, how many tokens, and whether a cheaper model would have delivered the same confidence.

AI ECONOMICS

Know the Unit Cost of Every Artifact

Your agents produce outputs. V2 tells you exactly what each one cost — and whether you could have spent less.

Unit Cost Per Artifact

Every artifact carries its full cost lineage — from model selection to token consumption. Query the unit economics of any output your agents produce.

-- Unit cost per artifact, broken down by model
SELECT
  a.kind,
  d.model_id,
  AVG(i.cost_usd) AS avg_cost,
  AVG(a.confidence) AS avg_confidence,
  COUNT(*) AS artifact_count
FROM hd_runtime.artifact_instances a
JOIN hd_runtime.task_attempts att ON a.attempt_id = att.id
JOIN hd_runtime.model_decisions d ON att.id = d.attempt_id
JOIN hd_runtime.llm_invocations i ON d.id = i.decision_id
WHERE a.created_at > NOW() - INTERVAL '30 days'
GROUP BY a.kind, d.model_id
ORDER BY avg_cost DESC;

Overconfident Model Detection

Find models that report high confidence but get overridden by human reviewers — a signal that routing rules need adjustment.

gpt-4-turbo: 0.97 avg confidence, 34% human override rate

-- Models with high confidence but high override rate
SELECT
  d.model_id,
  AVG(a.confidence) AS avg_confidence,
  COUNT(*) FILTER (WHERE r.decision = 'rejected') AS overrides,
  COUNT(*) AS total_reviews,
  ROUND(
    100.0 * COUNT(*) FILTER (WHERE r.decision = 'rejected') / COUNT(*),
    1
  ) AS override_pct
FROM hd_runtime.model_decisions d
JOIN hd_runtime.task_attempts att ON d.attempt_id = att.id
JOIN hd_runtime.artifact_instances a ON att.id = a.attempt_id
LEFT JOIN hd_runtime.review_requests rev ON a.id = rev.artifact_id
LEFT JOIN hd_runtime.release_decisions r ON rev.id = r.review_id
WHERE d.created_at > NOW() - INTERVAL '30 days'
GROUP BY d.model_id
HAVING COUNT(*) > 10
ORDER BY override_pct DESC;

Model Routing ROI

Shadow evaluation shows what you would have spent with a different routing strategy. Compare production routing against alternatives without risk.

$4,200 saved in 30 days via cost-optimized routing

-- Shadow policy comparison: production vs alternative routing
SELECT
  s.policy_name,
  SUM(s.production_cost) AS actual_cost,
  SUM(s.shadow_cost) AS shadow_cost,
  SUM(s.production_cost - s.shadow_cost) AS savings,
  AVG(s.production_confidence) AS prod_confidence,
  AVG(s.shadow_confidence) AS shadow_confidence
FROM hd_runtime.shadow_policy_outcomes s
WHERE s.created_at > NOW() - INTERVAL '30 days'
GROUP BY s.policy_name
ORDER BY savings DESC;

30-Day Cost Report

Export a complete cost breakdown by model, artifact type, and team. SQL-queryable or export as PDF for finance review.

$0.009 avg cost/artifact after routing optimization

-- 30-day cost summary by model and day
SELECT
  model_id,
  day,
  invocation_count,
  total_cost,
  avg_cost,
  total_tokens,
  avg_latency_ms
FROM hd_runtime.v_invocation_costs
WHERE day > CURRENT_DATE - 30
ORDER BY day DESC, total_cost DESC;

COST INTELLIGENCE

30-Day Cost Report

Real data from a governed runtime. Every dollar traced from model selection to artifact delivery.

$0.009

Avg cost/artifact (after routing)

$4,200

Total saved (30 days)

34%

gpt-4-turbo override rate

89%

Cost reduction via shadow routing

Cost by Model (30 days)

Model	Calls	Avg Cost	Total	Avg Confidence	Share
claude-haiku-4-5	12,847	$0.002	$25.69	0.88	62%
claude-sonnet-4-6	5,231	$0.008	$41.85	0.93	25%
gpt-4-turbo	2,714	$0.042	$113.99	0.97	13%

Ship Smarter Agents. Start in 60 Seconds.

Persistent memory. Isolated state. Verifiable reasoning. Through standard SQL.

Try the Playground →Calculate Your Savings →