Profit agents, post-AGI economics, and mechanism design
Study bounded toy economies for profit agents under explicit assumptions, then climb from platform incentives and demand closure to coordination and mechanism design.
Every result below is a bounded theorem inside a toy model, not a general claim about the real economy. The point is to make the logic precise enough that counterexamples, edge cases, and mechanism changes become visible.
Assumption tree
The right way to use this tutorial is as a decision tree over assumptions, not as one hidden story about the future.
The top-level axes are:
- capability
- current AI
- AGI-level agent
- control shell
- legal-world shell
- crypto-native shell
- hybrid shell
- final sink structure
- human-dominant
- mixed human-agent
- crypto or external-dominant
- discovery and execution rails
- open routing
- platform bottleneck
- governed protocol
- ownership pattern
- active principals
- passive claimants
- mixed ownership
Each theorem below should be read as:
on path B of the assumption tree, theorem T survives
That is better than pretending one starting model already matches future reality.
Assumption tree
How to read the game theory and logic
This tutorial uses three model shapes.
- finite choice games
- a platform, household, or firm chooses from explicit actions
- payoffs determine which actions survive as best responses or private optima
- mechanism-design games
- a rule such as a price or quota is chosen first
- households then respond
- the theorem asks which interior regimes are implementable
- scalar closure models
- the main question is not strategic deviation
- it is whether aggregate demand can absorb aggregate output under the stated cap
The common template is:
M = (players, types, actions, payoffs, constraints, timing)
and each theorem should be read as:
under assumptions A in model M, property P holds
The properties in this tutorial are mostly:
- activation, which types or firms stay active
- private optimum, which regime maximizes platform or controller revenue
- individual stability, whether a chosen action is a best response
- closure, whether demand is at least supply
- implementability, whether a price, quota, or composed rule can induce a clearing interior regime
The notation is consistent across the experiments:
- lowercase
piorUis an individual payoff - uppercase
PiorRis a platform, firm, or total-profit object iffmeans “if and only if”argmaxmeans the action with highest payoff- a
witnessis a small concrete parameter choice that makes a law visible, not a claim of proof by anecdote
The experiments also fall into four game-theoretic families:
- Experiments 1-3: two-stage platform-subscriber games with complement types and admissibility gates
- Experiments 4-7: ownership and coordination models under a one-unit household demand cap
- Experiments 8-10: price, quota, and price-plus-quota mechanism-design games
- Experiments 11-13: sink and routing models for entry ceilings and bottleneck rents
The strategic timing is also explicit:
- Platform-profit games (Experiments 1-3)
- complement structure and gates are fixed
- the platform chooses a regime or extraction level
- subscriber types choose inactive, passive, or active behavior and, when needed, effort
- the result is an activation law or a private-optimum law
- Closure and ownership games (Experiments 4-7)
- ownership counts or household preferences determine the active set
- aggregate supply and demand are computed
- the result is a closure law or a coordination law
- Mechanism-design games (Experiments 8-10)
- a designer chooses a price, a quota, or both
- household types best-respond
- the result is an implementability law for a clearing interior regime
- Sink and routing models (Experiments 11-13)
- the final sink bundle and cost structure are fixed
- firms share sink access symmetrically or through slots
- the result is an entry ceiling law or a redistribution law
These are bounded theorems, not a full general-equilibrium theory. That limitation is part of the method. It keeps the assumptions visible enough that counterexamples and new cases can still be added honestly.
Thought experiment: a zero-employee software company
Start with a concrete picture.
Assume a software company with:
- no human employees
- one or more strong profit-seeking agents
- a control shell that can hold rights, assets, or code authority
- compute, storage, and platform access
- customers who buy software or services from the company
The agent does the operational work:
- market research
- product design
- coding
- testing
- deployment
- customer support
- billing
- iteration on the next product
In the strongest version of the thought experiment, the company can keep spawning more software products with almost no extra human labor.
That sounds like unlimited entrepreneurship, but the real question is narrower:
if software production becomes cheap, what remains scarce?
The first candidates are:
- customer attention
- distribution
- trust
- settlement rails
- legal permission, if the company touches the legal world
- compute priority
- platform ranking
But that first picture is only one case.
The analysis below now separates at least three regimes:
- legal-world regime
- contracts, legal shells, and regulated payment rails matter
- crypto-native regime
- code and settlement logic can replace some legal plumbing
- blockspace, custody, governance, and settlement assets become scarce
- hybrid human-agent regime
- human buyers, agent buyers, and mixed organizations all contribute to final demand
So the correct opening correction is:
- human attention is one possible sink, not the only one
- legal contracts are one possible control shell, not the only one
- payment matters, but that can mean fiat rails, crypto settlement, or other code-native transfer layers
The question is not whether the economy is “really software”. The question is which parts of it are:
- intermediate agent-to-agent flow
- final sink demand
- and governance or settlement bottlenecks
So the analysis below does not assume that zero-human firms automatically become rich.
It studies the harder question:
under explicit assumptions, when does a profit-agent economy clear,
when does it concentrate, and what mechanisms can keep it stable?
That is why the next steps look mathematical. The point is to turn the thought experiment into bounded models where:
- the profit regions are explicit
- the failure modes are explicit
- and the coordination mechanisms can be compared exactly
Experiment 1: complement-threshold and governed-profit law
The next rabbit hole was not another software atlas refinement. It was the first bounded profit-agent game.
Assume:
- the same model capability for all subscribers
- two complement classes:
lowhigh
- three platform extraction levels:
openextractiveclosed
- and one
MPRDgate that can forbid the top illicit strategy
The bounded subscriber payoff is:
\[\pi_i = v_i + t_i - p - \tau_i - r_i - k_i\]and the platform payoff is:
\[\Pi_P = N p + \sum_i \tau_i + A + E - C_{\text{compute}} - C_{\text{capex}} - C_{\text{safety}}\]What survives is already nontrivial.
Equal model capability does not imply equal profit thresholds.
In the checked toy economy:
low -> {open}high -> {open, extractive}
So complement heterogeneity alone produces unequal activation regions.
The platform side also has an exact region law:
\[R_{\text{extractive}} \geq R_{\text{open}} \iff n_{\text{low}} \leq 4 \cdot n_{\text{high}}\]So a platform can rationally choose a regime that keeps only high-complement users active.
The MPRD gate changes the game without forcing profit to zero.
The bounded law is:
\[\max_{a \in \text{Allowed}} \text{profit}(a) > 0 \quad \text{even when} \quad \arg\max_a \text{profit}(a) \notin \text{Allowed}\]So the first bounded profit-agent object already says:
- same intelligence does not force equal income
- platform incentives can exclude low-complement users
- governed admissibility changes the feasible profit set, not only the labels
Experiment 2: hold-up and passive-ownership threshold law
The next deeper question is whether a platform should simply capture everything and turn users into passive claimants.
That is a two-stage game.
The platform chooses one expropriation level:
openmoderatehighmaximal
The user then chooses among:
- inactive
- passive ownership
- active profit-agent use
The bounded active payoff is:
\[U_{\text{active}}(q, e) = (1 - q)\, V(e) - C(e) - s\]and the user chooses:
\[U_{\text{user}}(q) = \max\!\big(0,\; d,\; \max_e U_{\text{active}}(q, e)\big)\]where d is the passive dividend.
Platform revenue is:
\[R_P(q) = \text{subscription} + q\, V\!\big(e^*(q)\big)\]when the user stays active, and 0 otherwise.
In the checked bounded game:
- best active effort drops:
open -> e3moderate -> e2high -> e1maximal -> e0
- mode choice is:
open -> activemoderate -> activehigh -> passivemaximal -> passive
- platform revenue is:
open -> 4moderate -> 12high -> 0maximal -> 0
So the unique platform optimum is not maximal extraction. It is the interior regime:
moderate
That is the first bounded formal hold-up law in this line of analysis.
It says:
- higher extraction reduces productive effort
- passive ownership can become rational before full extraction
- full platform capture is not automatically the revenue-maximizing equilibrium
That is a much stronger result than the earlier prose intuition.
Experiment 3: heterogeneous-complement passive-ownership region law
The next bounded question combines the last two layers:
- heterogeneous complements
- passive ownership
- platform extraction
Assume:
- all subscribers have the same model capability
- subscribers differ only by complements:
lowhigh
- passive ownership is available with a bounded dividend
- the platform chooses one expropriation level:
openmoderatehighmaximal
The active payoff is:
\[U_{\text{active}}(c, q, e) = (1 - q)\, V(c, e) - C(e) - s\]and the subscriber chooses:
\[U(c, q) = \max\!\big(0,\; d,\; \max_e U_{\text{active}}(c, q, e)\big)\]Platform revenue is:
\[R(q;\, n_{\text{low}}, n_{\text{high}}) = n_{\text{low}}\, r_{\text{low}}(q) + n_{\text{high}}\, r_{\text{high}}(q)\]The checked bounded result is very sharp.
Mode choice by complement class:
low:open -> activemoderate -> passivehigh -> passivemaximal -> passive
high:open -> activemoderate -> activehigh -> passivemaximal -> passive
So the platform collapses to exactly two viable regimes:
open, which includes both complement classesmoderate, which keeps only high-complement users active
The exact region law is:
\[R_{\text{moderate}} \geq R_{\text{open}} \iff 2\, n_{\text{low}} \leq 9\, n_{\text{high}}\]Strict witnesses:
openbeatsmoderateat(n_low, n_high) = (5, 1)moderatebeatsopenat(n_low, n_high) = (4, 1)
and both high and maximal are dominated because even high-complement users
switch to passive ownership there.
This is the first bounded object in this line of analysis where complement heterogeneity, passive ownership, and platform extraction all interact at once.
It says:
- equal intelligence does not imply equal active participation
- passive ownership changes the platform’s optimal extraction region
- the relevant equilibrium is controlled by complement mix, not only by model capability
Closure and ownership (Experiments 4-7)
Experiments 1-3 established that complement heterogeneity and platform extraction create unequal activation regions even under equal capability. The next group zooms out to the macro question: if output grows faster than direct consumption, who buys the surplus?
Experiment 4: demand-closure ownership law
The next deeper question is the macro one:
if labor income falls and output rises, who buys the output?
The first clean answer is an arithmetic theorem under an explicit one-unit household demand model.
Assume:
nactive owner households- each active owner household produces
m + 1units - each active owner household consumes at most one unit
bpassive-beneficiary households can each consume one unit
Then:
\[S = (m + 1)\, n, \qquad D = n + b\]and the exact closure law is:
\[D \geq S \iff b \geq m \cdot n\]So demand clears supply iff passive beneficiaries are broad enough to absorb
the extra m * n units beyond owner self-consumption.
This has two sharp corollaries.
First, if there are no passive beneficiaries:
\[b = 0 \;\Rightarrow\; \text{closure requires } m \cdot n = 0\]So once each active owner household produces more than one unit, concentrated claims alone fail demand closure in this model.
Second, in the double-output case:
\[m = 1 \;\Rightarrow\; \text{closure} \iff b \geq n\]So if each active owner household produces two units, the system needs at least one passive beneficiary for each active owner household.
This is the first generic arithmetic theorem in this post-AGI economics sequence.
It does not prove a whole macroeconomy. But it does prove something important:
- if output grows faster than direct owner consumption
- and claims stay concentrated
then broad passive claims or transfers are not optional in this model. They are mathematically required for demand closure.
Experiment 5: private-optimum versus closure phase diagram
The next bounded step integrated the v134 platform game with the new
demand-closure theorem.
Assume:
openregime:- low-complement households stay active and produce
1unit - high-complement households stay active and produce
2units
- low-complement households stay active and produce
moderateregime:- only high-complement households stay active
- each active high-complement household produces
2units
- each household consumes at most one unit
n_high > 0
Then the two clearance laws are:
\[\text{open clears} \iff n_{\text{low}} + n_{\text{high}} \geq n_{\text{low}} + 2\, n_{\text{high}} \iff n_{\text{high}} = 0\] \[\text{moderate clears} \iff n_{\text{low}} + n_{\text{high}} \geq 2\, n_{\text{high}} \iff n_{\text{high}} \leq n_{\text{low}}\]So once any positive high-complement population exists:
opennever clearsmoderateclears exactly when low-complement households are numerous enough
The private-optimum boundary from v134 still holds:
Putting those together gives the first integrated post-AGI platform phase diagram in the repo:
\[\text{For } n_{\text{high}} > 0: \quad \text{viable regime exists} \iff n_{\text{high}} \leq n_{\text{low}} \;\land\; 2\, n_{\text{low}} \leq 9\, n_{\text{high}}\]When it exists, it is the moderate regime.
Three witness regions make the structure clear:
- private-optimum / closure conflict:
(n_low, n_high) = (5, 1)openhas higher private revenue,24 > 22openfails closuremoderateclears
- viable moderate band:
(4, 1)moderateis private-optimalmoderateclears
- no viable regime:
(0, 1)moderateis privately optimal- neither
opennormoderateclears
Platform phase diagram
This is the deepest economics-side result so far because it formally separates:
- what the platform wants
- what keeps the economy demand-clearing
Those are not the same object.
Experiment 6: active-owner share ceiling
The next bounded step extracted the cleaner macro theorem hidden inside
v135 and v136.
Assume:
- there are
htotal households nhouseholds remain active owner-principals- each active owner household produces
2units - each household consumes at most
1unit - passive or inactive households consume
1unit and produce0
Then demand closure is:
\[h \geq 2n\]which is exactly equivalent to:
\[n \leq \frac{h}{2}\]So with two-unit active owners, at most half of households can remain active owners if the economy is to clear output under this one-unit demand cap.
This gives a sharper impossibility result:
\[\text{For } h > 0: \quad \lnot\,(h \geq 2h)\]So no positive-household economy in this model can keep every household as an active owner once each active owner produces two units.
Three small witnesses make the structure concrete:
- clearing case:
h = 4n = 2- active-owner share
1/2 - closure holds
- overfull case:
h = 3n = 2- active-owner share
2/3 - closure fails
- all-active failure:
h = 3n = 3- active-owner share
1 - closure fails
This is the first exact theorem in this profit-agent sequence that speaks directly to the universal-principal question.
Experiment 7: symmetric coordination law
The next bounded step turned that active-owner ceiling into a real active-versus-passive coordination game.
Assume:
hidentical householdsnactive owner households- each active owner household produces
2units - each household consumes at most
1unit
Mode semantics:
- if active payoff is greater than passive payoff, every household chooses
active, so
n = h - if active payoff is less than passive payoff, every household chooses
passive, so
n = 0 - if active payoff equals passive payoff, any
nis individually stable
Then three cases survive.
Strict active preference gives no positive-household clearing equilibrium.
For h > 0, if n = h, then:
Strict passive preference gives only the zero-production equilibrium:
n = 0
A nontrivial clearing equilibrium first appears only in the indifference case, where:
\[h \geq 2n \iff n \leq \frac{h}{2}\]So the first stable nontrivial regime is a coordinated interior split, not:
- everyone active
- or everyone passive
This is the first actual coordination theorem in this profit-agent sequence.
Mechanism design (Experiments 8-10)
Experiments 4-7 showed that a coordinated interior split is needed for demand closure, but did not say how to implement it. The next group turns that coordination problem into a mechanism-design problem: can a price, a quota, or both induce the right active-passive split?
Mechanism design progression
Experiment 8: uniform-price impossibility and quota implementability
The next bounded step turned the coordination question into a mechanism-design question.
Assume the same symmetric double-output economy, but now compare two coordination devices.
Uniform-price mode semantics:
- if
delta > 0, all households strictly prefer active, son = h - if
delta < 0, all households strictly prefer passive, son = 0 - if
delta = 0, households are indifferent, so anynis individually stable
Quota mode semantics:
- under
delta > 0, all households want active - a hard quota
qcaps active slots, son = min(h, q)
The first exact law is a uniform-price impossibility result:
\[0 < n < h \;\land\; \text{individual stability} \;\Rightarrow\; \delta = 0\]So prices alone cannot implement a positive interior individually stable regime unless they create exact indifference.
The second exact law is a quota implementability result:
\[h \geq 2q \iff q \leq \frac{h}{2}\]So under strict active preference, a hard quota can implement an interior regime, and it clears exactly when the quota stays below the same half-share ceiling.
Three witnesses show the split:
- uniform-price failure:
h = 4n = 2delta = 1- not individually stable
- quota clearing:
h = 5q = 2delta = 1n = 2- closure holds
- quota overfull:
h = 5q = 3delta = 1n = 3- closure fails
This is the first bounded mechanism theorem in this profit-agent sequence:
- prices alone do not solve the symmetric coordination problem
- quotas can
Experiment 9: heterogeneous price-selection law
The next bounded step asked whether the pricing impossibility from v139
survives once households differ.
Assume:
Llow-complement householdsHhigh-complement households- each active household produces
2units - each household consumes at most
1unit - strict surplus order:
Uniform-price semantics:
p < a_low, both types choose activea_low < p < a_high, only the high type chooses activep > a_high, both types choose passive
So in the strict middle region:
\[a_{\text{low}} < p < a_{\text{high}} \;\Rightarrow\; n_{\text{low}} = 0 \;\land\; n_{\text{high}} = H\]That is the first clean contrast with v139.
A single uniform price can now implement a nontrivial interior regime because
the household types are different.
The exact clearing law for that middle region is:
\[L + H \geq 2H \iff H \leq L\]So pricing works, but only under a sharp composition condition:
- the high-complement active group must not be a majority
This is the first exact result in this mechanism sequence showing that heterogeneous complements change the logic of the mechanism problem.
Experiment 10: price-plus-quota composition law
The next bounded step composed the two mechanisms.
Keep the same two-type economy and stay in the strict middle price region:
a_low < p < a_high
So price solves type selection by making only the high type want active.
Then add a second stage:
- a hard quota
qon active high-type slots
The exact composed clearing law is:
\[L + H \geq 2q \iff q \leq \frac{L + H}{2}\]This gives the sharp interpretation:
- price solves selection
- quota solves allocation
The witnesses make the contrast concrete:
- price-only failure:
L = 1H = 4q = 4- closure fails
- composed clearing:
L = 1H = 4q = 2- closure holds
- overfull quota:
L = 1H = 4q = 3- closure fails
So the deeper object is now a composed mechanism. Price alone or quota alone is not the whole story. The mechanism sequence now has:
- a homogeneous impossibility theorem
- a heterogeneous price-selection theorem
- and a price-plus-quota composition theorem
That is the clearest mechanism-design progression in this economics sequence so far.
Sinks, slots, and entry ceilings (Experiments 11-13)
Experiments 8-10 solved the coordination problem for a fixed demand model. The next group corrects a hidden assumption: human attention is not the only possible final sink. Agent-to-agent trade, crypto settlement, and external demand all contribute. These experiments ask what limits firm entry and who captures the rents.
Experiment 11: intermediate-demand multiplier law
This experiment corrects a hidden assumption in the zero-employee software-company thought experiment.
Human attention is not the only possible final sink.
Assume a scalar software-only economy with:
- gross output
Y - intermediate agent-to-agent demand share:
alpha = a / b
with 0 <= a < b
- final sink bundle:
F = H + A_term + C + X
where:
His human final demandA_termis terminal agent demandCis crypto or code-native settlement demandXis external demand
The gross-output law is:
\[Y = \alpha Y + F\]or in scaled integer form:
\[bY = aY + bF\]The exact laws are:
\[a < b \;\land\; F = 0 \;\Rightarrow\; Y = 0\] \[a < b \;\land\; F > 0 \;\Rightarrow\; Y > 0\]So intermediate agent demand amplifies final sinks, but cannot replace them.
This is the important correction:
- human attention is not uniquely necessary
- but circular agent trade without any positive final sink does not sustain output in this model
Two witnesses make the point concrete:
- zero final sink:
a = 1b = 2F = 0Y = 0
- crypto sink only:
a = 1b = 2H = 0A_term = 0C = 3X = 0F = 3Y = 6
So the right way to think about the zero-employee firm is no longer:
- “does it ultimately need human buyers”
It is:
- “what positive final sinks exist”
- “what intermediate flows amplify them”
- “what settlement and governance layer makes the profits real”
Experiment 12: zero-employee company entry ceiling law
Take one explicit path through the assumption tree.
Assume:
- legal shell creation is not the binding bottleneck
- the firm can use a legal, crypto-native, or hybrid control shell
- the mixed final sink bundle is still:
F = H + A_term + C + X
Nactive zero-employee firms share that bundle symmetrically- each active firm pays operating cost
c
Then total profit is:
\[\Pi_{\text{total}} = F - Nc\]The exact laws are:
\[\Pi_{\text{total}} \geq 0 \iff F \geq Nc, \qquad \Pi_{\text{total}} > 0 \iff F > Nc\]So even if agents can create whole firms and legal formation is treated as nonbinding, unlimited technical firm creation does not imply unlimited sustainable firms.
The hard entry ceiling is set by:
- final sink size
- active firm count
- per-firm operating cost
Three witnesses make the boundary concrete:
- positive margin:
F = 13N = 3c = 4Pi_total = 1
- break-even:
F = 12N = 3c = 4Pi_total = 0
- overcrowded:
F = 11N = 3c = 4Pi_total = -1
This is the first direct zero-employee-company ceiling theorem in this analysis. The next important splits are:
- open routing versus platform discovery bottlenecks
- symmetric access versus slot rents
- legal enforcement versus code-native enforcement
Experiment 13: discovery-slot redistribution law
What happens once zero-employee firms are easy to create but sink access is bottlenecked?
Assume:
- fixed final sink bundle:
F = H + A_term + C + X
Nactive firmsqdiscovery slots with0 < q <= N- slot holders split the sink symmetrically
- each firm pays operating cost
c
Then total profit remains:
Pi_total = F - N * c
regardless of q.
What changes is the slot-holder margin. Its scaled numerator is:
M_slot = F - q * c
with exact laws:
M_slot > 0 iff F > q * c
Pi_undiscovered = -c
So discovery bottlenecks redistribute profit, but do not create new system surplus.
The witness makes the point concrete:
F = 12N = 6c = 1- bottleneck case:
q = 2Pi_total = 6M_slot = 10
- open routing case:
q = 6Pi_total = 6M_slot = 6
The bottleneck changes who captures the gains, not the size of the pie.
That is the clean formal version of the platform-power question for zero-employee companies.
Slot sales, admissibility, and frontier labs (Experiments 14-16)
Experiments 11-13 showed that final sinks set hard entry ceilings and that discovery bottlenecks redistribute profit without creating it. The next group adds mechanisms on top: governed slot fees, MPRD admissibility caps, and the strategic choice between closed and open deployment for frontier labs.
Experiment 14: governed execution-slot sale law
Add an explicit mechanism to the bottleneck case.
Assume:
- fixed final sink bundle:
F = H + A_term + C + X
Ncandidate firmsqgoverned execution slots with0 < q <= N- only slot holders can reach the sink
- slot holders split the sink symmetrically
- every candidate firm pays operating cost
c - each winning slot holder pays fee
s_i >= 0
Let:
S = sum_i s_i
be total slot-fee revenue.
Then:
Pi_system = F - N * c
Pi_winners = F - q * c - S
Pi_controller = S
So slot sales redistribute rents between winners and the controller, but do not create new system surplus.
In the symmetric-fee case, where each winner pays the same fee s, the scaled
winner margin is:
M_win = F - q * c - q * s
with exact law:
M_win >= 0 iff F >= q * (c + s)
The witnesses make the split concrete:
F = 16N = 6q = 2c = 2- low fee:
s = 1Pi_system = 4Pi_winners = 10Pi_controller = 2
- high fee:
s = 3Pi_system = 4Pi_winners = 6Pi_controller = 6
So once execution access is sold, the hard question is not only who gets a slot. It is how the slot mechanism transfers rent.
Experiment 15: MPRD-governed slot-cap law
What does MPRD do in that same slot-sale economy?
Assume:
- the same governed-slot case
- the controller chooses total extraction
S - the admissibility layer imposes:
0 <= S <= G
- winners stay only if aggregate winner profit is nonnegative:
F - q * c - S >= 0
Stay in the viable case:
F >= q * c
Then the governed-optimal extraction is:
S_star = min(G, F - q * c)
with positivity law:
S_star > 0 iff G > 0 and F > q * c
So MPRD does not only block the unconstrained maximum.
It carves out a governed extraction region.
The two binding cases are:
- admissibility binding:
F = 16q = 2c = 2G = 5S_star = 5
- viability binding:
F = 10q = 2c = 2G = 9S_star = 6
That is the cleanest current toy answer to the question:
what does MPRD allow in a bottleneck economy?
Experiment 16: deployment-surface divergence law
This experiment switches perspective from platforms to frontier labs.
Assume a lab chooses one deployment surface:
closedopen
Let:
Fbe the base direct revenue opportunityDbe the extra ecosystem output created only by the open casek_closed,k_openbe lab-side operating or safety costsa / mbe the share of open-case output the lab capturese_closed,e_openbe externality or governance costs counted by the social planner
Then the exact case conditions are:
lab_prefers_open iff a * (F + D) >= m * (F - k_closed + k_open)
social_prefers_open iff D >= e_open - e_closed
So the private case and the social case can diverge.
The clean divergence region is:
D >= e_open - e_closed
and
a * (F + D) < m * (F - k_closed + k_open)
Witness:
F = 10D = 8a = 1m = 2k_closed = 1k_open = 1e_closed = 5e_open = 7
So one bounded answer to “what are a frontier lab’s options?” is:
- the deployment surface is a strategic choice
- and the privately optimal option need not be the socially optimal option
Household participation and routing (Experiments 17-19)
Experiments 14-16 studied the supply side: slot sellers, admissibility caps, and lab deployment. The next group takes the household and routing perspective. When agent firms are abundant, the relevant household question is not “can I own an agent?” but “does entering the active race beat passive claims?” And the relevant routing question is whether equalized protocols expand the viability region.
Experiment 17: household slot-lottery participation law
Take the household perspective on the slot economy.
Assume:
Nsymmetric householdsqgoverned execution slotsnhouseholds enter the active slot lottery- stay in the congested case:
n >= q > 0
- each active winner gets gross revenue
R - each active winner pays operating cost
c - each active winner pays slot fee
s - each household can stay passive and receive outside option
d > 0
Under symmetric congestion:
U_active(n) = (q / n) * (R - c - s)
U_passive = d
The exact entry law is:
U_active(n) >= d iff q * (R - c - s) >= n * d
So the slot-and-sink system has a sharp household-side active-entry ceiling.
Witnesses with q = 2, R = 9, c = 2, s = 1, d = 2:
- active entry survives at
n = 5 - knife-edge at
n = 6 - over-entry failure at
n = 7
This means that once agent firms are abundant, the relevant household question is not only whether a household can own an agent. It is whether entering the active race beats passive claims.
Experiment 18: asymmetric-routing class-viability law
Extend the sink-access model from slot counts to route weights.
Assume:
- fixed final sink bundle
F - route class
Ahas weighta - route class
Bhas weightb a > b > 0n_A > 0activeAfirmsn_B > 0activeBfirms- each active firm pays operating cost
c
Then total profit is still:
Pi_total = F - (n_A + n_B) * c
But class viability is controlled by:
M_A = a * F - c * (a * n_A + b * n_B)
M_B = b * F - c * (a * n_A + b * n_B)
with dominance law:
M_A - M_B = (a - b) * F > 0
So:
Pi_B >= 0 implies Pi_A >= 0
but not conversely.
The witness at F = 14, a = 3, b = 1, n_A = 2, n_B = 2, c = 2
gives:
M_A = 26M_B = -2
So routing asymmetry can make one route class viable while another fails, even when the total system surplus formula has not changed.
Experiment 19: protocol equalization expansion law
Compare favored routing with equalized routing directly.
Assume:
n_A > 0advantaged-route firmsn_B > 0disadvantaged-route firms- each firm pays operating cost
c - fixed sink bundle
F
Platform regime:
- class
Agets multipliera >= 1 - class
Bgets multiplier1
Protocol-equalized regime:
- both classes get multiplier
1
For class B, define:
M_B_platform = F - c * (a * n_A + n_B)
M_B_protocol = F - c * (n_A + n_B)
The exact inclusion law is:
M_B_platform >= 0 implies M_B_protocol >= 0
and the inclusion is strict whenever a > 1 and n_A > 0.
Witness:
F = 10a = 3n_A = 2n_B = 2c = 2M_B_protocol = 2M_B_platform = -6
So the platform-versus-protocol question is not hand-wavy dominance. It is whether equalized routing expands the disadvantaged class’s viability region.
Machine control and trust dynamics (Experiments 20-27)
Experiments 17-19 studied routing and participation for agent firms that already exist. The next group asks a different question: when should a company shift from human to machine control in the first place? The answer involves trust lag, incumbent rents, dynamic learning, and assurance packages.
Trust-adoption timeline
Experiment 20: machine-control dominance threshold law
When should a company becoming agentic be read as an actual control upgrade rather than just a narrative?
Compare:
- human-managed shell
H - machine-managed shell
M
Let:
k_H,k_Mbe operating costse_H,e_Mbe expected failure-loss burdenst_H,t_Mbe trust frictionsa_H,a_Mbe auditability premiums
Then:
\[\Pi_H = V - k_H - e_H - t_H + a_H, \qquad \Pi_M = V - k_M - e_M - t_M + a_M\]The exact dominance law is:
\[\Pi_M \geq \Pi_H \iff (e_H - e_M) + (a_M - a_H) \geq (k_M - k_H) + (t_M - t_H)\]So machine control dominates exactly when reliability gain plus auditability gain covers machine cost premium plus machine trust penalty.
This is the first bounded theorem in this line of analysis that turns the slogan
machines must be more trusted and reliable than people
into an exact threshold inequality.
Experiment 21: trust-lag divergence law
True reliability and observed trust are not the same. This experiment separates them.
Compare two evaluators:
- a private chooser, who pays machine trust discount
tau >= 0 - a social evaluator, who does not count
tauas a real welfare loss
Then the machine conditions are:
\[\text{social prefers machine} \iff e_H - e_M \geq k_M - k_H\] \[\text{private prefers machine} \iff e_H - e_M \geq k_M - k_H + \tau\]So the divergence region is:
\[k_M - k_H \leq e_H - e_M < k_M - k_H + \tau\]This means machines can be truly safer and still under-adopted.
The witness with V = 30, k_H = 8, k_M = 9, e_H = 10, e_M = 4,
tau = 7 gives:
- social evaluator prefers machine
- private chooser still prefers human
So reliability advantage and social optimality can arrive before private trust fully catches up.
Experiment 22: trust-learning experimentation law
The machine-control model becomes a two-period game.
Normalize the two-period human baseline to 0.
Let:
Abe the machine’s per-period structural advantage over human control, excluding trust discounttau1 >= 0be the period-1 machine trust discountlam >= 0be the trust-learning gain from one successful machine period
If the machine is used successfully in period 1, period-2 trust becomes:
\[\tau_2 = \tau_1 - \lambda\]Assume $\tau_2 \geq 0$.
Then:
-
period-1 machine premium: $A - \tau_1$
-
two-period committed machine-path premium:
The exact dynamic adoption law is:
\[\text{two-period machine path beats always-human} \iff 2A + \lambda \geq 2\tau_1\]So there is a wedge region where:
\[A < \tau_1 \quad \text{and} \quad 2A + \lambda \geq 2\tau_1\]In that region:
- the machine is myopically rejected in period 1
- but the two-period experimentation path is privately better
The witness with A = 5, tau1 = 6, lam = 4 gives:
- period-1 machine premium:
-1
- period-2 premium after one successful machine period:
3
- two-period machine-path premium:
2
So even when machines are better in structure, adoption can still require a trust-learning path rather than one-shot comparison.
Experiment 23: routing lock-in persistence law
The routing model becomes a two-period survival game.
Assume:
- class
Bis present at the start of period 1 - if class
Bfails in period 1, it cannot re-enter in period 2 - routing parameters stay fixed across periods
Compare:
- platform-favored routing, with disadvantaged-class margin:
M_B_platform = F - c * (a * n_A + n_B)
- protocol-equalized routing, with disadvantaged-class margin:
M_B_protocol = F - c * (n_A + n_B)
Under no re-entry:
platform persistent-B region iff M_B_platform >= 0
protocol persistent-B region iff M_B_protocol >= 0
Since:
M_B_platform >= 0 implies M_B_protocol >= 0
equalized routing weakly expands the two-period persistence region for class
B.
The strict witness at F = 10, a = 3, n_A = 2, n_B = 2, c = 2
gives:
M_B_platform = -6M_B_protocol = 2
So the period paths are:
- platform:
both -> A-only -> A-only
- protocol:
both -> both -> both
This is the first dynamic lock-in theorem in the routing model.
Experiment 24: incumbent-rent machine lockout law
Combine the trust-learning model with incumbent control rents.
Take a two-period adoption game with no re-entry.
Normalize the always-human path to 0.
Let:
Abe the machine’s per-period structural advantage over human control, excluding trust discounttau1 >= 0be the period-1 machine trust discountlam >= 0be the trust-learning gain after one successful machine periodrho >= 0be the incumbent controller’s per-period private rent loss if control shifts to the machine
If the machine is used successfully in period 1, period-2 trust becomes:
tau2 = tau1 - lam
Assume tau2 >= 0.
Then:
- social two-period machine premium:
2 * A - 2 * tau1 + lam
- private incumbent-controller machine premium:
2 * A - 2 * tau1 + lam - 2 * rho
The exact conditions are:
\[\text{social adopts} \iff 2A + \lambda \geq 2\tau_1\] \[\text{private incumbent adopts} \iff 2A + \lambda \geq 2\tau_1 + 2\rho\]So the strict lockout wedge is:
\[2A + \lambda \geq 2\tau_1 \quad \text{and} \quad 2A + \lambda < 2\tau_1 + 2\rho\]In that region:
- the machine path is socially better
- the incumbent still rejects it
- no re-entry turns that rejection into persistent human lock-in
The witness with A = 5, tau1 = 6, lam = 4, rho = 2 gives:
- social two-period machine premium:
2
- private incumbent-controller machine premium:
-2
- period-1 private machine premium:
-3
- period-2 private machine premium after success:
1
So even if machine control is genuinely better and becomes easier to trust after one good period, adoption can still fail when the relevant private chooser would lose control rents.
Experiment 25: assurance-package adoption law
Turn the lockout wedge into a software-design theorem.
Keep the same two-period no-reentry adoption game, then add one assurance package.
Let:
d >= 0be a per-period trust lift from audit, replay, or confinementg >= 0be extra period-2 trust learning from the same packagek >= 0be a one-time package cost
Then the package changes trust as follows:
tau1_pkg = tau1 - d
tau2_pkg = tau1 - d - lam - g
Assume tau1_pkg >= 0 and tau2_pkg >= 0.
The incumbent’s two-period packaged machine premium is:
2 * A - 2 * tau1 + lam + 2 * d + g - 2 * rho - k
So the exact package-adoption condition is:
\[2A + \lambda + 2d + g \geq 2\tau_1 + 2\rho + k\]If baseline adoption is blocked:
\[2A + \lambda < 2\tau_1 + 2\rho\]then the package flips rejection into adoption exactly when:
\[2d + g - k \geq (2\tau_1 + 2\rho) - (2A + \lambda)\]This is the exact assurance-surplus law.
The witness with A = 5, tau1 = 6, lam = 4, rho = 2, d = 1, g = 1,
k = 0 gives:
- baseline private machine premium:
-2
- packaged private machine premium:
1
- assurance surplus:
3
- baseline shortfall:
2
So the package closes the shortfall and flips private rejection into private adoption.
This is the cleanest current answer to the software-design question. If machine control is actually better but blocked, then the package has to:
- lower the initial trust penalty
- speed trust learning after one good deployment
- do so cheaply enough to close the exact adoption gap
Experiment 26: assurance-lever coefficient law
Separate the assurance package into distinct levers.
Keep the same two-period no-reentry adoption game.
Let:
d >= 0be trust lift from audit, replay, or confinementg >= 0be extra period-2 learning after one successful machine periodell >= 0be liability or rent offset that lowers the incumbent’s effective rent loss
Let package cost be linear:
k = c_d * d + c_g * g + c_ell * ell
Then the exact private-adoption condition becomes:
2 * A + lam + (2 - c_d) * d + (1 - c_g) * g + (2 - c_ell) * ell
>=
2 * tau1 + 2 * rho
So the private coefficients are:
- trust lift
d:2 - c_d
- extra learning
g:1 - c_g
- liability offset
ell:2 - c_ell
This is the coefficient law.
The equal-cost corollary is the important clean case.
If:
c_d = c_g = c_ell = 1
then the condition collapses to:
2 * A + lam + d + ell >= 2 * tau1 + 2 * rho
and delayed learning g drops out completely.
That means:
- one unit of trust lift buys more private adoption than one unit of delayed learning when costs are equal
- one unit of liability offset has the same raw private weight as one unit of trust lift
The witness at A = 5, tau1 = 6, lam = 4, rho = 2 starts from baseline
private premium -2.
Under equal unit costs:
- trust-lift-only package
d = 2,g = 0,ell = 0gives packaged premium0 - learning-only package
d = 0,g = 2,ell = 0gives packaged premium-2 - liability-only package
d = 0,g = 0,ell = 2gives packaged premium0
So predeployment trust lift and liability structure can flip adoption where delayed learning alone cannot.
Experiment 27: assurance-subsidy implementation law
Compare social versus private package choice directly.
Keep the same two-period no-reentry assurance model.
Let:
d >= 0be trust lift from the packageg >= 0be extra period-2 learningk >= 0be package costrho >= 0be incumbent rent loss from machine controls >= 0be an adoption subsidy paid only if the incumbent adopts the machine path with the package
Then:
- social packaged machine premium is:
P_social = 2 * A - 2 * tau1 + lam + 2 * d + g - k
- private packaged machine premium is:
P_private = 2 * A - 2 * tau1 + lam + 2 * d + g - k - 2 * rho
So the exact conditions are:
social package choice iff 2 * A + lam + 2 * d + g >= 2 * tau1 + k
private package choice iff 2 * A + lam + 2 * d + g >= 2 * tau1 + 2 * rho + k
The minimal implementing subsidy is therefore:
\[s^* = \max\!\big(0,\; 2\tau_1 + 2\rho + k - (2A + \lambda + 2d + g)\big)\]On the strict divergence wedge:
2 * tau1 + k <= 2 * A + lam + 2 * d + g < 2 * tau1 + 2 * rho + k
the package is socially preferred but privately rejected, and:
\[0 < s^* \leq 2\rho\]This is the exact implementation law.
The witness with A = 5, tau1 = 6, lam = 4, rho = 2, d = 1, g = 0,
k = 1 gives:
P_social = 3P_private = -1s_star = 1
So the package is socially worth deploying, privately blocked, and one unit of subsidy is exactly enough to implement it.