Tutorial 49

Neuro-symbolic Boolean algebras in Tau Language

A Tau Language enhancement experiment: qns8, a finite Boolean-algebra carrier for exact symbolic filtering after neural candidate generation.

View source Built 2026-06-29

This tutorial describes a small Tau Language experiment for neuro-symbolic programming: a model proposes candidates, and Tau performs an exact symbolic filter over a finite audited menu.

The experiment adds a feature-gated Boolean-algebra carrier:

qns8

qns8 is the powerset Boolean algebra over eight audited atoms, represented as an 8-bit mask. The same design could be widened to more atoms, but this demo stays at eight.

Scope

This is a community research experiment, not an official Tau Language feature. It adds one finite carrier (qns8) for exact symbolic filtering over an audited candidate universe. Neural scoring stays in the host program. The demo makes no correctness claim about atom extraction.

Reference notes for the experiment live in:

The public Tau-facing reproduction path is:

git clone https://github.com/TheDarkLightX/TauLang-Experiments.git
cd TauLang-Experiments
./scripts/run_public_demos.sh --accept-tau-license

That script downloads Tau Language from the official IDNI repository, applies the community research patches locally, builds Tau, and runs the safe-table, qelim, qNS, and EML/qNS demos.

This site repo also includes a local artifact-generation script:

scripts/run_qns_semantic_ba_demos.sh

Part I: the qNS split

The neuro-symbolic filtering equation is:

\[q_{\mathrm{NS}}(c \mid x) = \frac{ q_{\mathrm{N}}(c \mid x)\,\chi_{\mathrm{S}}(c,x) }{ \sum_{d\in C_x} q_{\mathrm{N}}(d \mid x)\,\chi_{\mathrm{S}}(d,x) }.\]

Standard reading. For candidate $c$ and input $x$, the neuro-symbolic probability of $c$ given $x$ is the neural probability of $c$ given $x$, multiplied by the symbolic indicator for $c$ at $x$, divided by the total neural mass of all candidates in $C_x$ whose symbolic indicator is true.

Plain English. The model ranks candidates. The symbolic layer deletes candidates that fail the rules. The host renormalizes the scores of the candidates that remain.

Trap. Tau is not being asked to produce the neural score. Tau checks the exact Boolean filtering condition. The host program handles numerical scoring and renormalization.

This is the central design boundary:

\[\text{neural proposal} \quad\longrightarrow\quad \text{finite symbolic filter} \quad\longrightarrow\quad \text{renormalized survivors}.\]

Part II: the Boolean algebra

For a given input $x$, choose a finite audited candidate set $C_x$. In the rest of the tutorial, write $U$ for this audited universe, so $U=C_x$.

The carrier is:

\[B_x = \mathcal{P}(U).\]

Standard reading. The carrier symbol $B_x$ is defined to be the power set of $U$, meaning the set whose elements are all subsets of the audited universe $U$.

Plain English. Each Boolean-algebra value is a set of candidates.

The Boolean operations are:

\[0=\varnothing,\qquad 1=U,\qquad A\wedge B=A\cap B,\qquad A\vee B=A\cup B,\qquad A'=U\setminus A.\]

Standard reading. The bottom element is the empty subset of $U$. The top element is $U$ itself. The meet of $A$ and $B$ is their intersection. The join of $A$ and $B$ is their union. The prime of $A$ is the subset of $U$ containing exactly the elements of $U$ that are not in $A$.

Plain English. Meet keeps candidates in both sets. Join keeps candidates in either set. Prime keeps everything in the audited universe that is not in the set.

Trap. The prime is relative to the finite audited universe $U$. It is not a probability complement and not a model judgment.

In the Tau experiment, the subsets are stored as masks:

qns8  : 8 audited atoms

This is why the implementation is useful. A symbolic candidate filter becomes ordinary Boolean algebra over a finite audited menu.

Part III: the candidate filter

The demo uses five regions:

Symbol	Meaning
$U$	audited universe
$N$	candidates proposed by the neural layer
$A$	candidates allowed by symbolic rules
$R$	candidates requiring human review
$H$	candidates hard-rejected by symbolic rules

First define the proposed region:

\[P := U\wedge N.\]

Standard reading. The proposed-region symbol $P$ is defined as the meet of $U$ and $N$, so an element belongs to $P$ exactly when it belongs both to the audited universe and to the neural-proposal set.

Plain English. Only candidates inside the audited universe can be treated as proposed.

Then define the eligible region:

\[E := U\wedge N\wedge A\wedge H'.\]

Standard reading. The eligible-region symbol $E$ is defined as the meet of $U$, $N$, $A$, and $H’$. Therefore an element belongs to $E$ exactly when it is audited, proposed, allowed, and outside the hard-reject set.

Plain English. A candidate is eligible exactly when it is audited, proposed, allowed, and not hard-rejected.

Now split the proposed region into three cases:

\[\mathrm{Auto} := E\wedge R', \qquad \mathrm{Review} := E\wedge R, \qquad \mathrm{Reject} := P\wedge(A'\vee H).\]

Standard reading. The auto-accept region is the part of $E$ outside the review set. The review region is the part of $E$ inside the review set. The reject region is the part of the proposed region $P$ whose candidates are either outside the allowed set or inside the hard-reject set.

Plain English. Eligible candidates either auto-accept or go to review. Proposed candidates that are disallowed or hard-rejected go to symbolic rejection.

Trap. This is not a soft preference ranking. These are exact set partitions over the audited candidate universe.

Part IV: the checked laws

The first checked law is the no-leak law:

\[\mathrm{Auto}\wedge H = 0.\]

Standard reading. The meet of the auto-accept region and the hard-reject region is the bottom element, so no candidate belongs to both regions.

Plain English. Nothing that is hard-rejected can also be auto-accepted.

The second checked law is the partition law:

\[\mathrm{Auto}\vee\mathrm{Review}\vee\mathrm{Reject}=P.\]

Standard reading. The join of the auto-accept region, the review region, and the reject region is equal to the proposed region $P$.

Plain English. Every proposed candidate lands in exactly the supported decision surface: auto-accept, human review, or symbolic rejection.

The proof packet also checks that these regions do not overlap:

\[\mathrm{Auto}\wedge\mathrm{Review}=0,\qquad \mathrm{Auto}\wedge\mathrm{Reject}=0,\qquad \mathrm{Review}\wedge\mathrm{Reject}=0.\]

Standard reading. Each pair among the auto-accept, review, and reject regions has bottom meet, so the three regions are pairwise disjoint.

Plain English. No candidate is classified into two decision regions at the same time.

Trap. The law is about the proposed region $P$, not about all possible actions in the world. The finite carrier only knows the audited atom set it was given.

Part V: what Tau runs

The reproduction command is:

./scripts/run_qns_semantic_ba_demos.sh

The script downloads the official Tau Language repository, applies the local research patches, builds Tau, and runs the qNS demo.

The demo checks:

native qns8 meet and join,
prime-as-XOR-with-top behavior,
exact symbolic filtering for candidate masks,
concept-set filtering for controlled audited labels,
bounded trace-class filtering,
rejection of qns8 syntax when TAU_ENABLE_QNS_BA=1 is absent.

The current result has:

ok: true
mismatch_count: 0

That is a runnable evidence claim, not a full semantic claim about arbitrary natural language.

Part VI: why this is not `nlang`

Upstream Tau’s nlang carrier is a natural-language concept carrier. It composes strings such as:

(A) and (B)
(A) or (B)
not (A)

The semantic question for nlang is delegated to an external oracle.

The qNS finite carrier is different:

Carrier	Stored value	Semantic discipline
`nlang`	natural-language concept string	oracle-backed interpretation
`qns8`	finite audited atom mask	exact finite powerset semantics

So nlang is better for exploratory natural-language interfaces. The qNS carrier is better when the candidate menu has already been audited and the filtering step must be exact.

Part VII: what this gives Tau

This experiment gives Tau a concrete role in a neuro-symbolic loop:

Model proposes.
Tau filters.
Host renormalizes.
Proof artifact checks the finite set laws.

The proof artifact is small but useful. It proves the no-leak and partition laws at the finite powerset level:

auto_accept_no_hard_reject:
  Auto ∧ H = 0

partition_eq_proposed:
  Auto ∨ Review ∨ Reject = P

The proof packet is checked in the Lean file:

experiments/neuro_symbolic_math_v001/Proofs.lean

The implementation artifact shows that Tau can run the exact symbolic carrier natively, under a feature flag, without claiming that the neural model itself is formally verified.

That is the point of this carrier: not to replace the neural model, and not to replace upstream nlang, but to give the neuro-symbolic loop one exact Boolean-algebraic checkpoint.

Part VIII: practical use cases

The practical gain is that Tau can now check a finite audited candidate menu natively. That enables workflows that were awkward before this carrier.

Use case	What the model proposes	What Tau checks
Agent tool-call gating	candidate tool calls	allow, deny, review, and hard-reject masks
DeFi risk triage	collateral or liquidation actions	symbolic risk atoms before execution
Governance routing	proposal labels	exact admit, reject, or human-review regions
Protocol trace triage	bounded trace classes	safe, forbidden, and unclassified behavior
Explanation menus	possible reasons	surviving and rejected reason atoms
Proof-task routing	candidate proof obligations	which obligations are allowed into an automated prover

For example, an agent can propose eight possible actions with soft scores. Tau can then compute:

\[\mathrm{Survivors} = \mathrm{Proposed}\wedge\mathrm{Allowed}\wedge\mathrm{HardReject}'.\]

Standard reading. The survivors region is defined as the meet of the proposed region, the allowed region, and the prime of the hard-reject region.

Plain English. Keep only candidates that were proposed, allowed, and not hard-rejected.

The host can then renormalize the neural scores over $\mathrm{Survivors}$. This is the concrete new ability:

soft model output becomes an audited finite decision surface.

Before this experiment, the host could still perform that filtering in ordinary application code. The difference is that the symbolic checkpoint now lives in Tau’s Boolean-algebraic world, where it can be composed with other Tau specifications and audited with the same proof discipline as the table and qelim demos.

Part IX: reason-coded routing

The next derived artifact is:

assets/data/qns_reason_manifest.json

It is generated from the Tau-checked trace artifact:

python3 scripts/generate_qns_reason_manifest.py

The manifest does not change the Boolean algebra. It compiles the existing mask outputs into per-atom explanations. For one action atom, a row has this shape:

{
  "name": "tax_extractor",
  "route": "symbolic_reject",
  "reasons": [
    "hard reject mask contains candidate",
    "symbolic allow mask does not contain candidate"
  ]
}

The exact partition check is:

\[\mathrm{Auto}\vee\mathrm{Review}\vee\mathrm{Reject}=P.\]

Standard reading. The join of the auto-accept region, the human-review region, and the symbolic reject region is equal to the proposed region $P$.

Plain English. Every proposed candidate gets exactly one supported decision route.

The reason manifest checks this in the bounded demo:

candidate_proposed_partition_failures = 0
candidate_universe_partition_failures = 0
unsafe_leak_failures = 0

Standard reading. The generated reason manifest has zero candidate proposed-partition failures, zero candidate universe-partition failures, and zero unsafe-leak failures.

Plain English. The explanation rows did not drift away from the Tau mask outputs.

Trap. The reasons are not free-form LLM explanations. They are deterministic labels computed from the same masks Tau already checked.

This is the more practical interface:

mask result
  -> per-atom route
  -> per-atom reason
  -> qNS survivor probability when applicable

The current manifest covers:

$24$ action-candidate entries,
$24$ controlled-concept entries,
$24$ bounded trace-class entries.

Boundary. Reason-coded routing still does not prove that the external model chose the right candidates, extracted the right concepts, or recognized the right trace classes. It proves that once those finite atom masks are supplied, Tau’s exact Boolean outputs can be rendered as an auditable route-and-reason manifest.

Part X: a multi-feature Tau demo

The next demo file is:

examples/tau/qns_multifeature_decision_surface_v1.tau

The generated trace artifact is:

assets/data/qns_multifeature_demo_traces.json

It is produced by:

python3 scripts/generate_qns_multifeature_demo_artifacts.py

This demo adds a toy micro-proposer outside Tau. It maps prompt keywords to a finite proposed-candidate mask. A real LLM could replace that toy proposer, provided it emits the same finite mask interface.

The Tau side then checks:

proposed mask
  -> auto/review/reject masks
  -> unsafe-leak check
  -> pointwise revision-style policy memory update

The revision expression is:

\[\operatorname{Rev}(O,G,A) = (G\wedge A)\vee(G'\wedge O).\]

Standard reading. The revision expression denotes the value obtained by taking $A$ on the part selected by guard $G$, taking the old value $O$ on the part outside $G$, and joining those two disjoint contributions.

Plain English. Inside the guard, use the replacement value. Outside the guard, preserve the old value.

The multi-feature artifact checks:

scenario_count = 3
tau_mismatch_count = 0
partition_failure_count = 0
unsafe_leak_failure_count = 0
revision_idempotence_failure_count = 0

Standard reading. Across the three bounded scenarios, Tau’s qNS outputs match the host reference outputs, the partition check has no failures, the unsafe leak check has no failures, and applying the same revision pass twice has no idempotence failure.

Plain English. The demo runs through actual Tau qNS expressions and produces the same result as the host-side reference model.

There are two execution shapes:

Shape	Meaning	Status
Fast staged	Check route masks first, then feed checked masks into revision.	Recommended path.
Slow monolithic	Send the fully expanded revision expression to Tau in one expression.	Performance-boundary demo.

The slow lane is intentionally included. In the current artifact, the fully expanded revision expression times out under the bounded timeout, while the staged lane passes.

Trap. The timeout is not a semantic failure. It shows why the qNS pipeline should compile into staged masks instead of blindly expanding every formula into one large expression.

This is where qNS can be more useful than raw nlang for formal workflows:

nlang: flexible text carrier, oracle interprets meaning
qNS: finite audited atoms, Tau computes exact routes, reasons, and revisions

The two designs serve different purposes. nlang is broader as an interface. qNS is stronger as an auditable execution checkpoint.

Part XI: an audited ontology compiler

The next qNS artifact is:

assets/data/qns_ontology_compiler_traces.json

It is generated by:

python3 scripts/compile_qns_ontology_masks.py

The compiler takes bounded policy text and maps audited phrases to qNS atom masks. It also quarantines two kinds of unsafe input: ambiguous phrases and unknown terms.

The compiler surface is:

\[C(t)= \bigl( M_{\mathrm{observed}}(t), M_{\mathrm{ambiguous}}(t), M_{\mathrm{exact}}(t), M_{\mathrm{review}}(t), Q_{\mathrm{unknown}}(t) \bigr).\]

Standard reading. The compiler output for text $t$ is the ordered tuple whose entries are the observed mask of $t$, the ambiguous mask of $t$, the exact mask of $t$, the review mask of $t$, and the unknown-term set of $t$.

Plain English. The compiler turns audited phrases into exact qNS bits, and it keeps ambiguity or unknown language out of the exact route.

The clean case has no quarantine:

clean_collateral_report:
  observed = registry_verified, liquidity_deep, token_old_enough,
             provenance_clean, governance_separated, oracle_stable
  ambiguous = none
  unknown = none

The ambiguous case is not silently accepted:

ambiguous_risk_report:
  ambiguous = review, risk
  review atoms = sanction_risk, human_review_required
  unknown = none

The unknown case is quarantined:

unknown_term_report:
  unknown = momentum, quantum, sentiment, vibe

The checked summary is:

case_count = 3
ambiguous_case_count = 1
unknown_quarantine_count = 1
total_unknown_terms = 4
exact_mask_nonzero_count = 3

Standard reading. The generated artifact contains three cases, exactly one case with a nonzero ambiguous mask, exactly one case with unknown-term quarantine, four total unknown terms, and three cases with a nonzero exact mask.

Plain English. The compiler separates the intended clean, ambiguous, and unknown examples.

Trap. This is not the same object as upstream nlang. nlang is a broad natural language Boolean algebra. This qNS compiler is narrower: it turns a governed phrase table into exact finite masks and sends ambiguity to review before Tau reasoning.

This is why the qNS lane can be stronger than nlang for audited workflows:

text phrase
  -> governed ontology match
  -> qNS mask
  -> Tau route/reason/revision expression

The gain is not broader language coverage. The gain is that a downstream Tau spec can receive a finite, checked mask instead of an unconstrained sentence.

Part XII: ontology masks running through Tau

The ontology compiler is useful only if the compiled masks can enter Tau. The bridge artifact is:

assets/data/qns_ontology_tau_bridge_traces.json

It is generated by:

python3 scripts/generate_qns_ontology_tau_bridge_artifacts.py \
  --tau-bin external/tau-lang-qns-ba/build-Release/tau

The bridge sends the compiled qNS masks into Tau expressions for required-atom coverage, blocker detection, and pointwise revision-style memory update.

The blocker surface is:

\[B(t) = M_{\mathrm{missing}}(t) \vee M_{\mathrm{risk}}(t) \vee M_{\mathrm{review}}(t).\]

Standard reading. The blocker mask for text $t$ denotes the join of three masks: the required atoms that are missing, the risk atoms that are present, and the atoms that require review.

Plain English. Anything missing, risky, ambiguous, or unknown becomes a blocker mask before the decision proceeds.

The memory update reuses the same pointwise revision law:

\[\operatorname{Rev}(O,B,B)=(B\wedge B)\vee(B'\wedge O).\]

Standard reading. The memory-revision expression revises old memory $O$ by guard $B$ using replacement value $B$ itself: inside $B$ the result records $B$, and outside $B$ the result preserves $O$.

Plain English. Record the blocker region, and preserve the old memory outside that region.

The checked bridge summary is:

case_count = 3
tau_mismatch_count = 0
clean_blocker_failure_count = 0
nonclean_blocker_failure_count = 0
revision_idempotence_failure_count = 0

Standard reading. The generated artifact contains three cases, zero Tau-versus-host mismatches, zero failures of the clean no-blocker check, zero failures of the non-clean blocker check, and zero failures of revision idempotence.

Plain English. The clean report stays clear, the ambiguous and unknown reports produce blockers, and Tau agrees with the host reference on every checked mask.

Trap. The bridge uses staged masks. It does not expand the whole ontology, blocker, and revision computation into one giant expression. The earlier slow lane shows why staged qNS compilation matters.

Part XIII: certificate-carrying proposer output

The strongest qNS interface so far is not a bare text string. It is a small certificate object:

span, atom, confidence, reason

The generated artifact is:

assets/data/qns_certificate_proposer_traces.json

It is produced by:

python3 scripts/generate_qns_certificate_proposer_artifacts.py \
  --tau-bin external/tau-lang-qns-ba/build-Release/tau

The acceptance rule is:

\[\operatorname{Accept}(s,a) \Longleftrightarrow a\in A \wedge \operatorname{norm}(s)\in P_a.\]

Standard reading. The acceptance predicate holds of span $s$ and atom $a$ exactly when $a$ is an atom in the audited atom set $A$ and the normalized span $s$ is a member of the audited phrase set $P_a$ for atom $a$.

Plain English. A proposed claim is accepted only when it names a known atom and its evidence span is one of that atom’s governed phrases.

The accepted mask is:

\[M_{\mathrm{accepted}}(C) = \bigvee_{\operatorname{Accept}(s,a)} m(a).\]

Standard reading. The accepted-mask expression for certificate $C$ is the join of the atom mask $m(a)$ over all claims in $C$ whose span and atom satisfy the acceptance predicate.

Plain English. Only accepted certificate claims contribute bits to the qNS mask.

The checked summary is:

certificate_count = 3
total_claim_count = 12
accepted_claim_count = 8
rejected_claim_count = 4
ambiguous_claim_count = 2
unknown_atom_claim_count = 1
unsupported_span_claim_count = 1
tau_mismatch_count = 0

Standard reading. The artifact contains three certificate objects, twelve total claims, eight accepted claims, four rejected claims, two rejected ambiguous claims, one rejected unknown-atom claim, one rejected unsupported-span claim, and zero Tau-versus-host mismatches.

Plain English. The external proposer can attach confidence and reasons, but Tau only receives the finite mask produced by the deterministic certificate validator.

Trap. The confidence field is recorded evidence, not authority. A high-confidence claim with an unknown atom or unsupported span is still rejected before Tau reasoning.

Part I: the qNS split

Part II: the Boolean algebra

Part III: the candidate filter

Part IV: the checked laws

Part V: what Tau runs

Part VI: why this is not nlang

Part VII: what this gives Tau

Part VIII: practical use cases

Part IX: reason-coded routing

Part X: a multi-feature Tau demo

Part XI: an audited ontology compiler

Part XII: ontology masks running through Tau

Part XIII: certificate-carrying proposer output

Part VI: why this is not `nlang`