Tutorial 3

Tau Language (invariants first, then execution)

Learn to read and write small executable stream specifications in Tau, starting from invariants and the common stepwise fragment.

View source Built 2026-04-01

This tutorial is about a shift in how to think about writing programs. Once the idea clicks, Tau starts to feel much less alien.

Tau is not imperative code with odd punctuation. It is a declarative language for typed input and output streams.

Instead of giving a step-by-step recipe (“do this, then do that”), a Tau spec says how current and earlier stream values may relate over time. If the spec is satisfiable in Tau’s execution sense, Tau can run it step by step: read the current inputs, solve for the current outputs without peeking at future inputs, advance time, repeat.

One careful point matters from the start. A Tau specification usually denotes a set of possible programs, not just one program. When that set is non-empty, Tau executes one deterministic representative from it.

That can sound a bit slippery on first contact, so this tutorial starts with a concrete habit: before writing any code, write a list of sentences that are meant to stay true forever. Those sentences are your invariants. Everything else, such as syntax, types, and stream declarations, is just scaffolding to make the invariants executable.

Mental pictures to keep

A timeline of streams: inputs and outputs indexed by time, like frames in a film
An invariant as a rail: the system can move, but not off the rails
A solver as a witness generator: it finds concrete current outputs that keep the constraints true
Three lenses for the same system: state machine, recurrence relation, logic specification

All examples in this tutorial are runnable from files under examples/tau/. Each file is a self-contained transcript that runs end-to-end without manual typing.

Scope of this tutorial

This page teaches the common beginner fragment: typed input and output streams, initial conditions like o1[0] = ..., and bounded-lookback constraints such as o1[t] = ... or o1[t] = f(o1[t-1], i1[t]). Full Tau also has explicit temporal operators such as always ([]) and sometimes (<>), quantifiers, and the special tau type for talking about Tau specs as values.

Part I: inside your head (invariants before syntax)

Imagine designing a turnstile at a subway entrance.

You do not start by writing code. You start by stating what must always be true:

The turnstile is always in one of two states: Locked or Unlocked.
Inserting a coin when locked unlocks it.
Pushing through when unlocked locks it again.
Pushing through when locked triggers an alarm.
No alarm when unlocked.

These are your invariants. They do most of the real design work before a keyboard even enters the story. They say what the state is, what inputs exist, and what must never be violated.

This is the habit that makes specifications readable: invariants first, then syntax.

A tiny turnstile state machine

The same system can be viewed as a state machine (states and arrows) or as logical constraints relating current and next values. Both views are correct; they emphasize different things.

Why start with invariants?

When the starting point is syntax, it is easy to get trapped in details: “What type is this? How is that declared? Why is the compiler unhappy already?”

When the starting point is invariants, orientation comes first. The invariants are the spec. Everything else exists to make them checkable.

Part II: reading Tau (the practical version)

Tau is easiest to read if it is pictured as a row of time-stamped boxes. That picture may feel almost too simple, but it really does carry a lot of the tutorial.

A stream is just a sequence of values indexed by time. Think of it like frames in a film, numbered 0, 1, 2, 3, …

Input streams (i1, i2, …) receive values from the outside world.
Output streams (o1, o2, …) produce values computed by the spec.

At each time step t, the spec relates inputs and outputs. In Tau, t is not “the current loop counter.” It is a placeholder that means “for every step like this one.”

t is not a loop variable

In imperative code, a loop counter t advances as the program runs. In Tau, t is a variable that ranges over time steps all at once. A line like o1[t] = o1[t-1] + i1[t] is a relationship that must hold at every step, not an instruction executed at one step.

Also, a Tau spec with no explicit always or sometimes is implicitly read as always. That is why a bare line like o1[t] = ... is understood as a standing rule.

Time-compatible means no retroactive cheating

The theory behind Tau uses the idea of a time-compatible (or prefix-preserving) behavior. If two input histories are the same up to step n, the produced output histories must also be the same up to step n. New future inputs are not allowed to reach back and rewrite earlier outputs. That is the clean formal version of "no time travel tricks."

Declaring streams

i1 : bv[8] := in console   # input stream, 8-bit values
o1 : bv[8] := out console  # output stream, 8-bit values

Types used in these examples

bv[8]: an 8-bit bitvector (values 0 to 255). Arithmetic wraps around; 255 + 1 = 0.
sbf: a Boolean-algebra type that we use here like a 0/1 flag for "valid?", "alarm?", and "solved?" signals.
tau: a Tau specification treated as a value. It matters in full Tau, but this page sticks to bv[8] and sbf examples.

Why Boolean algebra keeps showing up

Tau is not built around plain yes-or-no booleans alone. Its stream values live in typed Boolean-algebra settings. In one example that means bitvectors and bitwise operations. In another it means symbolic Boolean-function values. That shared algebraic backbone is why the language can reuse the same basic operators across several kinds of objects.

Writing constraints

In the fragment used on this page, a top-level Tau spec is a formula. The value side is a term, for example o1[t-1] + i1[t]. The full statement is a formula, for example o1[t] = o1[t-1] + i1[t].

A common beginner pattern is a formula that relates values at the current time step, and sometimes earlier steps too:

o1[t] = o1[t-1] + i1[t]

Plain English: “take the old value, add the new input, and that gives the new value.” That is just a running sum. This is a good example of a Tau line that looks more forbidding than it really is.

Time indices as a sliding stencil

A term like t-1 means "one step earlier." A constraint with t is like a stencil that is applied at every time step.

How to read `t-1` and `t-2` (unroll it)

The safest way to read Tau is to stop being abstract for a moment and plug in actual numbers. This is worth doing often. Tau gets friendlier very quickly once the symbols are forced to commit to one concrete step.

If a spec contains:

o1[t] = o1[t-1] + i1[t]

then the first few instances look like:

o1[1] = o1[0] + i1[1]
o1[2] = o1[1] + i1[2]
o1[3] = o1[2] + i1[3]

That is all t-1 means: “the previous frame.” Similarly, t-2 means “two frames back.” If a constraint uses t-2, unrolling it will show dependencies that skip one step.

Most runnable examples on this page have bounded lookback. That just means the rule for step t only looks at a fixed finite window, such as t and t-1, or t, t-1, and t-2. This is the recurrence-shaped fragment that feels most like “state update,” even though Tau itself is phrased in streams rather than mutable variables.

If the state-machine picture is easier to hold in mind, read the indices like this:

o1[t-1]: the state before the transition
i1[t]: the event/input at this step (the arrow label)
o1[t]: the state after the transition

Tau writes the transition rule as an equation over these time-indexed values.

A small but important detail: t-1

When a constraint mentions t-1, it only makes sense starting at t = 1. That is why examples also include initial conditions like o1[0] = ... to define the base case.

Fixed indices vs offsets

o1[0] means "the value at time 0" (a fixed index).
o1[t] means "the value at time t" (a universally-scoped time variable).
o1[t-1] means "one step earlier than t" (a relative offset).

Writing t-0 is unnecessary, because it is the same as t. In specs, the meaningful contrast is usually between [0] (an anchor) and [t-1] (a delay).

Why "state" often shows up as an output stream

Tau allows current outputs to depend on earlier outputs. So memory is already built into the language. In beginner examples like the turnstile, the running count, or the toggle board, the thing called "state" is usually just an output stream whose previous value is read at the next step.

The mindset shift

This is the biggest mindset shift in the page:

An imperative program computes outputs by executing instructions step by step.
A Tau specification constrains outputs by stating relationships that must hold.

During execution, Tau receives the current inputs and solves for the current outputs in a way that keeps the spec true now and still compatible with continuing later. One good mental model is “fill in the blanks for this step, but only in ways that keep the run extendable.” It is less like writing instructions for a machine to follow, and more like laying down rails and asking for the next legal move.

Tau satisfiability is stronger than one-shot satisfiability

Roughly, Tau is not asking only "is there one trace that works?" A satisfiable Tau spec must admit indefinite execution for every input stream, with outputs chosen step by step, without dependence on future inputs, and in a way that stays time-compatible as the run grows.

Scope check: what Tau does and does not do

Tau does: enforce relationships between typed streams, and solve current outputs from the current history
Tau does not: parse messy inputs, own the environment loop, or replace the host system that feeds outputs back into future inputs
Practical split: host code handles orchestration, Tau handles the logic kernel

Two operator families (a common source of mistakes)

& | ' ^ are term operators (bit-level operations on values)
&& || ! <-> are formula operators (logical connectives combining constraints)

A comparison like x = y is a formula, so it should be combined with &&, not &.

In these examples, bit-flips are written with XOR against a mask (for example, x ^ { 1 }:bv[8]), because it makes the intended bit-level change explicit.

A practical note: charvar

Tau restricts variable names by default. To use longer names like delta_is_valid, every example starts with set charvar off.

Part III: the same system in three lenses

The turnstile can be described in three equivalent ways. Nothing about the system changes. Only the notation changes.

Lens 1: state machine (the picture)

Draw circles for states (Locked, Unlocked) and arrows for transitions (coin, push). This is the visual view: good for intuition, but hard to execute directly.

Lens 2: recurrence relation (the function)

Define a step function that takes the current state and event, and returns the next state:

\[\mathrm{state}_{t+1} = \mathrm{step}(\mathrm{state}_t, \mathrm{event}_t)\]

This is the functional view. The whole behavior unfolds from an initial state and repeated application of step.

Lens 3: Tau specification (the constraints)

Encode the state as an output stream, encode the event as an input stream, and write constraints:

o1[0] = 0 (start locked)
o1[t] = 1 if coin and not push (unlock)
o1[t] = 0 if push (lock)
o2[t] = 1 if push while locked (alarm)

This is the declarative view. For this tiny example, the constraints pin down one behavior. In general Tau can leave several behaviors admissible, then execution picks one deterministic representative. Same turnstile, different accent.

One turnstile step through all three lenses

Take one concrete moment:

at time t-1, the turnstile is Locked
at time t, a coin arrives
at time t, there is no push
at time t, the next state should be Unlocked

Here is the same event in three forms:

State machine: Locked --coin--> Unlocked
Recurrence instance: state_t = step(Locked, coin) = Unlocked
Tau instance: if o1[t-1] = 0 && coin[t] = 1 && push[t] = 0, then o1[t] = 1

One event, three equivalent lenses

The picture, the recurrence, and the Tau clause are doing the same job. They differ in notation, not in meaning.

Why three lenses?

Different tools want different representations:

Humans often prefer state machines (visual, local).
Recursive code uses recurrence relations.
Constraint solvers use logic specifications.

The key insight from Tutorial 2 (Isomorphism) applies here: these are not three different systems. They are three equivalent descriptions of the same system. Picking the right lens for your task is a practical skill.

Programs as infinite decision trees

The turnstile rules form a case split: given the current state and event, pick the next state. That is a decision tree.

For code without loops, the tree is finite. For code with loops, the tree is conceptually infinite. It keeps branching forever.

State machines and stream specifications are compact ways to describe infinite unfoldings.

Part IV: cards in Tau (an approximate state tracker)

Tutorial 1 built a precise mental picture of a card counter’s running score. Now let’s see that same system in Tau.

The runnable example is:

examples/tau/card_counting_hilo_state_tracker.tau

From the repo root:

./scripts/run_tau_policy.sh examples/tau/card_counting_hilo_state_tracker.tau

What the spec does

The input stream provides a pre-classified delta: +1, 0, or -1 (encoded as 8-bit values: 1, 0, or 255). The output stream maintains a biased running count.

The core constraint is simple:

\[\text{count}_t = \text{count}_{t-1} + \text{delta}_t\]

That is it. No hidden cleverness. The spec just enforces the update rule.

A useful convention: push parsing out of Tau

Notice that Tau does not classify cards. It receives the already-classified delta. This is a design pattern: keep complex parsing in the host system, let Tau enforce simple invariants.

Why? Tau is good at constraints, not string manipulation. The split keeps both sides clean, and it keeps the spec focused on the part that is actually worth checking.

Part V: a toggle puzzle in Tau (and why XOR is linear algebra)

Now connect back to Tutorial 2’s “puzzle becomes linear algebra” example.

Consider a row of lights that are either on or off. Pressing a button toggles some subset of lights. The puzzle is simple to state and strangely satisfying to solve: turn all lights off.

The insight is that toggling is XOR. A board of lights is a bitvector. Applying a move is:

\[\text{board}_t = \text{board}_{t-1} \oplus \text{move}_t\]

where $\oplus$ is bitwise XOR.

The runnable example is:

examples/tau/toggle_puzzle_xor_state.tau

Why this matters

XOR over bits is the same as addition in $\mathbb{F}_2$ (the field with two elements). So “toggle puzzle” and “linear algebra over a finite field” are the same structure.

This is the leverage move from Tutorial 2: recognize an isomorphism, then use tools designed for the target domain. A puzzle that looked fiddly by hand becomes crisp once it is seen in the right language.

Part VI: Q-learning as a lookup table (and what Tau can check)

What a lookup table really is

A lookup table is a function on a finite domain, stored as a list of values. There is nothing mystical about it. It is just a very explicit function representation.

For tabular Q-learning, the table maps (state, action) pairs to scores:

\[Q : S \times A \to \mathbb{R}\]

If $S$ and $A$ are small and finite, you can literally store every entry. That is a Q-table: a grid of numbers, one per (state, action) pair.

The Q-learning update

The classic update rule is:

\[Q(s,a) \leftarrow (1-\alpha) Q(s,a) + \alpha \left(r + \gamma \max_{a'} Q(s',a')\right)\]

In words: blend the old value with a target computed from the reward and the best predicted future value.

For this tutorial, we use a much smaller executable toy. The table entries are bv[8] values, and the update is simplified to target = r + q_next. If learn = 1, the selected entry becomes target.

A runnable Tau example: 2x2 table update

The runnable spec is:

examples/tau/q_learning_tabular_update.tau

Each step provides:

(s, a): which entry to update (two 0/1 bits)
r: the reward
q_next: a stand-in for $\max_{a’} Q(s’, a’)$
learn: whether to update (0 = no, 1 = yes)
Q00, Q01, Q10, Q11: the current table entries

The spec computes target = r + q_next and enforces:

If learn = 1: exactly one entry changes to target; the others stay the same.
If learn = 0: all entries stay the same.

What Tau gives you

This is the “Tau + lookup table” pattern:

The host system chooses which entry to update and what the reward is.
Tau enforces that the update has the right shape.

The combination is powerful because it separates concerns:

The table is a concrete artifact you can log, diff, and replay.
The constraints are guardrails: “only this entry updates,” “the update uses this formula.”
When something breaks, there is a counterexample trace to inspect instead of a vague sense that “training went oddly.”

A note on closing the loop

This spec models a single update step. It takes the current table as input and produces the updated table as output.

To run multi-step learning, the host system must “close the loop”: take the output table and feed it back as the next step’s input. Tau specifies the shape of each step; the host orchestrates the sequence.

Part VII: tables vs weights vs latent space

Lookup tables versus neural network weights

A Q-table represents a function on a small discrete domain. Each entry is one input-output pair.

A neural network also represents a function, but parameterized differently: by weights.

Two questions that sound similar but are not:

Expressiveness: Can both representations compute the same input-output behavior?
Isomorphism: Is there a 1-to-1, structure-preserving map between them?

For tables and neural networks, (1) can be true. A small network can compute the same function as a table. But (2) is generally false. The map “weights → behavior” is many-to-one: different weight settings can compute the same function.

That is an equivalence relation (same behavior), not an isomorphism (same structure). Same outcome, different internal bookkeeping.

Table, weights, meaning

A table has a direct, lossless correspondence with a function on a finite domain. A weight vector often has many distinct settings that compute the same function.

Latent space as a learned abstraction

In deep learning, a latent vector is the model’s internal representation of an input. It is an abstraction: it forgets details while preserving distinctions the model needs.

From the “spaces” perspective:

State space: all configurations the environment can be in.
Table space: all possible Q-tables (one coordinate per entry).
Weight space: all possible neural network parameters.
Latent space: the internal representation space the model learns.

“Learning” and “verification” are different kinds of search through these spaces:

Learning searches parameter space for weights that score well on data.
Verification searches state space for counterexamples that break an invariant.

State spaces and parameter spaces

Formal methods and machine learning both involve search. The difference is which space you traverse and what counts as evidence.

Running the examples

All examples live in examples/tau/. To run them:

Build Tau (once):
```
./scripts/update_tau_lang.sh
```

Run an example:

./scripts/run_tau_policy.sh examples/tau/turnstile_fsm_alarm.tau

Each .tau file is a self-contained REPL transcript: it declares streams, runs the spec, feeds demo inputs, and quits. You can read the file to see both the spec and the expected behavior.

Where this tutorial goes next

This tutorial introduced Tau as a way to write executable specifications. The key habit: invariants first, syntax second.

Next directions:

Use Tau as a small “logic kernel” inside a larger system: the host computes complex checks, Tau enforces guardrails.
Connect Tau execution with counterexamples and synthesis loops (CEGIS).
Build more complex state machines with multiple interacting streams.

If you want to see how constraints connect to learning systems, read Tutorial 4: World Models.