v2 Pass Contracts¶

Status: frozen specification — binding for v2 development
Audience: implementation agents and maintainers
Closes: #50 (M1.E — pass fixtures and invariant enforcement)

Summary¶

This document defines the required pass contracts for the canonical v2 compiler. It is intentionally operational: each pass has one job, one input shape, one output shape, and explicit non-goals. The objective is to stop backend emitters and tooling layers from re-deriving semantics ad hoc.

Global rules¶

Every major pass must satisfy all of the following:

one canonical owner module group
one explicit input contract
one explicit output contract
validation tests at the pass boundary
documentation updates in the same change when the contract changes

No pass is allowed to silently absorb responsibilities that belong to another phase just because the current implementation makes that convenient.

Pass graph¶

The canonical v2 graph is:

Compiler.Syntax.Lexer
Compiler.Syntax.Parser
Compiler.Frontend.Elaborator
Compiler.Types
Compiler.Typed
Compiler.Infer
Compiler.Lower
Compiler.Backend.<Target>
Compiler.Project.Manifest
Compiler.Project.Loader
Compiler.Cli

Project loading and CLI orchestration wrap the language pipeline but still need their own contracts because they determine how the compiler is used in practice.

Phase contracts¶

1. `Compiler.Syntax.Lexer`¶

Input: raw source text
Output: token stream with source positions and layout tokens
Must own:

lexical classification
comment stripping
layout token synthesis
literal tokenization
source position tracking at token granularity

Must not own:

declaration parsing
precedence
type checking
backend-specific behavior

2. `Compiler.Syntax.Parser`¶

Input: token stream
Output: surface AST plus position map or equivalent source-location mapping
Must own:

declaration parsing
expression parsing
precedence and associativity
pattern parsing
syntactic sugar expansion only when it is purely syntactic

Must not own:

name resolution
type inference
target-specific rewriting

3. `Compiler.Frontend.Elaborator`¶

Input: surface AST
Output: validated surface AST plus symbol/declaration environment and frontend diagnostics
Must own:

name resolution
declared-type consistency checks
constructor/binder environment construction
exhaustiveness and frontend structural checks

Must not own:

general HM inference
backend lowering
project graph logic

4. `Compiler.Types`¶

Input: type-level declarations and frontend requirements
Output: canonical type representations, substitutions, schemes, utility operations
Must own:

type representation
substitution machinery
free-variable calculations
generalization/instantiation support

Must not own:

AST walking
project loading
backend emission

5. `Compiler.Typed`¶

Input: typed-core expression and declaration shapes required by inference and lowering
Output: canonical typed IR definitions
Must own:

typed expression forms
typed declaration forms
expression IDs or equivalent stable identity hooks
metadata needed by lowering/backends

Must not own:

inference algorithm
backend rendering

6. `Compiler.Infer`¶

Input: elaborated surface AST and type environment
Output: typed IR plus inference diagnostics and evidence tables where needed
Must own:

HM inference / unification
typed-core construction
operator typing enforcement
principal-type behavior where intended

Must not own:

match lowering
backend-specific representations
project-system concerns

7. `Compiler.Lower`¶

Input: typed IR
Output: backend-neutral lowered IR
Must own:

explicit lowering of pattern matching
canonicalization of syntactic conveniences
preparation for backend emission

Must not own:

source parsing
environment construction
target-specific pretty-printing

8. `Compiler.Backend.<Target>`¶

Input: lowered IR
Output: target artifact text or target-specific intermediate form
Must own:

target syntax rendering
target runtime/helper mapping
target naming/stability conventions

Must not own:

re-typechecking source constructs
redoing language-level lowering decisions that should already be explicit in Compiler.Lower

9. `Compiler.Project.Manifest`¶

Input: manifest text
Output: canonical manifest model
Must own:

manifest parsing
manifest validation
dependency declaration decoding

Must not own:

filesystem graph walking
backend codegen

10. `Compiler.Project.Loader`¶

Input: project root + manifest model
Output: resolved project graph in deterministic load/build order
Must own:

source discovery
import-to-file resolution
dependency graph construction
cycle detection
topo ordering

Must not own:

type inference
backend emission
CLI UX policy

11. `Compiler.Cli`¶

Input: user command line / MCP request / tool invocation
Output: orchestrated compiler action and stable diagnostics/artifacts
Must own:

command dispatch
entrypoint selection
command-mode policy (build, check, run, install, mod *, MCP)
user-facing output shaping

Must not own:

language semantics
internal backend logic
ad hoc parsing/typechecking shortcuts outside the canonical pipeline

Pass fixture shapes (M1.E)¶

Each pass has a minimal smoke fixture that can be used to validate the boundary in isolation.

Phase	Fixture shape	Validation command
`Compiler.Syntax.Lexer`	a `.lll` source file with at least one of each token class	`lllc self check <file>` succeeds; token stream matches snapshot
`Compiler.Syntax.Parser`	`spec/examples/valid/01-minimal.lll` through `spec/examples/valid/19-*.lll`	xUnit test: parse each valid corpus example without error
`Compiler.Frontend.Elaborator`	all `spec/examples/valid/*.lll` corpus examples	`dotnet test` elaboration tests pass
`Compiler.Types`	property: `applyType (generalize env t) [] == t` for closed types	unit tests on substitution identity, composition, ftv
`Compiler.Typed`	any `TypedModule` produced by inference must round-trip through `exprId` lookup	xUnit: `TypedModule.Dispatch` keys are all `ExprId` values in the typed tree
`Compiler.Infer`	`spec/examples/valid/20-bootstrap-compiler.lll` infers without error	`lllc self check spec/examples/valid/20-bootstrap-compiler.lll`
`Compiler.Lower`	a program with nested match and lambda must lower to a form with no `EMatch` or `ELam` in the IR	unit test on lowered IR structure
`Compiler.Backend.<Target>`	`spec/examples/valid/*.lll` must emit + build successfully for each stable target	`dotnet test` codegen tests; `lllc build --target <t>` exits 0
`Compiler.Project.Manifest`	a valid `lll.toml` and a malformed one	unit test: valid parses without error; malformed returns structured error
`Compiler.Project.Loader`	a two-module project with one import	integration test: topo order is `[dep, importer]`
`Compiler.Cli`	`lllc build`, `lllc self check`, `lllc run` on a minimal project	end-to-end: exit 0 and expected output

Invariants checked at each boundary:

After lexing: every token has line > 0 and col > 0.
After parsing: PosMap contains an entry for every expression node.
After elaboration: TypeEnv is non-empty; no unresolved TyVar in declared positions.
After inference: every ExprId in TypedModule has an entry in Dispatch or a direct type annotation.
After lowering: no EMatch remains in the IR output.
After backend: emitted file passes target-language static analysis (e.g., dotnet build for F#, tsc --noEmit for TS).

Required migration discipline¶

When moving a subsystem from stage0 to canonical ll-lang ownership:

define the owner module group
define the input/output contract
identify current stage0-only assumptions
add or update direct tests for the ll-lang-owned path
update all docs that name the old owner

If any step is missing, the migration is incomplete.

Validation targets¶

Milestone 1 is not done until:

each phase above has one documented owner
docs consistently reference the same phase graph
tests exist at the main pass boundaries
no major subsystem is “canonical by implication only”

v2 Pass Contracts¶

Summary¶

Global rules¶

Pass graph¶

Phase contracts¶

1. Compiler.Syntax.Lexer¶

2. Compiler.Syntax.Parser¶

3. Compiler.Frontend.Elaborator¶

4. Compiler.Types¶

5. Compiler.Typed¶

6. Compiler.Infer¶

7. Compiler.Lower¶

8. Compiler.Backend.<Target>¶

9. Compiler.Project.Manifest¶

10. Compiler.Project.Loader¶

11. Compiler.Cli¶

Pass fixture shapes (M1.E)¶

Required migration discipline¶

Validation targets¶

1. `Compiler.Syntax.Lexer`¶

2. `Compiler.Syntax.Parser`¶

3. `Compiler.Frontend.Elaborator`¶

4. `Compiler.Types`¶

5. `Compiler.Typed`¶

6. `Compiler.Infer`¶

7. `Compiler.Lower`¶

8. `Compiler.Backend.<Target>`¶

9. `Compiler.Project.Manifest`¶

10. `Compiler.Project.Loader`¶

11. `Compiler.Cli`¶