Sutra — a geometrically compiled language

Sutra

Sutra is a geometrically compiled language where logical operations over vector spaces are resolved at compile time into matrix multiplications.

Write a program in a TypeScript-shaped language. The compiler turns the entire program — control flow included — into one straight-line sequence of tensor operations. What comes out is, at the same time, a logic program you can read and a neural network you can train.

Why Sutra

Write logic, get a network. A Sutra program is simultaneously a readable symbolic program and a differentiable model. Train it with ordinary PyTorch autograd — the rule graph never changes; gradient descent only moves the embeddings it reasons over. A symbolic fuzzy-rule classifier trains from chance to 95% accuracy without touching a line of the program.
One tensor-op graph, no glue. The whole program — conditionals, loops, string I/O — compiles to a single tensor expression. No interpreter, no host-side if/while on data, no Python in the hot path. The program is the computation graph.
Substrate-agnostic. Values live in a frozen embedding space. The same source recompiles against a different model — a text encoder, a protein language model, any dense encoder — and the binding algebra stays exact where textbook vector-symbolic operators fall apart.
Symbolic and sub-symbolic without a bridge. Fuzzy three-valued logic, role binding, rotation hash-maps, recurrent loops — all native, all differentiable end to end. No separate neural front-end stitched to a symbolic back-end.

How it works

Every value is a vector; every operation — bundle, bind, unbind, similarity, select, loop — is a tensor op on that shape. Because the shape never changes, the compiler reads a whole program as one tensor expression: chains of bind/unbind/bundle collapse into chains of matrix multiplies, the simplifier folds those into cached matrices at compile time, and the runtime executes the result as one sequence of tensor ops.

A Sutra value is a vector in a frozen LLM embedding space (default substrate: nomic-embed-text, 768-d). Strings auto-embed in vector contexts — vector v = "cat" embeds the string through the substrate. Conditionals are softmax-weighted sums; loops are recurrent cells that unroll to a fixed-length tensor-op chain with a soft-halt mask, the loop counter being angular position on a helix in the substrate rather than a host variable.

Hardware

Sutra compiles to self-contained PyTorch and runs on an NVIDIA GPU (CUDA, selected automatically at module init) or on CPU — the same emitted module, no code change. Because the entire program is one tensor-op graph with no host-side control flow, it maps straight onto GPU execution: the program is the kernel sequence, not a script that calls into one. Requirements are Python and PyTorch, plus Ollama to serve the default embedding substrate.

Get started

git clone https://github.com/EmmaLeonhart/Sutra
cd Sutra
python examples/_smoke_test.py

Read examples/*.su for the language itself. The compiler, runtime, IntelliJ plugin, and VS Code extension all live in the repository.

Explore

The paper

Tensor-Op RNNs as a compilation target for VSAs — full text, readable here.

What is Sutra?

The short version: a typed language whose compiled forward pass is a neural net.

The graph-to-vector leap

Why embedding spaces look like graphs but behave like geometry.

Paradigms — Sutra is not Java

What programming paradigms Sutra is in conversation with.

The ontology

The type system and the role of OWL-style classes.

Primitive classes

Built-in primitive types and their geometric semantics.

Operators

The operator set and what each one compiles to.

Logical operations

&&, ||, ! over fuzzy three-valued truth.

Numeric math

How integers, floats, and complex numbers live in the substrate.

Memory without control flow

bind, unbind, bundle — the role-filler model.

Loops

First-class loop functions as substrate-pure RNN cells.

Promises and async/await

Promises and async/await, geometrically.

TypeScript → Sutra mapping

How TypeScript source maps onto Sutra.

Compilation: sugar to polynomial

The five-stage pipeline from source to fused tensor graph.

Demos

Every program in the smoke test.

History

How the language got to its current shape.

What Sutra implements — every keyword, operator, primitive, runtime method, and stdlib class, with the training status of each

Papers

Tutorials

Hands-on: hello Sutra, bind/unbind, snap-to-nearest.

NeurIPS 2026 archive

The frozen submission record + paper / anonymized / zip downloads.

Read the paper Read on arXiv View source on GitHub Releases & downloads All projects