SPECTR

Structural Primitive Extraction & Compositional Topology for Replay

A pattern-finding engine for any kind of data.

replay(ingest(X)) == X

In plain words: whatever goes in comes back out, exactly.

Feed it text, an image, a sensor signal, anything. It learns the building blocks that show up over and over. When the same pattern appears again, it recognises it instead of relearning it. No training, no schema, no guessing.

↓ See the evidence

Live demo

See it in action

Three short, self-contained walkthroughs. Pick a tab, click Run, watch what happens. Nothing leaves your browser.

Read a tiny image, find its building blocks, rebuild it byte for byte.

Input1024 bytes
Scanned: 0 / 1,024 bytes
EngineBuilding blocks found
0 primitives
OutputMatch: 0%
ready
Rebuilding: 0%

The Promise

What SPECTR guarantees

Three plain promises. They hold for every input, every time. Not approximate. Not probabilistic.

replay(ingest(X)) == X

What goes in comes back out, exactly.

SAME ANSWER

Same input, same answer, every time. On any machine, any run. No randomness, no rounding, no guessing.

NOTHING LOST

Nothing is thrown away or summarised. Every bit that went in comes back out. The original is fully rebuildable from the record alone.

STRUCTURE FOUND

Finds the building blocks that repeat. Records what they are, in what order, and how often. That structural record is what makes everything else possible.

How It Works

Three steps. One promise.

What SPECTR actually does, in plain language.

1

Read your data

Hand SPECTR any file, stream, or signal. It reads the bits exactly as they are. No conversion, no schema, no assumptions about what the data means.

2

Spot the building blocks

SPECTR notices the patterns that repeat. The first time it sees one it learns it. The next time, it recognises it. The record of what repeated, in what order, is the structural identity of the data.

3

Rebuild it perfectly

From that structural record alone, SPECTR can rebuild the original data byte for byte. Nothing is lost. Nothing is approximated. The same record that detects patterns also reconstructs the input.

Common misconceptions

What SPECTR Is Not

There are many benefits to SPECTR, but it's important to understand which are byproducts of the engine, not its primary purpose.

Not compression

SPECTR naturally deduplicates recurring primitives, which can reduce size, but that is a byproduct of how it records structure, not a design goal. It makes no guarantees about size reduction, and the ledger grows with novel data.

Not a hash function

The original stream is fully reconstructible from the ledger. Nothing is one-way. The seed is a structural identifier, not a content fingerprint.

Not AI / ML

No AI is used at any point. No machine learning, no training data, no weights, no neural networks, no gradient descent. SPECTR is a deterministic algorithm: same input and configuration always produces identical output on every machine.

Not encryption

SPECTR makes no confidentiality guarantees. The ledger encodes structure, not secrets. It is a substrate layer; what you build on top handles access control.

What it is: A deterministic, append-only substrate that decomposes any binary stream into a primitive vocabulary, records the arrangement as a replayable timeline, and reconstructs the original stream exactly. replay(ingest(X)) == X, always.

Built for Research

Every result is reproducible.

The engine is Rust. The bindings are Python.

REST API

All engine capabilities available over HTTP. Ingest, replay, structural profiles, batch analysis, and noise-class normalisation through one authenticated endpoint.

curl -X POST /v1/engine \
  -H "X-Api-Key: $KEY" \
  -d '{"data": "<base64>", "mode": "auto"}'
API reference →

Python Wheel

The Rust engine ships as a native Python wheel. Run experiments locally without any network round-trips. Full API parity with the hosted service.

import ufm

engine = ufm.InvariantIdentityEngine(
    bit_depth=21, storage_path="ledger.bin"
)
seed, status = engine.process(data)
original = [bytes(seq) for seq in engine.replay(seed)]
Local install guide →

Open Claim Registry

Every quantitative result on this site maps to a claim ID in a public test registry. Tests are deterministic, seeded, and re-runnable. Claims that turn out wrong are listed honestly with their root causes, not removed.

Browse the full registry →
91 verified claims
Core engine guarantees
Discovery findings
Method-layer governance
Novel-frontier research