SPECTR

Structural Primitive Extraction & Compositional Topology for Replay

A pattern-finding engine for any kind of data.

replay(ingest(X)) == X

In plain words: whatever goes in comes back out, exactly.

Feed it text, an image, a sensor signal, anything. It learns the building blocks that show up over and over. When the same pattern appears again, it recognises it instead of relearning it. No training, no schema, no guessing.

See it in action Get API Access

↓ See the evidence

Live demo

See it in action

Three short, self-contained walkthroughs. Pick a tab, click Run, watch what happens. Nothing leaves your browser.

Read a tiny image, find its building blocks, rebuild it byte for byte.

Input1024 bytes

Scanned: 0 / 1,024 bytes

EngineBuilding blocks found

0 primitives

OutputMatch: 0%

ready

Rebuilding: 0%

The Promise

What SPECTR guarantees

Three plain promises. They hold for every input, every time. Not approximate. Not probabilistic.

replay(ingest(X)) == X

What goes in comes back out, exactly.

SAME ANSWER

Same input, same answer, every time. On any machine, any run. No randomness, no rounding, no guessing.

NOTHING LOST

Nothing is thrown away or summarised. Every bit that went in comes back out. The original is fully rebuildable from the record alone.

STRUCTURE FOUND

Finds the building blocks that repeat. Records what they are, in what order, and how often. That structural record is what makes everything else possible.

How It Works

Three steps. One promise.

What SPECTR actually does, in plain language.

Read your data

Hand SPECTR any file, stream, or signal. It reads the bits exactly as they are. No conversion, no schema, no assumptions about what the data means.

Spot the building blocks

SPECTR notices the patterns that repeat. The first time it sees one it learns it. The next time, it recognises it. The record of what repeated, in what order, is the structural identity of the data.

Rebuild it perfectly

From that structural record alone, SPECTR can rebuild the original data byte for byte. Nothing is lost. Nothing is approximated. The same record that detects patterns also reconstructs the input.

Common misconceptions

What SPECTR Is Not

There are many benefits to SPECTR, but it's important to understand which are byproducts of the engine, not its primary purpose.

Not compression

SPECTR naturally deduplicates recurring primitives, which can reduce size, but that is a byproduct of how it records structure, not a design goal. It makes no guarantees about size reduction, and the ledger grows with novel data.

Not a hash function

The original stream is fully reconstructible from the ledger. Nothing is one-way. The seed is a structural identifier, not a content fingerprint.

Not AI / ML

No AI is used at any point. No machine learning, no training data, no weights, no neural networks, no gradient descent. SPECTR is a deterministic algorithm: same input and configuration always produces identical output on every machine.

Not encryption

SPECTR makes no confidentiality guarantees. The ledger encodes structure, not secrets. It is a substrate layer; what you build on top handles access control.

What it is: A deterministic, append-only substrate that decomposes any binary stream into a primitive vocabulary, records the arrangement as a replayable timeline, and reconstructs the original stream exactly. replay(ingest(X)) == X, always.

What we have actually shown

Three things SPECTR found
that other tools missed.

See all case studies →

A hidden pattern that gzip cannot see

We hid a repeating pattern inside one file and shuffled the bytes of another. Both files had identical byte statistics, so they looked the same to a normal compressor. Only the engine could tell them apart.

98 times out of 100, SPECTR picked the right file. Gzip got it right half the time, no better than a coin flip.

Same letters, different order. The engine noticed.

Two families of human DNA sequences share 100% of their structural building blocks. Only the order in which those blocks appear is different. Standard tools that look at what appears (not how it is arranged) cannot tell the families apart.

Two DNA types share 100% of their building blocks. Only the order differs. SPECTR told them apart 96 times out of 100.

The standard randomness test says they are identical. They are not.

We made 1,000 pairs of files. Each pair was carefully built so that both files had the exact same randomness score. One had a hidden repeating pattern, the other was shuffled. We then asked two methods to tell them apart.

Identical randomness scores. SPECTR was right 99 times out of 100. Shannon entropy was right 50 times out of 100.

Get API Access Full Case Studies

Built for Research

Every result is reproducible.

The engine is Rust. The bindings are Python.

REST API

All engine capabilities available over HTTP. Ingest, replay, structural profiles, batch analysis, and noise-class normalisation through one authenticated endpoint.

curl -X POST /v1/engine \
  -H "X-Api-Key: $KEY" \
  -d '{"data": "<base64>", "mode": "auto"}'

API reference →

Python Wheel

The Rust engine ships as a native Python wheel. Run experiments locally without any network round-trips. Full API parity with the hosted service.

import ufm

engine = ufm.InvariantIdentityEngine(
    bit_depth=21, storage_path="ledger.bin"
)
seed, status = engine.process(data)
original = [bytes(seq) for seq in engine.replay(seed)]

Local install guide →

Open Claim Registry

Every quantitative result on this site maps to a claim ID in a public test registry. Tests are deterministic, seeded, and re-runnable. Claims that turn out wrong are listed honestly with their root causes, not removed.

Browse the full registry →

91 verified claims

Core engine guarantees

Discovery findings

Method-layer governance

Novel-frontier research