helix · controlled evidence
agη

helix

Lossless genomic compression. Sealed core, public evidence.

A genome is reduced to a compact amortized footprint, not stored as raw sequence. On real public genomic data the codec is lossless at over 1,200× across a cohort, with each genome reduced to roughly 10 KB. Recovery is byte-identical and SHA-256 verified — a reviewer can reproduce it on public reference data and confirm the restored sequence hash-for-hash. General-purpose compressors reach 3–4× on the same data. The encoding mechanism stays sealed. Research-grade — no clinical claim.

evidence ledger

measured density, sealed machinery.

Bounded, stated-boundary claims on public reference data. The mechanism stays sealed; the density is verifiable.

measured density
>1,200× · real public genomic data
baseline
3–4× · gzip / zstd, same data
per genome
~10 KB · amortized footprint
throughput
>100M bases / second
recovery
lossless · byte-identical · SHA-256
model
Self-Referential Amortization
stage
research-grade · no clinical claim
public layer

Bounded, not hyped.

The result is stated with its context: an amortized density figure on real public genomic data, where each genome reduces to a ~10 KB footprint. Recovery is byte-identical and SHA-256 chained — a reviewer can reproduce it on public reference data and verify the restored sequence hash-for-hash. General-purpose compressors reach 3–4× on the same inputs. The mechanism stays sealed; the density does not depend on trusting it.

stated openly

Boundaries.

Research-grade — no clinical claim. The headline figure is amortized across a cohort; a single genome, compressed on its own, compresses less. All inputs are preserved losslessly, byte-for-byte. The core projection mechanism stays sealed pending patent.