tau · wave 3 · sealed core

agτ

tau

Structure-aware storage for Ethereum-class transaction data — compact, individually addressable, readable while still compressed.

Tau stores blockchain structure, not just bytes. On the public demo corpus (5,670 Ethereum RPC JSON transactions, 12.41 MB): 5.53× lossless (2.24 MB compressed), with sub-2µs query-in-place and 5,670/5,670 byte-exact round-trip verified across platforms. It exploits structure within RPC-JSON workloads; on unstructured noise it falls back to ≈1× and never inflates output. Byte-exact recoverable, independently verifiable, queryable while still compressed — without exposing the engine. The core is sealed. The evidence is not.

run demo →request access inspect evidence

one engine · target markets

one engine. multiple target markets.

Tau is a structure-aware storage engine for blockchain data. It makes a narrow, testable claim: blockchain transaction data contains stable structure, and storage engines should use that structure directly. The same engine serves indexers/analytics/data-warehouses (such as Dune, Allium, The Graph) and archive-node/RPC/L1-L2 infrastructure. Both target signed transaction history (canonical RLP format) and RPC representations (RPC JSON) using the same sealed structure-aware engine.

Parquet must store the field. Tau stores the rule that reconstructs it.

Parquet works with data · Tau works with the meaning of data · run it on your own data, full figures under NDA

Blobs changed the tariff layer. Tau attacks the payload layer.

inclusion is not representation · availability is not compression · hashing is not payload economics

superior density · structure-aware storage

superior density — using a sealed structure-aware codec.

On real RPC JSON transaction data Tau reaches 5.53× lossless density on the verified 5,670-tx corpus — with O(1) query-in-place in sub-2µs. Tau beats general-purpose codecs on the same corpus; exact comparator figures are shared under NDA and regenerate on your corpus during BYOC evaluation.

Per-tx archivers trade density for access. Tau keeps both — read a field while the archive stays packed.

for the skeptic · verify it yourself

“just delta-compression,” says the skeptic. the code says otherwise.

General-purpose compressors win by hunting repeated bytes. Tau works a layer up — on the structure of the transactions themselves. Same density, reached a different way. Don’t take the claim on trust — run the benchmark on your own corpus and read the table it prints.

built-in benchmarkThe competitor table is part of the engine, not a slide — Zstd / Brotli / LZ4 / Snappy run in-process on every evaluation.independenceTarget density is met without those codecs in the compression path; they exist only to be measured against.different mechanismNot byte-repetition search. Tau models transaction structure — which is why it can answer queries while the data stays compressed.buyer-side proofBring your own corpus. The table regenerates on your data, on your hardware.

evidence brief

a verification challenge, not a compression claim.

Every figure below was produced by running the engine on 2026-06-24 (Mac arm64), not quoted from a doc. Run logs are retained. Tau does not ask the market to believe — it asks the market to reproduce.

real corpus · lossless

5,670 tx · Ethereum RPC JSON · 12.41 MB

head-to-head · verified results

5.53× lossless (2.24 MB) · 7.37× analytics (1.68 MB)

out-of-sample validation

density holds on unseen 90/10 validation · figures under NDA

query-in-place

sub-2µs · O(1) random-access

analytics density

7.37× · analytics profile · scope D

density band

5.5–7.3× · lossless-to-analytics · labelled separately

conformance

byte-exact round-trip · 5,670/5,670 passed

cross-platform

verified across 6 machines / 4 CPU families

what tau is

A sealed transaction-representation engine.
A bit-perfect recovery path.
Buyer-side integrity verification.
A buyer-side verification model.
Infrastructure-oriented throughput.

what tau is not

Not a blob replacement.
Not a Merkle replacement.
Not a generic compressor.
Not a public-source release.
Not a universal cost-reduction claim.
Not externally crypto-audited yet — the construction is novel; independent audit is scheduled and available to evaluation partners under NDA.

If Tau fails, your corpus proves it. If it holds, your pipeline has a representation problem worth quantifying.

methodology separation

numbers, kept honest.

Figures from different runs and datasets are not interchangeable. They are labelled, not blended.

public rangesPublished figures are given as exact verified metrics on the 5,670 transaction corpus.public demo regimesPublic demo scope D only — 5,670 Ethereum RPC JSON transactions; repo-only wire KPIs stay under NDA.not blendedFigures from different runs and datasets are labelled, never mixed.BYOC evaluationBuyer corpus, buyer baseline, buyer-side verification. The decisive number.

cross-platform verification

same corpus. cross-platform verified.

The same corpus replayed on 6 machines across 4 CPU families (AMD Ryzen 9, AMD Ryzen 7, AMD EPYC, Apple M2, and Intel). The compression ratio is identical (5.53× lossless) everywhere, and round-trip verification passes on all hosts.

cross-platform validationVerified on 6 machines / 4 CPU families, identical ratio everywhere.execution-view readSub-2µs query-in-place random access across all platforms.performance rangesEncode time of 70–113 ms on the 12.41 MB corpus (throughput under NDA).NDA detailsDetailed per-host logs shared under NDA.

Execution-view throughput is an in-place read rate on the tested corpus — not a compression-ratio claim and not a network line-rate claim. Exact per-host figures are shared under NDA.

One corpus, 6 machines, one result. Reproduce it on yours.

boundaries

what it is — and what it is not.

Live demo processes data server-side — never upload private transactions.

repository documents

standards & licenses.

Normative files, specs, and conformance guarantees from the official repository.

README.md

ag^τ

tau · SEALED CORE

ag^τ

tau

Structure-aware storage engine for blockchain data that answers while still compressed.

→ 5.53× RPC JSON · 5,670 tx corpus
→ Lossless, byte-exact recovery verified on 5,670/5,670 transactions
→ Query-in-place: O(1), sub-2µs
→ Protocol-aware profiles in scoped eval (details under NDA)

SEALED CORE · PUBLIC EVIDENCE

BENCHMARKS.md

ag^τ

tau · performance

Verifiable performance metrics compared to industry standards.

tau lossless 5.53×

LICENSE

ag^τ

RESTRICTED

PROPRIETARY

Proprietary commercial license. Use, copying, or reverse engineering of the sealed engine core is strictly prohibited. Evaluators must sign the bilateral NDA.

PROPRIETARY