From 4f7ab8e4f0ac3457ead8be1e80a024e4152e6ecb Mon Sep 17 00:00:00 2001 From: ruv Date: Sat, 30 May 2026 17:15:08 -0400 Subject: [PATCH] =?UTF-8?q?docs(aether-arena):=20v0=20infrastructure=20com?= =?UTF-8?q?plete=20=E2=80=94=20Space=20live,=20harness=20gate=20passing=20?= =?UTF-8?q?(M8)?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- aether-arena/STATUS.md | 6 +++++- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/aether-arena/STATUS.md b/aether-arena/STATUS.md index 1566983a..8d97519a 100644 --- a/aether-arena/STATUS.md +++ b/aether-arena/STATUS.md @@ -14,7 +14,11 @@ ADR-079 camera-ground-truth collection โ€” *not* an infra-completion blocker. | M5 | Public smoke split (committed) + private MM-Fi held-out split prep | ๐ŸŸก smoke split done (`fixtures/smoke_*.json`); private MM-Fi prep pending | | M6 | HF Space (Gradio) โ€” leaderboard + ledger integrity + submit/verify/about | โœ… deployed โ†’ https://huggingface.co/spaces/ruvnet/aether-arena (sandboxed scorer container = later hardening) | | M7 | **Witness ledger chain** โ€” append-only, hash-chained, tamper-evident | โœ… done โ€” `ledger/ledger_tools.py` (seed/append/verify); tamper test fails as designed | -| M8 | Public launch | โณ โ€” **board starts EMPTY; no seeded numbers** (benchmark-first: only real harness scores) | +| M8 | Public launch | โœ… Space **LIVE** (gradio 5.9.1, serving 200) โ€” **board empty, awaiting first real harness score** (benchmark-first: no seeded numbers) | + +## v0 infrastructure: COMPLETE +Implement โœ… ยท Test โœ… ยท Deploy to HF โœ… (https://huggingface.co/spaces/ruvnet/aether-arena) ยท Instructions+Verification โœ… ยท PR runs the harness โœ… (PR #874, AA harness gate **passed**). +Remaining = data + hardening, not infra: private MM-Fi held-out split (M5), sandboxed scorer container (M6), privacy-leakage attacker (gated category), and **model SOTA** (separate ML effort, blocked on ADR-079 โ€” explicitly not an infra exit). ## Benchmark-first posture (per user direction) - **No placeholder numbers on the board.** The ledger seeds to genesis only; every result is a real scoring-pipeline witness. RuView gets no seeded baseline.