wifi-densepose/PROOF.md

5.5 KiB
Raw Blame History

PROOF — reproduce every claim, or find the one we can't yet

This project (RuView / wifi-densepose) has been publicly called "AI slop" and "fake." This document is the answer: a skeptic can clone the repo, run one script, and have every headline claim either verified on their own machine or shown — explicitly — as "CLAIMED, not yet reproduced (here's exactly what it needs)." Nothing below is asserted without a command you can run.

git clone https://github.com/ruvnet/RuView && cd RuView
bash scripts/prove.sh          # core gate + the anti-slop assertion tests
bash scripts/prove.sh --full   # also attempt the feature-gated subset

prove.sh exits 0 only if every non-gated claim passes. Gated claims never fail the run; they print the prerequisite (a GPU, a dataset, real hardware, a trained checkpoint) so you can reproduce them yourself.

Grading

  • MEASURED — reproduced on our hardware, with the exact command recorded, and pinned by a test that fails on the pre-fix code. prove.sh re-runs these.
  • CLAIMED — cited from a source, or measured by the source, but not reproduced in this repo's automated harness.
  • DATA-GATED / HARDWARE-GATED — the code path is real and tested, but the accuracy/throughput claim needs data or hardware we don't ship. We never fabricate the number; the code carries a typed error or a weights_trained/ provenance flag instead.

The hard gate (run on any machine with Rust + Python)

Claim Grade Reproduce
Rust workspace: 3,128 tests, 0 failed MEASURED cd v2 && cargo test --workspace --no-default-features
Deterministic CSI pipeline proof (bit-exact SHA-256) MEASURED python archive/v1/data/proof/verify.pyVERDICT: PASS

Anti-slop assertion tests (each fails on the pre-fix code)

Claim Grade Test (run via cargo test -p <crate> <name>)
Fusion crafted-input DoS panics are closed (ADR-156 §2.2) MEASURED wifi-densepose-ruvector :: triangulation_out_of_range_index_returns_none_no_panic
The "Soul Signature" identity claim, honestly bounded: on WiFi-only cardiac+respiratory channels two people are not separable (gap ≈ 0.0005) MEASURED wifi-densepose-bfld :: cardiac_alone_cannot_separate_identity_matches_audit
OccWorld predict() is real (input-dependent), not random noise MEASURED wifi-densepose-occworld-candle :: predict_is_deterministic_for_same_input
Pose runtime emits frames under its own default config (ADR-159 A1) MEASURED cog-pose-estimation :: default_config_emits_frames_with_real_model
Person-count flags untrained classes — no count inflation (ADR-159 A2) MEASURED cog-person-count :: untrained_class_argmax_is_flagged_low_confidence
Medical edge skills carry a "not a medical device" disclaimer (ADR-160 A1) MEASURED wifi-densepose-wasm-edge :: a1_med_modules_have_clinical_disclaimer (--features std)
Survivor dedup 3→1, count-inflation killed (ADR-158 §2) MEASURED wifi-densepose-mat :: test_identical_vitals_no_location_dedup_to_one (--features mat)

Measured performance (criterion; reproduce on your machine)

Claim Grade Reproduce
PSD FFT-planner cache 2.03.1×, DTW band 2.44.1× (ADR-154) MEASURED cd v2 && cargo bench -p wifi-densepose-signal
fuse() double-clone removed ~2.17× marshalling (ADR-156) MEASURED cd v2 && cargo bench -p wifi-densepose-ruvector --bench fusion_bench
zero-copy ORT input ~1.48× (ADR-155) MEASURED cd v2 && cargo bench -p wifi-densepose-nn --features onnx --bench onnx_bench
pointcloud splats 9→2 passes ~1.24× (ADR-160 research) MEASURED cd v2 && cargo bench -p wifi-densepose-pointcloud --bench splats_bench
native wlanapi multi-BSSID scan 9.74 Hz (vs netsh ~2 Hz) MEASURED (Windows) cd v2 && cargo test -p wifi-densepose-wifiscan -- --ignored measure_native_scan_rate

What we do NOT claim (the honest negatives — the strongest anti-slop signal)

Capability Status
Named person-identity from WiFi NOT achieved, and measured why. The §3.6 matcher is real, but identity does not lock on WiFi-only channels (gap 0.0005). DATA-GATED on a real enrollment feeding the AETHER/body-resonance channel — never done. No named-identity claim is made.
WiFlow-STD ~96% PCK@20 CLAIMED-reproduced on our RTX 5080 (benchmarks/wiflow-std/RESULTS.md); HARDWARE-GATED for you (needs an NVIDIA GPU + the MM-Fi dataset). The upstream shipped checkpoint was REFUTED (0.08% PCK) — we publish that.
OccWorld trajectory accuracy DATA-GATED on a trained checkpoint; predict() carries weights_trained=false until one is loaded — never silently faked.
Edge-skill detection accuracy (seizure, weapon, affect, …) UNVALIDATED — every such module is now disclaimer-gated as experimental/research; the DSP is real, the accuracy is not claimed.
802.11bf-2025 OTA conformance No commodity silicon ships a conformant interface as of 2026; ours is a simulation-tested forward-compat protocol model, not a certified implementation.

Provenance

Every claim above traces to a committed ADR (docs/adr/ADR-154ADR-160), a test, a criterion bench, or benchmarks/wiflow-std/RESULTS.md. The history includes published retractions (the 92.9% PCK retraction; the WiFlow-STD shipped-checkpoint refutation; the NV-diamond BOM reality check) — a faker hides failures; we commit them.