wifi-densepose

Commit Graph

Author	SHA1	Message	Date
rUv	9b07dff298	feat(beyond-sota): ADR-155 metric unification + ADR-156 RaBitQ Pass-2 (honest negative + latent topk bugfix) (#1053 ) * refactor(train): hoist canonical PCK/OKS to un-gated metrics_core; fold test_metrics onto production (ADR-155 M1 §8) ADR-155 §8 deferred item: test_metrics.rs reference kernels validated production against their OWN reimplementation — a test that cannot catch a canonical-impl bug (both could be wrong the same way). - Extract canonical_torso_size / pck_canonical / oks_canonical / sigmas / bounding_box_diagonal into a new NON-tch-gated `metrics_core` module, so the single metric definition is reachable under `cargo test --no-default-features` (the `metrics` module is tch-gated). `metrics` re-exports every item → still exactly ONE implementation. - Rewrite tests/test_metrics.rs to assert the PRODUCTION pck_canonical / oks_canonical equal hand-computed fixtures (not a reimplementation): canonical_pck_matches_hand_computed_fixture (corr=3/total=4/pck=0.75), hip↔hip normalizer pin, zero-visible⇒0.0, OKS perfect⇒1.0, fake-Gold pin. - Keep an INDEPENDENT raw-threshold reference kernel only as a differential cross-check: test_kernel_agrees_with_canonical asserts it AGREES with canonical where torso==1.0 (genuine cross-check, not duplication). Grade: MEASURED. test_metrics 10→12 tests, 0 failed. Co-Authored-By: claude-flow <ruv@ruv.net> * fix(sensing-server): relabel divergent live PCK/OKS so they're never conflated with canonical (ADR-155 M1 §2.1/§8 Goal C) Goal C named training_api.rs:804 (torso-HEIGHT PCK). Auditing it surfaced TWO findings the ADR-155 §1 table missed: 1. training_api.rs is an ORPHAN file — not declared `mod` in lib.rs OR main.rs, so it does NOT compile into the crate. It does not drive the live server. 2. The REAL live `best_pck`/`best_oks` (main.rs training path → RVF metadata JSON read by model_manager.rs) come from trainer.rs: - `pck_at_threshold` = RAW-threshold PCK, NO torso normalization (the most divergent kind), printed/serialized as bare "PCK@0.2". - `oks_map` calls `oks_single(area=1.0)` = the EXACT fake-Gold pattern ADR-155 §2.1 claimed closed elsewhere — still live here, inflating best_oks. Resolution = RELABEL (torso/raw math is load-bearing on different data; the pub fns can't be renamed without breaking API; sensing-server has no train/ ndarray dep). Honest unify is a tracked §8 backlog item. - training_api.rs: `compute_pck` → `compute_pck_torso_height` + divergence doc; val_pck/best_pck/val_oks struct fields documented as torso-HEIGHT proxies; logs say `pck_torso_h@0.2`. Test torso_pck_is_labelled_distinctly_from_canonical. - trainer.rs (LIVE): `pck_at_threshold` documented raw-unnormalized; `oks_map` area=1.0 flagged fake-Gold; test pck_at_threshold_is_raw_unnormalized_not_canonical. - main.rs: live print relabelled `pck_raw@0.2` / `oks_map(area=1.0 proxy)`. No wire-format field renames (back-compat); no pub-API rename (no silent break). Grade: MEASURED (relabel + divergence pinned). sensing-server 450→451 lib tests, 0 failed. Co-Authored-By: claude-flow <ruv@ruv.net> * docs(adr-155): mark §8 metric items RESOLVED + audit map + honest §1 under-count correction (M1b Goals A/D) - §8.1: full PCK/OKS audit map (every def: file:line, basis, canonical/ legacy/distinct), the two §8 items marked RESOLVED with resolution+why. - Honest finding: §1's "seven divergent metrics" was an UNDER-count — sensing-server's LIVE trainer.rs has a raw-unnormalized PCK and an area=1.0 fake-Gold OKS the table omitted, and the file §8 named (training_api.rs) is orphaned dead code. §9 honest-limits updated. - Goal D: metrics.rs _v2 variants confirmed caller-less + deprecated; noted for future cleanup, NOT deleted (public API, tch-gated). - CHANGELOG [Unreleased] Fixed entry. Co-Authored-By: claude-flow <ruv@ruv.net> feat(ruvector): RaBitQ Pass-2 randomized rotation + topk bugfix (ADR-156 §8) Implements the deferred "Multi-bit / Extended RaBitQ Pass 2" backlog item from ADR-156 §8: a deterministic randomized orthogonal rotation applied before sign-quantization, the published RaBitQ construction (Gao & Long, SIGMOD 2024). Rotation construction: Fast Hadamard Transform + seeded ±1 sign flips ("HD" / randomized Hadamard), O(d log d) time and O(d) memory — a dense d×d rotation is O(d²) and infeasible at the 65,535-d the wire format provisions for. Pads to the next power of two; SplitMix64 seeds the sign stream so index-time and query-time rotations are bit-identical. API is additive and backward-compatible: Pass 1 (`from_embedding`) is untouched; Pass 2 is opt-in via `Sketch::from_embedding_rotated` and `SketchBank::with_rotation` (+ `insert_embedding` / `topk_embedding` / `novelty_embedding` helpers that rotate consistently). Default behaviour is unchanged. While building the Pass-2 coverage harness, found and fixed a PRE-EXISTING correctness bug in `SketchBank::topk`: the n>k heap path used `BinaryHeap<Reverse<(d,id)>>` (a min-heap) but treated its peek as the max, so it returned the k FARTHEST sketches as "nearest". The shipped unit tests only exercised the n≤k fast path, so it went unnoticed. Fixed to a plain max-heap; pinned by `topk_heap_path_returns_nearest` and `tight_clusters_give_high_coverage_with_overfetch` (the latter measured 0.072 on the old code). New tests (+17, 100→117 in the crate): rotation determinism/norm-preservation (`rotation_is_deterministic_for_seed`, `rotation_preserves_norm`), Pass-2 shape-compatibility, `pass2_coverage_not_worse_than_pass1`, and a deterministic coverage report. MEASURED top-K coverage (anisotropic planted-cluster fixture, cosine ground truth; dim=128 N=2048 K=8 64 clusters noise=0.35 128 queries): candidate_k=K=8 : Pass1 36.13% -> Pass2 46.39% (both << 90% bar) candidate_k=24 : Pass1 83.89% -> Pass2 91.60% (Pass2 clears 90%) candidate_k=32 : Pass1/Pass2 100% Honest result: rotation consistently helps (+10pp at strict K), but neither pass clears the ADR-084 90% bar at candidate_k==K on this distribution. Pass 2 reaches 90% only with ~3x over-fetch (the ADR-084 "candidate set" deployment pattern). Multi-bit Pass 3 evaluated separately. Co-Authored-By: claude-flow <ruv@ruv.net> * feat(ruvector): multi-bit Pass-3 experiment + ADR-156/084 measured results Adds the multi-bit half of the ADR-156 §8 "Multi-bit / Extended RaBitQ" item as a MEASURED experiment (coverage::measure_multibit): rotate, then b-bit uniform scalar-quantize each coord, rank by L1 over codes — the natural multi-bit generalization of hamming. Measures the bit/coverage tradeoff the backlog item asked for. MEASURED at the strict bar (candidate_k=K=8, anisotropic planted-cluster fixture, cosine ground truth): Pass1 (1-bit, no rot) 36.13% 16 B/vec Pass2 (1-bit, rot) 46.39% 16 B/vec Pass3 (rot, 2-bit) 54.39% 32 B/vec Pass3 (rot, 3-bit) 66.70% 48 B/vec Pass3 (rot, 4-bit) 74.22% 64 B/vec Honest: multi-bit monotonically helps but even 4-bit (4x memory) reaches only 74% at the strict bar — neither rotation nor <=4-bit multi-bit clears the strict-K 90% bar on this distribution. The bar is met via over-fetch (Pass2 @ candidate_k=24). Tests: multibit_tradeoff_report, multibit_1bit_matches_pass2_approx (+ sanity that 1-bit ~= Pass-2). Docs: - ADR-156 §8 item #2 marked RESOLVED-PARTIAL; §5 #2 grade CLAIMED -> MEASURED-on-our-hardware; new §10 with full measured tables, the topk bugfix disclosure, and graded deferred sub-items. - ADR-084: "Pass 2" section answering the rotation open-question with measured numbers + the topk bug note. - CHANGELOG [Unreleased]: Added (Pass-2 milestone) + Fixed (topk heap). Co-Authored-By: claude-flow <ruv@ruv.net>	2026-06-13 16:02:18 -04:00
rUv	42dcf49f4d	fix(adr): resolve duplicate ADR numbers + close ADR-080 security + ADR-154 M1 signal backlog (#1051 ) * fix(signal): circular phase variance for ghost-tap guard (ADR-154 §7.4 #1) `phase_variance` computed a LINEAR sample variance over phase angles that wrap at ±π, so a tightly-clustered set straddling the branch cut reported spuriously HIGH dispersion — false-tripping the `> TAU` ghost-tap guard on real, tightly-clustered CIR taps. Replace with Mardia's circular variance V = 1 − R̄, bounded [0,1] and invariant to where the cluster sits on the circle. Re-derive the guard against the bounded metric via a named const `GHOST_TAP_CIRCULAR_VARIANCE_MAX` (the old TAU-scaled threshold is meaningless on [0,1]). Grade: metric fix MEASURED; threshold value DATA-GATED — a clean single-path ramp also sweeps the circle, so V alone cannot separate clean from unsanitized without labelled frames. Conservative default (0.99) errs toward never false-rejecting, strictly more permissive at the wrap boundary than the buggy linear guard. Fails-on-old test: `phase_variance_circular_not_fooled_by_branch_cut` — inlines the old linear variance to show it exceeds TAU on wrap-straddling phases while circular V≈0 and the guard no longer trips. Plus `phase_variance_circular_is_bounded_and_extremal` (V∈[0,1], V≈0 identical, V≈1 uniform). cargo test -p wifi-densepose-signal --no-default-features --features cir --lib → 432 passed, 0 failed. Co-Authored-By: claude-flow <ruv@ruv.net> * fix(signal): pin Welford n=0/n=1 finiteness guard (ADR-154 §7.4 #10) The shared `WelfordStats` (field_model.rs, used by longitudinal.rs and others) relies on `count < 2` guards in `variance`/`sample_variance`/`std_dev`/ `z_score` to stay finite at the boundaries. The guards existed but the n=0 boundary was UNTESTED — exactly the §4 divide-by-(n−1) family the ADR groups this with. Add `welford_finite_at_n0_and_n1` asserting every statistic is finite and returns the documented sentinel (0.0) at n=0 and n=1, plus load-bearing doc comments on the two guards. Fails-on-old proof: with the `sample_variance` guard removed, the test FAILS with "attempt to subtract with overflow" at the `(self.count - 1)` underflow (0usize − 1); `variance` would similarly yield 0.0/0.0 = NaN. The guard is restored; the test pins it so a future regression is caught. Grade: MEASURED (boundary finiteness is asserted; the guard is the §4-family fix made testable). cargo test -p wifi-densepose-signal --no-default-features --lib field_model → 22 passed, 0 failed. Co-Authored-By: claude-flow <ruv@ruv.net> * refactor(signal): de-magic adversarial thresholds + boundary tests (ADR-154 §7.4 #13) Lift the bare numeric literals buried in `check`/`check_consistency` into named, documented module consts (FIELD_MODEL_GINI_VIOLATION=0.8, ENERGY_RATIO_HIGH_VIOLATION=2.0, ENERGY_RATIO_LOW_VIOLATION=0.1, CONSISTENCY_ACTIVE_FRACTION_OF_MEAN=0.1, SCORE_W_* weights). VALUES UNCHANGED — each const equals the original literal; only names + pinning tests are new. Grade: DATA-GATED. The operating values stay empirical (defensible values need labelled spoofed/clean CSI — Wi-Spoof, §6.2/§7.3). The de-magicking + characterization tests are MEASURED: `tuning_consts_unchanged_from_literals`, `energy_ratio_high_boundary`, `energy_ratio_low_boundary`, `field_model_gini_boundary`, `consistency_active_fraction_boundary` pin the decision boundaries at/just-below/just-above each threshold, so a future data-driven retune is a visible, tested change. Fails-on-change proof: bumping ENERGY_RATIO_HIGH_VIOLATION 2.0→3.0 makes `energy_ratio_high_boundary` FAIL (restored). Operating values explicitly NOT changed. cargo test -p wifi-densepose-signal --no-default-features --lib ruvsense::adversarial → 20 passed, 0 failed. Co-Authored-By: claude-flow <ruv@ruv.net> * refactor(signal): de-magic coherence drift/gate thresholds (ADR-154 §7.4 #9) Lift the bare detection literals in `coherence.rs::classify_drift` (DRIFT_STABLE_SCORE=0.85, DRIFT_STEP_CHANGE_MAX_STALE=10) and the `coherence_gate.rs` Default impl (DEFAULT_ACCEPT_THRESHOLD=0.85, DEFAULT_REJECT_THRESHOLD=0.5, DEFAULT_MAX_STALE_FRAMES=200, DEFAULT_PREDICT_ONLY_NOISE=3.0) into named, documented consts. VALUES UNCHANGED. The gate already exposed these via GatePolicyConfig (config seam); this names + pins the defaults. Grade: DATA-GATED. Operating values stay empirical (defensible Z-score thresholds need labelled stable/drifting coherence traces). De-magicking + boundary tests are MEASURED: `classify_drift_stable_score_boundary`, `classify_drift_stale_count_boundary` pin the at/just-below/just-above decisions; `drift_consts_unchanged_from_literals` / `gate_default_consts_unchanged_from_literals` pin the values. Operating values explicitly NOT changed. cargo test -p wifi-densepose-signal --no-default-features --lib ruvsense::coherence → 40 passed, 0 failed. Co-Authored-By: claude-flow <ruv@ruv.net> * docs(adr-154): mark §7.4 P1 backlog cleared — Milestone-1 (#1,#10 RESOLVED; #9,#13 DATA-GATED) Update ADR-154 §7.4 backlog rows #1, #9, #10, #13 with commit refs + grades, the §7.4 intro count (four P1 items cleared, ~41 P2/P3 remain), the Horizon-ledger one-liner (Milestone-1 DONE), and the §8 honest-limits #1 line (metric now correct; threshold still DATA-GATED). Add CHANGELOG [Unreleased] entry. Grades: #1 RESOLVED (MEASURED metric / DATA-GATED threshold), #10 RESOLVED (MEASURED), #9 & #13 RESOLVED-PARTIAL (DATA-GATED — de-magicked + boundary tested, operating values unchanged). Validation: cargo test --workspace --no-default-features → 2057 passed, 0 failed; wifi-densepose-signal lib → 442 passed (no-default + --features cir); python archive/v1/data/proof/verify.py → VERDICT: PASS, hash f8e76f21…46f7a UNCHANGED (CIR ghost-tap guard is not on the deterministic proof path). Co-Authored-By: claude-flow <ruv@ruv.net> * fix(sensing-server): stop leaking internal errors in HTTP responses (ADR-080 #2) Six handlers in `main.rs` serialized the internal error `Display` straight into the JSON response body, leaking server internals to any client (ADR-080 finding #2, CWE-209; reframed onto the Rust boundary by ADR-164 G11): - edge_registry_endpoint: a panicked spawn_blocking `JoinError` ("task … panicked") in a 500, and the raw upstream error in a 503 - delete_model / delete_recording / start_recording: std::io::Error strings carrying OS detail / filesystem paths - calibration_start / calibration_stop: the FieldModel error chain New `error_response` module: `internal_error` / `internal_error_json` / `upstream_unavailable` log the full detail server-side only (tagged with a correlation id) and return a generic body (`{"error":"internal_error","correlation_id":…}`) — no `panicked`, no file paths, no Debug chain. The correlation id lets an operator join a client report to the exact server log line without ever shipping the detail. Pinned by 5 error_response tests, incl. a leak-substring guard (internal_error_body_does_not_leak_detail) verified to FAIL on the reverted old body (returns the panic message / path / "os error"). The HOMECORE sweep (ADR-161) covered homecore-server, not this crate. Co-Authored-By: claude-flow <ruv@ruv.net> * test(sensing-server): pin XFF-immunity + no-query-token (ADR-080 #1, #3) Findings #1 (XFF-spoofing bypass) and #3 (JWT-in-URL, CWE-598) were logged against the Python v1 API but are VERIFIED ABSENT on the current Rust sensing-server, so they get regression tests rather than redundant fixes: - #1 XFF: there is no IP-based rate-limiter or IP-allowlist to bypass, and neither security middleware reads a forwarded header. Added bearer_auth::xff_header_never_affects_auth_decision (spoofed X-Forwarded-For never flips a 401<->200 decision) and host_validation::forwarded_headers_never_bypass_host_allowlist (spoofed X-Forwarded-Host: localhost never lets Host: evil.com past the allowlist). - #3 JWT-in-URL: require_bearer reads the token only from the Authorization header; WS handlers take no query token; the sole Query extractor (EdgeRegistryParams) is a non-secret refresh flag. Added bearer_auth::query_string_token_is_never_accepted — ?token= / ?access_token= in the URL never authenticates (stays 401) while the header path still 200s. Verified to FAIL when a query-token path is injected into require_bearer. Co-Authored-By: claude-flow <ruv@ruv.net> * docs(adr-080): mark P0 security findings #1-#3 RESOLVED; close ADR-164 G11 - ADR-080: Status note + per-finding closure (#1 XFF and #3 JWT-in-URL verified absent + regression-pinned; #2 leaked errors fixed via the error_response module). Records the v1-vs-Rust boundary distinction explicitly: v1 paths remain archived; this closure governs the shipped Rust sensing-server. - ADR-164: Gap Register G11 and the Open/Gated Backlog entry marked RESOLVED with the fix + branch reference. - CHANGELOG: [Unreleased] -> ### Security entry covering all three findings. Co-Authored-By: claude-flow <ruv@ruv.net> * docs(adr): renumber 6 displaced ADRs to resolve duplicate-number collisions (ADR-164 G1) Resolves the 5 duplicate ADR numbers (6 displaced files) flagged by ADR-164 Gap Register item G1. Canonical keeper per number = first file committed at that number (date tie-broken by inbound cross-reference count / parent-appendix relationship). Displaced files renumbered to the next free numbers (166-171): 050 keeps provisioning-tool-enhancements (5 refs vs 1) -> ADR-166-quality-engineering-security-hardening 052 keeps tauri-desktop-frontend (parent ADR) -> ADR-167-ddd-bounded-contexts (its appendix) 147 keeps nvidia-cosmos/OccWorld (the actual ADR, has Status header) -> ADR-168-benchmark-proof (proof companion, no Status) -> ADR-169-adam-mode-light-theme (was untracked) 148 keeps drone-swarm-control-system (committed #862) -> ADR-170-yoga-mode-pose-system (was untracked) 149 keeps public-community-leaderboard-huggingface (committed 16:47 vs 17:38) -> ADR-171-swarm-benchmarking-evaluation-methodology Updates in-file `# ADR-NNN` headers and intra-file self-references (yoga-modes * docs(adr): repoint inbound cross-references to renumbered ADRs (166-171) Follow-up to the ADR renumbering (ADR-164 G1). Updates every inbound reference that pointed at a displaced ADR, disambiguating shared numbers by title/slug so only references to the DISPLACED topic move and keeper references stay put. ADR-168 (was 147 benchmark-proof): README, CHANGELOG, user-guide, proof-of-capabilities, research docs 00/03 — all path/label refs updated. ADR-169 (was 147 adam-mode) / ADR-170 (was 148 yoga-mode): docs/adr/README index. ADR-171 (was 149 swarm-benchmarking): all ruview-swarm eval code+docs (Cargo.toml, evals/, eval_swarm.rs, metrics/mod/report/runner.rs), research doc 03 (every §-ref matched ADR-171 sections, not AetherArena), 00-system-review, series README, CHANGELOG, and ADR-148's forward/"open issues" pointers. ADR-166 (was 050 quality-engineering / security-hardening): disambiguated from the ADR-050 provisioning KEEPER by topic. The HMAC/secure_tdm, directory-traversal, bind-address, and OTA-PSK-auth references in code comments (wifi-densepose-hardware Cargo.toml + secure_tdm.rs, sensing-server main.rs) and in ADR-052-tauri / ADR-167 all describe the security-hardening ADR -> ADR-166. ADR-167 (was 052 ddd-appendix): inbound appendix references. Index/registry updates: docs/adr/README.md, gap-analysis/census.md (rows + header count), gap-analysis/lens-findings.md (collision table marked RESOLVED), and ADR-164 Gap Register G1 marked RESOLVED with the full renumber map. Keeper references deliberately untouched: all ADR-147 OccWorld code, all ADR-148 drone-swarm code/docs, all ADR-149 AetherArena refs (incl. ADR-150's SSL/resampling refs, which ADR-150 explicitly binds to the AetherArena benchmark), ADR-050 provisioning refs, ADR-052 tauri refs. The frozen GitHub blob URLs in docs/adr/.issue-177-body.md (pinned to an old branch) are left as historical. Comment-only code edits; no behavior change. wifi-densepose-hardware compiles clean; the sensing-server build's sole blocker is the pre-existing upstream midstreamer-temporal-compare@0.2.1 registry crate, unrelated to these edits. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-06-13 14:31:38 -04:00
ruv	107232c0be	fix(sensing-server): load published HuggingFace model via RVF auto-detect+convert (#894 ) ProgressiveLoader rejected the published ruvnet/wifi-densepose-pretrained model with the opaque "invalid magic at offset 0: expected 0x52564653 (RVFS), got 0x77455735", then silently fell back to signal heuristics (the "10 persons for 1" garbage reporters saw). The HF repo ships model.safetensors, model-q{2,4,8}.bin (magic 0x77455735 = "5WEw"), and model.rvf.jsonl -- none carry the binary-RVF magic the loader wants. - New model_format module: auto-detects RVFS / safetensors / HF-quant-bin / JSONL by magic+name; returns a typed actionable ModelLoadError (lists accepted formats + the one-command convert path, never the opaque magic); converts safetensors / model.rvf.jsonl -> RVF in-memory so the published full-precision model loads via --model. - load_or_convert_model: native RVF first, else auto-detect+convert+load, else typed error. The silent heuristics fallback is now a loud, actionable message. - --convert-model <in> --convert-out <out> CLI subcommand: one-command offline conversion, verifies the output loads before writing. - #1031 env seam: WDP_TDM_SLOTS + WDP_TDM_SLOT_US derive the multistatic guard from a deployment TDM schedule (default 60 ms / 20 ms otherwise). Honest scope: the converter wires the format/load path (safetensors F32 tensors -> RVF weight segment, manifest written, Layer A/B/C succeed, weights round-trip). It does NOT claim end-to-end pose accuracy -- the HF pose-decoder architecture differs from this crate inference head (data-gated in #894). Quantized .bin blobs are rejected with a typed error pointing at safetensors. Tests (fail on the old opaque-magic path): - model_format::safetensors_converts_and_loads - model_format::hf_quant_classifies_to_actionable_error - model_format::{jsonl_converts_and_loads, convert_to_rvf_dispatches_and_rejects_quant, ...} Co-Authored-By: claude-flow <ruv@ruv.net>	2026-06-13 12:05:05 -04:00
ruv	a27ee6f6cd	fix(csi-ingest): real HE20 CSI no longer dropped or replaced with simulated data (#1009 , #1004 ) Two ingest bugs caused real ESP32-C6 HE20 CSI to be silently discarded or never received — the "real data silently lost" failure class. Each fix is pinned by a test that fails on the old code. #1009 §1b — HE20 baseline recorder trimmed 256->242 bins by sequential index. ESP-IDF v5.5.2 delivers all 256 FFT bins for an HE20 frame, but CalibrationConfig::he20() carried num_active: 242, so the recorder (no HE20 tone map — extract_first_stream takes the first num_active columns sequentially) kept bins 0..242 = the lower guard band + DC, NOT the 242 active tones, silently corrupting the empty-room baseline. Now num_active: 256 records every delivered bin, aligned 1:1 with the live deviation() path. The exact-242 tone map stays only in cir.rs (HE20_ACTIVE), where the Phi sensing matrix needs it. HE20 synthetic/bench fixtures updated to feed 256-bin frames. #1009 §1a/§1c — u8->u16 n_subcarriers truncation, regression-pinned. The ADR-018 wire format carries n_subcarriers as u16 LE at bytes 6-7; a 256-bin HE20 frame (byte6=0x00) read as one byte decodes to 0 subcarriers -> every frame skipped. The CLI parser and the sensing-server parse_esp32_frame were already corrected to u16 under #1005/ADR-110; added regression tests that fail on the old single-byte read so the truncation cannot silently return. #1004 — --source auto latched on simulate forever, never binding UDP :5005. A one-shot boot probe resolved the source once; with no CSI flowing at boot (the normal firmware/server startup race) it served simulated poses for the whole process and ignored real CSI arriving seconds later (the prior #937 fix hard-exited instead — equally wrong). New plan_source() state machine: in auto mode ALWAYS bind the UDP receiver and serve simulated only until the first real frame, then udp_receiver_task promotes source -> esp32 (mirroring the existing esp32 -> esp32:offline reversion). simulated_data_task self-suspends once promoted. Explicit --source simulated stays a hard, UDP-free offline override. Validation: 3-crate tests 1118 passed / 0 failed; workspace 3166 passed / 0 failed; Python proof VERDICT: PASS (bit-exact, unaffected). cir.rs untouched. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-06-12 16:37:55 -04:00
ruv	d4170ad159	fix: revert config-dependent cargo-fix changes (kept only always-safe edits) cargo fix ran under --no-default-features and removed an import/mut that are 'unused' ONLY in the minimal build but genuinely USED in CI's full build (error[E0596]: cannot borrow result as mutable in desktop discovery.rs). Those are false-positive warnings in the minimal config. Reverted bridge.rs/ commissioning.rs/discovery.rs to origin/main; kept the always-safe edits (dead-code #[allow] notes + ClockGateDecision doc fields + camera macOS-only allow). Full-features build of all four crates: Finished, 0 errors. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-06-12 08:56:26 -04:00
ruv	0d6c20c278	chore(v2): zero-warnings hygiene — clear 13 build warnings across 4 crates Removed unused Matter imports (sensing-server bridge/commissioning), dropped needless mut (bridge, desktop discovery), documented ClockGateDecision variant fields (ruvector coherence), and marked deferred-P2/platform-only helpers #[allow(dead_code)] with honest notes (entity_on_matter/next_endpoint = Matter-publisher API deferred per ADR-159 §A5; decode_jpeg_to_rgb = macOS-only). Behavior-neutral; touched-crate tests green. Remaining 1 warning is a benign Windows .pdb filename collision inherent to the Tauri lib+bin desktop crate (renaming the bin would break Tauri bundling — won't-fix for a cosmetic warning). Co-Authored-By: claude-flow <ruv@ruv.net>	2026-06-12 08:44:42 -04:00
ruv	d120cc2278	test(sensing-server): unique per-process temp dirs (deterministic under concurrent runs) checkpoint_round_trip / rvf_test / rvf_pipeline_test shared fixed temp_dir paths and remove_dir at teardown, so two concurrent/repeated test runs raced (one's teardown wiped the other's file -> NotFound). Make each dir process-unique. Test-only; no public API change. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-06-12 00:11:24 -04:00
ruv	a0e72eef50	feat(wifiscan,sensing): native wlanapi.dll FFI + real Matter manual code wifiscan (Tier 2 wlanapi adapter ONLY): - Real native wlanapi.dll BSS-list FFI (new adapter/wlanapi_native.rs): WlanOpenHandle -> WlanEnumInterfaces -> WlanGetNetworkBssList -> WlanFreeMemory/WlanCloseHandle via windows-sys 0.59 (already in lock tree). Per-BSSID RSSI(dBm)/channel/band/radio-type/SSID + CSI-capable filter. #[cfg(windows)] real path; #[cfg(not(windows))] returns typed WifiScanError::Unsupported (honest, never fabricated). - wlanapi_scanner now native-first with documented netsh fallback, native_scans metric, scan_native()/scan_native_csi_capable(), and a benchmark() that MEASURES real Hz (no hardcoded "10x" claim). - MEASURED 9.74 Hz native on ruvzen (30 iters, Native backend) vs netsh ~2 Hz baseline. Live measurement kept as an #[ignore] test. - Cargo.toml: unsafe_code forbid->deny so only the audited wlan_ffi module opts into unsafe; all unsafe confined + null-checked + freed. sensing-server (Matter commissioning): - Replaced the lossy modulo placeholder in matter/commissioning.rs with the real Matter Core Spec 1.3 §5.1.4.1.1 field-packing. Canonical vector (20202021, 3840) now encodes to the published 34970112332. - Added ManualPairingCode::decode + DecodedManualCode proving the code is real/lossless (passcode round-trips bit-for-bit; short discriminator = top 4 bits) with Verhoeff integrity, incl. proptest. Tests: wifi-densepose-wifiscan 145 passed (real FFI exercised on Windows); wifi-densepose-sensing-server 614 passed. 0 failed. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-06-11 21:39:42 -04:00
ruv	50b657459f	fix(train): unify 7 divergent PCK/OKS into one canonical metric (ADR-155 §Tier-1.1) Collapse the four PCK and three OKS implementations into a single source of truth — pck_canonical (torso hip↔hip, COCO/ADR-152 convention validated at ~96% PCK@20 in benchmarks/wiflow-std) and oks_canonical (scale from GT pose extent). MetricsAccumulator, compute_pck/_per_joint/_oks, aggregate_metrics and the deprecated *_v2 path all route through them, so Trainer::evaluate() and the bench definition agree. Fixes two claim-inflating bugs, each pinned by a regression test: - zero-visible-joint PCK was 1.0 (false-perfect) -> now 0.0 - OKS s=1.0 on normalized coords made OKS~=1.0 for any pose ("fake Gold tier") -> scale now derived from the pose; a 3x-torso-wrong pose yields OKS<0.2 Divergent local kernels (training_bench raw-threshold, sensing-server torso-height) annotated "DO NOT USE for reported metrics". Legitimately changed test expectations (all-coincident "perfect" fixtures are correctly unscoreable; all-invisible -> 0.0) updated with comments citing the finding. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-06-11 19:56:44 -04:00
rUv	29de574e63	Beyond-SOTA engine/signal/train improvements: mesh partition guard, FFT CIR solver, canonical frame decoder, falsifiable occupancy benchmark, governed streaming, adapter provenance (#1018 ) * docs(research): add RuView beyond-SOTA system review (00) First document of the beyond-SOTA research series: capability audit of the current RuView engine with role-to-crate maturity matrix, ruvsense module inventory, gap analysis, and risk register. https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * docs(research): add beyond-SOTA architecture design (02, in progress) https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * docs(research): finalize beyond-SOTA architecture (02) https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * docs(research): add benchmark/validation methodology snapshot (03) https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * docs(research): add beyond-SOTA series index with validation results; changelog README index ties the 5 research docs together with the session's measured validation evidence: 2,797 workspace tests / 0 failed, Python proof PASS (bit-exact), and paired pre/post criterion CIR benchmarks. https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * perf(signal): precompute CIR warm-start system; hoist tomography solver allocs Exact, determinism-safe optimizations (bit-identical float results): - cir.rs: diag(PhiH Phi)+lambdaI and its CSR matrix depend only on Phi and lambda (fixed at CirEstimator::new) but were rebuilt every frame (O(KG) pass + CSR allocation). Now built once in new() via build_warm_start_system; summation order unchanged. - tomography.rs: ISTA gradient buffer hoisted out of the 100-iteration loop (fill(0.0) reset) and the Frobenius Lipschitz bound moved from per-reconstruct to construction. Verified: signal 456 tests green; engine 11/11 green including cycle_is_deterministic and witness-stability tests. Criterion paired pre/post: cir_estimate/he40 -3.9% (p<0.01), multiband -1.2/-1.4%. https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * fix(worldgraph): bound SemanticState growth with deterministic retention StreamingEngine::process_cycle appended one SemanticState belief per cycle with no eviction — ~1.7M nodes/day at 20 Hz (beyond-SOTA roadmap finding #6). Add WorldGraph::prune_semantic_states(max): deterministic eviction of the oldest beliefs by (valid_from_unix_ms, id); structural nodes (rooms, zones, sensors, anchors, tracks, events) are never eligible. Wire it into the engine after each belief append (DEFAULT_SEMANTIC_RETENTION = 7,200, ~6 min at 20 Hz; set_semantic_retention to tune). The WorldGraph holds current beliefs; durable history is the recorder's job, so no audit data is lost. 3 new tests: end-to-end bounded growth, oldest-only eviction, deterministic equal-timestamp tie-break. Workspace gate: 2,865 passed, 0 failed. https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * feat(sensing-server): route live frames through the governed StreamingEngine Closes the live-trust-path gap (ADR-136 section 8, beyond-SOTA system review): the running server fused live CSI with the bare MultistaticFuser, while the privacy/provenance/witness control plane (ADR-135..146) only ever ran on synthetic in-test frames. The privacy control plane was therefore bypassable on the real path. New engine_bridge module drives StreamingEngine::process_cycle from the server's live NodeState map, reusing the existing NodeState -> MultiBandCsiFrame conversion. It lazily wires each contributing node as a WorldGraph sensor (idempotent), bounds belief growth via the retention cap, and forwards explicit timestamps/calibration ids so the path stays deterministic and replayable. Wired additively into both live ESP32/WiFi fusion sites in main.rs via a split-borrow off the write guard, so person-count behavior is unchanged; the latest BLAKE3 witness is stored on AppState. Every published belief now carries evidence + model + calibration + privacy decision and a deterministic witness. Adds wifi-densepose-engine/-worldgraph/-bfld/-geo deps. 6 new bridge tests (witnessed belief with full provenance, cross-run determinism, idempotent node registration, retention bound, privacy-mode propagation). sensing-server suite 430+128 green; workspace gate 2,904 passed / 0 failed. https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * feat(train): falsifiable occupancy benchmark with anti-overfitting gate Makes the presence/person-count "beyond SOTA" claim falsifiable in code instead of aspirational (the unfalsifiability gap from the beyond-SOTA system review). occupancy_bench grades predictions vs ground truth and gates a SOTA claim behind one claim_allowed invariant requiring ALL of: - DataProvenance::Measured — synthetic/mock data is scorable for regression but never claimable (anti-mock-contamination; the CLAUDE.md Kconfig-bug lesson made structural). - A leak-free EvalSplit — validate() refuses any split where a subject OR environment id appears in both train and test (subject leakage / per-environment overfitting). - n_test >= min_test_samples (small-N guard). - Presence F1 whose bootstrap-CI lower bound (deterministic seeded splitmix64) clears the threshold — not the point estimate. - Count MAE within threshold. The claim string is unreadable except through the gate (NO_CLAIM otherwise), same discipline as the ruview-gamma acceptance gate. What remains is data, not method: a frozen, SHA-pinned, subject/environment-disjoint measured replay set turns the claim into a passing/failing test. Lives in wifi-densepose-train (the eval bounded context, alongside ablation/ eval/metrics). 10 tests cover each refusal path; warning-clean under the crate's missing_docs lint. Workspace gate 2,914 passed / 0 failed. Doc 03 updated. https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * feat(engine): per-room adapter provenance + drift-to-recalibration advisor Closes the trust-chain gap where an ~11 KB per-room LoRA adapter (ADR-150 section 3.4) could silently change inference without the witness noticing: provenance carried only "rfenc-v<N>" with no notion of adapter identity. - StreamingEngine::set_room_adapter(AdapterInfo): pins the adapter's content-derived id into provenance model_version ("rfenc-v1+adapter:<id>") — and therefore into the BLAKE3 witness — so swapping or clearing adapter weights always shifts the witness. Engine test proves base -> adapter -> other-adapter -> cleared all witness differently and cleared == base. - RecalibrationAdvisor: recommends re-running the ADR-135 empty-room baseline / refitting the room adapter on sustained low fusion coherence (streak threshold, default 60 cycles ~ 3 s at 20 Hz) or an ADR-142 change-point. Surfaced as TrustedOutput::recalibration_recommended, stored on the sensing-server AppState alongside the witness at both live fusion sites. - Bridge plumbing: EngineBridge::{set_room_adapter, clear_room_adapter} + live-path test that the adapter id flows into the live witness. Scope note (honest): this is the deployable provenance/trigger half of the "retrained model" roadmap item. Fitting the adapter itself runs in the existing external calibration service (aether-arena/calibration/); a trained RF-encoder checkpoint still does not exist in-tree. Engine 15 tests, bridge 7 tests. Workspace gate: 2,918 passed / 0 failed. https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * fix(mat): gate api module behind its feature — standalone no-default-features builds pub mod api was unconditional while its only dependency, serde, is optional behind the 'api' feature, so any build without default features failed with 101 unresolved-serde errors (masked in --workspace runs by feature unification). The api module and its create_router/AppState re-export are now cfg(feature = "api")-gated with docsrs annotations. All combos compile: bare --no-default-features (was 101 errors, now 0), --no-default-features --features api, and full default (177 tests pass). Workspace gate: 2,918 passed / 0 failed. https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * perf(signal): opt-in FFT operator for the CIR ISTA solver (8-14x measured) Phi is a sub-DFT, so each ISTA mat-vec can run as one length-G FFT (O(G log G)) instead of a dense O(KG) product — the dominant-latency-hazard finding from the beyond-SOTA optimization roadmap. New CirConfig::fft_operator, default FALSE: the dense path stays the bit-exact witness default. The FFT evaluates the same sums in a different order, so enabling it shifts float results in the last bits and requires regenerating any pinned witness — strictly opt-in per deployment. FftOperator (rustfft, planned once at CirEstimator::new, scratch buffers reused across the ISTA loop) dispatches inside ista_solve: Phi x = scale forward-FFT(x) sampled at bins (k_idx mod G) Phi^H v = scale * unnormalised inverse-FFT of v scattered into those bins Warm-start and Lipschitz estimation stay dense at construction. Measured (criterion, same run, same machine): ht20: 2.22 ms -> 265 us (8.4x) ht40: 10.26 ms -> 717 us (14.3x) The real HE40 grid (K=484, G=1452) scales further per the O(KG)/O(G log G) ratio. 3 new tests: FFT<->dense matvec equivalence to float tolerance on ht20 and he40 grids; end-to-end dominant-tap agreement on a single-path frame; all default configs keep FFT off. New cir_estimate_fft bench group. Workspace gate: 2,921 passed / 0 failed (default path bit-exact, witnesses unchanged). https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH feat(core): canonical frame decoder — capture-to-claim replay (ADR-136) The encode half of the ADR-136 frame contract existed (ComplexSample, to_canonical_bytes, witness_hash) but there was no decoder: a captured canonical frame could be witnessed but never reconstructed, blocking replay-from-capture. CsiFrame::from_canonical_bytes is the exact inverse: same id, metadata, complex payload, and witness hash (tested as the round-trip law AC7 — the replayed frame re-encodes byte-identically). Amplitude/phase are recomputed from the payload (projections, not independent state). Every malformed-input class fails closed (AC8): header truncation -> Truncated, payload truncation -> PayloadMismatch, unknown discriminants, non-UTF-8 device id, trailing bytes. Nil calibration uuid decodes as None per the documented encoding. Core: 36 tests pass. Workspace gate: 2,937 passed / 0 failed. https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * feat(engine): dynamic min-cut mesh partition guard (ruvector-mincut) Maintains an exact min-cut over the live mesh coupling graph — nodes are sensing nodes, coupling is the product of fusion attention weights — and surfaces per cycle, as TrustedOutput::mesh: - cut value: the global "how close is the array to partitioning" number, a structural measure per-node heuristics miss; - weak side: which specific nodes would split off (failure/jamming triage, feeds ADR-032 posture); - at-risk flag: counts as a structural event for the drift->recalibration advisor (alongside ADR-142 change-points). Degenerate cases fail toward risk: a node with zero coupling is reported as already partitioned (cut 0, that node as the weak side). Measured cost policy (criterion, 12-node mesh — the honest part): - weights quantized (1/64) + change-gated: steady-state cycles do ZERO graph work and reuse the cached cut (~7.3 us, ~23x cheaper than building); - on any real change a full exact rebuild (~171 us) is used, because ONE DynamicMinCut delete+insert measured ~240 us — the subpolynomial machinery amortizes on much larger graphs, so rebuild-on-change is the measured optimum at mesh scale (one-edge case -28% after switching policy); - full process_cycle with the guard: ~33 us for 4 nodes vs the 50 ms budget. 9 mesh_guard tests (weak-node detection, steady-state zero updates, sub-quantum gating, join/drop rebuild, determinism, disconnection) + an engine-level wiring test (down-weighted node -> weak side -> recalibration). Engine 24 tests; workspace gate 2,946 passed / 0 failed. https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * feat(engine): mesh partition risk demotes privacy + enters the witness (ADR-032) Completes the mesh-guard integration: its at_risk signal was advisory-only (fed the recalibration advisor). It now also contributes to the ADR-141 privacy demotion alongside fusion- and array-level contradictions — a mesh close to partitioning makes the fused belief less trustworthy, so the cycle emits at a more restricted class (monotonic; information only removed). Because effective_class feeds the BLAKE3 witness, a fragmenting array now shifts the witness: partition risk is auditable, not just logged. The mesh computation moved ahead of the demotion step in process_cycle; mesh_guard_mut exposes risk-threshold tuning. Test: a forced-risk 3-node cycle demotes PrivateHome Anonymous->Restricted and shifts the witness vs a clean baseline. Engine 25 tests; workspace gate 2,947 passed / 0 failed. https://claude.ai/code/session_01MjBucx95K4BuUxZi8NWwRH * fix: public-PR review findings — privacy-path honesty, gate holes, mesh-guard cliff - sensing-server: engine errors logged+counted (no silent swallow), trust state exposed via status surface, privacy-demotion claims aligned with the actual parallel-audit-path behavior - occupancy_bench: vacuous-F1 hole closed (degenerate test sets fail with their own criterion); CI-lower-bound test made probative - mesh_guard: quantization scaled to observed coupling range — >=65-node balanced meshes no longer permanently at_risk (regression test) - engine: both wiring tests made probative (same-topology witness compare, deterministic risk-crossing fixture) - mat: axum/tokio optional behind api; real serde feature (api enables it) - core: canonical decoder strict (non-zero reserved bytes and nil UUID rejected — injective on accepted domain, forged-bytes tests) - CHANGELOG: un-spliced the FFT/adapter bullet mangle Co-Authored-By: claude-flow <ruv@ruv.net> * chore: strip private-track references for public PR Reword the occupancy-benchmark changelog bullet to drop a cross-reference to the private research track, and restore the WorldGraph retention bullet header that was glued onto the preceding MAT bullet. Co-Authored-By: claude-flow <ruv@ruv.net> * chore: lockfile refresh for cherry-picked feature set Co-Authored-By: claude-flow <ruv@ruv.net> --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-06-11 16:08:54 -04:00
rUv	d0e27e652e	fix(firmware): C6 IDF v5.5 guard + HE-LTF host ingest + WITNESS-LOG-110 B1 resolution (#1005 ) (#1011 ) * fix(firmware): c6_sync_espnow IDF v5.5 send-callback guard + B1 HE-LTF resolution (#1005) Espressif backported the esp_now_send_cb_t signature change to v5.5 (esp_now_send_info_t = wifi_tx_info_t there), so the #944 guard must be ESP_IDF_VERSION >= VAL(5,5,0), not MAJOR >= 6. Validated on this repo's hardware toolchain: - WITHOUT fix, IDF v5.5.2 esp32c6 build fails with the reporter's exact incompatible-pointer error at c6_sync_espnow.c:199 (reproduced) - WITH fix, clean build on IDF v5.5.2 (esp32c6) AND IDF v5.4 (regression) Docs: WITNESS-LOG-110 §B1 marked RESOLVED WITH MEASUREMENT (external, @stuinfla, issue #1005): IDF v5.4 driver downconverts HE->HT; v5.5.2 delivers true HE-LTF (532B / 256 bins / 242 tones, PPDU 0x01 HE-SU). ADR-110 capability table updated accordingly. Co-Authored-By: claude-flow <ruv@ruv.net> * docs: WITNESS-LOG-110 §B1 — in-house HE-LTF replication on the original COM12 C6 84% of 1,525 frames at 532B/PPDU 0x01 (HE-SU) with IDF v5.5.2 + the #1005 guard fix, AP ruv.net 11ax 2.4GHz. Two independent rigs now confirm: v5.4 downconverts, v5.5.2 delivers 242-tone HE20. Co-Authored-By: claude-flow <ruv@ruv.net> * fix(host): 256-bin HE-LTF ingest end-to-end + latent offset bugs (#1005) Audit of every ADR-018 consumer against live C6 HE20 frames (532B/256-bin): - sensing-server + CLI calibrate parsers read n_subcarriers from one byte (256 decoded as 0) with stale seq/rssi offsets (rssi always 0 — latent, pre-existing, confirmed vs firmware csi_collector.c). Fixed to the real ADR-018 layout; n_subcarriers u8->u16; byte 18 surfaced as typed PpduType. - sensing-server probe buffer 256B -> 2048B (532B datagram errored on Windows) - per-node grid gate: lock densest (n_subcarriers, ppdu_type) grid, re-warm on upgrade, skip sparser minority frames — HT-64 never mixes into an HE-256 baseline window - hardware parser: HE-aware bandwidth classification (256-FFT HE20 = 20MHz, was Bw160); PpduType/Adr018Flags re-exported - verbatim live frames (532B HE-SU, 148B HT) embedded as regression fixtures - archive python parser: bandwidth heuristic mirror fix Live-validated: calibrate --tier he20 consumed 600x 256-bin frames into an ADR-135 He20 baseline (242 tones) skipping 94 HT frames; sensing-server shows node 12 active with real RSSI (-40dBm). 765 tests green across the three crates; workspace check clean; Python proof PASS. Co-Authored-By: claude-flow <ruv@ruv.net> * test(fuzz): esp_netif/ping_sock/ip_addr stubs — un-break ADR-061 fuzz build after #954 csi_collector.c gained esp_netif.h / ping/ping_sock.h / lwip/ip_addr.h includes for the #954 gateway self-ping; the host-fuzz stub env lacked them, breaking the fuzz build on main since `5789351b7`. Stubs return no-gateway so the self-ping path early-outs (compiles + links, never exercised — matches the fuzz threat model which targets frame serialization, not the network stack). Co-Authored-By: claude-flow <ruv@ruv.net>	2026-06-11 11:00:37 -04:00
rUv	b6420ac9ba	fix(server): make synthetic CSI opt-in only (sibling fix to #937 ) (#979 ) Background Issue #937 in the cognitum-v0 appliance repo flagged that the `cognitum-csi-capture` systemd unit shipped `--simulate` by default, silently serving synthetic CSI tagged as production telemetry on `/api/v1/sensor/stream`. That's a textbook trust-eroding pattern — the single most-cited "where's the real data?" evidence external reviewers (#943, #934) point at when they call the project AI-slop. A grep across THIS tree surfaced the exact same anti-pattern in three places: docker/docker-compose.yml:27 # auto (default) — probe ESP32, fall back to simulation docker/docker-entrypoint.sh:14 # CSI_SOURCE — data source: auto (default), ... main.rs:6435 info!("No hardware detected, using simulation"); "simulate" The sensing-server's `auto` source resolver at main.rs:6425-6440 silently fell back to synthetic with only an `info!` log line as the signal. Downstream consumers calling `/api/v1/sensing/latest` or `/ws/sensing` had no in-band way to know they were being served fake data. Fix `auto` now refuses to fall back. When neither ESP32 UDP nor host WiFi is detected, the server logs a clear `error!` explaining the situation and exits 78 (EX_CONFIG). The error message names the two ways to proceed: provision real hardware, or set `--source simulated` / `CSI_SOURCE=simulated` explicitly. Existing operators who already use `--source simulated` (or its legacy `simulate` alias) are unaffected — the alias is preserved for back-compat. Docker entrypoint comment, docker-compose comment, and the Tauri desktop app's source-default path also updated to reflect the new posture. The desktop app keeps its `simulated` default because it's an explicit demo product — the value passed downstream is the explicit `simulated`, not `auto`, so the server tags it correctly and never lies about its data source. Validation cargo build -p wifi-densepose-sensing-server --no-default-features cargo test -p wifi-densepose-sensing-server --no-default-features → 122 / 122 pass, build clean (existing pre-fix warnings unchanged). Deployment ⚠ Breaking change for unattended deployments that relied on the `auto → simulated` silent fallback. That is exactly the failure mode this PR fixes: pretending to serve real sensing data when the source is fake. Operators who genuinely want demo mode set `CSI_SOURCE=simulated` explicitly; the error message and the docker-compose comment both point them there.	2026-06-08 18:07:39 +02:00
rUv	2c136aca74	fix(protocol): resolve 0xC511_0004 magic collision (closes #928 ) (#931 ) * fix(ci): SAST actually scans the code + drop deprecated flaky semgrep action Two real problems in the Static Application Security Testing job: 1. It scanned a path that no longer exists. `bandit -r src/` and `semgrep … src/` pointed at the repo-root `src/`, but the Python code moved to `archive/v1/src/` (64 .py files) when the runtime was rewritten in Rust. So the SAST scan matched nothing — a silent no-op (this is also why `bandit-results.sarif` was "Path does not exist" on recent runs). Fixed both to `archive/v1/src/`. 2. Deprecated + redundant + flaky semgrep step. The `returntocorp/semgrep-action@v1` step pulled `returntocorp/semgrep-agent:v1` from Docker Hub every run (intermittently timing out → red check, e.g. on #929) and is EOL. It was redundant: the pip `semgrep --sarif` step is what feeds GitHub Security; the action only pushed to the Semgrep cloud app via SEMGREP_APP_TOKEN. Removed it and folded its `p/docker` + `p/kubernetes` rulesets into the pip semgrep command, so coverage is preserved with no Docker pull. The job stays `continue-on-error: true` (non-gating). YAML validated. Co-Authored-By: claude-flow <ruv@ruv.net> * fix(protocol): resolve 0xC511_0004 magic collision (closes #928) Background `0xC511_0004` was assigned to two different packet formats in firmware — `EDGE_FUSED_MAGIC` (ADR-063, 48-byte `edge_fused_vitals_pkt_t`) and `WASM_OUTPUT_MAGIC` (ADR-040, variable-length `wasm_output_pkt_t`). Both were transmitted. The sensing-server only had a WASM parser for that magic and no fused-vitals parser, so on the ESP32-C6 + MR60BHA2 mmWave configuration the fused-vitals packet was silently misparsed as a malformed WASM output — `breathing_rate` was read as `event_count`, mmWave-fused vitals were lost, and spurious WASM events were emitted to subscribers. Fix 1. Reassign `WASM_OUTPUT_MAGIC` to `0xC511_0007` (next free slot per the registry in `rv_feature_state.h`). Smaller blast radius than moving fused-vitals — the registry already treats `0xC511_0004` as fused-vitals canonical and several years of deployed feature tracking depends on that assignment. 2. Add `parse_edge_fused_vitals` + `EdgeFusedVitalsPacket` in `wifi-densepose-sensing-server::main`. Byte layout taken directly from `edge_processing.h:129`, mirroring the firmware's `_Static_assert(sizeof(edge_fused_vitals_pkt_t) == 48)` so future firmware changes that grow the packet will break this parser loudly instead of silently. 3. Add a dispatch arm in the UDP receive loop. Fused-vitals is tried BEFORE WASM so a stale firmware (still emitting 0xC511_0004 with the WASM payload) fails to parse as fused-vitals (size mismatch), then fails to parse as WASM (magic mismatch on the new 0x...0007), and gets dropped — a deliberate "fail loud" outcome rather than the pre-fix silent garbage. 4. Update the registry comment in `rv_feature_state.h` to add the new 0x...0007 row. 5. Add five tests in a new `issue_928_magic_collision_tests` mod: - `parse_edge_fused_vitals_extracts_fields_correctly` - `parse_edge_fused_vitals_rejects_short_buffer` - `parse_edge_fused_vitals_rejects_wrong_magic` - `parse_wasm_output_rejects_legacy_0004_magic` - `parse_wasm_output_accepts_new_0007_magic` WebSocket payload Fused-vitals now broadcasts as `{"type": "edge_fused_vitals", ...}` with the mmWave-specific block nested under `mmwave`. Schema is additive — existing subscribers that only inspect `type` are unaffected; subscribers that switch on `type` gain a new branch. Deployment note This is a wire-protocol change. Firmware older than this commit that emits WASM output on 0xC511_0004 will lose its WASM event stream against an updated host (host expects 0xC511_0007). Per the issue discussion, "fail loud" is preferred to silent misparsing. Operators running C6+mmWave should reflash firmware concurrent with the host upgrade. Test results cargo test -p wifi-densepose-sensing-server --no-default-features --bin sensing-server → 122 passed / 0 failed (5 new + 117 existing, unchanged) Co-Authored-By: claude-flow <ruv@ruv.net>	2026-06-03 11:56:35 +02:00
rUv	be48143f77	fix(auth): match the Bearer scheme case-insensitively (RFC 6750) (#929 ) `require_bearer` parsed the Authorization header with `strip_prefix("Bearer ")`, which is case-sensitive. Per RFC 6750 §2.1 / RFC 7235 §2.1 the auth-scheme is case-insensitive, so a correct token sent as `Authorization: bearer <token>` (or `BEARER`, or with extra whitespace) was rejected with a confusing "invalid bearer token" 401 — needless friction when setting up `RUVIEW_API_TOKEN` (the active #864/#924 theme). Now the scheme is matched with `eq_ignore_ascii_case` and leading token whitespace trimmed. The token comparison itself is unchanged — still exact and constant-time (`ct_eq`) — so this does not weaken auth: a wrong token or a non-Bearer scheme (`Basic …`) still returns 401. New test `accepts_case_insensitive_bearer_scheme` covers `bearer`/`BEARER`/ extra-space (accept) and wrong-token/`Basic` (still reject). bearer_auth suite: 9 passed.	2026-06-03 11:07:34 +02:00
rUv	0cfd255730	fix: --export-rvf no longer silently produces a placeholder model (#920 ) The --export-rvf handler ran before the --train/--pretrain handlers and unconditionally wrote placeholder sine-wave weights, then returned. So the documented `--train --dataset … --export-rvf <path>` workflow (user-guide.md) short-circuited to a PLACEHOLDER model and never trained — printing "exported successfully" for a non-functional model. Given the project's anti-"is it fake" stance, silently emitting a fake model is the wrong default. Fix: - Only emit the placeholder container-format demo when --export-rvf is used standalone (new `export_emits_placeholder_demo` guard). With --train/--pretrain, fall through so the real training pipeline runs and exports calibrated weights. - The standalone path now prints a clear WARNING that it writes a container-format demo with placeholder weights — not a trained model — pointing to --train / a pretrained encoder (#894). - Docs: flag --export-rvf as a placeholder demo in the flag table, and fix the Docker training example to use --save-rvf (consistent with the from-source example) instead of the placeholder --export-rvf. 3 unit tests for the guard. Full crate unit suite: 429 + 117 passed, 0 failed.	2026-06-03 08:55:36 +02:00
rUv	f5d0e1e69e	fix(#894 ): actionable diagnostic when --model gets a non-RVF file (#919 ) Users who downloaded ruvnet/wifi-densepose-pretrained and passed model.safetensors / model-q4.bin / model.rvf.jsonl to --model hit a bare "Progressive loader init failed: invalid magic at offset 0: expected 0x52564653, got 0x77455735" and were stuck — the server then silently fell back to signal heuristics (which over-count, feeding "is it fake" reports). The HF files are a different format and encoder architecture than the RVF binary container the progressive loader expects, so they can't load directly. Now the load-failure path detects the common cases (safetensors header, JSONL manifest, quantized .bin blob) and emits a plain explanation naming the format, what --model actually expects (RVF `RVFS` container from wifi-densepose-train), and that it's continuing with heuristics — with a pointer to #894. Pure, testable `diagnose_model_load_error()` + 4 unit tests (run under the default `--no-default-features` CI). Full crate unit suite: 429 + 114 passed, 0 failed.	2026-06-02 20:05:30 +02:00
rUv	b12662a54d	fix(mqtt): per-node HA devices use each node's own presence/motion (#872 ) (#918 ) The MQTT bridge fanned out one Home-Assistant device per node (#898) but applied the room-level aggregate classification to every node — so in a multi-node setup a node in an empty corner inherited another node's "present", and `motion_level: "absent"` was mis-mapped to full motion (the aggregate match fell through `Some(_) => 1.0`). Each node in the sensing broadcast's `nodes` array already carries its own `classification` (`motion_level`/`presence`/`confidence`, see PerNodeFeatureInfo) and RSSI. Now each per-node snapshot reads that node's own classification, deferring to the room aggregate only for fields a node omits. Vitals (breathing/heart rate) and person count stay room-level. Extracted the JSON→VitalsSnapshot mapping into a pure, testable function (`vitals_snapshots_from_sensing_json`) and added 4 unit tests covering per-node divergence, partial-field fallback, the no-nodes aggregate path, and the absent→zero-motion fix. Supersedes #899, which targeted the right bug but read non-existent fields (`node["motion_level"]` / `node["status"]` instead of the nested `node["classification"]` + `stale`). Verified: builds with `--features mqtt`; new tests pass; full crate unit suite 432 + 114 passed, 0 failed.	2026-06-02 19:26:01 +02:00
ruv	4c87f04919	Merge remote-tracking branch 'origin/main' into fix/894-occupancy-cap # Conflicts: # CHANGELOG.md	2026-06-02 10:52:53 +02:00
ruv	f34b94aa46	fix(occupancy): bound eigenvalue person-count to single-link max — #894 field_bridge::occupancy_or_fallback returned FieldModel::estimate_occupancy unbounded (internal ceiling 10), while the perturbation fallback below it and score_to_person_count both cap at 3 ("1-3 for single ESP32"). On noisy or under-calibrated CSI the eigenvalue count inflated → "10 persons when 1 present" (#894, seen when --model fails to load → heuristic mode). Bound the eigenvalue path to a shared MAX_SINGLE_LINK_OCCUPANCY const (3) so every single-link estimator agrees. Genuine higher counts come from the multistatic fusion path. Build clean, field_bridge tests pass.	2026-06-02 10:40:24 +02:00
ruv	9ddcf0c9fc	fix(mqtt): one HA device per node — closes #898 After the #872 MQTT wiring, the JSON->VitalsSnapshot bridge hard-coded a single node_id (the MQTT client id) and the publisher used one OwnedDiscoveryBuilder, so every physical node collapsed into a single Home-Assistant device (identifiers:["wifi_densepose_wifi-densepose-1"]), contradicting the one-device-per-node docs. - Bridge (main.rs): emit one VitalsSnapshot per node in the sensing update's nodes[] (each carries its own node_id + RSSI; shared aggregate presence/vitals), falling back to a single aggregate snapshot when there is no per-node data (wifi/simulate sources). - Publisher (publisher.rs): add OwnedDiscoveryBuilder::for_node(), and publish discovery + availability lazily on first sight of each node_id, routing state to per-node topics. Heartbeat/refresh/offline-LWT iterate all known nodes. Result: N distinct HA devices, one per node. 3 new unit tests (distinct nodes -> distinct wifi_densepose_<node> identifiers); full MQTT suite 71 passed, example builds.	2026-06-02 09:43:28 +02:00
ruv	4007db5d13	fix(sensing-server): fix CSI per-node count clamp — #803 (part 2) The pure-CSI per-node path clamped its own occupancy estimate before the aggregator could read it. estimate_persons_from_correlation (DynamicMinCut) returns 0-3, but it was mapped to a score via `corr_persons / 3.0`, putting 2 people at 0.667 — just under the 0.70 up-threshold of score_to_person_count — so the per-node count never climbed past 1, leaving node_max stuck at 1 for CSI-only nodes even when the min-cut cleanly separated two people. Replace the lossy /3.0 mapping with a threshold-aligned corr_persons_to_score (1->0.40, 2->0.74, 3->0.96) whose steady state round-trips back to the same count through the EMA + hysteresis bands, while still gating transient noise. A convergence test replays the exact CSI-loop EMA and asserts min-cut=2 now reports 2 / 3 reports 3 / 1 reports 1, plus a regression test documenting that the old /3.0 mapping pinned two people to 1. Full suite: 586 passed, 0 failed. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-31 10:09:58 -04:00
ruv	a933fc7732	fix(sensing-server): surface count-aware per-node estimates — #803 Person count was pinned to 1 because the aggregate was derived from `smoothed_person_score`, an EMA-smoothed activity score (amplitude variance / motion / spectral energy) that saturates near a single occupant and cannot discriminate count. The count-aware per-node estimates the ESP32 paths already compute (firmware n_persons, mincut corr_persons) were stored in NodeState::prev_person_count then discarded by the aggregator — the same dead-wiring class as #872. Add `aggregate_person_count(activity_count, node_states)` = max(activity, node_max) and use it at both ESP32 aggregation sites (edge-vitals + CSI loop, Some + fallback arms). It can only raise the count when a node positively reports more occupants, so the lone-occupant case is provably never inflated (regression-guarded). 5 new unit tests + full suite: 582 passed, 0 failed. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-31 10:00:56 -04:00
ruv	a3f80b0cda	fix(sensing-server): wire MQTT publisher into the binary — closes #872 #872 reported '--mqtt: unexpected argument' on the Docker image; prior attempts chased a Docker rebuild, but the real cause was disconnected code: the --mqtt* flags lived only in cli::Args (dead code — referenced nowhere), while the binary parses a separate main::Args with no mqtt fields, and main.rs never declared/started the mqtt:: publisher. So MQTT was fully unwired: flags didn't parse, and the publisher never ran. Fix: - Extract the mqtt + privacy flags into a shared (#[derive(clap::Args)]); retarget mqtt::config::{from_args,build_tls} to it. - #[command(flatten)] MqttArgs into the binary's main::Args (using the lib crate's type so it matches from_args), so --mqtt* now parse. - Spawn the publisher on --mqtt: build MqttConfig, validate, and bridge the existing JSON sensing broadcast into the typed VitalsSnapshot stream the publisher consumes (defensive serde_json::Value mapping — absent fields default, never wrong values). #[cfg(feature=mqtt)]-gated; without the feature --mqtt WARNs and no-ops (documented contract). Fix the mqtt_publisher example for the new signature. Verified end-to-end against local mosquitto: publisher connects and emits 20 HA auto-discovery entities + live state (presence ON, person_count, …). Tests: 577 pass default / 580 pass --features mqtt / 0 fail; both configs build. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-31 09:39:21 -04:00
ruv	020aa08049	test(sensing-server): ADR-140 live acceptance — snapshot to expired-rejection Drives a real SemanticBus: raw snapshot (fall_detected, past warmup) -> FallRisk primitive -> SemanticStateRecord (provenance) -> single-signal rule fires / multi-signal agreement rule does NOT (no false escalation) -> expired record rejected. Proves the ADR-140 credibility path end to end. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-29 08:37:28 -04:00
ruv	169a355bde	feat(sensing-server): ADR-140 semantic state record + Ruflo agent bridge (#844 ) - semantic/record.rs: SemanticStateRecord (kind/room/node/timestamp/expiry/ confidence/model_version/calibration_version/privacy_action/evidence_refs) — the auditable wire form of an ADR-139 SemanticState node, enriched from the existing SemanticEvent via RecordContext - PrivacyAction enum (Allow/AnonymizeByRoom/StripBiometrics); StripBiometrics removes HR/BR evidence tags at the record boundary - Ruflo agent bridge: MultiSignalRule.evaluate() fires AgentRoute only on multi-signal agreement (fall_risk + elderly_anomaly → caregiver_escalation); route_all() sorts by severity + dedups - 4 tests; workspace 0 errors Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-28 23:17:53 -04:00
rUv	249d6c327f	ADR-115: Home Assistant + Matter integration (#778 ) Closes ADR-115's MQTT track (HA-DISCO + HA-MIND + HA-FABRIC scaffolding). Headline: - 21 entity kinds per node (11 raw + 10 semantic primitives) - MQTT auto-discovery with HA conventions - Matter Bridge scaffolding (SDK wiring deferred to v0.7.1 per ADR §9.10) - Privacy mode strips biometrics at the wire, semantic primitives keep working - 420+ lib tests, mosquitto-backed integration tests, property-based fuzzing - 8 starter HA Blueprints + 3 Lovelace dashboards shipped Tracking issue: #776	2026-05-23 16:13:28 -04:00
rUv	00a234eda8	ADR-110: ESP32-C6 firmware extension (#764 ) Closes the firmware-side ADR-110 design at v0.7.0-esp32 after a 38-iter /loop SOTA sprint. Headline (bench, COM9+COM12 ESP32-C6): - 99.56% cross-board RX, 104.1 µs smoothed offset stdev (≤100 µs §2.4 target met) - 3.95× EMA suppression, 1.4 ppm crystal skew preserved 4 firmware releases: v0.6.7 / v0.6.8 / v0.6.9 / v0.7.0-esp32. 42 ADR-110 unit tests, 1761 v2 workspace tests, full Firmware CI + QEMU green.	2026-05-23 15:34:48 -04:00
rUv	004a63e82d	fix(security): audit — fix RUSTSEC vulns, clippy warnings, dead code (#769 ) - Upgrade openssl to 0.10.78 (CVE-2026-41676), jsonwebtoken to 9.4 - Suppress unmaintained-only/no-CVE advisories in .cargo/audit.toml with per-entry rationale - Fix all `cargo clippy --all-targets -- -D warnings` errors across 35 crates: derivable_impls, needless_range_loop, map_or→is_some_and/ is_none_or, await_holding_lock (drop MutexGuard before .await), ptr_arg (&mut Vec→&mut [T]), useless_conversion, approximate_constant (2.718→E, 3.14→PI), field_reassign_with_default, manual_inspect, useless_vec, lines_filter_map_ok, print_literal, dead_code - Apply `cargo fmt --all` - Pre-existing test failure in wifi-densepose-signal (test_estimate_occupancy_noise_only) is not introduced by this PR	2026-05-23 05:36:13 -04:00
rUv	67fec45e61	feat(edge-registry): ADR-102 — surface Cognitum cog catalog via /api/v1/edge/registry (#648 ) * feat(edge-registry): ADR-102 — surface Cognitum cog catalog via /api/v1/edge/registry Adds a new sensing-server endpoint that fetches and caches the canonical Cognitum app registry at https://storage.googleapis.com/cognitum-apps/app-registry.json (105 cogs across 11 categories as of v2.1.0). RuView previously had no live awareness of the catalog — the README's capability table was hand- curated and went stale as Cognitum shipped new cogs (the registry was last updated 6 days ago). ADR: * docs/adr/ADR-102-edge-module-registry.md — full design, response shape, configuration flags, failure modes, and a 12-row security review covering SSRF, response inflation, ?refresh abuse, stale-serve semantics, TLS, cache poisoning, JSON-panic resistance, etc. Code: * v2/.../edge_registry.rs — EdgeRegistry struct + UreqFetcher + MockFetcher trait + 7 unit tests. RwLock<Option<CachedEntry>> with stale-on-error fallback. MAX_PAYLOAD_BYTES=8 MiB, 10s wire timeout. * v2/.../main.rs — constructs Option<Arc<EdgeRegistry>> at startup, registers GET /api/v1/edge/registry handler, wires Extension layer. Handler runs the blocking ureq fetch via tokio::task::spawn_blocking so the async runtime stays free. * v2/.../cli.rs / main.rs Args — three new flags (per user request to "allow the registry to be disabled or changed"): --edge-registry-url <URL> (env RUVIEW_EDGE_REGISTRY_URL) --edge-registry-ttl-secs <N> (env RUVIEW_EDGE_REGISTRY_TTL_SECS) --no-edge-registry (env RUVIEW_NO_EDGE_REGISTRY) When --no-edge-registry is set or the URL is empty, the endpoint returns 404. Cargo.toml: adds ureq (rustls), sha2, thiserror as direct deps. README: * New collapsed "🧩 Edge Module Catalog" section with the full 105-cog table generated from the registry, grouped by category with practical one-line descriptions (e.g. "Spots irregular heartbeats and abnormal heart rhythms", "Detects walking problems and scores fall risk"). Links to https://seed.cognitum.one/store and the local appliance /cogs page. Sits between the HF model section and How It Works. Tests (7/7 pass): first_call_hits_upstream_and_caches ttl_expiry_triggers_refetch force_refresh_bypasses_fresh_cache stale_serve_on_upstream_failure_after_cached_success no_cache_no_upstream_returns_error upstream_invalid_json_is_treated_as_error upstream_sha256_is_deterministic Security highlights (full review in ADR-102 §"Security review"): - The registry is metadata-only; per-cog binary signatures (ADR-100) remain the trust root for installs. A compromised registry can mislead a human reader but cannot ship malicious binaries. - 8 MiB cap + 10s timeout + Option<Arc<...>> via Extension layer means the endpoint can't be used to exhaust memory or pin tokio threads. - Stale-on-error responses carry an explicit `stale: true` field so upstream outages are visible to consumers rather than silently masked. - Endpoint sits behind the existing RUVIEW_API_TOKEN bearer gate when set, otherwise unauthenticated (registry contents are public anyway). * chore: refresh Cargo.lock for ureq/sha2/thiserror deps added by ADR-102	2026-05-19 18:08:43 -04:00
Rahul	c00f45e296	fix(sensing): finish #611 NaN-panic audit — 7 more sites missed by #613 (#624 ) #613 fixed adaptive_classifier.rs:94 (the IQR sort) and called the audit done, but the grep used `partial_cmp(b).unwrap()` as a literal and missed seven additional production sites that use comparator variants: adaptive_classifier.rs:205 AdaptiveModel::classify() argmax over softmax probs — same per-frame hot path as #611. NaN flows through normalise → logits → softmax and still reaches this site even after the IQR fix. adaptive_classifier.rs:480 train() argmax (training accuracy loop) adaptive_classifier.rs:500 train() per-class argmax main.rs:2446, 2449 count_persons_mincut variance source/sink select csi.rs:602, 605 count_persons_mincut variance source/sink select (duplicate of main.rs logic in csi.rs) For the variance-select sites, note that the outer `unwrap_or((0, &0))` only catches an empty iterator — it cannot rescue a panic raised inside the comparator. A single NaN in `variances[]` still aborts the process. Same fix as #613: swap `.unwrap()` for `.unwrap_or(std::cmp::Ordering::Equal)` inside the comparator closure. Pure behavioural change, no API surface. Re-audit of the remaining `partial_cmp(...).unwrap()` matches in v2/: they are all inside `#[cfg(test)]` / `#[test]` blocks (spectrogram.rs:269, depth.rs:234, connectivity.rs:477, vital_signs.rs:737) where inputs are controlled and panic-on-NaN is acceptable.	2026-05-19 10:02:08 -04:00
ruv	79cc2d7b22	Merge #491 : feat(sensing-server): adaptive person count — RollingP95 + dedup_factor runtime API Integrating @schwarztim's PR #491 into main on their behalf — their fork has fallen too far behind for a clean rebase (the PR's commit graph dropped silently during `git rebase origin/main`), so applying as a merge from the fork head to preserve the diff cleanly. What this lands: - `RollingP95` adaptive normaliser for the person-count feature scaling. Streaming P95 over a 600-sample / ~30 s sliding window. Cold-start (<60 samples) falls back to the legacy denominators (variance/300, motion_band_power/250, spectral_power/500) so day-0 behaviour is preserved on every deployment. - `RuntimeConfig` struct + `load_runtime_config` / `save_runtime_config` persisted to `data/config.json`. Exposes `dedup_factor` via REST so multi-node deployments can tune cluster-deduplication without a rebuild, including an auto-tune endpoint that derives optimal dedup from a known person count (calibration mode). - `compute_person_score()` now takes &AppStateInner alongside &FeatureInfo so the adaptive denominators are reachable. All 3 call sites updated. - New `AppStateInner` fields: `p95_variance`, `p95_motion_band_power`, `p95_spectral_power`, `dedup_factor`, `data_dir`. Closes #491. Directly addresses: - #499 (double skeletons, multi-node) — the slot-clustering problem this PR's adaptive normaliser was designed to fix - #519 Bug 1 (ghost person detection on edge-tier 1 & 2 multi-node) - #496 (person count over-reporting on single-room single-person) Verified locally: - cargo check -p wifi-densepose-sensing-server --no-default-features: 1.0s - cargo test -p wifi-densepose-sensing-server --no-default-features --lib: 233/233 passed in 25.0s Co-authored-by: @schwarztim Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-19 08:25:47 -04:00
rUv	b2e2e6d6fd	fix(sensing-server): WS broadcast emits effective_source() not hardcoded "esp32" (closes #618 ) (#621 ) Reported by @ArnonEnbar with a complete reproduction. broadcast_tick_task() re-emits the cached `latest_update` every tick so pose WS clients keep getting data even when ESP32 pauses between frames. The `source` field of that cached update was set to "esp32" at the moment a fresh ESP32 frame was last decoded (main.rs:3885, :4136). After the ESP32 loses power or network, no fresh frame is decoded — the cached `latest_update` is still re-broadcast every tick with the stale source: "esp32" baked in. UI's "Sensing" tab keeps showing "LIVE — ESP32 HARDWARE Connected" with frozen vitals/features/ classification re-broadcast indefinitely. REST `/health` correctly reports source: "esp32:offline" (via effective_source(), which checks last_esp32_frame elapsed time against ESP32_OFFLINE_TIMEOUT=5s) — but the WS broadcast path was the one consumer that didn't call it. Fix: clone the cached update per tick, overwrite source with s.effective_source(), then serialize and broadcast. UI now switches to "esp32:offline" on the same 5s budget as the REST surface. cargo build -p wifi-densepose-sensing-server --no-default-features: 17s, no errors (1 pre-existing unused-import warning unchanged).	2026-05-18 08:18:18 -04:00
rUv	72bbd256e7	fix(security): path-traversal guard on 5 sensing-server endpoints (closes #615 ) (#616 ) Reported by @bannned-bit. Five endpoints in v2/crates/wifi-densepose-sensing-server embedded user-controlled identifiers in format!() paths with no sanitization: recording.rs POST /api/v1/recording/start (session_name) recording.rs GET /api/v1/recording/download/:id (id) recording.rs DELETE /api/v1/recording/delete/:id (id) model_manager.rs POST /api/v1/models/load (model_id) training_api.rs load_recording_frames (dataset_ids[]) Each unauthenticated caller could: - READ arbitrary files via ../../etc/passwd, ../../.env, etc. - WRITE attacker-controlled JSONL via recording/start - LOAD attacker-controlled .rvf model files - DELETE arbitrary files the server process can touch New `path_safety` module exports `safe_id(&str) -> Result<&str, PathSafetyError>` that enforces the rejection envelope BEFORE any user input reaches a format!() that builds a path: - Allowed character set: [A-Za-z0-9._-] - Reject leading '.' (rules out '.', '..', '.env', hidden files) - Reject empty strings - Reject anything > 64 bytes - Reject all whitespace, path separators, null bytes, non-ASCII Applied at all 5 sites. Errors return 400 Bad Request (download) / status:"error" JSON (others) — not panics. 9 unit tests in path_safety::tests cover: - accepts simple alphanumeric / hyphen / underscore / dot - rejects empty, leading dot, path separators ('/', '\'), null byte, whitespace, shell specials, non-ASCII (including fullwidth slash U+FF0F), too-long, boundary at MAX_ID_LEN test result: ok. 9 passed; 0 failed cargo build -p wifi-densepose-sensing-server --no-default-features: 33s Fix-marker RuView#615 in scripts/fix-markers.json prevents removing the guard at any of the 5 call sites. CHANGELOG entry under [Unreleased] / Security documents the patched endpoints and the rejection envelope. Severity: critical per reporter — five remotely-reachable paths to read, write, or delete arbitrary files. Hot per-request paths, not edge cases.	2026-05-17 19:59:20 -04:00
rUv	3bd70f7910	fix(sensing): adaptive_classifier sorts with unwrap_or(Equal) — NaN panic (closes #611 ) (#613 ) Reported by @bannned-bit. v2/crates/wifi-densepose-sensing-server/src/ adaptive_classifier.rs:94 did: sorted.sort_by(\|a, b\| a.partial_cmp(b).unwrap()); f64::partial_cmp returns None on NaN, so `.unwrap()` panics. CSI data from real ESP32 hardware can produce NaN (silent DSP div-by-zero, empty buffer, etc.), and this code path runs on every frame in the classify() hot path — a single NaN frame kills the entire sensing server process. Fix swaps for unwrap_or(Ordering::Equal), matching the pattern the same file already uses at lines 149-150 and 155 (those sites were already NaN-safe; this site was an oversight). Scoped audit: greped the v2/ tree for `partial_cmp(b).unwrap()`. The other 3 hits are in #[cfg(test)] blocks (spectrogram.rs:269, depth.rs:234, connectivity.rs:477) where panic-on-NaN is acceptable because test inputs are controlled. Only adaptive_classifier.rs:94 was a production-path crash. Severity: critical per reporter — runtime panic on real-world data. Patch: 1-line behavioural change + comment.	2026-05-17 19:29:07 -04:00
Chaitanya Tata	f853c74563	v2: pin Rust 1.89 and fix sensing-server UI path when run from v2 (#523 ) * v2: pin Rust 1.89 for sensing-server dependency chain ruvector-core 2.0.5, hnsw_rs 0.3.4, and mmap-rs 0.7 require newer Cargo/rustc than 1.82 (edition2024 manifest, is_multiple_of, stable avx512f target_feature on x86_64). Add v2/rust-toolchain.toml so cargo build -p wifi-densepose-sensing-server picks a compatible toolchain. Signed-off-by: Chaitanya Tata <chaitanya@dotstarconsulting.com> Co-authored-by: Cursor <cursoragent@cursor.com> * sensing-server: default UI path for cwd v2/ and coalesce fallbacks The previous default ../../ui resolves to a non-existent directory when the binary is run from v2/ (common), so /ui/* returned 404 and the dashboard appeared broken. Default to ../ui and try ../ui, ./ui, ../../ui when the configured path is missing. Signed-off-by: Chaitanya Tata <chaitanya@dotstarconsulting.com> Co-authored-by: Cursor <cursoragent@cursor.com> --------- Signed-off-by: Chaitanya Tata <chaitanya@dotstarconsulting.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-17 18:00:36 -04:00
Timothy Schwarz	8b297dd706	fix(sensing-server): handle WebSocket Lagged + add ping keepalive (#484 ) Root cause: broadcast channel Lagged error caused instant disconnect when clients fell behind 256 frames (10Hz * 50-200KB = easy to lag). Client reconnects, immediately lags again, rapid cycling ensues. Sensing handler: Lagged error now continues (skips missed frames) instead of breaking. Added 30s ping interval for proxy keepalive. Pose handler: same Lagged handling + Pong match arm. CHANGELOG updated under Unreleased/Fixed. Co-authored-by: Deploy Bot <deploy@example.com>	2026-05-17 17:57:02 -04:00
@aaronjmars	3685d16a49	fix(security): host-header allowlist on sensing-server HTTP + WS — DNS rebinding (#580 ) The sensing-server binds to 127.0.0.1 by default with no `Host` header validation on either router. A foreign page can lower its DNS TTL, re-resolve to 127.0.0.1 after the browser has accepted the origin, and then read live pose + vital signs from /api/v1/* + /ws/sensing as same-origin against the attacker's hostname. When `RUVIEW_API_TOKEN` is unset (the documented LAN-mode default from #443/#547) the attacker can also drive state-mutating POSTs (recording/start, models/load, adaptive/train, calibration/start, sona/activate). Defense: a small `host_validation` axum middleware that pins the `Host` header to a configurable allowlist. The loopback names (`localhost`, `127.0.0.1`, `[::1]`, each with or without a port) are always in the set, so default 127.0.0.1 deployments keep working from the local browser without any configuration change. Operators who bind to a routable address extend the set with one or more `--allowed-host` flags or a comma-separated `SENSING_ALLOWED_HOSTS` env var. Reverse-proxy deployments that already canonicalise `Host` opt out with `--disable-host-validation`. The layer is wired into both the dedicated WebSocket router on `--ws-port` (8765) and the main HTTP router on `--http-port` (8080), so /ws/sensing on either listener is covered. Rejection responses are `421 Misdirected Request` (the correct status for a request that arrived at a server that does not consider the supplied `Host` authoritative); missing `Host` is `400 Bad Request`. CWE-346 (Origin Validation Error), CWE-350 (Reliance on Reverse DNS). Severity: high. Tests: 13 new unit tests on the middleware (loopback defaults, case-insensitivity, IPv6 bracketing, port stripping, env-var/CLI merge, foreign-host rejection on /health + /ws/*, disabled-allowlist escape hatch). Full suite: 220/220 pass under `cargo test -p wifi-densepose-sensing-server --no-default-features`. Co-authored-by: Aeon <aeon@aaronjmars.com>	2026-05-17 17:27:00 -04:00
Akhilesh Arora	10684972d7	fix(vital_signs): use circular variance for wrapped phases (#595 ) process_frame computed arithmetic mean + variance on phase values from atan2(), which are wrapped to (-pi, pi]. Phases close across the +/-pi discontinuity produced ~pi^2 variance instead of ~1e-6, feeding wrap noise into the heart-rate FFT buffer. Replace inline math with a standard circular variance helper (1 - mean resultant length). Add 4 unit tests, one through the production path of process_frame. Closes #593	2026-05-17 17:02:53 -04:00
ruv	ca97527646	feat(introspection): I6 — regime-changed signal + per-frame analyze + honest ADR-099 D8 amendment Three threads in this commit: 1) Per-frame attractor analysis (default analyze_every_n: 8 → 1). The I5 benchmark put per-frame update at 0.012 ms p99 — 83× under D4's 1 ms budget. The cost case for the every-8th-frame default doesn't hold; per-frame analysis is what makes regime_changed a viable early-detection trigger. 2) New `regime_changed: bool` field in IntrospectionSnapshot — flips on any frame whose attractor regime classification differs from the previous frame's. Pairs with top_k_similarity (full-shape match) to give downstream consumers two latencies with different robustness profiles. 3) Honest amendment of ADR-099 D8 to reflect empirical reality: - L1 stand-in achieves 3.20× ratio (5-frame shape match vs 16-frame event-path floor); the 10× aspirational bar is architecturally unreachable at 1-D scalar feature resolution. - regime_changed didn't fire in the 10-frame motion window — the 200-frame noise trajectory dominates the Lyapunov classification, and short perturbations don't shift the regime fast enough on a scalar feature. - Path to 10×: ADR-208 Phase 2 (Hailo NPU vec128 embeddings) — multi-dim partial matches discriminate from noise in 1-2 frames, not 5. - Side finding: midstream temporal-compare::DTW uses discrete equality cost (designed for LLM tokens), not numeric distance — swapping it in for f64 amplitude scoring would be strictly worse than the L1 stand-in. A numeric DTW is a separate concern (hand-roll or new crate). - Revised D8: ship behind --introspection (off by default) until multi- dim features land. Per-frame update budget IS met (0.041 ms p99 in this bench, ~24× under the 1 ms bar) — the feature is cheap enough to carry dark today. cargo test -p wifi-densepose-sensing-server --no-default-features: introspection (lib): 8 passed, 0 failed introspection_latency (test): 5 passed, 0 failed (incl. new regime_change_path_latency) clippy: clean on the introspection surface (pre-existing approx_constant lints in pose.rs / main.rs unchanged). Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-13 23:29:37 -04:00
ruv	4a1f3a1e10	feat(sensing-server): wire ADR-099 introspection tap + /ws/introspection + /api/v1/introspection/snapshot I3 (per ADR-099). Three changes in main.rs: 1) AppStateInner: + intro: IntrospectionState + intro_tx: broadcast::Sender<String> (256-slot ring, same shape as the existing tx). 2) ESP32 frame path: after the global frame_history push, before the per-node mutable borrow of s.node_states, compute the per-frame derived feature (mean amplitude across subcarriers), call s.intro.update(ts_ns, feature), and broadcast the snapshot JSON to s.intro_tx. Placement is deliberate — between the global state's mutable touch and the per-node &mut so borrow-checking stays linear; ns is borrowed after the tap completes its s.intro / s.intro_tx access. 3) Routes: ws_introspection_handler → /ws/introspection api_introspection_snapshot → /api/v1/introspection/snapshot Same Axum + tokio::sync::broadcast pattern as ws_sensing_handler, subscribed against s.intro_tx. Wrapped by the bearer-auth middleware already on /api/v1/* — orchestrator probes and unauthenticated /ws/sensing reachers continue to land on the existing topic. Verified: cargo build -p wifi-densepose-sensing-server --no-default-features ✓ cargo test -p wifi-densepose-sensing-server --no-default-features lib: 207 passed, 0 failed (199 pre-tap + 8 introspection) integration suites: 70, 8, 16, 18 passed, 0 failed cargo clippy: clean on the introspection surface (pre-existing warnings on -core / -ruvector / -signal unchanged). Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-13 23:00:31 -04:00
ruv	94ef125240	feat(sensing-server): introspection module skeleton (ADR-099 D1+D7+D8) Adds the per-frame introspection state that ADR-099 specifies, plus the two midstream dependencies. Pure addition — no other code touched. v2/crates/wifi-densepose-sensing-server/Cargo.toml + midstreamer-temporal-compare = "0.2" + midstreamer-attractor = "0.2" v2/crates/wifi-densepose-sensing-server/src/introspection.rs (new, 530 lines) pub struct IntrospectionState ├─ midstreamer-attractor's AttractorAnalyzer (regime + Lyapunov) ├─ SignatureLibrary (JSON-loaded labelled segments) ├─ VecDeque<f64> sliding amplitude buffer (default 128 points) └─ update(timestamp_ns, derived_feature) — never window-blocked + snapshot() -> IntrospectionSnapshot { timestamp_ns, frame_count, regime, lyapunov_exponent, attractor_dim, attractor_confidence, top_k_similarity } pub enum Regime { Idle, Periodic, Transient, Chaotic, Unknown } pub struct Signature { id, label, vectors, dtw, promotion_threshold } pub struct SimilarityMatch { signature_id, score, above_threshold } DTW path is currently a host-side stand-in (length-normalised L1 with the real DTW call deferred to I3/I5 once vec128 embeddings exist — ADR-099 P1). The attractor path is wired to midstream directly. The analyze() step only runs every N frames (default 8) to stay under the per-frame ms budget. 8 unit tests (snapshot defaults, frame-count + timestamp advance, empty library, scoring + ordering invariants, threshold gating, empty-signature fault-tolerance, regime classification after 200 frames). 199 → 207 lib tests, 0 failures. cargo build clean (only pre-existing warnings). Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-13 22:50:58 -04:00
ruv	c641fc44ae	feat(docker+sensing-server): refresh Docker publish + opt-in bearer-token API auth Closes #520, #514, #443. ## #520 / #514 — stale Docker image, missing UI assets `ruvnet/wifi-densepose:latest` was published before `ui/observatory` and `ui/pose-fusion` were added; users see /app/ui missing those files and the v0.6+ packet format doesn't reach the server. Two fixes: 1. `docker/Dockerfile.rust` now `RUN`s a build-time guard after `COPY ui/` that fails the build if `index.html` / `observatory.html` / `pose-fusion.html` / `viz.html` (or the `observatory/` / `pose-fusion/` / `components/` / `services/` directories) are missing, plus an exec-bit check on `/app/sensing-server`. A stale image can never be silently produced again. 2. New `.github/workflows/sensing-server-docker.yml` rebuilds + pushes on every change to the Dockerfile, the server crate, the signal/vitals/ wifiscan crates, the workspace manifests, the `ui/` tree, or itself — plus `v` tags and manual dispatch. Pushes to both `docker.io/ruvnet/ wifi-densepose` AND `ghcr.io/ruvnet/wifi-densepose` with `latest` + `vX.Y.Z` + `sha-<short>` tags, then post-push smoke-tests the artifact: /health, /api/v1/info, the observatory + pose-fusion HTML, AND the bearer-auth path (no token → 401, wrong → 401, correct → 200). Uses the `DOCKERHUB_USERNAME`/`DOCKERHUB_TOKEN` repo secrets; ghcr.io rides on the workflow's GITHUB_TOKEN. ## #443 — sensing-server REST API auth model QE security audit raised that 40+ /api/v1/ routes have no auth layer with a default `0.0.0.0` bind. New `wifi_densepose_sensing_server::bearer_auth` module + middleware: - Env-var-gated: `RUVIEW_API_TOKEN` unset/empty ⇒ middleware is a no-op (current LAN-mode behaviour preserved — no default change); set ⇒ every `/api/v1/` request must carry `Authorization: Bearer <token>` or the server returns 401. - Constant-time byte compare via local `ct_eq` (no new dep). - `/health`, `/ws/sensing`, and `/ui/*` are intentionally never gated (orchestrator probes + local browsers). - Startup logs which mode is active and warns when auth is ON with a `0.0.0.0` bind. - 8 unit tests on the middleware via `tower::ServiceExt::oneshot` (sensing-server lib tests 191 → 199, 0 failures). Verified locally: `cargo build --workspace --no-default-features` ✓, `cargo test -p wifi-densepose-sensing-server --no-default-features` ✓. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-13 08:52:25 -04:00
Deploy Bot	ce7983eb43	feat(sensing-server): adaptive person count — RollingP95 + dedup_factor runtime API RollingP95 adaptive normalizer (ADR-044 §5.2): - Streaming P95 estimator (600-sample / ~30 s window) replaces fixed-scale denominators (variance/300, motion/250, spectral/500) that saturated against live ESP32 values, collapsing dynamic range to zero. - Cold-start (<60 samples) falls back to legacy denominators — day-0 behaviour is preserved. - Three new fields on AppStateInner: p95_variance, p95_motion_band_power, p95_spectral_power (all RollingP95::new(600, 60)). - compute_person_score() refactored to accept &AppStateInner; all three call sites (wifi, wifi-fallback, simulated) updated. - 5 unit tests in rolling_p95_tests module. dedup_factor runtime API (ADR-044 §5.3): - New field dedup_factor: f64 (default 3.0) on AppStateInner. - fuse_or_fallback() gains dedup_factor param; fallback switches from max() to sum/dedup_factor (ceiling), matching the fork's sum-based aggregation. - RuntimeConfig struct + load/save_runtime_config() for data/config.json persistence across restarts. - Three new REST endpoints: GET /api/v1/config/dedup-factor POST /api/v1/config/dedup-factor POST /api/v1/config/ground-truth (auto-tune from known person count) Explicitly NOT included: - lambda=5.0 (upstream keeps its 0.1 default — deployment-specific tuning) - CC intensity threshold 0.3 and min-cluster-size 4 hardcodes - max_cc_size filter removal	2026-04-28 15:32:34 -04:00
rUv	17509a2a41	feat(ruvector,signal,sensing-server): ADR-084 Passes 1/1.5/2/3 — RaBitQ similarity sensor implementation (#435 ) * feat(ruvector): ADR-084 Pass 1 — sketch module foundation Implements Pass 1 of ADR-084 (RaBitQ similarity sensor): a thin RuView-flavored API over `ruvector_core::quantization::BinaryQuantized`, exposed at `wifi_densepose_ruvector::{Sketch, SketchBank, SketchError}`. API surface: - `Sketch::from_embedding(&[f32], sketch_version: u16)` — sign-quantize a dense embedding into a 1-bit-per-dim packed sketch. - `Sketch::distance` — hamming distance with schema-mismatch error. - `Sketch::distance_unchecked` — hot-path variant for sketches already validated as same-schema. - `SketchBank::insert/topk/novelty` — bank with caller-assigned u32 IDs, schema locked at first insert, novelty = min_distance / embedding_dim. Schema versioning (`sketch_version: u16` + `embedding_dim: u16`) prevents silent comparisons across embedding-model generations. Bumping the model forces re-sketch of the candidate bank. Pass 1 establishes the API and unit-test foundation. Acceptance criteria (8x-30x compare-cost reduction, 90% top-K coverage, <1pp accuracy regression) are measured per-site in Passes 2-5. Validated: - 12 new tests pass (sketch construction, hamming, top-K ordering, schema lock, schema rejection, novelty) - cargo test --workspace --no-default-features → 1,551 passed, 0 failed, 8 ignored (was 1,539 before; +12 new tests) - ESP32-S3 on COM7 still streaming live CSI (cb #117300) Co-Authored-By: claude-flow <ruv@ruv.net> * bench(ruvector): ADR-084 acceptance — sketch-vs-float compare cost Adds sketch_bench measuring the first ADR-084 acceptance criterion (8x-30x compare cost reduction) at three dimensions and a realistic top-K@k=8 over 1024 sketches. Measured (Windows host, criterion --warm-up 1s --measurement 3s): compare_d512: float_l2: 197.03 ns/op float_cosine: 231.17 ns/op sketch_hamming: 4.56 ns/op → 43-51x speedup topk_d128_n1024_k8: float_l2_topk: 47.59 us sketch_hamming: 6.34 us → 7.5x speedup Pair-wise compare exceeds the 8-30x acceptance criterion by an order of magnitude. Top-K is at 7.5x — close to the threshold; the sort dominates at this bank size, which is a Pass 1.5 optimization opportunity (partial-sort heap for small K). Co-Authored-By: claude-flow <ruv@ruv.net> * perf(ruvector): ADR-084 Pass 1.5 — partial-sort heap in SketchBank::topk Replace `sort_by_key + truncate` (O(n log n)) with a fixed-size max-heap (O(n log k)) for top-K queries when n > k. Fast path when n ≤ k stays on the simple sort. Bench at d=128, n=1024, k=8 (Windows host, criterion 3s measurement): Before (sort + truncate): 6.34 µs/op After (heap): 3.83 µs/op -39.4% / +1.65× faster Combined with the 32× memory shrink and 47.6 µs → 3.83 µs total path saving: topk_d128_n1024_k8 vs float_l2_topk: Pass 1 sort_by_key: 47.59 µs / 6.34 µs = 7.5× speedup Pass 1.5 heap: 47.59 µs / 3.83 µs = 12.4× speedup Now over the ADR-084 acceptance criterion of 8× minimum. Heap pays off strictly more at larger n; benchmark at n=4096 is a Pass-2 follow-up. Co-Authored-By: claude-flow <ruv@ruv.net> * feat(signal): ADR-084 Pass 2 — sketch-prefilter for EmbeddingHistory::search Adds `EmbeddingHistory::with_sketch(...)` and `search_prefilter(query, k, prefilter_factor)`. The prefilter sketches the query, hamming-ranks the parallel sketch array to take the top `k * prefilter_factor` candidates, then refines those with exact cosine and returns the top-K. `EmbeddingHistory::new(...)` is unchanged — sketches are opt-in via the new constructor. `search_prefilter` falls back to brute-force `search` when sketches are disabled, so callers never see incorrect results. ADR-084 acceptance criterion empirically validated: Synthetic 128-d AETHER-shape, n=256, 16 queries: k=8, prefilter_factor=4 → 78.9% top-K coverage (FAIL <90%) k=8, prefilter_factor=8 → ≥90% top-K coverage (PASS) k=16, prefilter_factor=8 → ≥90% top-K coverage (PASS) The factor=4 default that I'd planned in Pass 1 falls below the 90% bar on uniform-random synthetic data. Production callers should use 8 unless their embeddings carry enough structure (real AETHER traces likely will) to clear the bar at lower factors. Documented in the search_prefilter docstring and asserted in test_search_prefilter_topk_coverage_meets_adr_084. FIFO eviction now drains the parallel sketches array in lockstep — test_search_prefilter_evicts_sketches_on_fifo guards against the two arrays drifting (which would silently corrupt top-K via index mismatch). Validated: - cargo test --workspace --no-default-features → 1,554 passed, 0 failed, 8 ignored (was 1,551; +3 new prefilter tests) - ESP32-S3 on COM7 still streaming live CSI (cb #3200) Co-Authored-By: claude-flow <ruv@ruv.net> * bench(signal): ADR-084 Pass 2 — end-to-end search_prefilter speedup Measures EmbeddingHistory::search_prefilter (sketch + cosine refine) vs the brute-force EmbeddingHistory::search baseline at three realistic AETHER bank sizes, with the empirically validated prefilter_factor=8. Measured (Windows host, criterion --warm-up 1s --measurement 3s): d=128, k=8: n=256 brute_force_cosine = 31.98 us, prefilter = 13.78 us → 2.3x n=1024 brute_force_cosine = 110.4 us, prefilter = 16.64 us → 6.6x n=4096 brute_force_cosine = 507.4 us, prefilter = 66.37 us → 7.6x Speedup grows with bank size (sketch overhead is fixed; brute-force scales linearly with n). At n=4k the prefilter approaches the 8x ADR-084 acceptance criterion; at n=10k+ (realistic multi-day deployment banks) it crosses cleanly. Below n=512 the brute-force path is already cheap (sub-50 us) so the prefilter's narrower wins don't materially affect the hot path. Coverage acceptance (≥90% top-K agreement) is exercised in the unit-test suite, not the bench. The bench measures cost only. Co-Authored-By: claude-flow <ruv@ruv.net> * feat(signal): ADR-084 Pass 3 — EmbeddingHistory::novelty primitive Adds the cluster-Pi novelty-sensor primitive: `EmbeddingHistory::novelty(query)` returns `Option<f32>` in [0.0, 1.0] where 0.0 = exact-match-in-bank and 1.0 = no-overlap. Returns None when sketches are disabled so callers can fall back gracefully (existing `EmbeddingHistory::new` constructor stays sketch-disabled). This is the building block of the cluster-Pi novelty gate described in ADR-084 §"cluster-Pi novelty sensor": each sensor node maintains a bank of recent feature vectors, the gate scores the incoming frame's novelty against the bank, and the heavy CNN / pose-model wake gate consumes the score. Wiring novelty into sensing-server's NodeState happens in a follow-up — that's a ~50-line surgical change touching main.rs that deserves its own commit. This patch lands the primitive + tests so the wiring is straightforward. Three regression tests added: - test_novelty_returns_none_without_sketches (graceful fallback when bank is sketch-less) - test_novelty_zero_for_exact_match_one_for_empty_bank (semantic boundaries) - test_novelty_decreases_as_bank_grows_around_query (gradient direction — guards against reversed comparator) Validated: - cargo test --workspace --no-default-features → 1,557 passed, 0 failed, 8 ignored (was 1,554; +3 new novelty tests) - ESP32-S3 on COM7 still streaming live CSI (cb #7600) Co-Authored-By: claude-flow <ruv@ruv.net> * feat(sensing-server): ADR-084 Pass 3 — wire novelty into NodeState Wires the EmbeddingHistory::novelty primitive (Pass 3 prior commit) into the per-node frame ingestion path on the cluster Pi. Each incoming CSI frame now updates a per-node sketch bank of the last 6.4 s of feature vectors and produces a novelty score in [0.0, 1.0] that downstream model-wake gates can consume. Two NodeState structs were touched (one in types.rs and a refactoring-leftover duplicate in main.rs that the call site uses); both gain feature_history + last_novelty_score fields and an update_novelty helper that: - truncates / zero-pads incoming amplitudes to NOVELTY_VECTOR_DIM (56) - scores novelty before inserting (so a frame doesn't see itself) - FIFO-evicts when the bank reaches NOVELTY_HISTORY_CAPACITY (64) Wired at the per-node ESP32 frame path in main.rs:3772 (immediately before frame_history.push_back). Existing call sites that operate on the singleton SensingState (not per-node) intentionally untouched — they will be wired in a follow-up alongside the WebSocket update envelope's novelty_score field. Two new unit tests in novelty_tests: - first_frame_yields_max_novelty_then_zero_on_repeat (semantic boundaries: empty bank = 1.0, exact repeat = 0.0) - handles_short_and_long_amplitude_vectors (truncate / zero-pad robustness across hardware variants) Validated: - cargo test --workspace --no-default-features → 1,559 passed, 0 failed, 8 ignored (was 1,557; +2 new novelty tests) - ESP32-S3 on COM7 still streaming live CSI (cb #3900) Co-Authored-By: claude-flow <ruv@ruv.net> * hardening(ruvector): L2 from PR #435 review — overflow on >u16::MAX dims Pass 1.6 hardening, addressing L2 finding from the security review on PR #435 (https://github.com/ruvnet/RuView/pull/435#issuecomment-4321285519): The original `Sketch::from_embedding` used `debug_assert!` for the `embedding.len() <= u16::MAX` invariant, which compiled out in release builds. A caller passing a 65,536+ -dim embedding would silently truncate the dimension count via `as u16` cast — two over-long inputs would then compare as same-dimensional rather than as 64k vs 70k, and the dimension confusion would not surface anywhere. Two-part fix: - `from_embedding` (infallible) now SATURATES `embedding_dim` to `u16::MAX` rather than truncating. Two over-long inputs still get packed bit-correctly by `BinaryQuantized` and the saturated dim is consistent across both, so they compare predictably (just with an upper-bounded distance). - `try_from_embedding` (new, fallible) returns `Err(SketchError::EmbeddingDimOverflow{got, max})` when the input exceeds `u16::MAX`. Use this when an over-long input should fail loudly rather than be silently saturated. - New error variant `SketchError::EmbeddingDimOverflow` with the observed `got` and the `max` (`u16::MAX as usize`). - New regression test `try_from_embedding_rejects_over_long_input` asserts both paths: try_ → Err, infallible → saturate. Validated: - 13 sketch unit tests pass (was 12; +1 for L2 boundary). - cargo test --workspace --no-default-features → 1,560 passed, 0 failed, 8 ignored (was 1,559; +1). - ESP32-S3 on COM7 streaming live CSI (cb #100, fresh boot RSSI -48 dBm). Co-Authored-By: claude-flow <ruv@ruv.net> * hardening(ruvector,signal): L1+L3 from PR #435 review Two follow-ups to the security review on PR #435: L1 — Defensive `if let Some(...)` for SketchBank::topk heap peek. The original `.expect("heap len == k > 0")` was mathematically unreachable (k > 0 enforced at function entry, heap.len() >= k branch guards), but a structural pattern makes the impossibility a type property rather than a runtime invariant. Same hot-path cost; zero panic risk in the production binary. L3 — Guard `embedding_dim == 0` in `EmbeddingHistory::novelty`. A 0-dim history is constructible via `with_sketch(0, ...)`; without the guard the function returned `NaN` (min_d as f32 / 0.0), silently poisoning every downstream gate (model-wake, anomaly-emit, etc). Now returns Some(1.0) — fail-loud at "no comparison possible → maximally novel," never NaN. New regression test `test_novelty_zero_dim_history_returns_one_not_nan` pins it down. Validated: - cargo test --workspace --no-default-features → 1,561 passed, 0 failed, 8 ignored (was 1,560; +1 for the L3 NaN guard test). - ESP32-S3 on COM7 streaming live CSI (cb #12400, RSSI fresh). L4 (f64→f32 cast) is documentation-only and lands in a follow-up patch; L8 (always-on novelty sensor) is an observation, not a fix. Co-Authored-By: claude-flow <ruv@ruv.net> * feat(sensing-server): ADR-084 Pass 3.5 — novelty_score on PerNodeFeatureInfo Adds an optional `novelty_score: Option<f32>` field to PerNodeFeatureInfo, the per-node WebSocket envelope shape. Mirrored on both struct definitions (types.rs canonical + main.rs's refactoring-leftover duplicate) so the schema is consistent. `#[serde(skip_serializing_if = "Option::is_none")]` keeps existing WebSocket consumers unaffected — old clients see no extra field unless the server populates it. No PerNodeFeatureInfo literal construction sites exist today (all `node_features: None`), so this is a schema-only addition; live population from `NodeState::last_novelty_score` lands in a Pass 3.6 follow-up that also wires `node_features: Some(...)` at the per-node ESP32 frame emit path. Validated: - cargo test --workspace --no-default-features → 1,561 passed, 0 failed, 8 ignored (no change; schema-only). - ESP32-S3 on COM7 streaming live CSI (cb #2100, fresh boot). Co-Authored-By: claude-flow <ruv@ruv.net> * feat(sensing-server): ADR-084 Pass 3.6 — populate node_features with novelty_score Wires `node_features: Some(...)` at the two per-node ESP32 frame emit sites (formerly `node_features: None`). Adds a `build_node_features` helper that constructs `Vec<PerNodeFeatureInfo>` from `s.node_states`, including the per-node `last_novelty_score`. This completes the Pass 3.x track — novelty score now flows from NodeState → PerNodeFeatureInfo → SensingUpdate envelope → WebSocket clients. Cluster-Pi UI / model-wake / anomaly-emit gates can read it without round-tripping back to the server. Three other call sites (singleton paths at 1772, 1911, 4170) keep `node_features: None` for now — those are for the offline / simulated paths that don't have per-node ESP32 state. They'll get populated when their parent flows wire up real multi-node fanout. Stale flag uses `ESP32_OFFLINE_TIMEOUT` (5s) — same threshold the rest of the system uses to decide a node has dropped. Validated: - cargo test --workspace --no-default-features → 1,561 passed, 0 failed, 8 ignored (no change; integration test would be wire- format diff in a follow-up). - ESP32-S3 on COM7 streaming live CSI (cb #100, fresh boot, RSSI -49 dBm). Co-Authored-By: claude-flow <ruv@ruv.net> * feat(ruvector): ADR-084 Pass 4 — WireSketch wire-format primitive Adds `WireSketch::serialize` / `deserialize` for transmitting a sketch + novelty score over any byte-stream channel — cluster↔cluster mesh (ADR-066 swarm bridge when it exists), sensor→cluster-Pi UDP (ADR-086 edge gate complement), gateway→cloud QUIC. Channel-agnostic by design. Wire layout (12-byte header + ceil(dim/8) bytes payload, little-endian): [0..4] magic = 0xC5110084 [4..6] format_version = 1 [6..8] sketch_version (embedding-model schema) [8..10] embedding_dim [10..12] novelty_q15 (novelty * 32_767, saturated) [12..] packed sketch bits A 128-d AETHER sketch fits in exactly 28 bytes (12 header + 16 bits). Deserializer is paranoid by design — every untrusted byte buffer gets validated against: - length floor (>= header bytes) - length ceiling (WIRE_SKETCH_MAX_BYTES = 9 KiB; defends against memory-exhaustion attacks via claimed-but-impossible large dims) - magic match - format_version supported - embedding_dim → payload bytes consistency A malformed UDP packet from a non-RuView sender produces a typed `WireSketchError` (variant per failure class), never a panic. Re-exported from lib.rs alongside `Sketch` / `SketchBank`. Seven new tests: - wire_serialize_round_trip (correctness) - wire_rejects_short_buffer (length floor) - wire_rejects_oversized_buffer (length ceiling, DoS guard) - wire_rejects_bad_magic (cross-protocol confusion guard) - wire_rejects_unsupported_format_version (forward-compat) - wire_rejects_payload_size_mismatch (header/body consistency) - wire_envelope_size_for_aether_128d (sizing contract: 28 bytes) Validated: - cargo test --workspace --no-default-features → 1,568 passed, 0 failed, 8 ignored (was 1,561; +7 wire-format tests). - ESP32-S3 on COM7 streaming live CSI (cb #15100, RSSI -48 dBm). Pass 4's wire-format primitive ships first; the channel that carries it (ADR-066 swarm-bridge or ADR-086 sensor→Pi gate) is out-of-scope for this commit and tracked separately. Co-Authored-By: claude-flow <ruv@ruv.net> * feat(ruvector): ADR-084 Pass 5 — privacy-preserving event log + L4 docstring Pass 5 — `PrivacyEventLog` and `NoveltyEvent` types in a new `wifi_densepose_ruvector::event_log` module. Each event stores `(timestamp, sketch_bytes, sketch_version, embedding_dim, novelty, witness_sha256)` — explicitly NOT the raw float embedding. The witness is SHA-256 of the WireSketch serialization (12-byte header + packed bits + q15 novelty), making events content-addressable: two pushes of the same `(sketch, novelty)` produce byte-identical witnesses, enabling dedup at the receiver and verifier. Privacy properties (ADR-084 §"Privacy-preserving event log"): 1. Non-invertibility — 1-bit sign quantization is lossy; an attacker with read access cannot reconstruct the source CSI / embedding. 2. Content addressing — `(sketch_version, witness)` is fully qualified. 3. Bounded memory — fixed capacity ring; misbehaving senders cannot exhaust receiver memory. Seven new tests: - push_grows_until_capacity_then_fifo_evicts - zero_capacity_log_silently_drops_pushes (no-op stub case) - witness_is_deterministic_for_same_sketch_and_novelty (witness must NOT depend on timestamp) - witness_differs_for_different_novelty_scores - find_by_witness_returns_most_recent_match - find_by_witness_returns_none_on_miss - event_does_not_carry_raw_embedding (structural privacy guarantee) L4 hardening (PR #435 security review) — the `f64 → f32` cast in NodeState::update_novelty now has a docstring noting the boundary behaviour: `f64::INFINITY` survives as `f32::INFINITY`, `f64::NAN` propagates as `f32::NAN`. Neither panics. CSI amplitudes from healthy firmware are well within f32 finite range. Validated: - cargo test --workspace --no-default-features → 1,575 passed, 0 failed, 8 ignored (was 1,568; +7 event-log tests). - ESP32-S3 on COM7 streaming live CSI (cb #2800, RSSI -52 dBm). Co-Authored-By: claude-flow <ruv@ruv.net>	2026-04-26 02:21:35 -04:00
rUv	f49c722764	chore(repo): rename rust-port/wifi-densepose-rs → v2/ (flatten to one level) (#427 ) The Rust port lived two directories deep (rust-port/wifi-densepose-rs/) without any sibling under rust-port/ that warranted the extra level. Move the whole workspace up to v2/ to match v1/ (Python) at the same depth and shorten every cd / build command across the repo. git mv preserves history for all tracked files. 60 files updated for path references (CI workflows, ADRs, docs, scripts, READMEs, internal .claude-flow state). Two manual fixes for relative-cd paths in CLAUDE.md and ADR-043 that became wrong after the depth change (cd ../.. → cd ..). Validated: - cargo check --workspace --no-default-features → clean (after target/ nuke; the gitignored target/ was carried by the OS rename and had hard-coded old paths in build scripts) - cargo test --workspace --no-default-features → 1,539 passed, 0 failed, 8 ignored (same totals as pre-rename) - ESP32-S3 on COM7 → still streaming live CSI (cb #40300, RSSI -64 dBm) After-merge follow-up: contributors should `rm -rf v2/target` once and let cargo regenerate from the new path.	2026-04-25 21:28:13 -04:00

45 Commits