wifi-densepose

Commit Graph

Author	SHA1	Message	Date
rUv	67fec45e61	feat(edge-registry): ADR-102 — surface Cognitum cog catalog via /api/v1/edge/registry (#648 ) * feat(edge-registry): ADR-102 — surface Cognitum cog catalog via /api/v1/edge/registry Adds a new sensing-server endpoint that fetches and caches the canonical Cognitum app registry at https://storage.googleapis.com/cognitum-apps/app-registry.json (105 cogs across 11 categories as of v2.1.0). RuView previously had no live awareness of the catalog — the README's capability table was hand- curated and went stale as Cognitum shipped new cogs (the registry was last updated 6 days ago). ADR: * docs/adr/ADR-102-edge-module-registry.md — full design, response shape, configuration flags, failure modes, and a 12-row security review covering SSRF, response inflation, ?refresh abuse, stale-serve semantics, TLS, cache poisoning, JSON-panic resistance, etc. Code: * v2/.../edge_registry.rs — EdgeRegistry struct + UreqFetcher + MockFetcher trait + 7 unit tests. RwLock<Option<CachedEntry>> with stale-on-error fallback. MAX_PAYLOAD_BYTES=8 MiB, 10s wire timeout. * v2/.../main.rs — constructs Option<Arc<EdgeRegistry>> at startup, registers GET /api/v1/edge/registry handler, wires Extension layer. Handler runs the blocking ureq fetch via tokio::task::spawn_blocking so the async runtime stays free. * v2/.../cli.rs / main.rs Args — three new flags (per user request to "allow the registry to be disabled or changed"): --edge-registry-url <URL> (env RUVIEW_EDGE_REGISTRY_URL) --edge-registry-ttl-secs <N> (env RUVIEW_EDGE_REGISTRY_TTL_SECS) --no-edge-registry (env RUVIEW_NO_EDGE_REGISTRY) When --no-edge-registry is set or the URL is empty, the endpoint returns 404. Cargo.toml: adds ureq (rustls), sha2, thiserror as direct deps. README: * New collapsed "🧩 Edge Module Catalog" section with the full 105-cog table generated from the registry, grouped by category with practical one-line descriptions (e.g. "Spots irregular heartbeats and abnormal heart rhythms", "Detects walking problems and scores fall risk"). Links to https://seed.cognitum.one/store and the local appliance /cogs page. Sits between the HF model section and How It Works. Tests (7/7 pass): first_call_hits_upstream_and_caches ttl_expiry_triggers_refetch force_refresh_bypasses_fresh_cache stale_serve_on_upstream_failure_after_cached_success no_cache_no_upstream_returns_error upstream_invalid_json_is_treated_as_error upstream_sha256_is_deterministic Security highlights (full review in ADR-102 §"Security review"): - The registry is metadata-only; per-cog binary signatures (ADR-100) remain the trust root for installs. A compromised registry can mislead a human reader but cannot ship malicious binaries. - 8 MiB cap + 10s timeout + Option<Arc<...>> via Extension layer means the endpoint can't be used to exhaust memory or pin tokio threads. - Stale-on-error responses carry an explicit `stale: true` field so upstream outages are visible to consumers rather than silently masked. - Endpoint sits behind the existing RUVIEW_API_TOKEN bearer gate when set, otherwise unauthenticated (registry contents are public anyway). * chore: refresh Cargo.lock for ureq/sha2/thiserror deps added by ADR-102	2026-05-19 18:08:43 -04:00
Rahul	c00f45e296	fix(sensing): finish #611 NaN-panic audit — 7 more sites missed by #613 (#624 ) #613 fixed adaptive_classifier.rs:94 (the IQR sort) and called the audit done, but the grep used `partial_cmp(b).unwrap()` as a literal and missed seven additional production sites that use comparator variants: adaptive_classifier.rs:205 AdaptiveModel::classify() argmax over softmax probs — same per-frame hot path as #611. NaN flows through normalise → logits → softmax and still reaches this site even after the IQR fix. adaptive_classifier.rs:480 train() argmax (training accuracy loop) adaptive_classifier.rs:500 train() per-class argmax main.rs:2446, 2449 count_persons_mincut variance source/sink select csi.rs:602, 605 count_persons_mincut variance source/sink select (duplicate of main.rs logic in csi.rs) For the variance-select sites, note that the outer `unwrap_or((0, &0))` only catches an empty iterator — it cannot rescue a panic raised inside the comparator. A single NaN in `variances[]` still aborts the process. Same fix as #613: swap `.unwrap()` for `.unwrap_or(std::cmp::Ordering::Equal)` inside the comparator closure. Pure behavioural change, no API surface. Re-audit of the remaining `partial_cmp(...).unwrap()` matches in v2/: they are all inside `#[cfg(test)]` / `#[test]` blocks (spectrogram.rs:269, depth.rs:234, connectivity.rs:477, vital_signs.rs:737) where inputs are controlled and panic-on-NaN is acceptable.	2026-05-19 10:02:08 -04:00
ruv	79cc2d7b22	Merge #491 : feat(sensing-server): adaptive person count — RollingP95 + dedup_factor runtime API Integrating @schwarztim's PR #491 into main on their behalf — their fork has fallen too far behind for a clean rebase (the PR's commit graph dropped silently during `git rebase origin/main`), so applying as a merge from the fork head to preserve the diff cleanly. What this lands: - `RollingP95` adaptive normaliser for the person-count feature scaling. Streaming P95 over a 600-sample / ~30 s sliding window. Cold-start (<60 samples) falls back to the legacy denominators (variance/300, motion_band_power/250, spectral_power/500) so day-0 behaviour is preserved on every deployment. - `RuntimeConfig` struct + `load_runtime_config` / `save_runtime_config` persisted to `data/config.json`. Exposes `dedup_factor` via REST so multi-node deployments can tune cluster-deduplication without a rebuild, including an auto-tune endpoint that derives optimal dedup from a known person count (calibration mode). - `compute_person_score()` now takes &AppStateInner alongside &FeatureInfo so the adaptive denominators are reachable. All 3 call sites updated. - New `AppStateInner` fields: `p95_variance`, `p95_motion_band_power`, `p95_spectral_power`, `dedup_factor`, `data_dir`. Closes #491. Directly addresses: - #499 (double skeletons, multi-node) — the slot-clustering problem this PR's adaptive normaliser was designed to fix - #519 Bug 1 (ghost person detection on edge-tier 1 & 2 multi-node) - #496 (person count over-reporting on single-room single-person) Verified locally: - cargo check -p wifi-densepose-sensing-server --no-default-features: 1.0s - cargo test -p wifi-densepose-sensing-server --no-default-features --lib: 233/233 passed in 25.0s Co-authored-by: @schwarztim Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-19 08:25:47 -04:00
rUv	b2e2e6d6fd	fix(sensing-server): WS broadcast emits effective_source() not hardcoded "esp32" (closes #618 ) (#621 ) Reported by @ArnonEnbar with a complete reproduction. broadcast_tick_task() re-emits the cached `latest_update` every tick so pose WS clients keep getting data even when ESP32 pauses between frames. The `source` field of that cached update was set to "esp32" at the moment a fresh ESP32 frame was last decoded (main.rs:3885, :4136). After the ESP32 loses power or network, no fresh frame is decoded — the cached `latest_update` is still re-broadcast every tick with the stale source: "esp32" baked in. UI's "Sensing" tab keeps showing "LIVE — ESP32 HARDWARE Connected" with frozen vitals/features/ classification re-broadcast indefinitely. REST `/health` correctly reports source: "esp32:offline" (via effective_source(), which checks last_esp32_frame elapsed time against ESP32_OFFLINE_TIMEOUT=5s) — but the WS broadcast path was the one consumer that didn't call it. Fix: clone the cached update per tick, overwrite source with s.effective_source(), then serialize and broadcast. UI now switches to "esp32:offline" on the same 5s budget as the REST surface. cargo build -p wifi-densepose-sensing-server --no-default-features: 17s, no errors (1 pre-existing unused-import warning unchanged).	2026-05-18 08:18:18 -04:00
rUv	72bbd256e7	fix(security): path-traversal guard on 5 sensing-server endpoints (closes #615 ) (#616 ) Reported by @bannned-bit. Five endpoints in v2/crates/wifi-densepose-sensing-server embedded user-controlled identifiers in format!() paths with no sanitization: recording.rs POST /api/v1/recording/start (session_name) recording.rs GET /api/v1/recording/download/:id (id) recording.rs DELETE /api/v1/recording/delete/:id (id) model_manager.rs POST /api/v1/models/load (model_id) training_api.rs load_recording_frames (dataset_ids[]) Each unauthenticated caller could: - READ arbitrary files via ../../etc/passwd, ../../.env, etc. - WRITE attacker-controlled JSONL via recording/start - LOAD attacker-controlled .rvf model files - DELETE arbitrary files the server process can touch New `path_safety` module exports `safe_id(&str) -> Result<&str, PathSafetyError>` that enforces the rejection envelope BEFORE any user input reaches a format!() that builds a path: - Allowed character set: [A-Za-z0-9._-] - Reject leading '.' (rules out '.', '..', '.env', hidden files) - Reject empty strings - Reject anything > 64 bytes - Reject all whitespace, path separators, null bytes, non-ASCII Applied at all 5 sites. Errors return 400 Bad Request (download) / status:"error" JSON (others) — not panics. 9 unit tests in path_safety::tests cover: - accepts simple alphanumeric / hyphen / underscore / dot - rejects empty, leading dot, path separators ('/', '\'), null byte, whitespace, shell specials, non-ASCII (including fullwidth slash U+FF0F), too-long, boundary at MAX_ID_LEN test result: ok. 9 passed; 0 failed cargo build -p wifi-densepose-sensing-server --no-default-features: 33s Fix-marker RuView#615 in scripts/fix-markers.json prevents removing the guard at any of the 5 call sites. CHANGELOG entry under [Unreleased] / Security documents the patched endpoints and the rejection envelope. Severity: critical per reporter — five remotely-reachable paths to read, write, or delete arbitrary files. Hot per-request paths, not edge cases.	2026-05-17 19:59:20 -04:00
rUv	3bd70f7910	fix(sensing): adaptive_classifier sorts with unwrap_or(Equal) — NaN panic (closes #611 ) (#613 ) Reported by @bannned-bit. v2/crates/wifi-densepose-sensing-server/src/ adaptive_classifier.rs:94 did: sorted.sort_by(\|a, b\| a.partial_cmp(b).unwrap()); f64::partial_cmp returns None on NaN, so `.unwrap()` panics. CSI data from real ESP32 hardware can produce NaN (silent DSP div-by-zero, empty buffer, etc.), and this code path runs on every frame in the classify() hot path — a single NaN frame kills the entire sensing server process. Fix swaps for unwrap_or(Ordering::Equal), matching the pattern the same file already uses at lines 149-150 and 155 (those sites were already NaN-safe; this site was an oversight). Scoped audit: greped the v2/ tree for `partial_cmp(b).unwrap()`. The other 3 hits are in #[cfg(test)] blocks (spectrogram.rs:269, depth.rs:234, connectivity.rs:477) where panic-on-NaN is acceptable because test inputs are controlled. Only adaptive_classifier.rs:94 was a production-path crash. Severity: critical per reporter — runtime panic on real-world data. Patch: 1-line behavioural change + comment.	2026-05-17 19:29:07 -04:00
dependabot[bot]	ab9799adc3	chore(deps): bump tower-http from 0.5.2 to 0.6.8 in /v2 (#483 ) Bumps [tower-http](https://github.com/tower-rs/tower-http) from 0.5.2 to 0.6.8. - [Release notes](https://github.com/tower-rs/tower-http/releases) - [Commits](https://github.com/tower-rs/tower-http/compare/tower-http-0.5.2...tower-http-0.6.8) --- updated-dependencies: - dependency-name: tower-http dependency-version: 0.6.8 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-05-17 18:08:04 -04:00
Chaitanya Tata	f853c74563	v2: pin Rust 1.89 and fix sensing-server UI path when run from v2 (#523 ) * v2: pin Rust 1.89 for sensing-server dependency chain ruvector-core 2.0.5, hnsw_rs 0.3.4, and mmap-rs 0.7 require newer Cargo/rustc than 1.82 (edition2024 manifest, is_multiple_of, stable avx512f target_feature on x86_64). Add v2/rust-toolchain.toml so cargo build -p wifi-densepose-sensing-server picks a compatible toolchain. Signed-off-by: Chaitanya Tata <chaitanya@dotstarconsulting.com> Co-authored-by: Cursor <cursoragent@cursor.com> * sensing-server: default UI path for cwd v2/ and coalesce fallbacks The previous default ../../ui resolves to a non-existent directory when the binary is run from v2/ (common), so /ui/* returned 404 and the dashboard appeared broken. Default to ../ui and try ../ui, ./ui, ../../ui when the configured path is missing. Signed-off-by: Chaitanya Tata <chaitanya@dotstarconsulting.com> Co-authored-by: Cursor <cursoragent@cursor.com> --------- Signed-off-by: Chaitanya Tata <chaitanya@dotstarconsulting.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-05-17 18:00:36 -04:00
Timothy Schwarz	8b297dd706	fix(sensing-server): handle WebSocket Lagged + add ping keepalive (#484 ) Root cause: broadcast channel Lagged error caused instant disconnect when clients fell behind 256 frames (10Hz * 50-200KB = easy to lag). Client reconnects, immediately lags again, rapid cycling ensues. Sensing handler: Lagged error now continues (skips missed frames) instead of breaking. Added 30s ping interval for proxy keepalive. Pose handler: same Lagged handling + Pong match arm. CHANGELOG updated under Unreleased/Fixed. Co-authored-by: Deploy Bot <deploy@example.com>	2026-05-17 17:57:02 -04:00
@aaronjmars	3685d16a49	fix(security): host-header allowlist on sensing-server HTTP + WS — DNS rebinding (#580 ) The sensing-server binds to 127.0.0.1 by default with no `Host` header validation on either router. A foreign page can lower its DNS TTL, re-resolve to 127.0.0.1 after the browser has accepted the origin, and then read live pose + vital signs from /api/v1/* + /ws/sensing as same-origin against the attacker's hostname. When `RUVIEW_API_TOKEN` is unset (the documented LAN-mode default from #443/#547) the attacker can also drive state-mutating POSTs (recording/start, models/load, adaptive/train, calibration/start, sona/activate). Defense: a small `host_validation` axum middleware that pins the `Host` header to a configurable allowlist. The loopback names (`localhost`, `127.0.0.1`, `[::1]`, each with or without a port) are always in the set, so default 127.0.0.1 deployments keep working from the local browser without any configuration change. Operators who bind to a routable address extend the set with one or more `--allowed-host` flags or a comma-separated `SENSING_ALLOWED_HOSTS` env var. Reverse-proxy deployments that already canonicalise `Host` opt out with `--disable-host-validation`. The layer is wired into both the dedicated WebSocket router on `--ws-port` (8765) and the main HTTP router on `--http-port` (8080), so /ws/sensing on either listener is covered. Rejection responses are `421 Misdirected Request` (the correct status for a request that arrived at a server that does not consider the supplied `Host` authoritative); missing `Host` is `400 Bad Request`. CWE-346 (Origin Validation Error), CWE-350 (Reliance on Reverse DNS). Severity: high. Tests: 13 new unit tests on the middleware (loopback defaults, case-insensitivity, IPv6 bracketing, port stripping, env-var/CLI merge, foreign-host rejection on /health + /ws/*, disabled-allowlist escape hatch). Full suite: 220/220 pass under `cargo test -p wifi-densepose-sensing-server --no-default-features`. Co-authored-by: Aeon <aeon@aaronjmars.com>	2026-05-17 17:27:00 -04:00
Akhilesh Arora	10684972d7	fix(vital_signs): use circular variance for wrapped phases (#595 ) process_frame computed arithmetic mean + variance on phase values from atan2(), which are wrapped to (-pi, pi]. Phases close across the +/-pi discontinuity produced ~pi^2 variance instead of ~1e-6, feeding wrap noise into the heart-rate FFT buffer. Replace inline math with a standard circular variance helper (1 - mean resultant length). Add 4 unit tests, one through the production path of process_frame. Closes #593	2026-05-17 17:02:53 -04:00
ruv	ca97527646	feat(introspection): I6 — regime-changed signal + per-frame analyze + honest ADR-099 D8 amendment Three threads in this commit: 1) Per-frame attractor analysis (default analyze_every_n: 8 → 1). The I5 benchmark put per-frame update at 0.012 ms p99 — 83× under D4's 1 ms budget. The cost case for the every-8th-frame default doesn't hold; per-frame analysis is what makes regime_changed a viable early-detection trigger. 2) New `regime_changed: bool` field in IntrospectionSnapshot — flips on any frame whose attractor regime classification differs from the previous frame's. Pairs with top_k_similarity (full-shape match) to give downstream consumers two latencies with different robustness profiles. 3) Honest amendment of ADR-099 D8 to reflect empirical reality: - L1 stand-in achieves 3.20× ratio (5-frame shape match vs 16-frame event-path floor); the 10× aspirational bar is architecturally unreachable at 1-D scalar feature resolution. - regime_changed didn't fire in the 10-frame motion window — the 200-frame noise trajectory dominates the Lyapunov classification, and short perturbations don't shift the regime fast enough on a scalar feature. - Path to 10×: ADR-208 Phase 2 (Hailo NPU vec128 embeddings) — multi-dim partial matches discriminate from noise in 1-2 frames, not 5. - Side finding: midstream temporal-compare::DTW uses discrete equality cost (designed for LLM tokens), not numeric distance — swapping it in for f64 amplitude scoring would be strictly worse than the L1 stand-in. A numeric DTW is a separate concern (hand-roll or new crate). - Revised D8: ship behind --introspection (off by default) until multi- dim features land. Per-frame update budget IS met (0.041 ms p99 in this bench, ~24× under the 1 ms bar) — the feature is cheap enough to carry dark today. cargo test -p wifi-densepose-sensing-server --no-default-features: introspection (lib): 8 passed, 0 failed introspection_latency (test): 5 passed, 0 failed (incl. new regime_change_path_latency) clippy: clean on the introspection surface (pre-existing approx_constant lints in pose.rs / main.rs unchanged). Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-13 23:29:37 -04:00
ruv	59d2d0e54f	test(sensing-server): ADR-099 latency benchmark — record empirical baseline I5. Measures the architectural latency floor of the introspection path vs. the window-aggregated event path, plus the per-frame update cost. Result on this run: ADR-099 D8 floor ratio : 3.20× (16 frames / 5 frames) D8 target ≥10× — NOT YET MET on the host-side L1 stand-in scoring; I6 closes the gap. ADR-099 D4 update p50/p99 : 0.001 ms / 0.012 ms (~83× under the 1 ms budget on a desktop runner; even with thermal throttling on a Pi 5 we have orders of magnitude of headroom). Regime after 200 frames : Idle, lyapunov=-2.32, confidence=1.0 (attractor analyzer is firing as designed). The D8 gap is structural to the current scoring: signature_score() uses a length-normalised L1 over the trailing window, which requires roughly the full signature length of in-shape frames before crossing promotion_threshold. Closing it is the I6 work — swap in the real midstreamer-temporal-compare DTW (partial-match scoring) and/or surface the attractor's regime-change as an earlier trigger than full signature match. The latency-ratio test asserts a regression bar (≥3.0×) on the L1 baseline, prints the D8 ratio + whether it's met, and explicitly defers the ≥10× target to I6 in the docstring. Better empirical reporting than a flag that silently fails until tuned. ESP32 sanity (independent of the benchmark): COM7 device alive at csi_collector cb #84500 (~30 min uptime), len=128/256 HT20/HT40, ch5, RSSI swings -44 to -79 (= real motion in the room). UDP target still unreachable from this host per the earlier diagnosis; that's a deployment fix, not a measurement gate. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-13 23:18:10 -04:00
ruv	4a1f3a1e10	feat(sensing-server): wire ADR-099 introspection tap + /ws/introspection + /api/v1/introspection/snapshot I3 (per ADR-099). Three changes in main.rs: 1) AppStateInner: + intro: IntrospectionState + intro_tx: broadcast::Sender<String> (256-slot ring, same shape as the existing tx). 2) ESP32 frame path: after the global frame_history push, before the per-node mutable borrow of s.node_states, compute the per-frame derived feature (mean amplitude across subcarriers), call s.intro.update(ts_ns, feature), and broadcast the snapshot JSON to s.intro_tx. Placement is deliberate — between the global state's mutable touch and the per-node &mut so borrow-checking stays linear; ns is borrowed after the tap completes its s.intro / s.intro_tx access. 3) Routes: ws_introspection_handler → /ws/introspection api_introspection_snapshot → /api/v1/introspection/snapshot Same Axum + tokio::sync::broadcast pattern as ws_sensing_handler, subscribed against s.intro_tx. Wrapped by the bearer-auth middleware already on /api/v1/* — orchestrator probes and unauthenticated /ws/sensing reachers continue to land on the existing topic. Verified: cargo build -p wifi-densepose-sensing-server --no-default-features ✓ cargo test -p wifi-densepose-sensing-server --no-default-features lib: 207 passed, 0 failed (199 pre-tap + 8 introspection) integration suites: 70, 8, 16, 18 passed, 0 failed cargo clippy: clean on the introspection surface (pre-existing warnings on -core / -ruvector / -signal unchanged). Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-13 23:00:31 -04:00
ruv	94ef125240	feat(sensing-server): introspection module skeleton (ADR-099 D1+D7+D8) Adds the per-frame introspection state that ADR-099 specifies, plus the two midstream dependencies. Pure addition — no other code touched. v2/crates/wifi-densepose-sensing-server/Cargo.toml + midstreamer-temporal-compare = "0.2" + midstreamer-attractor = "0.2" v2/crates/wifi-densepose-sensing-server/src/introspection.rs (new, 530 lines) pub struct IntrospectionState ├─ midstreamer-attractor's AttractorAnalyzer (regime + Lyapunov) ├─ SignatureLibrary (JSON-loaded labelled segments) ├─ VecDeque<f64> sliding amplitude buffer (default 128 points) └─ update(timestamp_ns, derived_feature) — never window-blocked + snapshot() -> IntrospectionSnapshot { timestamp_ns, frame_count, regime, lyapunov_exponent, attractor_dim, attractor_confidence, top_k_similarity } pub enum Regime { Idle, Periodic, Transient, Chaotic, Unknown } pub struct Signature { id, label, vectors, dtw, promotion_threshold } pub struct SimilarityMatch { signature_id, score, above_threshold } DTW path is currently a host-side stand-in (length-normalised L1 with the real DTW call deferred to I3/I5 once vec128 embeddings exist — ADR-099 P1). The attractor path is wired to midstream directly. The analyze() step only runs every N frames (default 8) to stay under the per-frame ms budget. 8 unit tests (snapshot defaults, frame-count + timestamp advance, empty library, scoring + ordering invariants, threshold gating, empty-signature fault-tolerance, regime classification after 200 frames). 199 → 207 lib tests, 0 failures. cargo build clean (only pre-existing warnings). Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-13 22:50:58 -04:00
ruv	c641fc44ae	feat(docker+sensing-server): refresh Docker publish + opt-in bearer-token API auth Closes #520, #514, #443. ## #520 / #514 — stale Docker image, missing UI assets `ruvnet/wifi-densepose:latest` was published before `ui/observatory` and `ui/pose-fusion` were added; users see /app/ui missing those files and the v0.6+ packet format doesn't reach the server. Two fixes: 1. `docker/Dockerfile.rust` now `RUN`s a build-time guard after `COPY ui/` that fails the build if `index.html` / `observatory.html` / `pose-fusion.html` / `viz.html` (or the `observatory/` / `pose-fusion/` / `components/` / `services/` directories) are missing, plus an exec-bit check on `/app/sensing-server`. A stale image can never be silently produced again. 2. New `.github/workflows/sensing-server-docker.yml` rebuilds + pushes on every change to the Dockerfile, the server crate, the signal/vitals/ wifiscan crates, the workspace manifests, the `ui/` tree, or itself — plus `v` tags and manual dispatch. Pushes to both `docker.io/ruvnet/ wifi-densepose` AND `ghcr.io/ruvnet/wifi-densepose` with `latest` + `vX.Y.Z` + `sha-<short>` tags, then post-push smoke-tests the artifact: /health, /api/v1/info, the observatory + pose-fusion HTML, AND the bearer-auth path (no token → 401, wrong → 401, correct → 200). Uses the `DOCKERHUB_USERNAME`/`DOCKERHUB_TOKEN` repo secrets; ghcr.io rides on the workflow's GITHUB_TOKEN. ## #443 — sensing-server REST API auth model QE security audit raised that 40+ /api/v1/ routes have no auth layer with a default `0.0.0.0` bind. New `wifi_densepose_sensing_server::bearer_auth` module + middleware: - Env-var-gated: `RUVIEW_API_TOKEN` unset/empty ⇒ middleware is a no-op (current LAN-mode behaviour preserved — no default change); set ⇒ every `/api/v1/` request must carry `Authorization: Bearer <token>` or the server returns 401. - Constant-time byte compare via local `ct_eq` (no new dep). - `/health`, `/ws/sensing`, and `/ui/*` are intentionally never gated (orchestrator probes + local browsers). - Startup logs which mode is active and warns when auth is ON with a `0.0.0.0` bind. - 8 unit tests on the middleware via `tower::ServiceExt::oneshot` (sensing-server lib tests 191 → 199, 0 failures). Verified locally: `cargo build --workspace --no-default-features` ✓, `cargo test -p wifi-densepose-sensing-server --no-default-features` ✓. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-13 08:52:25 -04:00
Deploy Bot	ce7983eb43	feat(sensing-server): adaptive person count — RollingP95 + dedup_factor runtime API RollingP95 adaptive normalizer (ADR-044 §5.2): - Streaming P95 estimator (600-sample / ~30 s window) replaces fixed-scale denominators (variance/300, motion/250, spectral/500) that saturated against live ESP32 values, collapsing dynamic range to zero. - Cold-start (<60 samples) falls back to legacy denominators — day-0 behaviour is preserved. - Three new fields on AppStateInner: p95_variance, p95_motion_band_power, p95_spectral_power (all RollingP95::new(600, 60)). - compute_person_score() refactored to accept &AppStateInner; all three call sites (wifi, wifi-fallback, simulated) updated. - 5 unit tests in rolling_p95_tests module. dedup_factor runtime API (ADR-044 §5.3): - New field dedup_factor: f64 (default 3.0) on AppStateInner. - fuse_or_fallback() gains dedup_factor param; fallback switches from max() to sum/dedup_factor (ceiling), matching the fork's sum-based aggregation. - RuntimeConfig struct + load/save_runtime_config() for data/config.json persistence across restarts. - Three new REST endpoints: GET /api/v1/config/dedup-factor POST /api/v1/config/dedup-factor POST /api/v1/config/ground-truth (auto-tune from known person count) Explicitly NOT included: - lambda=5.0 (upstream keeps its 0.1 default — deployment-specific tuning) - CC intensity threshold 0.3 and min-cluster-size 4 hardcodes - max_cc_size filter removal	2026-04-28 15:32:34 -04:00
rUv	17509a2a41	feat(ruvector,signal,sensing-server): ADR-084 Passes 1/1.5/2/3 — RaBitQ similarity sensor implementation (#435 ) * feat(ruvector): ADR-084 Pass 1 — sketch module foundation Implements Pass 1 of ADR-084 (RaBitQ similarity sensor): a thin RuView-flavored API over `ruvector_core::quantization::BinaryQuantized`, exposed at `wifi_densepose_ruvector::{Sketch, SketchBank, SketchError}`. API surface: - `Sketch::from_embedding(&[f32], sketch_version: u16)` — sign-quantize a dense embedding into a 1-bit-per-dim packed sketch. - `Sketch::distance` — hamming distance with schema-mismatch error. - `Sketch::distance_unchecked` — hot-path variant for sketches already validated as same-schema. - `SketchBank::insert/topk/novelty` — bank with caller-assigned u32 IDs, schema locked at first insert, novelty = min_distance / embedding_dim. Schema versioning (`sketch_version: u16` + `embedding_dim: u16`) prevents silent comparisons across embedding-model generations. Bumping the model forces re-sketch of the candidate bank. Pass 1 establishes the API and unit-test foundation. Acceptance criteria (8x-30x compare-cost reduction, 90% top-K coverage, <1pp accuracy regression) are measured per-site in Passes 2-5. Validated: - 12 new tests pass (sketch construction, hamming, top-K ordering, schema lock, schema rejection, novelty) - cargo test --workspace --no-default-features → 1,551 passed, 0 failed, 8 ignored (was 1,539 before; +12 new tests) - ESP32-S3 on COM7 still streaming live CSI (cb #117300) Co-Authored-By: claude-flow <ruv@ruv.net> * bench(ruvector): ADR-084 acceptance — sketch-vs-float compare cost Adds sketch_bench measuring the first ADR-084 acceptance criterion (8x-30x compare cost reduction) at three dimensions and a realistic top-K@k=8 over 1024 sketches. Measured (Windows host, criterion --warm-up 1s --measurement 3s): compare_d512: float_l2: 197.03 ns/op float_cosine: 231.17 ns/op sketch_hamming: 4.56 ns/op → 43-51x speedup topk_d128_n1024_k8: float_l2_topk: 47.59 us sketch_hamming: 6.34 us → 7.5x speedup Pair-wise compare exceeds the 8-30x acceptance criterion by an order of magnitude. Top-K is at 7.5x — close to the threshold; the sort dominates at this bank size, which is a Pass 1.5 optimization opportunity (partial-sort heap for small K). Co-Authored-By: claude-flow <ruv@ruv.net> * perf(ruvector): ADR-084 Pass 1.5 — partial-sort heap in SketchBank::topk Replace `sort_by_key + truncate` (O(n log n)) with a fixed-size max-heap (O(n log k)) for top-K queries when n > k. Fast path when n ≤ k stays on the simple sort. Bench at d=128, n=1024, k=8 (Windows host, criterion 3s measurement): Before (sort + truncate): 6.34 µs/op After (heap): 3.83 µs/op -39.4% / +1.65× faster Combined with the 32× memory shrink and 47.6 µs → 3.83 µs total path saving: topk_d128_n1024_k8 vs float_l2_topk: Pass 1 sort_by_key: 47.59 µs / 6.34 µs = 7.5× speedup Pass 1.5 heap: 47.59 µs / 3.83 µs = 12.4× speedup Now over the ADR-084 acceptance criterion of 8× minimum. Heap pays off strictly more at larger n; benchmark at n=4096 is a Pass-2 follow-up. Co-Authored-By: claude-flow <ruv@ruv.net> * feat(signal): ADR-084 Pass 2 — sketch-prefilter for EmbeddingHistory::search Adds `EmbeddingHistory::with_sketch(...)` and `search_prefilter(query, k, prefilter_factor)`. The prefilter sketches the query, hamming-ranks the parallel sketch array to take the top `k * prefilter_factor` candidates, then refines those with exact cosine and returns the top-K. `EmbeddingHistory::new(...)` is unchanged — sketches are opt-in via the new constructor. `search_prefilter` falls back to brute-force `search` when sketches are disabled, so callers never see incorrect results. ADR-084 acceptance criterion empirically validated: Synthetic 128-d AETHER-shape, n=256, 16 queries: k=8, prefilter_factor=4 → 78.9% top-K coverage (FAIL <90%) k=8, prefilter_factor=8 → ≥90% top-K coverage (PASS) k=16, prefilter_factor=8 → ≥90% top-K coverage (PASS) The factor=4 default that I'd planned in Pass 1 falls below the 90% bar on uniform-random synthetic data. Production callers should use 8 unless their embeddings carry enough structure (real AETHER traces likely will) to clear the bar at lower factors. Documented in the search_prefilter docstring and asserted in test_search_prefilter_topk_coverage_meets_adr_084. FIFO eviction now drains the parallel sketches array in lockstep — test_search_prefilter_evicts_sketches_on_fifo guards against the two arrays drifting (which would silently corrupt top-K via index mismatch). Validated: - cargo test --workspace --no-default-features → 1,554 passed, 0 failed, 8 ignored (was 1,551; +3 new prefilter tests) - ESP32-S3 on COM7 still streaming live CSI (cb #3200) Co-Authored-By: claude-flow <ruv@ruv.net> * bench(signal): ADR-084 Pass 2 — end-to-end search_prefilter speedup Measures EmbeddingHistory::search_prefilter (sketch + cosine refine) vs the brute-force EmbeddingHistory::search baseline at three realistic AETHER bank sizes, with the empirically validated prefilter_factor=8. Measured (Windows host, criterion --warm-up 1s --measurement 3s): d=128, k=8: n=256 brute_force_cosine = 31.98 us, prefilter = 13.78 us → 2.3x n=1024 brute_force_cosine = 110.4 us, prefilter = 16.64 us → 6.6x n=4096 brute_force_cosine = 507.4 us, prefilter = 66.37 us → 7.6x Speedup grows with bank size (sketch overhead is fixed; brute-force scales linearly with n). At n=4k the prefilter approaches the 8x ADR-084 acceptance criterion; at n=10k+ (realistic multi-day deployment banks) it crosses cleanly. Below n=512 the brute-force path is already cheap (sub-50 us) so the prefilter's narrower wins don't materially affect the hot path. Coverage acceptance (≥90% top-K agreement) is exercised in the unit-test suite, not the bench. The bench measures cost only. Co-Authored-By: claude-flow <ruv@ruv.net> * feat(signal): ADR-084 Pass 3 — EmbeddingHistory::novelty primitive Adds the cluster-Pi novelty-sensor primitive: `EmbeddingHistory::novelty(query)` returns `Option<f32>` in [0.0, 1.0] where 0.0 = exact-match-in-bank and 1.0 = no-overlap. Returns None when sketches are disabled so callers can fall back gracefully (existing `EmbeddingHistory::new` constructor stays sketch-disabled). This is the building block of the cluster-Pi novelty gate described in ADR-084 §"cluster-Pi novelty sensor": each sensor node maintains a bank of recent feature vectors, the gate scores the incoming frame's novelty against the bank, and the heavy CNN / pose-model wake gate consumes the score. Wiring novelty into sensing-server's NodeState happens in a follow-up — that's a ~50-line surgical change touching main.rs that deserves its own commit. This patch lands the primitive + tests so the wiring is straightforward. Three regression tests added: - test_novelty_returns_none_without_sketches (graceful fallback when bank is sketch-less) - test_novelty_zero_for_exact_match_one_for_empty_bank (semantic boundaries) - test_novelty_decreases_as_bank_grows_around_query (gradient direction — guards against reversed comparator) Validated: - cargo test --workspace --no-default-features → 1,557 passed, 0 failed, 8 ignored (was 1,554; +3 new novelty tests) - ESP32-S3 on COM7 still streaming live CSI (cb #7600) Co-Authored-By: claude-flow <ruv@ruv.net> * feat(sensing-server): ADR-084 Pass 3 — wire novelty into NodeState Wires the EmbeddingHistory::novelty primitive (Pass 3 prior commit) into the per-node frame ingestion path on the cluster Pi. Each incoming CSI frame now updates a per-node sketch bank of the last 6.4 s of feature vectors and produces a novelty score in [0.0, 1.0] that downstream model-wake gates can consume. Two NodeState structs were touched (one in types.rs and a refactoring-leftover duplicate in main.rs that the call site uses); both gain feature_history + last_novelty_score fields and an update_novelty helper that: - truncates / zero-pads incoming amplitudes to NOVELTY_VECTOR_DIM (56) - scores novelty before inserting (so a frame doesn't see itself) - FIFO-evicts when the bank reaches NOVELTY_HISTORY_CAPACITY (64) Wired at the per-node ESP32 frame path in main.rs:3772 (immediately before frame_history.push_back). Existing call sites that operate on the singleton SensingState (not per-node) intentionally untouched — they will be wired in a follow-up alongside the WebSocket update envelope's novelty_score field. Two new unit tests in novelty_tests: - first_frame_yields_max_novelty_then_zero_on_repeat (semantic boundaries: empty bank = 1.0, exact repeat = 0.0) - handles_short_and_long_amplitude_vectors (truncate / zero-pad robustness across hardware variants) Validated: - cargo test --workspace --no-default-features → 1,559 passed, 0 failed, 8 ignored (was 1,557; +2 new novelty tests) - ESP32-S3 on COM7 still streaming live CSI (cb #3900) Co-Authored-By: claude-flow <ruv@ruv.net> * hardening(ruvector): L2 from PR #435 review — overflow on >u16::MAX dims Pass 1.6 hardening, addressing L2 finding from the security review on PR #435 (https://github.com/ruvnet/RuView/pull/435#issuecomment-4321285519): The original `Sketch::from_embedding` used `debug_assert!` for the `embedding.len() <= u16::MAX` invariant, which compiled out in release builds. A caller passing a 65,536+ -dim embedding would silently truncate the dimension count via `as u16` cast — two over-long inputs would then compare as same-dimensional rather than as 64k vs 70k, and the dimension confusion would not surface anywhere. Two-part fix: - `from_embedding` (infallible) now SATURATES `embedding_dim` to `u16::MAX` rather than truncating. Two over-long inputs still get packed bit-correctly by `BinaryQuantized` and the saturated dim is consistent across both, so they compare predictably (just with an upper-bounded distance). - `try_from_embedding` (new, fallible) returns `Err(SketchError::EmbeddingDimOverflow{got, max})` when the input exceeds `u16::MAX`. Use this when an over-long input should fail loudly rather than be silently saturated. - New error variant `SketchError::EmbeddingDimOverflow` with the observed `got` and the `max` (`u16::MAX as usize`). - New regression test `try_from_embedding_rejects_over_long_input` asserts both paths: try_ → Err, infallible → saturate. Validated: - 13 sketch unit tests pass (was 12; +1 for L2 boundary). - cargo test --workspace --no-default-features → 1,560 passed, 0 failed, 8 ignored (was 1,559; +1). - ESP32-S3 on COM7 streaming live CSI (cb #100, fresh boot RSSI -48 dBm). Co-Authored-By: claude-flow <ruv@ruv.net> * hardening(ruvector,signal): L1+L3 from PR #435 review Two follow-ups to the security review on PR #435: L1 — Defensive `if let Some(...)` for SketchBank::topk heap peek. The original `.expect("heap len == k > 0")` was mathematically unreachable (k > 0 enforced at function entry, heap.len() >= k branch guards), but a structural pattern makes the impossibility a type property rather than a runtime invariant. Same hot-path cost; zero panic risk in the production binary. L3 — Guard `embedding_dim == 0` in `EmbeddingHistory::novelty`. A 0-dim history is constructible via `with_sketch(0, ...)`; without the guard the function returned `NaN` (min_d as f32 / 0.0), silently poisoning every downstream gate (model-wake, anomaly-emit, etc). Now returns Some(1.0) — fail-loud at "no comparison possible → maximally novel," never NaN. New regression test `test_novelty_zero_dim_history_returns_one_not_nan` pins it down. Validated: - cargo test --workspace --no-default-features → 1,561 passed, 0 failed, 8 ignored (was 1,560; +1 for the L3 NaN guard test). - ESP32-S3 on COM7 streaming live CSI (cb #12400, RSSI fresh). L4 (f64→f32 cast) is documentation-only and lands in a follow-up patch; L8 (always-on novelty sensor) is an observation, not a fix. Co-Authored-By: claude-flow <ruv@ruv.net> * feat(sensing-server): ADR-084 Pass 3.5 — novelty_score on PerNodeFeatureInfo Adds an optional `novelty_score: Option<f32>` field to PerNodeFeatureInfo, the per-node WebSocket envelope shape. Mirrored on both struct definitions (types.rs canonical + main.rs's refactoring-leftover duplicate) so the schema is consistent. `#[serde(skip_serializing_if = "Option::is_none")]` keeps existing WebSocket consumers unaffected — old clients see no extra field unless the server populates it. No PerNodeFeatureInfo literal construction sites exist today (all `node_features: None`), so this is a schema-only addition; live population from `NodeState::last_novelty_score` lands in a Pass 3.6 follow-up that also wires `node_features: Some(...)` at the per-node ESP32 frame emit path. Validated: - cargo test --workspace --no-default-features → 1,561 passed, 0 failed, 8 ignored (no change; schema-only). - ESP32-S3 on COM7 streaming live CSI (cb #2100, fresh boot). Co-Authored-By: claude-flow <ruv@ruv.net> * feat(sensing-server): ADR-084 Pass 3.6 — populate node_features with novelty_score Wires `node_features: Some(...)` at the two per-node ESP32 frame emit sites (formerly `node_features: None`). Adds a `build_node_features` helper that constructs `Vec<PerNodeFeatureInfo>` from `s.node_states`, including the per-node `last_novelty_score`. This completes the Pass 3.x track — novelty score now flows from NodeState → PerNodeFeatureInfo → SensingUpdate envelope → WebSocket clients. Cluster-Pi UI / model-wake / anomaly-emit gates can read it without round-tripping back to the server. Three other call sites (singleton paths at 1772, 1911, 4170) keep `node_features: None` for now — those are for the offline / simulated paths that don't have per-node ESP32 state. They'll get populated when their parent flows wire up real multi-node fanout. Stale flag uses `ESP32_OFFLINE_TIMEOUT` (5s) — same threshold the rest of the system uses to decide a node has dropped. Validated: - cargo test --workspace --no-default-features → 1,561 passed, 0 failed, 8 ignored (no change; integration test would be wire- format diff in a follow-up). - ESP32-S3 on COM7 streaming live CSI (cb #100, fresh boot, RSSI -49 dBm). Co-Authored-By: claude-flow <ruv@ruv.net> * feat(ruvector): ADR-084 Pass 4 — WireSketch wire-format primitive Adds `WireSketch::serialize` / `deserialize` for transmitting a sketch + novelty score over any byte-stream channel — cluster↔cluster mesh (ADR-066 swarm bridge when it exists), sensor→cluster-Pi UDP (ADR-086 edge gate complement), gateway→cloud QUIC. Channel-agnostic by design. Wire layout (12-byte header + ceil(dim/8) bytes payload, little-endian): [0..4] magic = 0xC5110084 [4..6] format_version = 1 [6..8] sketch_version (embedding-model schema) [8..10] embedding_dim [10..12] novelty_q15 (novelty * 32_767, saturated) [12..] packed sketch bits A 128-d AETHER sketch fits in exactly 28 bytes (12 header + 16 bits). Deserializer is paranoid by design — every untrusted byte buffer gets validated against: - length floor (>= header bytes) - length ceiling (WIRE_SKETCH_MAX_BYTES = 9 KiB; defends against memory-exhaustion attacks via claimed-but-impossible large dims) - magic match - format_version supported - embedding_dim → payload bytes consistency A malformed UDP packet from a non-RuView sender produces a typed `WireSketchError` (variant per failure class), never a panic. Re-exported from lib.rs alongside `Sketch` / `SketchBank`. Seven new tests: - wire_serialize_round_trip (correctness) - wire_rejects_short_buffer (length floor) - wire_rejects_oversized_buffer (length ceiling, DoS guard) - wire_rejects_bad_magic (cross-protocol confusion guard) - wire_rejects_unsupported_format_version (forward-compat) - wire_rejects_payload_size_mismatch (header/body consistency) - wire_envelope_size_for_aether_128d (sizing contract: 28 bytes) Validated: - cargo test --workspace --no-default-features → 1,568 passed, 0 failed, 8 ignored (was 1,561; +7 wire-format tests). - ESP32-S3 on COM7 streaming live CSI (cb #15100, RSSI -48 dBm). Pass 4's wire-format primitive ships first; the channel that carries it (ADR-066 swarm-bridge or ADR-086 sensor→Pi gate) is out-of-scope for this commit and tracked separately. Co-Authored-By: claude-flow <ruv@ruv.net> * feat(ruvector): ADR-084 Pass 5 — privacy-preserving event log + L4 docstring Pass 5 — `PrivacyEventLog` and `NoveltyEvent` types in a new `wifi_densepose_ruvector::event_log` module. Each event stores `(timestamp, sketch_bytes, sketch_version, embedding_dim, novelty, witness_sha256)` — explicitly NOT the raw float embedding. The witness is SHA-256 of the WireSketch serialization (12-byte header + packed bits + q15 novelty), making events content-addressable: two pushes of the same `(sketch, novelty)` produce byte-identical witnesses, enabling dedup at the receiver and verifier. Privacy properties (ADR-084 §"Privacy-preserving event log"): 1. Non-invertibility — 1-bit sign quantization is lossy; an attacker with read access cannot reconstruct the source CSI / embedding. 2. Content addressing — `(sketch_version, witness)` is fully qualified. 3. Bounded memory — fixed capacity ring; misbehaving senders cannot exhaust receiver memory. Seven new tests: - push_grows_until_capacity_then_fifo_evicts - zero_capacity_log_silently_drops_pushes (no-op stub case) - witness_is_deterministic_for_same_sketch_and_novelty (witness must NOT depend on timestamp) - witness_differs_for_different_novelty_scores - find_by_witness_returns_most_recent_match - find_by_witness_returns_none_on_miss - event_does_not_carry_raw_embedding (structural privacy guarantee) L4 hardening (PR #435 security review) — the `f64 → f32` cast in NodeState::update_novelty now has a docstring noting the boundary behaviour: `f64::INFINITY` survives as `f32::INFINITY`, `f64::NAN` propagates as `f32::NAN`. Neither panics. CSI amplitudes from healthy firmware are well within f32 finite range. Validated: - cargo test --workspace --no-default-features → 1,575 passed, 0 failed, 8 ignored (was 1,568; +7 event-log tests). - ESP32-S3 on COM7 streaming live CSI (cb #2800, RSSI -52 dBm). Co-Authored-By: claude-flow <ruv@ruv.net>	2026-04-26 02:21:35 -04:00
rUv	f49c722764	chore(repo): rename rust-port/wifi-densepose-rs → v2/ (flatten to one level) (#427 ) The Rust port lived two directories deep (rust-port/wifi-densepose-rs/) without any sibling under rust-port/ that warranted the extra level. Move the whole workspace up to v2/ to match v1/ (Python) at the same depth and shorten every cd / build command across the repo. git mv preserves history for all tracked files. 60 files updated for path references (CI workflows, ADRs, docs, scripts, READMEs, internal .claude-flow state). Two manual fixes for relative-cd paths in CLAUDE.md and ADR-043 that became wrong after the depth change (cd ../.. → cd ..). Validated: - cargo check --workspace --no-default-features → clean (after target/ nuke; the gitignored target/ was carried by the OS rename and had hard-coded old paths in build scripts) - cargo test --workspace --no-default-features → 1,539 passed, 0 failed, 8 ignored (same totals as pre-rename) - ESP32-S3 on COM7 → still streaming live CSI (cb #40300, RSSI -64 dBm) After-merge follow-up: contributors should `rm -rf v2/target` once and let cargo regenerate from the new path.	2026-04-25 21:28:13 -04:00

19 Commits