wifi-densepose

Commit Graph

Author	SHA1	Message	Date
ruv	26e00e6910	fix(adr-115/test): drive state-test snapshots throughout capture window Iter 46 — second attempt at fixing state_messages_published_on_snapshot_broadcast on CI. The iter-45 SubAck fix proved necessary but not sufficient; the test still returned an empty Vec for presence states. Root cause analysis: the test was front-loading 6 snapshots over 1.2 s after a 3 s warm-up sleep, then capturing for 8 s. That schedule assumes: - mosquitto sidecar is ready - cargo build cache is warm - rumqttc connect + 21 QoS-1 discovery publishes complete in <3 s - the publisher's select! starts draining state_rx in <3 s On the CI runner those assumptions break. The publisher takes >3 s to finish discovery, so all 6 state publishes either land in the rumqttc outbound channel before the broker is reachable OR are emitted while the subscriber's reception path has stalled. Fix: drive snapshots in a background task THROUGHOUT the capture window instead of front-loading them. 40 snapshots × 300 ms = 12 s of steady-state ON/OFF traffic across a 14 s capture window. Even if the first 3-5 s of publishes are missed during slow publisher bootstrap, plenty of ON and OFF messages arrive afterward. This also makes the test more representative of real HA workloads (steady stream of vitals, not a burst then silence). Local cargo test --features mqtt --no-default-features --test mqtt_integration --no-run → compiles green. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 16:02:31 -04:00
ruv	636ca7b52f	test(adr-115): diagnostic dump + wider subscription on state-test failure After 4 surgical fixes the state_messages_published_on_snapshot_broadcast test still reports 'expected ON state, got []' on CI — and we can't tell whether the publisher is publishing nothing, or publishing the wrong topic, or publishing to a session the subscriber lost. Two changes to surface what's actually happening: 1. Widen subscription from `homeassistant/binary_sensor/+/presence/state` to `homeassistant/#`. Now the captured-message dump shows every topic the publisher emitted under the homeassistant prefix — discovery configs, availability heartbeats, state messages, anything else. A narrow filter was hiding which side of the pipeline was broken. 2. Add stderr `[diag]` lines that dump every captured (retain, topic, payload-prefix) on test failure. CI runs `--nocapture` so the lines land in the workflow log. From the next failed-CI log we'll know whether: - publisher isn't emitting state at all (no /state topics in dump) - publisher is emitting to a different topic shape (typo in topic format string) - subscriber connected to a stale session and missed messages (would see discovery + no state but dump would have count > 0) - subscriber is connecting after publisher disconnected (count = 0 even after widening) This is a debugging commit, not a production fix — once we know the exact failure mode from the next CI log we can ship a real fix. Refs PR #778, issue #776. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 15:58:33 -04:00
ruv	967cede74d	fix(adr-115/test): drive subscriber eventloop until SubAck before returning Third cause of the state_messages_published_on_snapshot_broadcast failure (after timing fix in `5ed8e3451` and client_id fix in `2aeed32a7`): the subscriber's eventloop was NEVER polled between `client.subscribe(...).await` and `collect_published(eventloop, ...)`, so the SUBSCRIBE packet was only queued in rumqttc's outbound channel — it didn't reach the broker until collect_published began polling. By that time the publisher had already emitted all 6 state messages. The retained ones (binary_sensor presence with retain=true) should have been redelivered on the late subscribe, but only the LAST one would land — yet CI was reporting `got []` (zero messages). Theory: the broker may not redeliver retained messages reliably when the subscribe arrives during the publisher's burst, OR the test's collect_published timing budget runs out before redelivery completes. Fix: drain the subscriber's eventloop inside `subscribe_client` until we see the SubAck for our subscribe. That guarantees the subscription is active at the broker BEFORE the function returns, so non-retained publishes from the publisher's send loop arrive normally. Also made the subscriber client_id include a per-call nanosecond suffix so subscribers in back-to-back tests can't collide on a shared ID (paranoia, complementary to the publisher-side fix from `2aeed32a7`). Refs PR #778, issue #776. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 15:52:52 -04:00
ruv	2aeed32a72	fix(adr-115/test): per-test MQTT client_id so session-takeover doesn't break state test The mqtt-integration test suite still failed `state_messages_published_ on_snapshot_broadcast` after the timing fix (`5ed8e3451`) — but with a new symptom: 'expected ON state, got []'. The subscriber captured ZERO messages on the presence state topic. Root cause: all three integration tests built `client_id` as `ruview-int-test-<pid>` — the same string for every test in the sequential cargo-test run. MQTT brokers default to "session takeover": when a new connect arrives with the same client_id as an existing session, mosquitto disconnects the old one immediately. Sequence on CI (`--test-threads=1`): 1. discovery_topics_appear_on_broker connects (ruview-int-test-1234) 2. test passes; publisher task continues running in background 3. privacy_mode_suppresses_biometric_discovery connects (same id) → mosquitto kicks test 1's publisher mid-rumqttc-disconnect-handshake 4. state_messages_published_on_snapshot_broadcast connects (same id) → mosquitto kicks test 2's publisher → test 3's publisher in turn races with the broker's cleanup and its first publishes may land in a half-cleaned session → state messages dropped silently Fix: include a per-test label in the client_id (`ruview-int-test-<pid>-<label>` — labels: "discovery", "privacy", "state"). Each test gets its own MQTT session; no cross-test takeover. Refs PR #778, issue #776. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 15:44:56 -04:00
ruv	00e766ec28	merge: main into feat/adr-115-ha-mqtt-matter — resolve CHANGELOG + ADR-115.md Brings the new main (with ADR-110 merged, commit `00a234eda`) into the ADR-115 branch so PR #778 can be merge-able. Conflicts: CHANGELOG.md Both branches added entries to the [Unreleased] → Added section. Resolution: keep BOTH — the ADR-115 HA+Matter entry first (front-facing, this branch's contribution), then the ADR-110 waves 1-5 entries from main (already merged, historical record). No content lost. docs/adr/ADR-115-home-assistant-integration.md Add/add conflict — main got the file in its earlier shape (Status: Proposed, Tracking issue: TBD) via the iter-17-19 cross-branch checkout incident on the adr-110 branch that ended up merged via PR #764. This branch's version has the current Accepted status and the real PR #778 link. Resolution: take this branch's authoritative ADR-115 content. The 3 .rs files I had flagged in PR #778 comment 4526344883 (lib.rs, esp32_parser.rs, tracker_bridge.rs) AUTO-MERGED cleanly — this branch's local state already had the equivalent shape. Verification: cargo check -p wifi-densepose-sensing-server --no-default-features → green (5 warnings, 0 errors). Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 15:44:33 -04:00
ruv	5bc081d61d	fix(adr-115/doctest): wrap ASCII endpoint tree in ```text fence (bridge.rs) The module-level doc comment in matter/bridge.rs had a 4-space-indented ASCII tree diagram. Rustdoc parses any 4-space-indented block in a doc comment as a Rust code block (markdown indented-code-block syntax) and runs it as a doctest. The tree text isn't valid Rust → doctest fails. This broke the Rust Workspace Tests workflow on PR #778: test crates/.../src/matter/bridge.rs - matter::bridge (line 6) ... FAILED test result: FAILED. 0 passed; 1 failed error: doctest failed, to rerun pass `-p ... --doc` Wrapping the tree in a `text` fenced block tells rustdoc to render but not compile it. Verified locally: cargo test -p wifi-densepose-sensing-server --no-default-features --doc test result: ok. 0 passed; 0 failed; 1 ignored Refs PR #778, issue #776. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 15:38:46 -04:00
rUv	00a234eda8	ADR-110: ESP32-C6 firmware extension (#764 ) Closes the firmware-side ADR-110 design at v0.7.0-esp32 after a 38-iter /loop SOTA sprint. Headline (bench, COM9+COM12 ESP32-C6): - 99.56% cross-board RX, 104.1 µs smoothed offset stdev (≤100 µs §2.4 target met) - 3.95× EMA suppression, 1.4 ppm crystal skew preserved 4 firmware releases: v0.6.7 / v0.6.8 / v0.6.9 / v0.7.0-esp32. 42 ADR-110 unit tests, 1761 v2 workspace tests, full Firmware CI + QEMU green.	2026-05-23 15:34:48 -04:00
ruv	5ed8e34510	fix(adr-115/test): state_messages integration test — race on publisher startup state_messages_published_on_snapshot_broadcast was failing under CI (0/3 → 2/3 passes; only this one red). Root cause: the test waited only 700ms after spawn(publisher) before sending the first VitalsSnapshot through the broadcast channel, and used a 3s capture window after a 200ms inter-snapshot delay. What's actually happening on the wire during those 700ms: 1. rumqttc::AsyncClient::new() returns immediately (connection is lazy — happens on first publish) 2. publisher::run() awaits publish_all_discovery() which issues 21+ QoS-1 publishes on the discovery prefix. Each is an ack-waited round-trip — median ~800ms total on local loopback, easily >2s on a fresh GH Actions runner with cold rustls. 3. After discovery, the run loop reaches its tokio::select! and starts draining state_rx. The test was sending broadcasts WHILE the publisher was still in discovery, so the broadcast::Receiver buffer (capacity 32) was draining without the publisher ever processing them — the publisher's select! only polls state_rx between rumqttc events. Fix: - Wait 3s after spawn() (well past observed ramp-up, doubled for CI variance) - Send 6 snapshots in a loop with 200ms gaps (one dropped won't tank the test) - Capture window 8s instead of 3s (room for rate-limited publishes to land) Local impact: test now reliably passes against `mosquitto -c allow_anonymous=true` on loopback in ~12s wall time. CI matrix should pick the same green outcome. Other two integration tests (discovery + privacy_mode) already passed on every prior run — they only assert on discovery topics, which the publisher emits before any state. Refs PR #778, issue #776. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 15:33:41 -04:00
ruv	cb3ea9fbd2	test(adr-115): property-based fuzzing for Matter commissioning encoder (424 lib tests) 4 new proptest cases in matter::commissioning::tests. Each fuzzes ~256 random (passcode, discriminator) pairs per cargo-test run → ~1,024 additional commissioning-code trials per CI cycle. ## Invariants enforced under random sampling - `manual_code_shape_invariants` — for ANY valid (passcode, disc) in range and not in the §5.1.6.1 disallowed set, from_input MUST produce: exactly 11 ASCII digits, Verhoeff self-consistent body+ check, 4-3-4 display form with dashes at positions 4 and 8. - `disallowed_passcodes_always_rejected` — every passcode in the §5.1.6.1 list MUST be rejected regardless of discriminator. - `oversized_inputs_always_rejected` — passcode ≥ 2^27 OR discriminator ≥ 2^12 MUST be rejected, regardless of the other axis's value. - `manual_code_deterministic_under_random_input` — same input always produces same code (uses prop_assume to skip the spec-disallowed passcodes since they'd Err out before getting to the code-equality check). The DISALLOWED_PASSCODES const is hoisted from the example-based test for reuse across proptest cases. Lib test count: 420 → 424. Effective ADR-115 fuzz coverage rises to ~3,584 fuzzed trials per CI run (1,280 wire-boundary + 1,280 semantic-bus + 1,024 commissioning). Refs #776, PR #778. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 15:24:27 -04:00
ruv	b41fdd75c6	test(adr-115): property-based invariants for the semantic bus (420 lib tests) 5 new proptest cases in semantic:🚌:tests. Each runs ~256 iterations per cargo-test invocation → ~1,280 additional fuzzed snapshot trials per CI run, throwing every variety of RawSnapshot the bus can plausibly see at the 10-primitive FSM dispatch. The `arb_snapshot()` Strategy generates RawSnapshots with: - since_start ∈ 0..86400 s (covers warmup + 24h primitives) - timestamp_ms full positive range - motion deliberately ∈ -0.5..2.0 (out-of-range to test clamping) - motion_energy ∈ -1000..10000 - breathing_rate_bpm ∈ Option<0..200> - heart_rate_bpm ∈ Option<0..250> - n_persons ∈ 0..10 - rssi_dbm ∈ Option<-120..0> - vital_confidence ∈ 0..1 - local_seconds_since_midnight ∈ 0..86400 (covers bed_exit window wrap-around test) - active_zones ∈ random vec of [a-z]{3,8} strings Strategy is split into two nested tuples because proptest only impls Strategy for tuples up to length 12 (we have 13 fields). Invariants enforced: - `bus_tick_never_panics_on_arbitrary_snapshot` — every primitive handles every plausible input without panic. Pathological cases include motion=1.7, HR=Some(0.0), empty zones, NULs nowhere (RawSnapshot doesn't carry those), and odd timestamp combinations. - `bus_events_carry_node_id_and_ts` — no event ever emitted with empty node_id; timestamp_ms exactly matches the input snapshot's. - `boolean_states_always_have_reason_tags` — when `changed=true`, the `reason.tags` MUST be non-empty. The explainability contract is enforced at the bus boundary, not just where convenient. - `per_tick_event_count_bounded_by_primitive_count` — bus emits ≤ 10 events per tick (one per primitive). Catches double-emission bugs where a future primitive accidentally fires twice. - `replay_same_snapshot_is_deterministic_per_fresh_bus` — replaying the same snapshot to N fresh buses produces the same event-kind list every time. Catches uninitialised internal state. Lib test count: 415 → 420 (each proptest function = 1 test slot but fuzzes ~256 cases internally). Effective coverage rises to ~1,955 assertions per CI lib run. Refs #776, PR #778. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 15:15:24 -04:00
ruv	0f7a4bd36e	feat(adr-115): flip Status → Accepted (MQTT track) + property-based fuzz tests + ADR index entry ## Status flip — ADR-115 §Status Per maintainer ACK (#776 issue body + 13 ACK'd open questions) and the shipped implementation in PR #778 (410 lib tests, witness bundle VERIFIED), the MQTT track is now Accepted. The Matter SDK wiring P8b remains Proposed pending the §9.10 deferral to v0.7.1. ADR header table updated: - Status: "Accepted (MQTT track P1-P7 + P8a + P9 + P10 shipped 2026-05-23 in PR #778, 410 lib tests, witness bundle VERIFIED) / Proposed (Matter SDK wiring P8b deferred to v0.7.1 per §9.10)" - Codename: HA-DISCO (MQTT) + HA-FABRIC (Matter) + HA-MIND (semantic primitives) — the third codename always belonged in the masthead. - Tracking issue: now points at #776 + PR #778 `docs/adr/README.md` ADR index gets an ADR-115 row in the "Platform and UI" section with the same Accepted/Proposed split. ## Property-based fuzzing — mqtt::security Added 5 proptest cases (each runs ~256 iterations per cargo-test invocation, so ~1280 additional assertions per CI run): - topic_segment_rejects_anything_with_wildcards_or_separators — random Unicode prefix/suffix + an injected '+', '#', NUL, or '/' MUST be rejected - topic_segment_accepts_safe_alphabet — any string built solely from the safe alphabet MUST be accepted - topic_segment_always_rejects_empty — invariant across seeds - payload_size_check_is_monotonic — every size ≤ MAX is OK, every size > MAX errors with the exact size - path_safety_rejects_nul_or_newline_anywhere — NUL/newline at any offset in the path MUST be rejected `proptest` 1.5 added as dev-dep with default features off (no proptest-derive needed). ~3 transitive crates added, dev-only. Total lib tests: 410 → 415 passed, 0 failed, 1 properly ignored. Refs #776, PR #778. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 15:11:32 -04:00
ruv	ca10df7b0d	fix(adr-115): CI green — example feature-gate + mosquitto allow_anon + bench numbers ## Two CI failures on PR #778 fixed ### 1. Rust Workspace Tests (E0601: `main` not found in mqtt_publisher) Default `cargo build --workspace` compiles examples without forwarding `--features mqtt`. The example had a crate-level `#![cfg(feature = "mqtt")]` so the entire file evaporated, leaving zero `main`. Now provides a stub `main` when the feature is off (prints a hint and exits 2), and gates the real implementation behind `#[cfg(feature = "mqtt")]` per-item. Local verification: cargo check --no-default-features --examples → clean ### 2. mqtt-integration (mosquitto never became reachable) `eclipse-mosquitto:2.x` rejects anonymous connections by default and GH Actions `services:` containers don't easily support volume-mounting a custom config. Removed the service container and start mosquitto manually in a step with an inline `allow_anonymous true` listener on port 11883. Same wire shape, no auth (CI tests protocol behaviour, not security — production uses mTLS per ADR §3.9). ## Benchmark numbers captured (`docs/integrations/benchmarks.md`) Ran `cargo bench --features mqtt --bench mqtt_throughput` locally: \| Hot path \| Measured \| Target \| Better by \| \|---------------------------------------\|----------\|--------\|-----------\| \| state::event_fall encode \| 259 ns \| <2 µs \| 7.7× \| \| rate_limiter::allow_first \| 49.7 ns \| <100 ns\| 2× \| \| rate_limiter::allow_within_gap \| 62.1 ns \| <100 ns\| 1.6× \| \| privacy::decide_hr_strip \| 0.24 ns \| <50 ns \| 208× \| \| privacy::decide_presence_keep \| 0.24 ns \| <50 ns \| 208× \| \| semantic::bus_tick_all_10_primitives \| 717 ns \| <10 µs \| 14× \| At 1 Hz publish rate per node, the entire ADR-115 hot path costs ~1 µs per node per tick on commodity hardware. A Cognitum Seed hosting 100 nodes would burn 100 µs/sec — 0.01% load floor. Memory: ~30 KB total FSM state for 10 primitives × 100 nodes. The numbers exceed every target by ≥1.6×, several by 100×+. No need to optimise further for v0.7.0. Refs #776, PR #778. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 14:47:46 -04:00
ruv	6364e0f7d8	feat(adr-115): P8 — Matter bridge tree + commissioning code (38 tests, lib total 410) Ships the SDK-independent half of the Matter Bridge production work: ## `matter::bridge` — endpoint tree assembly `build_bridge_tree(nodes) -> BridgeTree` walks a list of `(node_id, friendly_name, [EntityKind])` tuples and produces the Matter endpoint graph the SDK will materialise: EP 0 (BridgedDevicesAggregator) EP 1 (BridgedNode for "Bedroom") EP 2 (OccupancySensor for Presence + PersonCount vendor attr) EP 3 (OccupancySensor for SomeoneSleeping) EP 4 (GenericSwitch for FallDetected) EP 5 (BridgedNode for "Living") … Key invariants enforced by tests: - `PersonCount` collapses onto Presence's endpoint as a vendor attribute, never gets its own endpoint - Biometric entities (HR/BR/pose) are skipped entirely — they never appear in the tree - Every child endpoint carries `BasicInformation` cluster - Endpoint IDs are monotonic + unique (verified by sort+dedup test) - Empty node list yields just the root aggregator - Multi-node bridges keep per-node endpoint isolation - `endpoint(id)` lookup resolves every assigned ID ## `matter::commissioning` — setup-code generation `SetupCodeInput::dev(passcode, discriminator) -> ManualPairingCode` produces the 11-digit human-readable Matter pairing code that users scan/enter into Apple Home / Google Home / HA Matter integration. Validates against Matter Core Spec §5.1.6.1 disallowed-values list (11111111, 12345678, 87654321, all-same-digit patterns, 0). Rejects oversized passcode (≥2^27) and discriminator (≥2^12). The Verhoeff check digit is computed per spec §5.1.4.1.5 — full D/P/INV tables transcribed. The check digit appended to the body is self-consistent (verified by a recompute-and-compare test). `ManualPairingCode::display_4_3_4()` returns the dashed form (`1234-567-8901`) controllers actually display. Bit-packing is a placeholder for v0.7.0 — the chunk values are hashed-then-mod into their decimal widths so the output is deterministic + input-sensitive + Verhoeff-valid, but not yet bit-perfect spec-compliant. The fully spec-compliant code (with QR base-38 payload) lands at P8b when `rs-matter` is integrated; see ADR-115 §9.10. This module gives the SDK layer a stable testable contract to build against. ## Tests - 16 cluster mapping (existing) - 11 bridge assembly (new): aggregator root, branch-per-node, PersonCount collapsing, HR/BR skip, BasicInformation cluster on every endpoint, monotonic+unique IDs, total endpoint count, lookup, multi-node isolation, empty-node list - 11 commissioning (new): dev VID/PID defaults, disallowed-passcode rejection (12 spec values), oversized-passcode rejection, oversized-discriminator rejection, canonical test vectors accepted, 11-digit code always, 4-3-4 display format, determinism, sensitivity to passcode change, sensitivity to discriminator change, Verhoeff self-consistency, invalid-input early return Total lib tests: 410 passed, 0 failed, 1 properly ignored. Refs #776, PR #778. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 14:36:10 -04:00
ruv	a7467f5470	feat(adr-115): P7 — Matter cluster + device-type mapping (HA-FABRIC scaffolding, 16 tests) Ships the Matter cluster + device-type mapping table as pure Rust types independent of any specific Matter SDK. SDK choice between `matter-rs` and chip-tool FFI per ADR-115 §9.10 lands in P8 once spike-validated against real controllers; this commit gives the SDK work a stable mapping target to build against. ## What this lands - `matter::clusters` module: - Spec-defined constants: `CLUSTER_OCCUPANCY_SENSING` (0x0406), `CLUSTER_SWITCH` (0x003B), `CLUSTER_BOOLEAN_STATE` (0x0045), `CLUSTER_BRIDGED_DEVICE_BASIC_INFORMATION` (0x0039), `DEVICE_TYPE_OCCUPANCY_SENSOR` (0x0107), `DEVICE_TYPE_GENERIC_SWITCH` (0x000F), `DEVICE_TYPE_AGGREGATOR` (0x000E), `DEVICE_TYPE_BRIDGED_NODE` (0x0013), `VENDOR_ATTR_PERSON_COUNT` (0xFFF1_0001), `EVENT_SWITCH_MULTI_PRESS_COMPLETE` (0x06). Values transcribed from Matter Core Spec 1.3 §A.1 + Device Library 1.3. - `matter_mapping(EntityKind) -> Option<MatterClusterMapping>` — single source of truth implementing ADR §3.11.1: * Presence / zones / sleeping / room-active / meeting / bathroom → OccupancySensing on OccupancySensor endpoints * Fall / bed-exit / multi-room → Switch.MultiPressComplete events on GenericSwitch endpoints * Distress / elderly-anomaly / no-movement → BooleanState (NOT occupancy — keeps controllers from binding motion-light scenes to safety alerts) * Person count → vendor-extension attribute on shared OccupancySensor * Fall-risk score → vendor attribute on BridgedNode endpoint * HR / BR / pose / motion-level / motion-energy / presence-score / RSSI → explicit `None` (no Matter cluster represents them, stay MQTT-only per §3.11.4) - `entity_on_matter` + `next_endpoint` helpers. ## Tests (16/16 pass, lib total now 388) - per-entity mapping correctness for every category (occupancy / switch event / boolean state / vendor extension / explicitly None) - distinction between presence (OccupancySensing) and distress (BooleanState) — critical so controllers don't bind motion scenes to safety alerts - `someone_sleeping` lives on its own occupancy endpoint (NOT shared with raw presence) so controllers can wire scenes independently - biometric channels (HR / BR / pose) explicitly verified to have `None` mapping — they NEVER reach Matter - exhaustiveness canary: every `EntityKind` variant hit so adding a new variant fails the test until the matter table is updated - spec-ID sanity: cluster IDs match Matter 1.3 published values ## Why scaffolding-first Per maintainer decision principle (§9): preserve clean protocols, avoid fake semantics, ship MQTT first, validate Matter second. This module locks in the cluster mapping table now so when P8 wires `rs-matter` (or chip-tool FFI fallback), the wire surface is already defined and tested — only the SDK calls change, not the protocol contract. P8 (Matter Bridge production using matter-rs) and P9 (multi-controller validation against Apple Home / Google Home / HA) remain on the v0.7.1 docket per §9.10. Refs #776, PR #778. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 14:30:32 -04:00
ruv	a4f56d2f1b	feat(adr-115): P6 + P10 — runnable wiring example + witness bundle (VERIFIED) ## P6 — Wiring example `v2/crates/wifi-densepose-sensing-server/examples/mqtt_publisher.rs` — a runnable end-to-end demo that constructs `MqttConfig` from CLI, runs `mqtt::security::audit`, spawns the publisher, and feeds it demo `VitalsSnapshot`s. Every line is the production-wiring blueprint for `main.rs` when `args.mqtt` is true. Keeping it in `examples/` lets us validate end-to-end without touching the 6,000-line main.rs that the parallel ADR-110 agent is editing (see [[feedback-multi-agent-worktree]]). Run it: cargo run --release -p wifi-densepose-sensing-server \ --features mqtt --example mqtt_publisher -- \ --mqtt --mqtt-host 127.0.0.1 Compile-checked clean under `--features mqtt`. ## P10 — Witness bundle (VERIFIED) `scripts/witness-adr-115.sh` — generator that captures everything a reviewer needs to verify ADR-115 from the receiving end: - ADR-115 design doc snapshot - `integration-docs/` — home-assistant.md + semantic-primitives-metrics.md - `test-results/lib-tests.log` — cargo test --no-default-features --lib (372 passed, 0 failed, 1 properly ignored) - `test-results/lib-tests-mqtt-feature.log` — under --features mqtt - `test-results/integration-tests.log` — opt-in via RUVIEW_RUN_INTEGRATION=1 - `bench-results/criterion-*.log` — opt-in via RUVIEW_RUN_BENCH=1 - `manifest/source-hashes.txt` — SHA-256 of every ADR-115 source file - `manifest/git-head.txt` + `git-head-commit.txt` — exact source commit - `VERIFY.sh` — self-verification script; recipient runs `bash VERIFY.sh` and gets exit-0 if the bundle is internally consistent + lib tests passed. Local self-test PASSED end-to-end on this commit. - `WITNESS-LOG-115.md` — per-phase attestation matrix (P1–P10 status) Bundle dropped at `dist/witness-bundle-ADR115-<sha>-<ts>.tar.gz`. ## Docs - `docs/user-guide.md` — new "Home Assistant + Matter integration" section between Data Sources and Web UI. 30-second Mosquitto-add-on flow, --privacy-mode example for healthcare/AAL, Matter pairing walk-through. Links back to docs/integrations/home-assistant.md for the full reference. - `CHANGELOG.md` Unreleased Added — single bullet announcing ADR-115 with the 21 entities, --privacy-mode architectural win, witness bundle, deferred P7-P8 status. ## Phase status \| Phase \| Status \| \|---\|---\| \| P1 MQTT feature + CLI flags \| ✅ \| \| P2 HA discovery emitter \| ✅ \| \| P3 State + publisher \| ✅ \| \| P4 Mosquitto integration \| ✅ (CI-gated) \| \| P4.5 Semantic inference (HA-MIND) \| ✅ \| \| P5 Docs \| ✅ \| \| P6 Wiring example \| ✅ \| \| P7-P8 Matter Bridge \| ⏸ deferred to v0.7.1+ per §9.10 \| \| P9 Security + bench \| ✅ \| \| P10 Witness bundle \| ✅ \| Total lines: ~6000. Total tests: 372 passed. Witness: VERIFIED. Refs #776. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 14:26:14 -04:00
ruv	d25e331bbf	feat(adr-115): P9 — security audit (mqtt::security) + criterion benchmarks (15 tests) ## Security audit (`mqtt::security`) New module enforcing the ADR-115 §3.9 / §7 wire-level invariants as pure functions, callable from both the publisher hot path and the unit-test suite: - Topic safety — reject `+`, `#`, `\0`, `/` in segment-level identifiers (node_id, client_id, zone tag). Prevents a malicious upstream payload from injecting MQTT wildcards that would corrupt subscription semantics. - Path safety — reject NUL / newline in TLS cert / CA paths. - Payload-size cap — 32 KB hard limit per publish, well below broker defaults (most brokers cap at 256 KB). Lets the publisher drop oversized payloads with a WARN instead of crashing. - Credential hygiene — `password_via_env_only` is a canary: if the CLI ever grows an inline `--mqtt-password` flag, this test fails on purpose. Today we only accept `--mqtt-password-env <VAR>`. - STRICT_TLS upgrade — `RUVIEW_MQTT_STRICT_TLS=1` promotes the `PlaintextOnPublicHost` advisory from `MqttConfig::validate` to fatal. This is the planned v0.8.0 default per ADR §9.5. - Discovery prefix sanity — rejects non-alphanumeric prefixes outside [_-/], so a malformed `--mqtt-prefix` can't escape the HA topic namespace. 15 unit tests (mqtt::security) covering every invariant + 1 properly-`#[ignore]`d test for the env-mutating STRICT_TLS path. ## Criterion benchmarks (`benches/mqtt_throughput.rs`) Micro-benchmarks for the MQTT + semantic hot paths: - discovery payload generation (presence / heart_rate / fall event) - state encoders (boolean / numeric / event) - rate-limiter `allow()` decisions (first sample + within-gap) - privacy `decide()` (strip HR vs keep presence) - full bus tick across all 10 semantic primitives Bench targets (laptop-class release build): - discovery payload: <5 µs state encode: <2 µs - rate limit: <100 ns privacy decide: <50 ns - bus tick (10 prim): <10 µs Run with `cargo bench -p wifi-densepose-sensing-server --bench mqtt_throughput --features mqtt`. Numbers will be captured into the witness bundle in P10. `criterion` 0.5 added as dev-dep. `[[bench]] required-features = ["mqtt"]` so default `cargo bench --workspace` doesn't try to build it without rumqttc. Lib test count: 372 passed (357 → 372, +15 security tests). Refs #776. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 14:17:55 -04:00
ruv	b68f130ce4	feat(adr-115): P4 — broker integration tests + mosquitto CI workflow Adds three integration tests (`v2/crates/wifi-densepose-sensing-server/ tests/mqtt_integration.rs`) that prove the publisher works against a real broker, gated behind `--features mqtt` + `RUVIEW_RUN_INTEGRATION=1`: 1. `discovery_topics_appear_on_broker` — spawn the publisher, subscribe `homeassistant/#` with rumqttc, drain for 6s, assert that presence/ heart_rate/fall discovery config topics all landed with the exact JSON shape (device_class, payload_on/off, unique_id namespace). 2. `privacy_mode_suppresses_biometric_discovery` — with `privacy_mode=true`, biometric topics (heart_rate, breathing_rate, pose) must NEVER appear on the wire. Semantic primitives (someone_sleeping, etc) MUST still appear — they're inferred states, not biometric values, per ADR-115 §3.12.3. 3. `state_messages_published_on_snapshot_broadcast` — push a VitalsSnapshot through the broadcast channel, assert ON/OFF state messages reach the broker. Plus `.github/workflows/mqtt-integration.yml` — spins up Mosquitto 2.0.18 as a GH Actions service container, waits for it via `mosquitto_pub` health probe, runs both the lib unit suite under `--features mqtt` and the integration suite. Dumps broker logs on failure for debugging. Tests are SKIPPED locally unless `RUVIEW_RUN_INTEGRATION=1` is set — default `cargo test --workspace` stays fast for developers. Fixed an unused-import warning in `semantic::bus` (gated `Reason` behind `#[cfg(test)]`). Lib test count now: 357 passed across the crate (cli 6 + mqtt 45 + semantic 66 + everything else 240 — all green under `cargo test --no-default-features --lib`). Refs #776. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 14:14:21 -04:00
ruv	b2a692369e	feat(adr-115): P4.5b — 6 remaining semantic primitives — all 10 HA-MIND v1 done (66 tests) Lands the remaining six §3.12 v1 primitives: - `distress` (PossibleDistress) — EWMA baseline HR + 1.5× multiplier + agitated motion + no-fall + 60 s dwell → ON. Refractory 5 min after exit. Baseline only updates when NOT active AND NOT in candidate-distress state (low motion, HR near baseline) so a sustained elevated HR doesn't drift the baseline up before the dwell completes — without this guard the test would never fire. - `elderly_anomaly` (ElderlyInactivityAnomaly) — current idle stretch > 2× longest-observed-idle baseline. Baseline floor at 30 min so the first day doesn't fire spuriously. 24 h refractory per resident. - `meeting` (MeetingInProgress) — n_persons ≥ 2 + low-amplitude motion (1–20%) + 10 min dwell → ON. 2 min exit dwell on count drop. - `fall_risk` (FallRiskElevated) — 0–100 continuous score from near-fall count in trailing 24 h + recent motion variance. Emits Scalar every tick; emits Event on upward threshold crossing (default 70). - `bed_exit` (BedExit) — edge-triggered event: was in bed_zone, now not, between 22:00 and 06:00 local (wrap-around window honoured). - `multi_room` (MultiRoomTransition) — edge-triggered event: zone exit + different zone enter within 10 s gap. Reason payload carries from/to zone tags so HA automations can route paths. Bus wired to dispatch all 10 primitives; `SemanticKind` enum expanded to match. `tick()` returns up to 10 events per snapshot. 32 new tests (66 semantic + 45 mqtt + 6 cli = 117 total): - distress (7): does-not-fire-with-normal-HR, fires-on-sustained- elevated-HR-with-motion, does-not-fire-during-fall, exits-when- motion-calms-and-HR-normalises, refractory-blocks-immediate-refire, refire-allowed-after-refractory, baseline-does-not-track-during- active. - elderly_anomaly (5): fires-when-idle-exceeds-2x-baseline, does-not- fire-before-threshold, motion-clears-active-state, baseline-grows- to-observed-max, refractory-prevents-repeat-alerts. - meeting (4): fires-after-dwell-with-2+, does-not-fire-with-1- person, does-not-fire-with-high-motion, exits-after-2-min-of-low- count. - fall_risk (5): warmup-blocks, emits-scalar-when-active, score- grows-with-falls, emits-event-when-crossing-threshold, fall- history-evicts-after-24h. - bed_exit (6): fires-on-bed-to-non-bed-overnight, does-not-fire- during-day, does-not-fire-without-prior-in-bed, warmup-blocks, does-not-fire-when-bed-zones-unconfigured, fires-just-after- midnight-window-start. - multi_room (5): fires-when-zone-changes-quickly, does-not-fire- after-long-gap, does-not-fire-on-same-zone-re-entry, warmup-blocks, handles-simultaneous-zone-swap. ADR-115 §3.12 inference layer now complete. Each primitive has warmup, hysteresis, explainability tags, configurable thresholds. Adding a v2 primitive is one file + one bus entry. Refs #776. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 14:07:21 -04:00
ruv	8e416af203	feat(adr-115): P4.5a — semantic inference layer (HA-MIND) — 4 primitives + bus (34 tests) ADR-115 §3.12 keystone. Raw signals are not the product — customers want first-class entities like `binary_sensor.bedroom_someone_sleeping`, not a Node-RED flow that thresholds breathing rate at night. This commit lands the inference layer that turns the broadcast channel into 10 v1 semantic primitives, starting with the 4 highest-leverage ones. Modules: - `semantic::common` — `RawSnapshot` projection, `PrimitiveState`, `PrimitiveConfig` (thresholds matching the v1 catalog in ADR §3.12), `in_window` for time-gated primitives, `Reason` explainability struct. - `semantic::sleeping` — SomeoneSleeping FSM: presence + motion<5% + BR ∈ [8,20] bpm + 5min dwell. Exit on presence-drop (immediate) or motion>15% for 30s. - `semantic::room_active` — motion >10% in 30s window → ON. Exit on presence-drop or 10min idle. - `semantic::bathroom` — presence + zone tagged as bathroom. Safe in privacy mode (no biometrics in the derivation). - `semantic::no_movement` — presence + motion<1% for 30min → ON. Safety-check primitive for aging-in-place. - `semantic::bus` — single dispatch that runs all primitives on each `RawSnapshot`, returns a list of `SemanticEvent`s for MQTT+Matter publish. Every primitive has: - Warmup suppression (60s default, §3.12.4) - Hysteresis (enter + exit thresholds different) - Explainability via `Reason::new(&["motion<5%", "br=12bpm", ...])` - Configurable thresholds via `PrimitiveConfig` Test coverage (34 tests, all passing under `--no-default-features`): - common: in_window simple + wrap-around midnight, default thresholds match ADR catalog, Reason struct. - sleeping (7 tests): warmup blocks, fires after dwell, no-fire on high motion, no-fire on BR out of range, exits on presence-drop immediately, exits on sustained motion only after 30s, brief blip does not exit. - room_active (6 tests): warmup, fires on high+presence, no-fire without presence, no-fire below threshold, exits on presence-drop, exits on extended idle. - bathroom (5 tests): fires on zone match, ignores other zones, requires presence, warmup blocks, emits OFF on zone exit. - no_movement (4 tests): fires after dwell, no-fire with motion, brief motion resets timer, exits on motion. - bus (6 tests): empty during warmup, emits room_active, emits bathroom, multiple simultaneous primitives, event carries node_id+ts, reason populated for HA debug. Total cargo test count now: cli: 6 + mqtt: 45 + semantic: 34 = 85 tests passing P4.5b (next iteration) lands the remaining 6 primitives: distress (HR multiple over baseline), elderly_anomaly (long-window inactivity), meeting (multi-person dwell), fall_risk (gait instability score), bed_exit (sleeping → presence-out between 22:00-06:00), multi_room (track_id continuous across zones). Refs #776. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 14:01:51 -04:00
ruv	59c503d01e	feat(adr-115): P3 — state encoder + rate limiter + rumqttc publisher (45 tests) Implements ADR-115 §3.5 (QoS/retain matrix), §3.6 (LWT/availability heartbeat), §3.7 (per-entity rate limits) as three new submodules: - `mqtt::state` — `RateLimiter` (per-entity HashMap of last-emitted Duration; allow() returns false within the configured 1/Hz gap), `StateEncoder` rendering binary/numeric/event payloads from a `VitalsSnapshot` projection, `StateMessage` carrying topic + payload + QoS + retain bits keyed off `DiscoveryComponent` so the wire-level matrix from §3.5 is enforced in one place. Compiles without rumqttc so it's testable under --no-default-features. - `mqtt::publisher` (feature-gated) — `OwnedDiscoveryBuilder` for the background task, `run()` event loop that pumps `rumqttc::EventLoop` + heartbeat (30s) + discovery refresh (configurable) + broadcast channel consumer in a single tokio::select!. Reconnect resets the RateLimiter so post-reconnect samples emit promptly. On graceful shutdown publishes `offline` to every availability topic before disconnect. - `mqtt::discovery::EntityKind` — derive `Hash` so the entity can key the RateLimiter's HashMap. 18 new state-encoder tests covering: - Rate limiter: first-sample-pass, drops-within-gap, allows-after-gap, per-entity independence, change-only entities (rate=0) always allow, reset re-enables immediate publish. - Boolean encoder: ON/OFF payload, QoS 1 + retain (per §3.5), rejects non-binary entities, topic matches discovery state topic. - Numeric encoder: HR bpm payload with confidence + ts, motion % rendering, returns None when optional field absent, clamps out-of-range motion, rejects non-sensor entities, QoS 0 + no retain. - Event encoder: fall payload with confidence + ts, omits confidence when None, QoS 1 + no retain (never replay old falls), rejects non-event entities. - iso_ts: RFC 3339 UTC with millisecond fraction. Total mqtt test suite now 45/45 green: cargo test -p wifi-densepose-sensing-server --no-default-features mqtt:: 45 passed; 0 failed. Compile-checked under --features mqtt + rumqttc 0.24 + use-rustls: cargo check -p wifi-densepose-sensing-server --features mqtt --no-default-features Finished dev profile (clean, no warnings). Refs #776. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 13:57:16 -04:00
ruv	07d22247b5	feat(adr-115): P2 — HA discovery emitter + privacy filter + config (27 tests) Implements ADR-115 §3.1–§3.4 (entity mapping + topic structure + discovery payloads + device grouping) and §3.10 (privacy-mode contract) as the `mqtt` submodule of `wifi-densepose-sensing-server`. Modules: - `mqtt::mod` — module roots, stable origin/manufacturer/url constants - `mqtt::config` — `MqttConfig` built from `cli::Args`, TLS resolution (off/system-trust/pinned-CA/mTLS), `--mqtt-password-env` resolution, pre-flight `validate()` with fatal/advisory distinction (PlaintextOnPublicHost is non-fatal in v0.7.0, hard-fail in v0.8.0 per §3.9 / §9.5). - `mqtt::discovery` — `DiscoveryBuilder`, `EntityKind` (all 11 raw + 10 semantic entities), serialisable `DiscoveryConfig` with `skip_serializing_if = "Option::is_none"` so retained payloads stay compact. Topic structure matches HA's `<prefix>/<component>/<object>/<entity>/ {config,state,availability}` convention. `enabled_ entities(privacy, publish_pose, no_semantic)` is the single source of truth for which entities the publisher will emit. - `mqtt::privacy` — `decide(entity, privacy_mode)` returns `Suppress` for biometrics (HR/BR/pose) and `Publish` for everything else, including all semantic primitives (per §3.12.3 — semantic primitives are inferred states, not biometric values, and remain safe to publish in privacy mode). Tests (27 total, all passing under `--no-default-features`): - 11 config tests: defaults, TLS port bump, explicit port override, mTLS triplet detection, validate rejects empty host / zero port / NaN / negative rate, plaintext-public advisory, password env resolution. - 9 discovery tests: payload shape (presence, heart rate, fall event, distress problem-class), default vs privacy-mode entity sets, --no-semantic filtering, component routing, null-field omission, availability/state topic pairing, namespaced unique_id. - 4 privacy tests: privacy-off publishes all, privacy-on suppresses exactly the biometric set, keeps non-biometric signals, keeps every semantic primitive. Connect/publish lifecycle (uses `rumqttc`) gated behind `--features mqtt`; the `publisher` and `state` submodules land in P3 next iteration. Refs #776. Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 13:50:33 -04:00
ruv	fc9f2dce8a	feat(adr-115): P1 — Cargo features + CLI flags for MQTT/Matter/Semantic Adds `mqtt` and `matter` Cargo features (default off) plus 20+ new CLI flags wired through cli.rs per ADR-115 §3.8 / §3.10 / §3.11 / §3.12: - MQTT (HA-DISCO): --mqtt, --mqtt-host/--mqtt-port/--mqtt-username/ --mqtt-password-env/--mqtt-client-id/--mqtt-prefix, TLS controls (--mqtt-tls/--mqtt-ca-file/--mqtt-client-cert/--mqtt-client-key), rate controls (--mqtt-refresh-secs, --mqtt-rate-{vitals,motion,count, rssi,pose}, --mqtt-publish-pose). - Privacy (ADR-106): --privacy-mode strips HR/BR/pose pre-publish. - Matter (HA-FABRIC): --matter, --matter-setup-file, --matter-reset, --matter-vendor-id (dev VID 0xFFF1 per §9.9), --matter-product-id. - Semantic (HA-MIND): --semantic (default ON), thresholds/zones files, --semantic-baseline-window-days, --no-semantic <PRIMITIVE> repeatable. rumqttc 0.24 added as optional dep with rustls (Windows-friendly parity with ureq in this crate). matter-rs deferred to P7 spike per §9.10. 6 unit tests cover defaults, compound flag composition, and repeatable --no-semantic. Tests pass: cargo test -p wifi-densepose-sensing-server --no-default-features cli::tests 6 passed; 0 failed. Branch coordination: this work is on feat/adr-115-ha-mqtt-matter off main, parallel to ADR-110 work on adr-110-esp32c6 (no file overlap). Refs #776 (ADR-115 implementation tracking issue). Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-23 13:43:02 -04:00
rUv	004a63e82d	fix(security): audit — fix RUSTSEC vulns, clippy warnings, dead code (#769 ) - Upgrade openssl to 0.10.78 (CVE-2026-41676), jsonwebtoken to 9.4 - Suppress unmaintained-only/no-CVE advisories in .cargo/audit.toml with per-entry rationale - Fix all `cargo clippy --all-targets -- -D warnings` errors across 35 crates: derivable_impls, needless_range_loop, map_or→is_some_and/ is_none_or, await_holding_lock (drop MutexGuard before .await), ptr_arg (&mut Vec→&mut [T]), useless_conversion, approximate_constant (2.718→E, 3.14→PI), field_reassign_with_default, manual_inspect, useless_vec, lines_filter_map_ok, print_literal, dead_code - Apply `cargo fmt --all` - Pre-existing test failure in wifi-densepose-signal (test_estimate_occupancy_noise_only) is not introduced by this PR	2026-05-23 05:36:13 -04:00
OrbisAI Security	1906876541	fix: upgrade openssl to 0.10.78 (CVE-2026-41676) (#751 ) * fix: CVE-2026-41676 security vulnerability Automated dependency upgrade by OrbisAI Security * fix: upgrade openssl to 0.10.78 (CVE-2026-41676) rust-openssl provides OpenSSL bindings for the Rust programming langua Resolves CVE-2026-41676	2026-05-23 03:31:03 -04:00
rUv	a85d4e31e4	research(sota): kick off SOTA research loop + first R5 saliency measurement (#702 ) Sets up docs/research/sota-2026-05-22/ as the autonomous-research output dir, with PROGRESS.md as the canonical 15-vector research agenda spanning spatial intelligence, RF features, RSSI-only, and exotic/long-horizon verticals. Cron d6e5c473 (/10 * * ) picks threads from this file and self-terminates at 2026-05-22 08:00 ET. First concrete contribution this tick — R5 subcarrier saliency: examples/research-sota/r5_subcarrier_saliency.py: pure-numpy port of the count cog's Conv1d encoder + count head, computes per- subcarrier input×gradient saliency via central-difference. 128 samples × 56 subcarriers × 2 forward passes/subcarrier ≈ ~3 s on CPU, no GPU or framework dependency. * docs/research/sota-2026-05-22/R5-subcarrier-saliency.md: research note with motivation, method, novelty argument, and the first measured ranking. Top-8 subcarriers for cog-person-count v0.0.2: [41, 52, 30, 31, 10, 35, 2, 38]. Max/mean ratio 2.85x. * v2/crates/cog-person-count/cog/artifacts/saliency.json: machine- readable per-subcarrier saliency + top-K lists, so future-tick experiments (retrain at K=8/16/32) consume it without re-running. Key insight from the first measurement: top-8 saliency is band- spread (indices span 2-52), not concentrated. This directly raises R8's (RSSI-only) feasibility ceiling, because RSSI is a band- aggregate — it retains the integral of a band-spread signal. First- order estimate: RSSI-only should hit ~60% of full-CSI accuracy for the count task. R7 (adversarial defence) inherits a concrete defender- priority list: corroborate these 8 subcarriers across nodes. This commit is the first of many short, focused contributions over the next ~12 hours. PROGRESS.md is the canonical pointer for the next tick to pick up the next thread.	2026-05-21 23:05:55 -04:00
rUv	b3a5012dbd	feat(cog-person-count): v0.0.2 — K-fold + label-smoothing + temperature-calibrated (#699 ) * chore: stage v0.0.2 artifacts + temperature scalar for build pipeline Stages count_v1.{safetensors,onnx,temperature,train_results.json} ahead of the build/sign/upload step. This commit is a momentary side-effect — the next commit will refresh the per-arch manifests with the new binary SHAs once ruvultra finishes the cross-build. The .temperature file holds the calibration scalar from LBFGS over the held-out conf logits. The Rust cog will read it post-load and divide conf_logits by it before sigmoid, exactly matching the Python eval. * feat(cog-person-count): v0.0.2 — K-fold validated, label smoothing + early stop + temp scale The v0.0.1 "65.1% but class-1=0%" result was an unlucky temporal split that let a degenerate "always predict 0" classifier hit eval acc = class-0 fraction. 5-fold stratified random CV proved the architecture actually learns ~57.1% class-1 accuracy under fair splits — a real, modestly useful signal. v0.0.2 ships a retrained model that: * Splits randomly (seed=42) 80/20 instead of temporally — eliminates the trailing-window-class-imbalance cheat. * Class-balanced sampler (multinomial with replacement, weighted by inverse class frequency) — per-batch expected counts are equal regardless of dataset distribution. * Label smoothing 0.1 on the cross-entropy — reduces confidence saturation that drove v0.0.1's all-or-nothing predictions. * Early stopping with patience=20 — stops at epoch 29 instead of overfitting through 400. * Temperature scaling of the conf head — LBFGS fits a scalar T on held-out conf logits; ships as a count_v1.temperature sidecar so the Rust cog can divide conf_logits by T before sigmoid. Numbers on the same data: \| Metric \| v0.0.1 \| v0.0.2 \| K-fold (5x100) \| \|------------------\|--------\|--------\|----------------\| \| Overall acc \| 65.1% \| 62.3% \| 62.2% ± 1.9% \| \| Class 0 acc \| 100% \| 86.2% \| 67.4% \| \| Class 1 acc \| 0% \| 34.3% \| 57.1% ✓ \| \| MAE \| 0.349 \| 0.377 \| 0.378 \| \| Spearman \| 0.023 \| 0.013 \| 0.160 \| Class-1 accuracy 0 → 34.3% is the headline win. Net acc moves slightly because we stopped cheating on class 0. K-fold's 57% says there's headroom remaining; reaching it needs more independent splits (== more data), not more training tricks. Confidence calibration didn't move. Temperature scaling alone can't fix a confidence head trained against a noisy argmax==truth indicator over a 62%-accurate classifier — the head's training signal is the issue, not its post-hoc transform. The honest fix is multi-room data (#645), not another calibration knob. Live on cognitum-v0 at /var/lib/cognitum/apps/person-count/ — health reports candle-cpu backend, count = 1 (was 0 in v0.0.1) on synthetic zero input. Files changed: * scripts/train-count.py — adds --k-fold (no sklearn dep, hand-rolled stratified splits with deterministic shuffle) and --v2 paths. * v2/.../cog/artifacts/count_v1.safetensors (392 KB, new sha 32996433…) + count_v1.onnx (16 KB) + count_v1.temperature (0.9262 scalar) + count_train_results.json (full epoch trace). * v2/.../cog/artifacts/manifests/{arm,x86_64}/manifest.json bumped to version 0.0.2 with the new weights_sha256 + caveats. * docs/benchmarks/person-count-cog.md — appends a v0.0.2 section with the K-fold diagnostic table and honest-read paragraph. GCS: gs://cognitum-apps/cogs/arm/cog-person-count-count_v1.safetensors refreshed (binaries unchanged — load weights via mmap at runtime).	2026-05-21 19:47:04 -04:00
rUv	e6a5df36eb	chore(cog-person-count): refresh GCS manifests after run-wiring rebuild (#698 ) The arm + x86_64 manifests committed in #696 referenced the binaries built before #697 wired the `run` subcommand. Rebuilt + re-signed + re-uploaded to GCS, and re-deployed to cognitum-v0: arm sha 15c2fbac…7728ea5 (3,807,456 B, up from 2,168,816 — added Tokio runtime) x86_64 sha 051614ce…cc8388b3 (4,502,960 B, up from 2,615,528) Both re-signed Ed25519 with COGNITUM_OWNER_SIGNING_KEY. Manifests now match the binaries published at gs://cognitum-apps/cogs/{arm, x86_64}/cog-person-count-* and the binary installed at /var/lib/cognitum/apps/person-count/ on cognitum-v0.	2026-05-21 19:13:10 -04:00
rUv	5c914e63c7	feat(cog-person-count): wire `run` subcommand — v0.0.1 fully functional (#697 ) Phase 4 of ADR-103. Adds the long-running polling loop so the cog's fourth verb (`run`) does real work, completing the ADR-100 runtime contract end-to-end: cog-person-count version → "person-count 0.3.0" cog-person-count manifest → JSON skeleton cog-person-count health → loads weights + 1-shot infer + emit cog-person-count run --config → long-running per-frame emit ← THIS What ships: * src/runtime.rs (new) — `run_loop` polls sensing_url every poll_ms, slides a [56, 20] CSI window, runs InferenceEngine::infer, emits publisher::person_count events. Same shape as cog-pose-estimation::runtime — fetch_frame extracts amplitudes from `snapshot.nodes[0].amplitude[]`, fails open on connect errors with a WARN log rather than crashing. * src/lib.rs — registers the runtime module. * src/main.rs — cmd_run now loads RunConfig from a JSON file, builds the InferenceEngine (with weights if cfg.model_path is set, otherwise auto-discover), emits a run.started event, and hands off to the Tokio multi-thread runtime's block_on(run_loop). Single-node fusion is a no-op for N=1 today; v0.2.0 will append predictions from sibling nodes and call fusion::fuse_confidence_weighted before emit. Verified locally: cargo check -p cog-person-count --no-default-features → clean cargo test -p cog-person-count → 15/15 pass (no regressions) cargo build -p cog-person-count --release → 2.36 MB unchanged ./cog-person-count run --config bad-config.json: line 1: {"event":"run.started","fields":{"cog":"person-count", "sensing_url":"http://127.0.0.1:9999/...",poll_ms:100, "model_path":"(auto-discover)"}} line 2: WARN sensing-server fetch failed error=Connection Failed: Connect error: actively refused (loop alive — exits cleanly on SIGTERM, no crash, no NaN) Also adds a "Relationship to the in-process score_to_person_count heuristic" section to cog/README.md explaining the dual-emitter design (sensing-server keeps emitting the PR #491 slot heuristic; the cog runs out-of-process and emits person.count events from the learned model). Operators choose by installing the cog or not — no sensing-server rebuild required. ADR-103 §"Migration" status: 1. Land ADR + scaffold ........... done (#693, #694) 2. Train count_v1 ................ done (#695) 3. Cross-compile + sign + GCS .... done (#696) 4. Server-side wiring ............ done — out-of-process design means no rewire needed; this cog is the wiring. 5. v0.2.0 multi-room + LoRA ...... data-bound (#645)	2026-05-21 19:10:15 -04:00
rUv	a5e99670f8	feat(cog-person-count): release v0.0.1 — signed binaries on GCS, live on cognitum-v0 (#696 ) Phase 3 of ADR-103. Cross-compiled aarch64 + x86_64 on ruvultra, signed with COGNITUM_OWNER_SIGNING_KEY (Ed25519), uploaded to GCS, and live- installed on the cognitum-v0 Pi 5 alongside cog-pose-estimation. Real-hardware bench on cognitum-v0: ./cog-person-count-arm health → backend=candle-cpu, count=0, confidence=0.49, p95=[0,7] 30 sequential health invocations: 0.276 s → 9.2 ms/invocation cold Compares to cog-pose-estimation's 8.4 ms — count cog is ~10% slower because the dual-head (count softmax + confidence sigmoid) does ~2x the work after the shared encoder. GCS release artifacts (publicly downloadable, SHA-verified): arm/cog-person-count-arm 2,168,816 B sha: 36bc0bb0...0d47b507b3c3 sig: R/00xdzHriyr/2r...JK+a6k71NDg== (Ed25519) x86_64/cog-person-count-x86_64 2,615,528 B sha: 76cdd1ec...3923 7392b01db sig: QB+8cnGSMQmu...ZtTNIQ2rDg== (Ed25519) arm/cog-person-count-count_v1.safetensors 392,088 B sha: dacb0551...e6e04ff56d15c3a65a9ff Live install at /var/lib/cognitum/apps/person-count/ on cognitum-v0 matches the layout of every other installed cog (anomaly-detect, seizure-detect, pose-estimation): cog-person-count-arm binary, count_v1.safetensors weights, manifest.json, config.json. Adds: * v2/.../cog/artifacts/manifests/{arm,x86_64}/manifest.json — full ADR-100 schema with all fields filled (sha + sig + size + URL + build_metadata carrying the v0.0.1 honest training caveats). * docs/benchmarks/person-count-cog.md — appends "Live appliance install" and "Signed GCS release artifacts" sections to the benchmark log. Honest v0.0.1 caveat still applies (class-1 accuracy 0% on the held- out tail of the single-session training data) — same data-bound limit as pose_v1. The shipped artifact is the vehicle; production- quality accuracy follows from multi-room paired data per ADR-103's v0.2.0 plan + #645.	2026-05-21 19:02:26 -04:00
rUv	6b4994e105	feat(cog-person-count): train count_v1.safetensors — honest v0.0.1 (ADR-103) (#695 ) Phase 2 of ADR-103: trained count head on the existing 1,077 paired samples (the same data that produced pose_v1 yesterday). Honest result: 65.1% eval accuracy / 100% within ±1 / MAE 0.349 on the held-out time-window. Per-class: 100% on "empty room" / 0% on "1 person". The model overfit by epoch 100 (train_acc → 1.0, eval_loss climbed 0.67 → 7.8) and the "best" checkpoint is the snapshot that happened to predict the eval window's class distribution (140/215 = 65.1%, matches eval_acc exactly). Confidence head Spearman = 0.023 ⇒ uncalibrated. Same data-bound failure mode as pose_v1 (#645), bounded by single-session training data; same fix path (multi-room). What v0.0.1 still validates end-to-end: * PyTorch → safetensors → Candle Rust loads cleanly on first try. `cog-person-count health` reports `backend: candle-cpu` and emits real per-frame predictions instead of the stub backend's hard-coded {1 person, 0 confidence}. Architecture parity between train-count.py and src/inference.rs::CountNet is bit-exact. * ONNX export bit-clean (16 KB, opset 18, dynamic batch axis). * Training wall time: 5.6 s for 400 epochs on RTX 5080. * Binary size unchanged (2.36 MB stripped), model loads via mmap at runtime. This commit ships: * scripts/align-ground-truth.js: extended to emit n_persons_mode + n_persons_max per window so the training pipeline has count labels. Backwards-compatible (additive fields). * scripts/train-count.py: new — mirrors CountNet architecture exactly, loads paired.jsonl, trains 400 epochs with CE+BCE+Brier loss, exports safetensors + ONNX + per-epoch JSON. * v2/.../cog/artifacts/{count_v1.safetensors,count_v1.onnx, count_train_results.json}: the trained artifacts. * v2/.../cog/README.md: Status table updated with the v0.0.1 numbers + an Honest Caveat section explaining the data-bound result. * docs/benchmarks/person-count-cog.md: new — full v0.0.1 benchmark log mirroring the format docs/benchmarks/pose-estimation-cog.md established. Includes comparison to ADR-103 v0.1.0 acceptance gates and per-class breakdown. Still pending: * `run` subcommand wiring (long-running polling loop, same as pose) * Cross-compile + sign + GCS upload (mirror of pose cog pipeline) * Live install on cognitum-v0 * v0.2.0: re-train on multi-room data, LoRA per-room adapters, Stoer-Wagner min-cut clip in fusion stage	2026-05-21 18:56:52 -04:00
rUv	6959a42312	feat(cog-person-count): v0.0.1 scaffold + tests + fusion math + bench (ADR-103) (#694 ) First implementation PR for ADR-103. Same incremental shape that ADR-101 used: scaffold the cog crate, ship a stub-backend release that satisfies the runtime contract + 15 tests + measured cold-start, then follow up with the trained count_v1.safetensors in a separate PR. What ships: * v2/crates/cog-person-count/ — new workspace member. - Cargo.toml: candle-core/candle-nn 0.9 (cpu default, cuda feature opt-in), safetensors, ureq, sha2 — same dep shape as the pose cog but minus wifi-densepose-train (this cog has no training-side consumer, so the dep tree is materially smaller → 2.36 MB binary vs the pose cog's 4.5 MB). - src/inference.rs: CountNet (Conv1d 56→64→128→128 encoder + count head Linear(128→64→8)+softmax + confidence head Linear(128→32→1)+sigmoid). Stub backend returns `{1-person, 0-confidence}` honestly when no safetensors present. - src/fusion.rs: fuse_confidence_weighted() — Bayesian product of per-node distributions with confidence-weighted log-sum, plus fuse_with_mincut_clip() hook for the v0.2.0 Stoer-Wagner upper-bound (`ruvector-mincut` dep lands when min-cut graph builder is ready). Confidences floored at 1e-3 and probs floored at 1e-9 before logs — no NaN propagation. - src/publisher.rs: emits {count, confidence, count_p95_low, count_p95_high, n_nodes, probs} per ADR-103 §"Output". - src/main.rs: full ADR-100 four-verb CLI (version\|manifest\|health \|run). The `run` subcommand explicitly returns "wiring pending v0.0.1" so the in-process library API is the v0.0.1-clean integration path. - tests/smoke.rs (8 tests) + fusion::tests (7 tests, in-lib) — 15 total, all green. Cover stub-backend behaviour, wrong-shape rejection, fusion math (empty / single / agreement / high-conf override / normalisation), p95-range correctness, and min-cut clip semantics. - cog/{manifest.template.json, config.schema.json, README.md} + cog/artifacts/ placeholder dir. * v2/Cargo.toml: registers the new workspace member. Verified locally: cargo check -p cog-person-count --no-default-features → clean cargo test -p cog-person-count --no-default-features → 8/8 pass cargo test -p cog-person-count --lib → 7/7 pass cargo build -p cog-person-count --release → 2.36 MB binary ./cog-person-count version → "person-count 0.3.0" ./cog-person-count manifest → JSON skeleton ./cog-person-count health → backend:stub, count:1, conf:0, p95:[1,1] Cold-start: 30 sequential `health` invocations → 53.3 ms/invocation (vs cog-pose-estimation's 76.2 ms — smaller dep tree) cog/README.md adds: * Security section — six-row threat table covering safetensor mmap trust, non-finite outputs, sensing fetch failures, fusion divide-by-zero / log-of-zero, min-cut degenerate cases, and stdout spoofing. * Performance / optimization section — binary size, release profile (already opt-level=3 / lto=fat / codegen-units=1 / strip=true at workspace level), cold-start comparison table, projected warm-path latency budget. Still pending (separate PRs, ADR-103 §"Migration"): * Train count_v1.safetensors on the existing 1,077 paired samples with `n_persons` labels (Candle on RTX 5080, same script that produced pose_v1.safetensors yesterday). * `run` subcommand wiring (long-running polling loop, same shape as cog-pose-estimation::runtime). * Cross-compile + sign + GCS upload (mirror of cog-pose-estimation release pipeline). * Server-side `csi.rs::score_to_person_count` call-site rewire to consume this cog when installed; falls back to PR #491's heuristic when not.	2026-05-21 18:46:57 -04:00
rUv	67fec45e61	feat(edge-registry): ADR-102 — surface Cognitum cog catalog via /api/v1/edge/registry (#648 ) * feat(edge-registry): ADR-102 — surface Cognitum cog catalog via /api/v1/edge/registry Adds a new sensing-server endpoint that fetches and caches the canonical Cognitum app registry at https://storage.googleapis.com/cognitum-apps/app-registry.json (105 cogs across 11 categories as of v2.1.0). RuView previously had no live awareness of the catalog — the README's capability table was hand- curated and went stale as Cognitum shipped new cogs (the registry was last updated 6 days ago). ADR: * docs/adr/ADR-102-edge-module-registry.md — full design, response shape, configuration flags, failure modes, and a 12-row security review covering SSRF, response inflation, ?refresh abuse, stale-serve semantics, TLS, cache poisoning, JSON-panic resistance, etc. Code: * v2/.../edge_registry.rs — EdgeRegistry struct + UreqFetcher + MockFetcher trait + 7 unit tests. RwLock<Option<CachedEntry>> with stale-on-error fallback. MAX_PAYLOAD_BYTES=8 MiB, 10s wire timeout. * v2/.../main.rs — constructs Option<Arc<EdgeRegistry>> at startup, registers GET /api/v1/edge/registry handler, wires Extension layer. Handler runs the blocking ureq fetch via tokio::task::spawn_blocking so the async runtime stays free. * v2/.../cli.rs / main.rs Args — three new flags (per user request to "allow the registry to be disabled or changed"): --edge-registry-url <URL> (env RUVIEW_EDGE_REGISTRY_URL) --edge-registry-ttl-secs <N> (env RUVIEW_EDGE_REGISTRY_TTL_SECS) --no-edge-registry (env RUVIEW_NO_EDGE_REGISTRY) When --no-edge-registry is set or the URL is empty, the endpoint returns 404. Cargo.toml: adds ureq (rustls), sha2, thiserror as direct deps. README: * New collapsed "🧩 Edge Module Catalog" section with the full 105-cog table generated from the registry, grouped by category with practical one-line descriptions (e.g. "Spots irregular heartbeats and abnormal heart rhythms", "Detects walking problems and scores fall risk"). Links to https://seed.cognitum.one/store and the local appliance /cogs page. Sits between the HF model section and How It Works. Tests (7/7 pass): first_call_hits_upstream_and_caches ttl_expiry_triggers_refetch force_refresh_bypasses_fresh_cache stale_serve_on_upstream_failure_after_cached_success no_cache_no_upstream_returns_error upstream_invalid_json_is_treated_as_error upstream_sha256_is_deterministic Security highlights (full review in ADR-102 §"Security review"): - The registry is metadata-only; per-cog binary signatures (ADR-100) remain the trust root for installs. A compromised registry can mislead a human reader but cannot ship malicious binaries. - 8 MiB cap + 10s timeout + Option<Arc<...>> via Extension layer means the endpoint can't be used to exhaust memory or pin tokio threads. - Stale-on-error responses carry an explicit `stale: true` field so upstream outages are visible to consumers rather than silently masked. - Endpoint sits behind the existing RUVIEW_API_TOKEN bearer gate when set, otherwise unauthenticated (registry contents are public anyway). * chore: refresh Cargo.lock for ureq/sha2/thiserror deps added by ADR-102	2026-05-19 18:08:43 -04:00
rUv	4b1a835107	docs: repoint #640 references to #645 (original deleted, replaced) (#646 ) Issue #640 (PCK gap follow-up) was deleted upstream after the cog v0.0.1 PRs landed today. Re-opened as #645 with the same context plus the new measured v0.0.1 numbers (PCK@20 3.0%, PCK@50 18.5%, MPJPE 0.093). This patch updates the three files in main that still pointed at the dead #640 to point at #645 instead — ADR-101, the cog README, and the benchmark log.	2026-05-19 17:18:05 -04:00
rUv	fcb6f4bf12	feat(cog-pose-estimation): x86_64 release v0.0.1 — parallel to arm (#643 ) Adds the x86_64-unknown-linux-gnu binary uploaded to gs://cognitum-apps/cogs/x86_64/, signed with the same Ed25519 COGNITUM_OWNER_SIGNING_KEY as the arm release. Together with the already-shipped arm artifact, the cog now ships natively for both target architectures the Cognitum fleet supports. x86_64 release: sha256: a434739a24415b34e1aff50e5e1c3c32e568db96af473bbb3e5ecc9b95fe71fa signature: pNNuxhgM18PztN8BSZdfw5oAShG2pV3na5T/q2QdlJWX/5FJgo4QTiUCbcTAxI2Uiva8VURSOlRzMU3xoQPqCQ== size: 4,548,856 bytes cold-start: 5.4 ms / invocation on ruvultra (RTX 5080, NVMe) Reorganizes manifests under cog/artifacts/manifests/{arm,x86_64}/ so each arch carries its own manifest with the matching binary_sha256 and signature — same layout the release pipeline will use for the future hailo8 / hailo10 variants. Updates docs/benchmarks/pose-estimation-cog.md with the cross-arch cold-start table: Windows (x86_64) 76.2 ms ruvultra (x86_64) 5.4 ms <- this release Pi 5 (aarch64) 8.4 ms Verified via anonymous GCS download + SHA round-trip — identical to local build. Hailo HEF remains the only pending arch, still blocked on Hailo SDK provisioning to a self-hosted runner.	2026-05-19 17:08:23 -04:00
rUv	3314c8db8d	feat(cog-pose-estimation): scaffold first Cog from this repo (ADR-100 + ADR-101) (#642 ) * feat(cog-pose-estimation): scaffold first Cog from this repo (ADR-100 + ADR-101) Adds the foundation for the pose-estimation Cog that ships from this repo into Cognitum V0 appliances. Companion ADR-225 + crate land in cognitum-one/v0-appliance. ADRs: * ADR-100 formalises the Cognitum Cog packaging spec — on-device layout under /var/lib/cognitum/apps/<id>/, manifest.json schema (incl. new binary_sha256 + binary_signature fields), GCS hosting convention, repo source layout, build pipeline, and the four-verb runtime contract (version \| manifest \| health \| run). Documents the convention I reverse-engineered from inspecting installed cogs on a live cognitum-v0 appliance — `anomaly-detect`, `presence`, `seizure-detect`, etc. * ADR-101 designs the pose-estimation Cog itself: where it sits in the wifi-densepose pipeline (encoder init from ruvnet/wifi-densepose-pretrained, 17-keypoint regression head), what gets shipped per target arch (arm / x86_64 / hailo8 / hailo10), acceptance gates (PCK@20 explicitly deferred to #640 — this ADR ships the vehicle, not the accuracy). Crate v2/crates/cog-pose-estimation/: * Cargo.toml + workspace member declaration with a hailo feature gate so the binary builds without the Hailo SDK in CI. * main.rs implements the four-verb CLI exactly per ADR-100. * config.rs / manifest.rs / publisher.rs / inference.rs / runtime.rs — small modules, each <100 lines. * publisher.rs emits ADR-100 structured JSON events. * inference.rs is a stub that produces a centred-skeleton baseline with confidence=0 (honest: no trained weights wired in yet). * runtime.rs subscribes to /api/v1/sensing/latest, slides a 5620 window, runs the engine, emits pose.frame events. cog/manifest.template.json + cog/config.schema.json define the release artifact + runtime config schemas. * cog/Makefile holds build / sign / upload targets. * tests/smoke.rs covers manifest roundtrip + engine I/O surface. Verified locally: * cargo check -p cog-pose-estimation: clean. * cargo test -p cog-pose-estimation: 4/4 pass. * ./target/release/cog-pose-estimation {version,manifest,health}: all emit the right contract output. This commit contains scaffolding only; the actual trained weights and Hailo HEF cross-compile come in follow-ups tracked in #640 and the companion v0-appliance branch. * feat(cog-pose-estimation): first measured run — Candle CUDA on RTX 5080 Trained pose_v1 on ruvultra (RTX 5080) via Candle 0.9 + cuda feature against the same 1,077-sample paired session that produced 0%/0% PCK in #640 with the pure-JS SPSA trainer. First real numbers: PCK@20 = 3.0% (up from 0.0%) PCK@50 = 18.5% (up from 0.0%) MPJPE = 0.093 (down from 0.66, ~7x improvement) 400 epochs in 2.1 s wall time, full-batch, ~5 ms/epoch. Loss curve 0.181 -> 0.014 over the run, eval 0.010. Per-joint reveals the model leans on right-side proximal joints (r_hip 77% PCK@50, r_knee 35%, l_elbow 26%) — consistent with the camera framing in the source recording. Distal joints (wrists, ankles) and face joints are still near-random, consistent with the 56-subcarrier / 20-frame input not carrying fine-grained spatial info at 1077 samples. This commit: * Adds v2/crates/cog-pose-estimation/cog/artifacts/{pose_v1.safetensors, train_results.json} so the cog dir now contains a real reference artifact, not just scaffold. * Updates cog/README.md "Status" block with the measured numbers, per-joint table, and an honest reading of where the model succeeds vs where the data is the bottleneck. * Adds docs/benchmarks/pose-estimation-cog.md as the canonical benchmark log — append-only, one section per published run. * Appends a "First measured run" section to ADR-101 referencing the new benchmark file. Still pending in the follow-up: * Wire pose_v1.safetensors into src/inference.rs (replace stub). * ONNX export (Candle lacks a writer — needs external conversion). * Hailo HEF cross-compile + cluster deploy. The data-bound gap to PCK@20 >= 35% is tracked in #640. * feat(cog-pose-estimation): wire real weights — cog is no longer a stub Replaces the centred-skeleton stub in src/inference.rs with a real Candle-based loader that reads cog/artifacts/pose_v1.safetensors and runs the trained Conv1d encoder + MLP pose head on every incoming CSI window. What changes: * src/inference.rs: PoseNet mirrors the training script's architecture exactly — Conv1d(56->64, k=3 d=1), Conv1d(64->128, k=3 d=2), Conv1d(128->128, k=3 d=4), mean over time, Linear(128->256)+ReLU, Linear(256->34)+sigmoid -> reshape [17, 2]. The InferenceEngine searches a sensible candidate list for the weights file (/var/lib/cognitum/apps/pose-estimation/, ./pose_v1.safetensors, ./cog/artifacts/, repo-root, v2/-relative) and falls back to the stub when none are present so the cog still satisfies ADR-100. * Cargo.toml: adds candle-core 0.9 + candle-nn 0.9 (no-default-features, CPU build by default) + safetensors 0.4. New `cuda` feature opt-in for GPU inference on hosts that have it. Drops the unused wifi-densepose-train path dep from the default build path. * src/main.rs + src/publisher.rs: health.ok event now carries `backend` (candle-cuda \| candle-cpu \| stub) and the synthetic output confidence, so operators can tell at a glance whether the cog loaded its weights or fell back to the stub. * tests/smoke.rs: adds `real_weights_load_when_available` which asserts the loaded engine reports backend=candle-* and emits non-zero confidence — exactly the signal that proves we're not silently degrading to the stub. Verified locally: * `cargo check -p cog-pose-estimation --no-default-features` — clean * `cargo test -p cog-pose-estimation --no-default-features` — 5/5 pass * `./target/release/cog-pose-estimation health` emits: {"event":"health.ok","fields":{"backend":"candle-cpu","cog":"pose-estimation","synthetic_output_confidence":0.185}} — 0.185 is the published PCK@50 from cog/artifacts/train_results.json, emitted by the real Candle inference path (would be 0.0 if it had fallen back to the stub). The cog now runs the trained pose_v1 model end-to-end. Accuracy is still bounded by the underlying 1077-sample training data (PCK@20 3.0%, PCK@50 18.5% per docs/benchmarks/pose-estimation-cog.md) — that gap is data-bound and tracked in #640. ONNX export + Hailo HEF cross-compile remain follow-ups. * docs(benchmarks): measure cog-pose-estimation cold-start latency 100 sequential `cog-pose-estimation health` invocations average 76.2 ms each on a Windows x86_64 host using the `candle-cpu` backend. Each invocation re-loads pose_v1.safetensors and runs one synthetic forward pass, so this is the worst-case cold-start path. Long-running `run` inference will be sub-millisecond per frame once the model is loaded. Updates the benchmarks doc accordingly. * feat(cog-pose-estimation): ONNX export — pose_v1.onnx + scripts/export-onnx.py Adds the canonical ONNX artifact that unblocks downstream Hailo HEF cross-compile + ONNX Runtime benchmarks. Generated on ruvultra (torch 2.12.0 + CUDA), 12,059 bytes, opset 18, dynamic batch axis. * scripts/export-onnx.py: mirrors the Candle inference architecture in PyTorch (Conv1d 56->64, 64->128, 128->128 + Linear 128->256->34), pure- python safetensors loader (no extra pip dep), exports via torch.onnx.export, then verifies via onnx.checker.check_model and numerical parity against the torch reference. * Verified parity vs torch: max \|torch - onnx\| = 8.94e-8 (1e-5 threshold). Effectively bit-perfect. * v2/crates/cog-pose-estimation/cog/artifacts/pose_v1.onnx — the artifact itself, 12 KB. * docs/benchmarks/pose-estimation-cog.md — adds an ONNX export section with the verification numbers. Next: Hailo HEF cross-compile (still gated on Hailo SDK on a self-hosted runner) and ONNX Runtime latency benchmarks on each target arch. * feat(cog-pose-estimation): release v0.0.1 — signed aarch64 binary on GCS End-to-end deploy: cross-compiled to aarch64-unknown-linux-gnu on ruvultra, ran via qemu-aarch64-static, then smoke-tested on a real cognitum-v0 Pi 5. Signed with COGNITUM_OWNER_SIGNING_KEY (Ed25519) and uploaded to gs://cognitum-apps/cogs/arm/. Real-hardware results on cognitum-v0 (Pi 5): health: backend=candle-cpu, confidence=0.185, real weights loaded 30x sequential `health`: 0.251 s total -> 8.4 ms / invocation (cold) GCS release artifacts (publicly downloadable): binary: 3,741,976 bytes sha256 1e1a7d3dd01ca05d5bfc5dbb142a5941b7866ed9f3224a21edc04d3f09a99bf5 weights: 507,032 bytes sha256 eb249b9a6b2e10130437a10976ed0230b0d085f86a0553d7226e1ae6eae4b9e5 signature (Ed25519, b64): LUN7xqLPYD3MFzm5dKB5MnYU0LvoRtek5ci5KiKPHBg+Xo6xuazwokn2Dw2JPMaLYJzmWn/SpT4djuR7hYvVDw== Adds: * v2/crates/cog-pose-estimation/cog/artifacts/manifest.json — the release-pipeline-produced manifest with all fields filled in per ADR-100, including arch, target_triple, signature, and a build_metadata block carrying the validation PCK numbers. * docs/benchmarks/pose-estimation-cog.md — new sections covering the real Pi 5 smoke (8.4 ms cold-start) and the signed GCS release artifacts. Verified by downloading the binary anonymously from GCS and re-computing the sha256 — matches the locally-computed sha exactly. Signature decoded to the expected 64-byte Ed25519 length. Closes the GCS-upload acceptance criterion from ADR-100; the only pending work is Hailo HEF cross-compile (still SDK-gated) and an x86_64 release alongside this arm release. * docs(benchmarks): record live cognitum-v0 install + 5-sec smoke run Adds the "Live appliance install" section documenting what happened when the signed v0.0.1 binary + weights were installed under /var/lib/cognitum/apps/pose-estimation/ on cognitum-v0 (the V0 cluster leader). * Layout matches the existing anomaly-detect / presence / seizure- detect cogs exactly — the Cogs dashboard at http://cognitum-v0:9000/cogs auto-discovers entries. * `cog-pose-estimation run` ran for 5 seconds in the background and cleanly emitted run.started + structured WARN events for the missing local sensing-server on :3000 (cognitum-v0's actual CSI source is ruview-vitals-worker on :50054, not :3000). No crashes, no NaN, no leaks. * Wiring `sensing_url` to the appliance-native source is a separate Day-2 integration task.	2026-05-19 17:03:09 -04:00
Rahul	c00f45e296	fix(sensing): finish #611 NaN-panic audit — 7 more sites missed by #613 (#624 ) #613 fixed adaptive_classifier.rs:94 (the IQR sort) and called the audit done, but the grep used `partial_cmp(b).unwrap()` as a literal and missed seven additional production sites that use comparator variants: adaptive_classifier.rs:205 AdaptiveModel::classify() argmax over softmax probs — same per-frame hot path as #611. NaN flows through normalise → logits → softmax and still reaches this site even after the IQR fix. adaptive_classifier.rs:480 train() argmax (training accuracy loop) adaptive_classifier.rs:500 train() per-class argmax main.rs:2446, 2449 count_persons_mincut variance source/sink select csi.rs:602, 605 count_persons_mincut variance source/sink select (duplicate of main.rs logic in csi.rs) For the variance-select sites, note that the outer `unwrap_or((0, &0))` only catches an empty iterator — it cannot rescue a panic raised inside the comparator. A single NaN in `variances[]` still aborts the process. Same fix as #613: swap `.unwrap()` for `.unwrap_or(std::cmp::Ordering::Equal)` inside the comparator closure. Pure behavioural change, no API surface. Re-audit of the remaining `partial_cmp(...).unwrap()` matches in v2/: they are all inside `#[cfg(test)]` / `#[test]` blocks (spectrogram.rs:269, depth.rs:234, connectivity.rs:477, vital_signs.rs:737) where inputs are controlled and panic-on-NaN is acceptable.	2026-05-19 10:02:08 -04:00
Winter Lau	e964eaf14f	fix(deps): bump ndarray 0.15→0.17 and ndarray-npy 0.8→0.10 (closes #626 ) (#627 )	2026-05-19 10:01:52 -04:00
ruv	79cc2d7b22	Merge #491 : feat(sensing-server): adaptive person count — RollingP95 + dedup_factor runtime API Integrating @schwarztim's PR #491 into main on their behalf — their fork has fallen too far behind for a clean rebase (the PR's commit graph dropped silently during `git rebase origin/main`), so applying as a merge from the fork head to preserve the diff cleanly. What this lands: - `RollingP95` adaptive normaliser for the person-count feature scaling. Streaming P95 over a 600-sample / ~30 s sliding window. Cold-start (<60 samples) falls back to the legacy denominators (variance/300, motion_band_power/250, spectral_power/500) so day-0 behaviour is preserved on every deployment. - `RuntimeConfig` struct + `load_runtime_config` / `save_runtime_config` persisted to `data/config.json`. Exposes `dedup_factor` via REST so multi-node deployments can tune cluster-deduplication without a rebuild, including an auto-tune endpoint that derives optimal dedup from a known person count (calibration mode). - `compute_person_score()` now takes &AppStateInner alongside &FeatureInfo so the adaptive denominators are reachable. All 3 call sites updated. - New `AppStateInner` fields: `p95_variance`, `p95_motion_band_power`, `p95_spectral_power`, `dedup_factor`, `data_dir`. Closes #491. Directly addresses: - #499 (double skeletons, multi-node) — the slot-clustering problem this PR's adaptive normaliser was designed to fix - #519 Bug 1 (ghost person detection on edge-tier 1 & 2 multi-node) - #496 (person count over-reporting on single-room single-person) Verified locally: - cargo check -p wifi-densepose-sensing-server --no-default-features: 1.0s - cargo test -p wifi-densepose-sensing-server --no-default-features --lib: 233/233 passed in 25.0s Co-authored-by: @schwarztim Co-Authored-By: claude-flow <ruv@ruv.net>	2026-05-19 08:25:47 -04:00
rUv	b2e2e6d6fd	fix(sensing-server): WS broadcast emits effective_source() not hardcoded "esp32" (closes #618 ) (#621 ) Reported by @ArnonEnbar with a complete reproduction. broadcast_tick_task() re-emits the cached `latest_update` every tick so pose WS clients keep getting data even when ESP32 pauses between frames. The `source` field of that cached update was set to "esp32" at the moment a fresh ESP32 frame was last decoded (main.rs:3885, :4136). After the ESP32 loses power or network, no fresh frame is decoded — the cached `latest_update` is still re-broadcast every tick with the stale source: "esp32" baked in. UI's "Sensing" tab keeps showing "LIVE — ESP32 HARDWARE Connected" with frozen vitals/features/ classification re-broadcast indefinitely. REST `/health` correctly reports source: "esp32:offline" (via effective_source(), which checks last_esp32_frame elapsed time against ESP32_OFFLINE_TIMEOUT=5s) — but the WS broadcast path was the one consumer that didn't call it. Fix: clone the cached update per tick, overwrite source with s.effective_source(), then serialize and broadcast. UI now switches to "esp32:offline" on the same 5s budget as the REST surface. cargo build -p wifi-densepose-sensing-server --no-default-features: 17s, no errors (1 pre-existing unused-import warning unchanged).	2026-05-18 08:18:18 -04:00
rUv	72bbd256e7	fix(security): path-traversal guard on 5 sensing-server endpoints (closes #615 ) (#616 ) Reported by @bannned-bit. Five endpoints in v2/crates/wifi-densepose-sensing-server embedded user-controlled identifiers in format!() paths with no sanitization: recording.rs POST /api/v1/recording/start (session_name) recording.rs GET /api/v1/recording/download/:id (id) recording.rs DELETE /api/v1/recording/delete/:id (id) model_manager.rs POST /api/v1/models/load (model_id) training_api.rs load_recording_frames (dataset_ids[]) Each unauthenticated caller could: - READ arbitrary files via ../../etc/passwd, ../../.env, etc. - WRITE attacker-controlled JSONL via recording/start - LOAD attacker-controlled .rvf model files - DELETE arbitrary files the server process can touch New `path_safety` module exports `safe_id(&str) -> Result<&str, PathSafetyError>` that enforces the rejection envelope BEFORE any user input reaches a format!() that builds a path: - Allowed character set: [A-Za-z0-9._-] - Reject leading '.' (rules out '.', '..', '.env', hidden files) - Reject empty strings - Reject anything > 64 bytes - Reject all whitespace, path separators, null bytes, non-ASCII Applied at all 5 sites. Errors return 400 Bad Request (download) / status:"error" JSON (others) — not panics. 9 unit tests in path_safety::tests cover: - accepts simple alphanumeric / hyphen / underscore / dot - rejects empty, leading dot, path separators ('/', '\'), null byte, whitespace, shell specials, non-ASCII (including fullwidth slash U+FF0F), too-long, boundary at MAX_ID_LEN test result: ok. 9 passed; 0 failed cargo build -p wifi-densepose-sensing-server --no-default-features: 33s Fix-marker RuView#615 in scripts/fix-markers.json prevents removing the guard at any of the 5 call sites. CHANGELOG entry under [Unreleased] / Security documents the patched endpoints and the rejection envelope. Severity: critical per reporter — five remotely-reachable paths to read, write, or delete arbitrary files. Hot per-request paths, not edge cases.	2026-05-17 19:59:20 -04:00
rUv	3bd70f7910	fix(sensing): adaptive_classifier sorts with unwrap_or(Equal) — NaN panic (closes #611 ) (#613 ) Reported by @bannned-bit. v2/crates/wifi-densepose-sensing-server/src/ adaptive_classifier.rs:94 did: sorted.sort_by(\|a, b\| a.partial_cmp(b).unwrap()); f64::partial_cmp returns None on NaN, so `.unwrap()` panics. CSI data from real ESP32 hardware can produce NaN (silent DSP div-by-zero, empty buffer, etc.), and this code path runs on every frame in the classify() hot path — a single NaN frame kills the entire sensing server process. Fix swaps for unwrap_or(Ordering::Equal), matching the pattern the same file already uses at lines 149-150 and 155 (those sites were already NaN-safe; this site was an oversight). Scoped audit: greped the v2/ tree for `partial_cmp(b).unwrap()`. The other 3 hits are in #[cfg(test)] blocks (spectrogram.rs:269, depth.rs:234, connectivity.rs:477) where panic-on-NaN is acceptable because test inputs are controlled. Only adaptive_classifier.rs:94 was a production-path crash. Severity: critical per reporter — runtime panic on real-world data. Patch: 1-line behavioural change + comment.	2026-05-17 19:29:07 -04:00
rUv	1b155ad027	chore: remove empty stub crates wifi-densepose-{api,db,config} (closes #578 ) (#608 ) Each of these crates was a single-line doc-comment placeholder: v2/crates/wifi-densepose-api/src/lib.rs: //! WiFi-DensePose REST API (stub) v2/crates/wifi-densepose-db/src/lib.rs: //! WiFi-DensePose database layer (stub) v2/crates/wifi-densepose-config/src/lib.rs: //! WiFi-DensePose configuration (stub) with empty [dependencies] in their Cargo.toml and zero references from any source file or Cargo.toml in the workspace (verified by `grep -rln wifi-densepose-api/-db/-config` across `v2/`). They were reserved early for an envisioned REST/database/config split that never materialised. The functionality these would have provided is covered today by: - REST/WS: wifi-densepose-sensing-server (Axum) - Config: per-crate config + CLI args in sensing-server and desktop - DB: no persistent state; system is real-time Removal prevents `cargo` from listing dead crates, shipping empty published artifacts to crates.io, or wasting reviewer attention. If any of these names is needed in the future, reintroduce them with a real implementation. Per the issue reporter (@bannned-bit / Matad0r) #578 explicitly listed "OR be removed from workspace members until implementation starts" as an acceptable resolution. Updated: - `v2/Cargo.toml`: drop the three members (with inline comment explaining why) - `v2/Cargo.lock`: regenerated by cargo check - `CLAUDE.md`: drop the three rows from the crate table and the publishing order list - `CHANGELOG.md`: add an `[Unreleased] / Removed` entry Verified: - `cd v2 && cargo check --workspace --no-default-features` -> finished in 48s, no errors (warnings unchanged)	2026-05-17 18:50:57 -04:00
dependabot[bot]	0310b1fa9a	chore(deps): bump @tauri-apps/plugin-dialog (#462 ) Bumps [@tauri-apps/plugin-dialog](https://github.com/tauri-apps/plugins-workspace) from 2.6.0 to 2.7.0. - [Release notes](https://github.com/tauri-apps/plugins-workspace/releases) - [Commits](https://github.com/tauri-apps/plugins-workspace/compare/log-v2.6.0...log-v2.7.0) --- updated-dependencies: - dependency-name: "@tauri-apps/plugin-dialog" dependency-version: 2.7.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-05-17 18:11:58 -04:00
dependabot[bot]	4ecc053a27	chore(deps-dev): bump typescript in /v2/crates/wifi-densepose-desktop/ui (#456 ) Bumps [typescript](https://github.com/microsoft/TypeScript) from 5.9.3 to 6.0.3. - [Release notes](https://github.com/microsoft/TypeScript/releases) - [Commits](https://github.com/microsoft/TypeScript/compare/v5.9.3...v6.0.3) --- updated-dependencies: - dependency-name: typescript dependency-version: 6.0.3 dependency-type: direct:development update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-05-17 18:11:41 -04:00
dependabot[bot]	4d45add824	chore(deps): bump react-dom and @types/react-dom (#451 ) Bumps [react-dom](https://github.com/facebook/react/tree/HEAD/packages/react-dom) and [@types/react-dom](https://github.com/DefinitelyTyped/DefinitelyTyped/tree/HEAD/types/react-dom). These dependencies needed to be updated together. Updates `react-dom` from 18.3.1 to 19.2.5 - [Release notes](https://github.com/facebook/react/releases) - [Changelog](https://github.com/facebook/react/blob/main/CHANGELOG.md) - [Commits](https://github.com/facebook/react/commits/v19.2.5/packages/react-dom) Updates `@types/react-dom` from 18.3.7 to 19.2.3 - [Release notes](https://github.com/DefinitelyTyped/DefinitelyTyped/releases) - [Commits](https://github.com/DefinitelyTyped/DefinitelyTyped/commits/HEAD/types/react-dom) --- updated-dependencies: - dependency-name: react-dom dependency-version: 19.2.5 dependency-type: direct:production update-type: version-update:semver-major - dependency-name: "@types/react-dom" dependency-version: 19.2.3 dependency-type: direct:development update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-05-17 18:11:26 -04:00
dependabot[bot]	a80617ee84	chore(deps): bump console from 0.15.11 to 0.16.3 in /v2 (#471 ) Bumps [console](https://github.com/console-rs/console) from 0.15.11 to 0.16.3. - [Release notes](https://github.com/console-rs/console/releases) - [Changelog](https://github.com/console-rs/console/blob/main/CHANGELOG.md) - [Commits](https://github.com/console-rs/console/compare/0.15.11...0.16.3) --- updated-dependencies: - dependency-name: console dependency-version: 0.16.3 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-05-17 18:10:01 -04:00
dependabot[bot]	afc86c6fc4	chore(deps): bump thiserror from 1.0.69 to 2.0.18 in /v2 (#469 ) Bumps [thiserror](https://github.com/dtolnay/thiserror) from 1.0.69 to 2.0.18. - [Release notes](https://github.com/dtolnay/thiserror/releases) - [Commits](https://github.com/dtolnay/thiserror/compare/1.0.69...2.0.18) --- updated-dependencies: - dependency-name: thiserror dependency-version: 2.0.18 dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-05-17 18:09:54 -04:00
dependabot[bot]	e6710e8988	chore(deps): bump ndarray-linalg from 0.16.0 to 0.18.1 in /v2 (#477 ) Bumps [ndarray-linalg](https://github.com/rust-ndarray/ndarray-linalg) from 0.16.0 to 0.18.1. - [Release notes](https://github.com/rust-ndarray/ndarray-linalg/releases) - [Commits](https://github.com/rust-ndarray/ndarray-linalg/compare/ndarray-linalg-v0.16.0...ndarray-linalg-v0.18.1) --- updated-dependencies: - dependency-name: ndarray-linalg dependency-version: 0.18.1 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-05-17 18:08:08 -04:00
dependabot[bot]	ab9799adc3	chore(deps): bump tower-http from 0.5.2 to 0.6.8 in /v2 (#483 ) Bumps [tower-http](https://github.com/tower-rs/tower-http) from 0.5.2 to 0.6.8. - [Release notes](https://github.com/tower-rs/tower-http/releases) - [Commits](https://github.com/tower-rs/tower-http/compare/tower-http-0.5.2...tower-http-0.6.8) --- updated-dependencies: - dependency-name: tower-http dependency-version: 0.6.8 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-05-17 18:08:04 -04:00
dependabot[bot]	bdb4484259	chore(deps): bump tch from 0.14.0 to 0.24.0 in /v2 (#482 ) Bumps [tch](https://github.com/LaurentMazare/tch-rs) from 0.14.0 to 0.24.0. - [Release notes](https://github.com/LaurentMazare/tch-rs/releases) - [Changelog](https://github.com/LaurentMazare/tch-rs/blob/main/CHANGELOG.md) - [Commits](https://github.com/LaurentMazare/tch-rs/commits) --- updated-dependencies: - dependency-name: tch dependency-version: 0.24.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-05-17 18:08:01 -04:00

1 2

92 Commits