5.5 KiB
R8 — RSSI-only person count: does it work without CSI?
Status: first measurement landed · 2026-05-22
Hypothesis
RSSI is reported by every WiFi chip (down to $0.50 ESP8266s). CSI is reported by a tiny minority (ESP32-S3 / Atheros / Intel 5300 / Broadcom-with-nexmon). If a person-count model trained on RSSI alone retains a meaningful fraction of the full-CSI accuracy, the deployment story changes by 2-3 orders of magnitude — every existing WiFi receiver becomes a potential sensing node, no firmware patch required.
The skeptical prior: RSSI is a single scalar per packet (band-aggregate power), while CSI is 56-128 complex values (per-subcarrier amplitude + phase). Naively, RSSI throws away ≥98% of the information. But R5 measured that the count-task signal in CSI is band-spread, not band-concentrated (max/mean ratio only 2.85× across 56 subcarriers). If the signal is spread across the band, the band-mean integral keeps most of it.
Method
- Take the existing
data/paired/wiflow-p7-1779210883.paired.jsonl(1,077 paired CSI windows + labels). - Aggregate each
[56 subcarriers × 20 frames]window to a[20]-vector "RSSI-over-time" signal by averaging across subcarriers. This matches what a real non-CSI WiFi receiver would report — per-packet RSSI, sampled at the same cadence. - Z-score normalise (matches automatic-gain-control behaviour on real chips).
- Random 80/20 split with seed=42 — identical to
cog-person-countv0.0.2's split, so the eval sets are the same individual samples. - Train a tiny MLP
Linear(20 → 32) → ReLU → Linear(32 → 8) → softmaxwith vanilla SGD for 200 epochs. No framework — pure NumPy. Keep best-by-eval-acc checkpoint.
Result
| Metric | RSSI-only (this) | cog-person-count v0.0.2 (full CSI) |
Retained |
|---|---|---|---|
| Overall accuracy | 0.591 | 0.623 | 94.82% |
| Class 0 accuracy | 0.595 | 0.862 | — |
| Class 1 accuracy | 0.586 | 0.343 | — |
| Train time | 0.72 s (CPU) | 0.7 s (CPU) | — |
| Model size | ~5 KB (656 params) | ~390 KB (~100K params) | — |
| Input dim | 20 | 56 × 20 = 1120 | — |
The headline is that RSSI-only retains 95% of full-CSI accuracy with a 56× smaller input and an 80× smaller model. The class accuracies are also notably more balanced than v0.0.2 (59.5 / 58.6 vs 86.2 / 34.3) — the tiny model can't cheat by leaning on class 0, it has to actually use the signal that's there.
Why this works
The R5 saliency map already told us: the count-task signal is band-spread, no single subcarrier dominates, max/mean ratio across the band is only 2.85×. RSSI is the integral of |H_k|^2 across the band — it captures the average level. For a band-spread signal, the average is a near-sufficient statistic. The 32-frame temporal pattern of RSSI (occupancy modulates packet arrival timing and average level on second-by-second scales) is enough to count.
What this enables (10-year horizon)
- Phones-as-sensors. Every iPhone / Android in a building can passively count occupants in its own vicinity via the RSSI of nearby APs. No app permissions beyond WiFi-scan; no CSI hardware required.
- Smart speakers, smart TVs, smart lights. Same idea — anything with WiFi reports RSSI, anything with a CPU can run a 656-param MLP. Counting becomes a federated property of any room with WiFi.
- Adoption story for the cog ecosystem. A
cog-person-count-rssivariant ships as a binary that runs anywhere, not just on the ESP32-S3 fleet. Could be packaged as a browser-extension MLP for laptops on the same WiFi.
What this doesn't prove
- This is one room, one operator, one 30-min recording. Generalisation across rooms / chips / people is unmeasured. The 5-fold reference for the full-CSI model was 62.2 ± 1.9% — the RSSI-only 59.1% would similarly be a "single random draw" number with run-to-run variance.
- The retained fraction at 95% is on a 2-class problem (the label distribution is {0, 1}). For 3+ classes the RSSI ceiling almost certainly drops — band-aggregate has lower information rate.
- The class 1 accuracy (58.6%) is actually higher than v0.0.2's (34.3%). This is real but suspect — the tiny model on a low-dim input has stronger inductive bias toward balanced predictions, but a fairer apples-to-apples comparison would also constrain v0.0.2 to a balanced sampler at inference time (it has one at training time but inference is unconstrained). Followup tick: re-eval v0.0.2 with the same prediction-balancing constraint.
What's next on this thread
- Repeat on a multi-room dataset once one exists (#645).
- 3-class extension (0 / 1 / 2+ people) — measure the information-rate cliff.
- Run the model on a non-ESP32 RSSI source (e.g.
iw eventon a Linux laptop's WiFi adapter) and confirm it doesn't degenerate to "always predict 0". - Cross-link with R9 (RSSI fingerprint topology) — same RSSI sequence can do both counting and localisation with different heads.
- Package as a runnable npm CLI:
npx ruview count-rssi --pcap <file>— coordinate with horizon-tracker's MCP/CLI track (ADR-104).
Connection back to PROGRESS.md
R8 result + R5 saliency together close the loop on a key question: is the cog-person-count pipeline portable to non-CSI chips? Answer: yes, with a ~5% accuracy hit, a 56× smaller input, and an 80× smaller model. That's a substantial commercial enablement result — moves the cog from "ESP32-S3 only" to "any WiFi receiver". Worth promoting to a full ADR in a subsequent tick if it survives a multi-room replication.