Sidecar that runs model.safetensors inference on the live sensing
stream and rebroadcasts annotated frames so the dashboard renders
real model output without recompiling the Rust crate.
The sensing server accepts --model but the runtime broadcast loop
emits pose_keypoints: None and zero confidences whether or not a
model is loaded. This script subscribes to /ws/sensing, runs the
encoder plus presence head per frame, and serves annotated
sensing_update and pose_data streams on a sibling port.
Includes an in place repair for the published safetensors header
padding so the script works directly against the HuggingFace bundle.