Front camera only. Once the user steps out of the frame, the robot forgets who they were — no memory of the person it was just with.
User in frame.
User out of frame.
User returns — but as a stranger.
Detection is not recognition. And anything behind the robot is invisible.
Improve the user experience while navigating them to the goal.
The robot should remember who's using it — and have an all-around view.
Front + back. The robot remembers who it's leading — through the turn, into the follow.
Two ML models. A body fingerprint and a face fingerprint. Body first, face confirms.
A vision model spots people in the frame and crops out the body and face for the next stage.
ML turns the body and face into mathematical signatures — vectors a computer can compare in microseconds.
Body match scans the memory bank; the face gate confirms it's the same person — even after the user turns and walks away.
Happy-path run: interaction on the front camera, the user moves to the back, robot keeps the same identity.
Cameras mounted in their actual positions on arcmove — same identity tracked across the real front→back perspective the robot sees in deployment.
Cameras moving with the robot — testing the identity lock under real navigation motion: turning, driving forward, the user shifting view as the robot moves.
A working POC, not a finished product.
Three honest limits to know about.
Only front and back cameras. A user crossing left or right disappears between the cones.
Fix: 4 cameras in iteration 3.
Face confirms identity — but only up close. Far away or facing away, only the body is used.
Mitigation: body bank stores multiple poses.
Once selected, the robot locks to that user only. No parallel tracking.
By design: one guest per interaction.
From 2 cameras to 4 — front, back, left, right — with overlapping FoVs so the user never leaves coverage.
Left and right cameras added to the existing front + back. FoVs are sized to overlap at the corners — no gap between cones.
Move the perception stack off the dev machine onto the robot's small on-board Jetson — efficient inference, lower BOM cost, no laptop tethered to the robot.
Feed the identity signal into the robot's navigation and behaviour stack — turning "we know who's there" into "the robot does what we want."
Questions & discussion.