RESEARCH

The field has converged on human video as the foundation for robot learning. The remaining question is quality.

THE CONVERGENCE ON HUMAN VIDEO

Every major robotics lab publishing in 2025-2026 uses egocentric human video as the pretraining foundation. NVIDIA found a log-linear scaling law. The question shifted from “should we use human video?” to “how much, and how to combine it with robot data?”

But vision-only misses half the story. On contact-rich tasks, vision-only averages 21% success. Adding tactile pushes that to 71%. For force-sensitive, bimanual manipulation — the data must include what cameras cannot see.

THE DATA THESE MODELS NEED

Egocentric video is the pretraining foundation. Tactile sensing is the missing modality. We deliver both.