From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation
FSD connects spatial visual reasoning with robotic action by generating structured intermediate representations that improve generalization on unseen manipulation tasks.
Jan 3, 2026