CP-Gen | Constraint-Preserving Data Generation for Visuomotor Policy Generalization

Abstract

CP-Gen: Large-scale demonstration data has powered key breakthroughs in robot manipulation, but collecting that data remains costly and time-consuming. To this end, we present Constraint-Preserving Data Generation (CP-Gen), a method that uses a single expert trajectory to generate robot demonstrations containing novel object geometries and poses. These generated demonstrations are used to train closed-loop visuomotor policies that transfer zero-shot to the real world. Similar to prior data-generation work focused on pose variations, CP-Gen first decomposes expert demonstrations into free-space motions and robot skills. Unlike prior work, we achieve geometry-aware data generation by formulating robot skills as keypoint-trajectory constraints: keypoints on the robot or grasped object must track a reference trajectory defined relative to a task-relevant object. To generate a new demonstration, CP-Gen samples pose and geometry transforms for each task-relevant object, then applies these transforms to the object and its as sociated keypoints or keypoint trajectories. We optimize robot joint configurations so that the keypoints on the robot or grasped object track the transformed keypoint trajectory, and then motion plan a collision-free path to the first optimized joint configuration. Using demonstrations generated by CP-Gen, we train visuomotor policies that generalize across variations in object geometries and poses. Experiments on 16 simulation tasks and four real-world tasks, featuring multi-stage, non-prehensile and tight-tolerance manipulation, show that policies trained using our method achieve an average success rate of 77%, outperforming the best base line which achieves an average success rate of 50%.

Digital twin environment for real world task from third-person agentview

*CP-Gen* policies succesfully transfer zero-shot sim2real. Policies take in depth and segmentation masks, and tasks feature both object-geometry and object pose variations

Constraint-Preserving Data Generation
for Visuomotor Policy Generalization

CP-Gen generates data for new object poses, scales, and geometries when given a single demonstration, and enables zero-shot sim2real transfer..

Abstract

Real World Evaluations

Quantitative Results

CP-Gen for Geometry and Spatial Generalization

Simulation Environments

Geometry Generalization Reset Distribution

CP-Gen Generated Trajectories

Citation

Acknowledgments