Learning to Manipulate Deformable Objects - Plastic Bags

Last updated on Dec 16, 2023

Introduction

Isaac Sim from Nvidia is the physics engine used to simulate the dynamics of a bag.
The Bag in the simulator is modified in a way to have a different colored rim.
Properties of the simulation bag are that of a cloth to achieve a close-to-realistic simulation of the bag.

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

Environment is created using Isaac Orbit\footnote{Mittal, Mayank, et al. “Orbit: A unified simulation framework for interactive robot learning environments.” IEEE Robotics and Automation Letters (2023).} with a manipulator and a bag.
Various bag properties can be tweaked, and multiple manipulators can be added.
Multiple environments can be trained in parallel.

A visual policy\footnote{Wu, Yilin, et al. “Learning to manipulate deformable objects without demonstrations.” arXiv preprint arXiv:1910.13439 (2019).} is trained to get the pick point of the bag to lift the bag above the ground.
A positive reward is given when the manipulator successfully lifts the bag and a negative reward for an unsuccessful attempt.
UNet is pre-trained to get the segmentation of the bag.

A visual policy\footnote{Blanco-Mulero, David, et al. “QDP: Learning to Sequentially Optimise Quasi-Static and Dynamic Manipulation Primitives for Robotic Cloth Manipulation.” arXiv preprint arXiv:2303.13320 (2023)} is trained to get the pick point and a place point to open the bag to maximum opening.
A pick position is selected from the network, based on the pick position a place position is selected, and a drag from pick $\rightarrow$ place is performed.
Reward is given when the area of the bag rim is increased from the previous configuration after performing an action.

A pick point is predicted from the UNet to give a pick point, using the pick point and crop-centering around the point a place point is predicted.

The visual policy is able to learn the pick and place positions as the simulation environment is perfect in a sense (the bag and the background can be easily distinguished)

-Manipulator approaches pick/place point in only one orientation.

The Inverse Kinematics controller used is not able to solve in some instances.
Bag slips out of manipulator’s grip.
The task at hand is to increase the opening area of the rim at each step, the maximal opening goal set is not reached.

A docker image with Orbit support is built and hosted on Dockerhub.\footnote{DockerHub \href{https://hub.docker.com/layers/harshavguda/orbit-isaacsim/2022.2.1/images/sha256-4ea1a60c11b7b82c510a892441cb16d66c9a95874ef54af6ad9287ca9bb56632?context=repo}{link}}
Isaac Orbit Environments are set to run on Triton HPC using the docker image from above.\footnote{More details can be found at \href{https://harshaguda.notion.site/Isaac-Orbit-on-Triton-60762305bbc244eba68d07bed0c715f6?pvs=25}{documentation.}}
Synthetic camera data generation from the simulator environment.