A visual-linguistic framework that enables open-vocabulary object grasping in robots
To be deployed in a broad range of real-world dynamic settings, robots should be able to successfully complete various manual tasks, ranging from household chores to complex manufacturing or agricultural processes. These ...