Workshop
Generalizable Policy Learning in the Physical World
Young Min Kim · Sergey Levine · Ming Lin · Tongzhou Mu · Ashvin Nair · Hao Su
Fri 29 Apr, 8 a.m. PDT
While the study of generalization has played an essential role in many application domains of machine learning (e.g., image recognition and natural language processing), it did not receive the same amount of attention in common frameworks of policy learning (e.g., reinforcement learning and imitation learning) at the early stage for reasons such as policy optimization is difficult and benchmark datasets are not quite ready yet. Generalization is particularly important when learning policies to interact with the physical world. The spectrum of such policies is broad: the policies can be high-level, such as action plans that concern temporal dependencies and causalities of environment states; or low-level, such as object manipulation skills to transform objects that are rigid, articulated, soft, or even fluid.In the physical world, an embodied agent can face a number of changing factors such as \textbf{physical parameters, action spaces, tasks, visual appearances of the scenes, geometry and topology of the objects}, etc. And many important real-world tasks involving generalizable policy learning, e.g., visual navigation, object manipulation, and autonomous driving. Therefore, learning generalizable policies is crucial to developing intelligent embodied agents in the real world. Though important, the field is very much under-explored in a systematic way.Learning generalizable policies in the physical world requires deep synergistic efforts across fields of vision, learning, and robotics, and poses many interesting research problems. This workshop is designed to foster progress in generalizable policy learning, in particular, with a focus on the tasks in the physical world, such as visual navigation, object manipulation, and autonomous driving. We envision that the workshop will bring together interdisciplinary researchers from machine learning, computer vision, and robotics to discuss the current and future research on this topic.
Schedule
Fri 8:00 a.m. - 8:10 a.m.
|
Introduction and Opening Remarks
(
Introduction
)
>
|
Hao Su 🔗 |
Fri 8:10 a.m. - 8:35 a.m.
|
Invited Talk (Danica Kragic): Learning for contact rich tasks
(
Invited Talk
)
>
SlidesLive Video |
Danica Kragic 🔗 |
Fri 8:35 a.m. - 8:40 a.m.
|
Q&A for Invited Talk (Danica Kragic)
(
Q&A
)
>
|
Danica Kragic 🔗 |
Fri 8:40 a.m. - 9:05 a.m.
|
Invited Talk (Peter Stone): Grounded Simulation Learning for Sim2Real
(
Invited Talk
)
>
SlidesLive Video |
Peter Stone 🔗 |
Fri 9:05 a.m. - 9:10 a.m.
|
Q&A for Invited Talk (Peter Stone)
(
Q&A
)
>
|
Peter Stone 🔗 |
Fri 9:10 a.m. - 9:20 a.m.
|
Break
|
🔗 |
Fri 9:20 a.m. - 10:15 a.m.
|
Poster Session 1 ( Poster Session ) > link | 🔗 |
Fri 10:15 a.m. - 11:15 a.m.
|
Panel Discussion
(
Panel Discussion
)
>
|
Young Min Kim · Peter Stone · Nadia Figueroa · Hao Su · Mrinal Kalakrishnan · Xiaolong Wang · Deepak Pathak · Ming Lin · Danfei Xu 🔗 |
Fri 11:15 a.m. - 11:23 a.m.
|
ManiSkill Challenge Winner Presentation (Zhutian Yang & Aidan Curtis)
(
Contributed Talk
)
>
SlidesLive Video |
Zhutian Yang 🔗 |
Fri 11:23 a.m. - 11:31 a.m.
|
ManiSkill Challenge Winner Presentation (Fattonny)
(
Contributed Talk
)
>
SlidesLive Video |
Kun Wu 🔗 |
Fri 11:31 a.m. - 1:00 p.m.
|
Lunch Break
|
🔗 |
Fri 1:00 p.m. - 1:10 p.m.
|
Contributed Talk (Sim-to-Lab-to-Real: Safe RL with Shielding and Generalization Guarantees)
(
Contributed Talk
)
>
SlidesLive Video |
Kai-Chieh Hsu 🔗 |
Fri 1:10 p.m. - 1:35 p.m.
|
Invited Talk (Shuran Song): Iterative Residual Policy for Generalizable Dynamic Manipulation of Deformable Objects
(
Invited Talk
)
>
SlidesLive Video |
Shuran Song 🔗 |
Fri 1:35 p.m. - 1:40 p.m.
|
Q&A for Invited Talk (Shuran Song)
(
Q&A
)
>
|
Shuran Song 🔗 |
Fri 1:40 p.m. - 2:05 p.m.
|
Invited Talk (Nadia Figueroa): Towards Safe and Efficient Learning and Control for Physical Human Robot Interaction
(
Invited Talk
)
>
SlidesLive Video |
Nadia Figueroa 🔗 |
Fri 2:05 p.m. - 2:10 p.m.
|
Q&A for Invited Talk (Nadia Figueroa)
(
Q&A
)
>
|
Nadia Figueroa 🔗 |
Fri 2:10 p.m. - 2:18 p.m.
|
ManiSkill Challenge Winner Presentation (EPIC Lab)
(
Contributed Talk
)
>
SlidesLive Video |
Weikang Wan 🔗 |
Fri 2:18 p.m. - 2:30 p.m.
|
Break
|
🔗 |
Fri 2:30 p.m. - 2:40 p.m.
|
Contributed Talk (Know Thyself: Transferable Visual Control Policies Through Robot-Awareness)
(
Contributed Talk
)
>
SlidesLive Video |
Edward Hu 🔗 |
Fri 2:40 p.m. - 3:05 p.m.
|
Invited Talk (Mrinal Kalakrishnan): Robot Learning & Generalization in the Real World
(
Invited Talk
)
>
SlidesLive Video |
Mrinal Kalakrishnan 🔗 |
Fri 3:05 p.m. - 3:10 p.m.
|
Q&A for Invited Talk (Mrinal Kalakrishnan)
(
Q&A
)
>
|
Mrinal Kalakrishnan 🔗 |
Fri 3:10 p.m. - 3:35 p.m.
|
Invited Talk (Xiaolong Wang): Generalizing Dexterous Manipulation by Learning from Humans
(
Invited Talk
)
>
SlidesLive Video |
Xiaolong Wang 🔗 |
Fri 3:35 p.m. - 3:40 p.m.
|
Q&A for Invited Talk (Xiaolong Wang)
(
Q&A
)
>
|
Xiaolong Wang 🔗 |
Fri 3:40 p.m. - 3:48 p.m.
|
ManiSkill Challenge Winner Presentation (Silver-Bullet-3D)
(
Contributed Talk
)
>
SlidesLive Video |
Yingwei Pan 🔗 |
Fri 3:48 p.m. - 3:50 p.m.
|
Break
|
🔗 |
Fri 3:50 p.m. - 4:45 p.m.
|
Poster Session 2 ( Poster Session ) > link | 🔗 |
Fri 4:45 p.m. - 5:30 p.m.
|
ManiSkill Challenge Award Ceremony
(
Challenge Award Ceremony
)
>
|
13 presentersHao Su · Weikang Wan · Hao Shen · He Wang · Yingwei Pan · Zhutian Yang · Fabian Dubois · Tom Sonoda · Kun Wu · Kangqi Ma · Liu Kun · Jilei Hou · Tongzhou Mu |
Fri 5:30 p.m. - 6:30 p.m.
|
Closing Remarks
(
Closing Remarks
)
>
|
🔗 |
-
|
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations ( Poster ) > link | Sang Tong · Hongyao Tang · Yi Ma · Jianye HAO · YAN ZHENG · Zhaopeng Meng · Boyan Li · Zhen Wang 🔗 |
-
|
Imitation Learning for Generalizable Self-driving Policy with Sim-to-real Transfer ( Poster ) > link | Zoltán Lőrincz · Márton Szemenyei · Robert Moni 🔗 |
-
|
FlexiBiT: Flexible Inference in Sequential Decision Problems via Bidirectional Transformers ( Poster ) > link |
11 presentersMicah Carroll · Jessy Lin · Orr Paradise · Raluca Georgescu · Mingfei Sun · David Bignell · Stephanie Milani · Katja Hofmann · Matthew Hausknecht · Anca Dragan · Sam Devlin |
-
|
Learning Category-Level Generalizable Object Manipulation Policy via Generative Adversarial Self-Imitation Learning from Demonstrations ( Poster ) > link | Hao Shen · Weikang Wan · He Wang 🔗 |
-
|
A Study of Off-Policy Learning in Environments with Procedural Content Generation ( Poster ) > link | Andrew Ehrenberg · Robert Kirk · Minqi Jiang · Edward Grefenstette · Tim Rocktaeschel 🔗 |
-
|
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent Space ( Poster ) > link | Kuan Fang · Patrick Yin · Ashvin Nair · Sergey Levine 🔗 |
-
|
Learning Transferable Policies By Inferring Agent Morphology ( Poster ) > link | Brandon Trabucco · mariano Phielipp · Glen Berseth 🔗 |
-
|
Using Deep Learning to Bootstrap Abstractions for Robot Planning ( Poster ) > link | Naman Shah · Siddharth Srivastava 🔗 |
-
|
Don't Freeze Your Embedding: Lessons from Policy Finetuning in Environment Transfer ( Poster ) > link | Victoria Dean · Daniel Toyama · Doina Precup · Victoria Dean 🔗 |
-
|
Safer Autonomous Driving in a Stochastic, Partially-Observable Environment by Hierarchical Contingency Planning ( Poster ) > link | Ugo Lecerf · Christelle Yemdji-Tchassi · Pietro Michiardi 🔗 |
-
|
Separating the World and Ego Models for Self-Driving
(
Poster
)
>
link
SlidesLive Video |
Vlad Sobal · Alfredo Canziani · Nicolas Carion · Kyunghyun Cho · Yann LeCun 🔗 |
-
|
Multi-objective evolution for Generalizable Policy Gradient Algorithms ( Poster ) > link | Juan Jose Garau-Luis · Yingjie Miao · John Co-Reyes · Aaron Parisi · Jie Tan · Esteban Real · Aleksandra Faust 🔗 |
-
|
ShiftNorm: On Data Efficiency in Reinforcement Learning with Shift Normalization
(
Poster
)
>
link
SlidesLive Video |
Sicong Liu · Xi Zhang · Yushuo Li · Yifan Zhang · Jian Cheng 🔗 |
-
|
Improving performance on the ManiSkill Challenge via Super-convergence and Multi-Task Learning ( Poster ) > link | Fabian Dubois · Eric Platon · Tom Sonoda 🔗 |
-
|
Multi-task Reinforcement Learning with Task Representation Method ( Poster ) > link | Myungsik Cho · Whiyoung Jung · Youngchul Sung 🔗 |
-
|
Deep Sequenced Linear Dynamical Systems for Manipulation Policy Learning ( Poster ) > link | Mohammad Nomaan Qureshi · Ben Eisner · David Held 🔗 |
-
|
Learning Robust Task Context with Hypothetical Analogy-Making ( Poster ) > link | Shinyoung Joo · Sang Wan Lee 🔗 |
-
|
Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation ( Poster ) > link | Yingwei Pan · Yehao Li · Yiheng Zhang · Qi Cai · Fuchen Long · Zhaofan Qiu · Ting Yao · Tao Mei 🔗 |
-
|
Zero-Shot Reward Specification via Grounded Natural Language ( Poster ) > link | Parsa Mahmoudieh · Deepak Pathak · trevor darrell 🔗 |
-
|
Reinforcement Learning for Location-Aware Warehouse Scheduling ( Poster ) > link | Stelios Stavroulakis · Biswa Sengupta 🔗 |
-
|
A Probabilistic Perspective on Reinforcement Learning via Supervised Learning ( Poster ) > link | Alexandre Piche · Rafael Pardinas · David Vazquez · Chris J Pal 🔗 |
-
|
Prompts and Pre-Trained Language Models for Offline Reinforcement Learning ( Poster ) > link | Denis Tarasov · Vladislav Kurenkov · Sergey Kolesnikov 🔗 |
-
|
Compositional Multi-Object Reinforcement Learning with Linear Relation Networks ( Poster ) > link | Davide Mambelli · Frederik Träuble · Stefan Bauer · Bernhard Schoelkopf · Francesco Locatello 🔗 |
-
|
Density Estimation For Conservative Q-Learning ( Poster ) > link | Paul Daoudi · Ludovic Dos Santos · Merwan Barlier · Aladin Virmaux 🔗 |
-
|
Control of Two-way Coupled Fluid Systems with Differentiable Solvers
(
Poster
)
>
link
SlidesLive Video |
Brener Ramos · Felix Trost · Nils Thuerey 🔗 |
-
|
One-Shot Imitation with Skill Chaining using a Goal-Conditioned Policy in Long-Horizon Control ( Poster ) > link | Hayato Watahiki · Yoshimasa Tsuruoka 🔗 |
-
|
Versatile Offline Imitation Learning via State-Occupancy Matching ( Poster ) > link | Yecheng Jason Ma · Andrew Shen · Dinesh Jayaraman · Osbert Bastani 🔗 |
-
|
Let’s Handle It: Generalizable Manipulation of Articulated Objects ( Poster ) > link | Zhutian Yang · Aidan Curtis 🔗 |
-
|
Revisiting Model-based Value Expansion ( Poster ) > link | Daniel Palenicek · Michael Lutter · Jan Peters 🔗 |
-
|
An Empirical Study and Analysis of Learning Generalizable Manipulation Skill in the SAPIEN Simulator ( Poster ) > link | Liu Kun · Huiyuan Fu · Zheng Zhang · huanpu yin 🔗 |
-
|
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning ( Poster ) > link | Denis Yarats · David Brandfonbrener · Hao Liu · Michael Laskin · Pieter Abbeel · Alessandro Lazaric · Lerrel Pinto 🔗 |
-
|
Learning Generalizable Dexterous Manipulation from Human Grasp Affordance ( Poster ) > link | Yueh-Hua Wu · Jiashun Wang · Xiaolong Wang 🔗 |
-
|
Continuous Control on Time ( Poster ) > link | Tianwei Ni · Eric Jang · Tianwei Ni 🔗 |
-
|
A Minimalist Ensemble Method for Generalizable Offline Deep Reinforcement Learning ( Poster ) > link | Kun Wu · Yinuo Zhao · Zhiyuan Xu · Zhen Zhao · Pei Ren · Zhengping Che · Chi Liu · Feifei Feng · Jian Tang 🔗 |
-
|
Know Thyself: Transferable Visual Control Policies Through Robot-Awareness ( Poster ) > link | Edward Hu · Kun Huang · Oleh Rybkin · Dinesh Jayaraman 🔗 |
-
|
Sim-to-Lab-to-Real: Safe RL with Shielding and Generalization Guarantees ( Poster ) > link | Kai-Chieh Hsu · Allen Z. Ren · Duy Nguyen · Anirudha Majumdar · Jaime Fernández Fisac 🔗 |