AW-Opt: Learning Robotic Skills with Imitationand Reinforcement at Scale

Yao Lu

Karol Hausman

Yevgen Chebotar

Mengyuan Yan

Eric Victor Jang

Alexander Herzog

Ted Xiao

Alex Irpan

Mohi Khansari

Dmitry Kalashnikov

Sergey Levine

Conference on Robot Learning 2021 (2021)

Google Scholar

Abstract

This paper proposes a new algorithm "AW-Opt" to combine Imitation Learning (IL) and Reinforcement Learning (RL). Prior methods face significant difficulty with sparse reward, image based input robotics tasks. By carefully designing sample filtering strategy, exploration strategy, and bellman equation, AW-Opt outperforms existing SOTA algorithms. Experimental results in both simulation and with real robots show that AW-Opt can achieve reasonable success rate from initial demonstrations, maintain low inference time, fine tune to reach SOTA success rate and use much less samples than existing algorithms.

Research Areas

Robotics

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

AW-Opt: Learning Robotic Skills with Imitationand Reinforcement at Scale

Abstract

Research Areas

Learn more about how we conduct our research

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

AW-Opt: Learning Robotic Skills with Imitationand Reinforcement at Scale

Abstract

Research Areas

Learn more about how we conduct our research

AI/ML Foundations  & Capabilities