Can RL from pixels be as efficient as RL from state?
By Misha Laskin, Aravind Srinivas, Kimin Lee, Adam Stooke, Lerrel Pinto, Pieter Abbeel
[latexpage]
A remarkable characteristic of human intelligence is our ability to learn tasks quickly. Most human...