https://arxiv.org/pdf/2402.15025
We found that simulated and real robots using EES are able to rapidly and continually improve their parameter policies with respect to human-given task distributions.
Our approach is able to improve its performance after 120 and 240 real-world skill-executions respectively. Each seed took 1-3 hours of real robot time.