Displaying 1 to 2 from 2 results

POP3D - Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization

  •    Python

You can download results on three seeds from google drive https://drive.google.com/file/d/1c79TqWn74mHXhLjoTWaBKfKaQOsfD2hg/view?usp=sharing. We release it to make reproduction of this paper easy.