Displaying 1 to 2 from 2 results

pytorch-A3C - Simple A3C implementation with pytorch + multiprocessing

  •    Python

This is a toy example of using multiprocessing in Python to asynchronously train a neural network to play discrete action CartPole and continuous action Pendulum games. The asynchronous algorithm I used is called Asynchronous Advantage Actor-Critic or A3C. I believe it would be the simplest toy implementation you can find at the moment (2018-01).