Displaying 1 to 11 from 11 results

In these tutorials for reinforcement learning, it covers from the basic RL algorithms to advanced algorithms developed recent years. If you speak Chinese, visit θ«η¦ Python or my Youtube channel for more.

reinforcement-learning tutorial q-learning sarsa sarsa-lambda deep-q-network a3c ddpg policy-gradient dqn double-dqn prioritized-replay dueling-dqn deep-deterministic-policy-gradient asynchronous-advantage-actor-critic actor-critic tensorflow-tutorials proximal-policy-optimization ppo machine-learningDeep Reinforcement Learning Course is a free series of blog posts and videos π about Deep Reinforcement Learning, where we'll learn the main algorithms, and how to implement them with Tensorflow. πThe articles explain the concept from the big picture to the mathematical details behind it.

deep-reinforcement-learning qlearning deep-learning tensorflow-tutorials tensorflow ppo a2c actor-critic deep-q-network deep-q-learningChainerRL is a deep reinforcement learning library that implements various state-of-the-art deep reinforcement algorithms in Python using Chainer, a flexible deep learning framework. ChainerRL is tested with Python 2.7+ and 3.5.1+. For other requirements, see requirements.txt.

chainer reinforcement-learning deep-learning machine-learning dqn actor-criticThis repository contains the code and pdf of a series of blog post called "dissecting reinforcement learning" which I published on my blog mpatacchiola.io/blog. Moreover there are links to resources that can be useful for a reinforcement learning practitioner. If you have some good references which may be of interest please send me a pull request and I will integrate them in the README. The source code is contained in src with the name of the subfolders following the post number. In pdf there are the A3 documents of each post for offline reading. In images there are the raw svg file containing the images used in each post.

reinforcement-learning deep-reinforcement-learning markov-chain temporal-differencing-learning sarsa q-learning actor-critic multi-armed-bandit inverted-pendulum mountain-car drone-landing dissecting-reinforcement-learning genetic-algorithmModular Deep Reinforcement Learning framework in PyTorch. A multitask agent solving both OpenAI Cartpole-v0 and Unity Ball2D.

reinforcement-learning pytorch openai-gym framework research dqn artificial-intelligence policy-gradient actor-critic ppo a3c deep-rlNOTE: THE CODE IS UNDER DEVELOPMENT, PLEASE ALWAYS PULL THE LATEST VERSION FROM HERE. In recent years, sequence-to-sequence (seq2seq) models are used in a variety of tasks from machine translation, headline generation, text summarization, speech to text, to image caption generation. The underlying framework of all these models are usually a deep neural network which contains an encoder and decoder. The encoder processes the input data and a decoder receives the output of the encoder and generates the final output. Although simply using an encoder/decoder model would, most of the time, produce better result than traditional methods on the above-mentioned tasks, researchers proposed additional improvements over these sequence to sequence models, like using an attention-based model over the input, pointer-generation models, and self-attention models. However, all these seq2seq models suffer from two common problems: 1) exposure bias and 2) inconsistency between train/test measurement. Recently a completely fresh point of view emerged in solving these two problems in seq2seq models by using methods in Reinforcement Learning (RL). In these new researches, we try to look at the seq2seq problems from the RL point of view and we try to come up with a formulation that could combine the power of RL methods in decision-making and sequence to sequence models in remembering long memories. In this paper, we will summarize some of the most recent frameworks that combines concepts from RL world to the deep neural network area and explain how these two areas could benefit from each other in solving complex seq2seq tasks. In the end, we will provide insights on some of the problems of the current existing models and how we can improve them with better RL models. We also provide the source code for implementing most of the models that will be discussed in this paper on the complex task of abstractive text summarization.

reinforcement-learning actor-critic policy-gradient abstractive-text-summarization pointer-generator nlpNOTICE: Please use the next version, SLM-Lab. An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.

keras tensorflow openai experiment policy-gradient actor-critic ddpg deep-reinforcement-learning reinforcement-learning gym lab reinforcement learningNote that this repo is only compatible with Mujoco in OpenAI gym. If you want to train agent in Atari domain, please refer to pytorch-a3c. There're three tasks/modes for you: train, eval, develop.

pytorch reinforcement-learning a3c mujoco continuous-control actor-criticThis is a toy example of using multiprocessing in Python to asynchronously train a neural network to play discrete action CartPole and continuous action Pendulum games. The asynchronous algorithm I used is called Asynchronous Advantage Actor-Critic or A3C. I believe it would be the simplest toy implementation you can find at the moment (2018-01).

pytorch a3c gym neural-network asynchronous-advantage-actor-critic multiprocessing toy-example actor-criticSome reinforcement learning algorithms I'm (re)-implementing, all in one place. Also is a dumping ground for other ML work.

reptile reinforcement-learning policy-gradient dqn deep-learning deep-q-network deep-q-learning deep-reinforcement-learning lstm ddpg machine-learning meta-learning hierarchical-reinforcement-learning a2c actor-criticThe intent of these IPython Notebooks are mostly to help me practice and understand the papers I read; thus, I will opt for readability over efficiency in some cases. First the implementation will be uploaded, followed by markup to explain each portion of code. I'll be assigning credit for any code which is borrowed in the Acknowledgements section of this README.

python3 pytorch reinforcement-learning deep-reinforcement-learning deep-q-network double-dqn multi-step-learning dueling-dqn noisy-networks prioritized-experience-replay deeprl-tutorials categorical-dqn rainbow quantile-regression deep-recurrent-q-network actor-critic advantage-actor-critic a2c gae ppo
We have large collection of open source products. Follow the tags from
Tag Cloud >>

Open source products are scattered around the web. Please provide information
about the open source projects you own / you use.
**Add Projects.**