Project work for Autonomous and Adaptive Systems, UNIBO 2022
-
Updated
Jul 2, 2022 - Python
Project work for Autonomous and Adaptive Systems, UNIBO 2022
implementation of ppo on legged robots
CS 5180 Reinforcement Learning: Final Project
Example PPO implementation with ReLAx
Implementations of modern machine-learning papers , including PPO ,PPG and POP3D
A novel approach to solve Contextual Reinforcement Learning
simple and compact implementations of reinforcement learning benchmark algorithms
RL learning model for Hola's Haggling Challenge
Pytorch implementations of reinforcement learning. Policy gradient methods (Vanilla pg, Actor Critic, PPO). Generative adversial imitation learning.
Implementation of some deep RL algorithms
Reinforcement learning approaches for Mahjong AI
Training PPO with DQN as a critic
An implementation of the Proximal Policy Optimization (PPO) algorithm. This implementation is based on the example available on the official Keras website. The goal of this project is to provide a .NET-based solution for running PPO algorithms.
Training a PPO to balance a pendulum in a partially observable environment.
Dialogue Summary LLM - FLAN - T5: An implementation of the Flan-t5 LLM to summarize dialogues. Prompt Engineering , Fine tuning with PEFT and fine tuning with RL (PPO) is explored within this project.
Learning to Run NIPS 2017 Competition
Add a description, image, and links to the ppo topic page so that developers can more easily learn about it.
To associate your repository with the ppo topic, visit your repo's landing page and select "manage topics."