ppo

An implementation of the Proximal Policy Optimization (PPO) algorithm. This implementation is based on the example available on the official Keras website. The goal of this project is to provide a .NET-based solution for running PPO algorithms.

machine-learning csharp dotnet keras keras-tensorflow proximal-policy-optimization ppo

Updated Jun 6, 2023
C#

nimahsn / lusr_carracing

Star

tensorflow rl domain-adaptation ppo

Updated Apr 15, 2023
Jupyter Notebook

Ezgii / PPO-on-pendulum-extended

Star

Training a PPO to balance a pendulum in a partially observable environment.

reinforcement-learning openai-gym pytorch pendulum proximal-policy-optimization ppo

Updated May 27, 2023
Python

nithin-kamal / Dialogue-Summary-using-LLMs

Star

Dialogue Summary LLM - FLAN - T5: An implementation of the Flan-t5 LLM to summarize dialogues. Prompt Engineering , Fine tuning with PEFT and fine tuning with RL (PPO) is explored within this project.

natural-language-processing reinforcement-learning ppo prompt-engineering llms generative-ai flan-t5 peft-fine-tuning-llm

Updated Feb 19, 2024
Jupyter Notebook

marioyc / learning-to-run

Star

Learning to Run NIPS 2017 Competition

machine-learning reinforcement-learning tensorflow continuous-control trpo ppo

Updated Aug 18, 2017
Python

Improve this page

Add a description, image, and links to the ppo topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ppo topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ppo

Here are 624 public repositories matching this topic...

manuel-mariani / Proximal-Policy-Optimization-implementation

saolxs / legged-robots

XuTpoKoT / bmstu-sem6-sd

valentinowyhnel / cours_de_java

ruchapendharkar / learning-on-the-highway

nslyubaykin / relax_ppo_example

rusenburn / Axel

rpanackal / rl-msc-pro

huiwenzhang / rl-benchmark

cmlima / PPO

indutny / haggling_rl

SiddharthSingi / Policy-Gradient-Methods

TaoHuang13 / DeepRL

sharedcare / mahjong_rl

nslyubaykin / ppo_with_dqn_critic

samuelcaldas / PPOCartpole.NET

nimahsn / lusr_carracing

Ezgii / PPO-on-pendulum-extended

nithin-kamal / Dialogue-Summary-using-LLMs

marioyc / learning-to-run

Improve this page

Add this topic to your repo