Policy gradient agent - no intention to add? #278

jonilaserson · 2019-01-02T10:16:34Z

I've noticed that the REINFORCE algorithm (aka policy gradient, without the Q function) is not listed in the list of agents, even the not-yet-implemented ones. I presume this was intentional? How come such a basic building block of RL was left out?

RaphaelMeudec · 2019-01-05T13:40:49Z

The list hasn't been updated recently, contributions on algorithm implementations are more than welcome!

piyush01123 · 2019-01-11T04:07:02Z

Hi. I want to take this up. I'll submit a PR once I'm done.
From what I understand, I should create a class at rl/agents and write a test for it at tests/rl/agents. I'll run the tests as per the Contribution guide before submitting PR.

tbmreza · 2019-04-30T04:41:16Z

It has been months, is this issue closing anytime soon?

random-user-x · 2019-04-30T08:35:11Z

I have been working lately to add more algorithms to this software. I am currently busy with my thesis and examination. I will begin this work from mid May. Thank you :)

tbmreza · 2019-04-30T14:18:22Z

@mirraaj Great! If it's not too much, I would really really appreciate it if you make a blog post walking through the process creating it. Translating from equations to implementation has always been a mystical thing to me.

random-user-x · 2019-07-16T11:10:30Z

@tbmreza Sincere apologies for such a delay. I am back working for keras-rl. I will keep you updated. Look for new algorithms soon. :)

hrik2001 · 2022-08-19T05:23:25Z

Hi! Any updates so far on this?

RaphaelMeudec added the contribution-welcome label Jan 5, 2019

RaphaelMeudec added this to To do in Contributing Jan 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Policy gradient agent - no intention to add? #278

Policy gradient agent - no intention to add? #278

jonilaserson commented Jan 2, 2019

RaphaelMeudec commented Jan 5, 2019

piyush01123 commented Jan 11, 2019

tbmreza commented Apr 30, 2019

random-user-x commented Apr 30, 2019

tbmreza commented Apr 30, 2019

random-user-x commented Jul 16, 2019

hrik2001 commented Aug 19, 2022

Policy gradient agent - no intention to add? #278

Policy gradient agent - no intention to add? #278

Comments

jonilaserson commented Jan 2, 2019

RaphaelMeudec commented Jan 5, 2019

piyush01123 commented Jan 11, 2019

tbmreza commented Apr 30, 2019

random-user-x commented Apr 30, 2019

tbmreza commented Apr 30, 2019

random-user-x commented Jul 16, 2019

hrik2001 commented Aug 19, 2022