Added feature: Multi-input with dictionaries #195

bklebel · 2018-04-11T21:26:39Z

This is an improvement to the MultiInputProcessor, establishing correct communication between keras-rl and a gym-environment, which has a dictionary-like observation space.

Code examples of model and environment can be found here https://gist.github.com/bklebel/913d8f155e6ed23f8a35fba989c70140. It is not a minimal working example, but contains the important parts of model and environment (corresponding names of input layers and spaces in the observation).

merging keras-rl/keras-rl into bklebel/keras-rl

This is an improvement to the MultiInputProcessor, establishing correct communication between keras-rl and a gym-environment, which has a dictionary-like observation space.

bklebel · 2018-05-07T11:38:58Z

Currently only 2D input is supported (images, matrices, ...) as this was what I needed - an update to variable input dimensions (1d vectors, scalars, cubic matrices, ...) is under way, but not high up in my priorities

previously only 2D inputs (matrices) could be handled, now alll kind of inputs, vectors, 2D matrices or higher dimensions should be handled

bklebel · 2018-05-10T18:45:48Z

PR updated for functionality - now also non-2D arrays can be processed.
the bug failing all checks was removed - should be fully functional now!

bklebel · 2018-05-14T11:48:57Z

For anyone who wants to use this already, in /rl/callbacks.py, in the lines 191-193, the np.mean/min/max need to be exchanged (e.g. with 0) because numpy cannot calculate the mean of dictionaries, which is all to understandable.
Apart from this little piece of code, everything should run now as intended.

bklebel · 2018-05-15T08:46:21Z

I found some problems with nonzero window lengths in memories, I will continue working on it

RaphaelMeudec · 2018-05-15T09:25:21Z

@bklebel Feel free to ping me if you need help/advice on specific points

bklebel · 2018-05-15T10:00:28Z

@RaphaelMeudec Thank you!
It is very good and heartening to know that there is some attention to this topic.

bklebel · 2018-07-23T21:20:56Z

@RaphaelMeudec The nonzero window length option works now, however with an ugly workaround for a rather weird problem: lines 48-50 (in processors.py) exist, because at a nonzero window length, the "last instance in the window" is being wrapped in an additional 0-dim numpy array. I did not find the place in the code where this happens (so arbitrarily), and I could not think of a good implicit and array-like way how to solve this, so I ended up with going through the whole batch and window instances individually. I am sure this can be solved nicer (e.g. finding the place where the mischief is contrived i.e. the dict wrapped in 0-dim array). So, please help!

Test cases for this dict-like multi-input are still on my list.

updating fork

a) One single dict entry could not properly would not properly work b) channels_first is now a possible format of input c) the keras.train_on_batch function was not correctly loaded with target values in case of a dict-input I also completely removed the output of mean, max and min observations at output-verbose=2, since it essentially becomes meaningless as soon as non-scalar observations are used

hopefully

RaphaelMeudec · 2018-08-03T11:19:45Z

@bklebel I'll check all this soon, I'll keep you updated!

bklebel · 2018-08-03T11:24:52Z

Thanks - I think I solved it in the commits after my comment, but I am not entirely sure.
I have tested the dqn_cartpole example, rewriting everything to dictionaries, and got good results, I am just about to show them here and upload the altered environment+agent, maybe this will make everything easier for checking weird behaviour.

bklebel · 2018-08-03T16:08:21Z

In this gist, the there is the code for the keras-rl cartpole example, rewritten for dictionary inputs. The corresponding results are displayed, once for the whole observation put into one value of a dict (dqn_onedict_cartpole.py):

And the same for all inputs in separate values of the dictionary (dqn_multidict_cartpole.py)

In both cases, the agent learns a working policy.

I could integrate them in the tests, there is a TODO to use an environment to see whether it learns something, however I am not quite sure whether the cartpole environment is simple enough for travis.

I'm looking forward to your assessment, @RaphaelMeudec :)

Merge pull request #3 from bklebel/bklebel-codacy-1

I will not change line 60 `order[idx_state, idx_window, i] = state_batch[idx_state][idx_window][key][i]`, in spite of codacy, because I think it is easier to understand what happens if all indices are visible as is.

updating with changes

pittnerf · 2018-12-08T08:39:26Z

The link to your examples changed: https://gist.github.com/bklebel/e3bd43ce228a53d27de119c639ac61ee

update from keras-rl master

using `state` in order to satisfy Codacy line 63 (and following) remains unchanged (with all indices), to ensure the clarity of what is happening.

bklebel added 2 commits April 10, 2018 18:40

Merge pull request #1 from keras-rl/master

4d0e0bb

merging keras-rl/keras-rl into bklebel/keras-rl

Update to MultiInputProcessor

e318c71

This is an improvement to the MultiInputProcessor, establishing correct communication between keras-rl and a gym-environment, which has a dictionary-like observation space.

bklebel added 2 commits May 10, 2018 18:09

debugging, adding multidimensional support

c3d2111

previously only 2D inputs (matrices) could be handled, now alll kind of inputs, vectors, 2D matrices or higher dimensions should be handled

debugging for travis checks

d9ea8c9

bklebel added 5 commits May 13, 2018 20:34

update dqn agent for compatibility

8396057

update sarsa for compatibility

a91d952

update for complete compatibility of dqn agent

effb9b2

update sarsa agent for "full" compatibility

e1ad17f

debug

ec06a0f

bklebel changed the title ~~Add feature for the MultiInputProcessor~~ Added feature: Multi-input with dictionaries May 14, 2018

fix zeroed observation in memory for dict space

f81d034

bklebel and others added 6 commits July 23, 2018 09:08

fixed nonzero window length

c9877f3

nonzero window length not fixed....

c8a12d4

nonzero window length not fixed....continuing work

8627e04

nonzero window length works - super ugly workaround

d46ace0

fix if condition with .item()

6694636

fix for travis PEP8-checks

875029c

bklebel added 6 commits July 24, 2018 18:19

cleanup

a367e11

cleanup

f483f68

Merge pull request #2 from keras-rl/master

b149a46

updating fork

cleaned personal changes

02cfd5f

cleanup

419aebb

bklebel added 3 commits July 28, 2018 13:51

fixing the fix

45f95ab

hopefully

fixing

ae9b827

fix for scalar values

5126403

bklebel mentioned this pull request Aug 3, 2018

Keras-RL adding extra dimension to RGB input #229

Closed

bklebel added 8 commits August 8, 2018 16:40

update processors.py

a300691

update memory.py

478fb84

Update dqn.py

90fbdef

resolve issues shown by codacy

1e7e241

Merge pull request #3 from bklebel/bklebel-codacy-1

update processors.py for codacy review

2f1348d

I will not change line 60 `order[idx_state, idx_window, i] = state_batch[idx_state][idx_window][key][i]`, in spite of codacy, because I think it is easier to understand what happens if all indices are visible as is.

Update processors.py

d9ab4de

Update processors.py

c34e87f

Merge pull request #4 from keras-rl/master

5b344e1

updating with changes

bklebel mentioned this pull request Nov 30, 2018

spaces.Dict is being packed into arrays openai/gym#984

Closed

ishit approved these changes Dec 31, 2018

View reviewed changes

Merge pull request #5 from keras-rl/master

1fd63d5

update from keras-rl master

bklebel mentioned this pull request Jan 9, 2019

Multiple-Input models #254

Closed

Codacy clean

b6c6b9c

using `state` in order to satisfy Codacy line 63 (and following) remains unchanged (with all indices), to ensure the clarity of what is happening.

bklebel mentioned this pull request Jan 17, 2019

DDPG with MultiInputProcessor not working #287

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added feature: Multi-input with dictionaries #195

Added feature: Multi-input with dictionaries #195

bklebel commented Apr 11, 2018

bklebel commented May 7, 2018

bklebel commented May 10, 2018

bklebel commented May 14, 2018

bklebel commented May 15, 2018

RaphaelMeudec commented May 15, 2018

bklebel commented May 15, 2018

bklebel commented Jul 23, 2018 •

edited

RaphaelMeudec commented Aug 3, 2018

bklebel commented Aug 3, 2018

bklebel commented Aug 3, 2018 •

edited

pittnerf commented Dec 8, 2018

Added feature: Multi-input with dictionaries #195

Are you sure you want to change the base?

Added feature: Multi-input with dictionaries #195

Conversation

bklebel commented Apr 11, 2018

bklebel commented May 7, 2018

bklebel commented May 10, 2018

bklebel commented May 14, 2018

bklebel commented May 15, 2018

RaphaelMeudec commented May 15, 2018

bklebel commented May 15, 2018

bklebel commented Jul 23, 2018 • edited

RaphaelMeudec commented Aug 3, 2018

bklebel commented Aug 3, 2018

bklebel commented Aug 3, 2018 • edited

pittnerf commented Dec 8, 2018

bklebel commented Jul 23, 2018 •

edited

bklebel commented Aug 3, 2018 •

edited