il-by-rl

A reference implementation of a reduction from imitation learning to reinforcement learning, presented in the following paper:

Kamil Ciosek Imitation Learning by Reinforcement Learning, ICLR 2022.

Getting Started

The implementation was tested on Python 3.9. To run the code, you need to install packages from requirements.txt. Using this repository requires the git-lfs extension. See here for installation instructions.

To get started, simply follow these steps:

Clone the repo locally with: git clone https://github.com/spotify-research/il-by-rl.git
Move to the repository with: cd il-by-rl
install the dependencies: pip install -r requirements.txt

Running the experiments

Since running the experiments is computationally expensive, we provide pre-computed logs in the sample-logs directory. These can be plotted using the plots.ipynb notebook. Due to minor changes to the code, these logs are not absolutely identical to the ones used for the paper, but they support the exact same qualitative conclusion (ILR is as good as other methods while being simpler).

If you want to re-run the experiments (regenerating the logs), you can run the command python train.py --env=ENV --method=METHOD, where ENV is one of hopper, ant, walker, halfcheetah and METHOD is one of bc, il, gail, gmmil, sqil. The new logs will be saved to the current directory.

Support

Create a new issue

Contributing

We feel that a welcoming community is important and we ask that you follow Spotify's Open Source Code of Conduct in all interactions with the community.

Authors

Kamil Ciosek

Follow @SpotifyResearch on Twitter for updates.

License

Licensed under the Apache License, Version 2.0: https://www.apache.org/licenses/LICENSE-2.0

Security Issues?

Please report sensitive security issues via Spotify's bug-bounty program (https://hackerone.com/spotify) rather than GitHub.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
datasets		datasets
sample-logs		sample-logs
.gitattributes		.gitattributes
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
d4rl_sac.py		d4rl_sac.py
d4rl_train.py		d4rl_train.py
d4rl_train_sqil.py		d4rl_train_sqil.py
plots.ipynb		plots.ipynb
requirements.txt		requirements.txt
train.py		train.py
train_bc_functions.py		train_bc_functions.py
train_gail_functions.py		train_gail_functions.py
train_gmmil_functions.py		train_gmmil_functions.py
train_il_functions.py		train_il_functions.py
train_sqil_functions.py		train_sqil_functions.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

il-by-rl

Getting Started

Running the experiments

Support

Contributing

Authors

License

Security Issues?

About

Releases

Packages

Languages

License

spotify-research/il-by-rl

Folders and files

Latest commit

History

Repository files navigation

il-by-rl

Getting Started

Running the experiments

Support

Contributing

Authors

License

Security Issues?

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages