Skip to content

Exploring Self-supervised Self-Adaption Mechanisms Based on Continuous Reinforcement Learning with Application to Game Systems"

License

Notifications You must be signed in to change notification settings

hpi-sam/RL_SelfSupervised_GameEngine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

RL_SelfSupervised_GameEngine

Motivation

Self-adaption is a desired feature of real-time systems like flight control and driverless vehicles. Self-adaption in these systems is particularly challenging in the real world because these systems are subject to unpredictable changes that can be endogenous (e.g., partial failures or performance degradation) and exogenous (e.g., environmental or other concurrent systems). The current solution has been to trained these self-adaptation mechanism on supervised way, which implies the strong assumption that all change situations are known at training time. For this reason, the solution is to rely on simulators to produce change situations that could be used to train the adaptation mechanism in real time (continuous learning).

Problem

However, designing simulators is a domain-dependent task, which is difficult to generalize across domains. Recent research on game showed that it is possible to train without human input while playing the game. While this is step to continuous learning, it does not deal with two major problems: (1) how to learn when the environment is changing (e.g., changes in the game rules) and (2) when to trigger learning (how to know which events and magnitude of effects to monitor?).

Approach

We are interested in exploring reinforcement learning techniques to continuously train an adaptation mechanims in a self-supervised way. The adaptation mechanism is part of an agent that interacts (makes decisions) by means of actions on an environment (system) that is subject to endogenous and exogenous changes. Our insight is to model the problem as simulation architecture that is executed in run time with the deployed system. As a proof of concept, we would execute experiments both with game engine and an e-commerce engine.

Preliminary References

Sequential Decision Making Techniques

  • Model-Based Reinforcement Learning (DQN)
  • Multi-Armed Bandits (Bayesian Bandits)
  • Monte Carlo Tree Search (MCTS)
  • Machine Learning on Graphs (Graph Neural Nets, Causal Inference)

Our Sister Projects

Our Lectures

get in touch with us:

About

Exploring Self-supervised Self-Adaption Mechanisms Based on Continuous Reinforcement Learning with Application to Game Systems"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published