🎯
Focusing
PhD in Reinforcement Learning, LLM Alignment, RLHF
-
University of Cambridge
- https://holarissun.github.io/
- @HolarisSun
Highlights
- Pro
Block or Report
Block or report holarissun
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned
-
Prompt-OIRL
Prompt-OIRL Publiccode for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
-
RewardShifting
RewardShifting PublicCode for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
-
PCHID_code
PCHID_code PublicCode for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics
Jupyter Notebook 15
-
YangRui2015/AWGCSL
YangRui2015/AWGCSL PublicCode for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
-
Accountable-Offline-RL
Accountable-Offline-RL PublicCode for NeurIPS 2023 paper Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.