Skip to content
View huangshiyu13's full-sized avatar
:octocat:
Coding
:octocat:
Coding

Organizations

@TARTRL @OpenRL-Lab
Block or Report

Block or report huangshiyu13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
huangshiyu13/README.md

OpenRL | 知乎 | Google Scholar | Linkedin | Personal Website

  • Hi, I am a researcher in Zhipu AI. Before that, I was a research scientist in 4Paradigm Inc. and the leader of OpenRL Lab. I received my B.E. and Ph. D. degrees (co-advised by Prof. Jun Zhu and Prof. Ting Chen) from the Department of Computer Science and Technology, Tsinghua University in July, 2017 and June, 2022. My researches focus on deep reinforcement learning, multi-agent reinforcement learning, distributed reinforcement learning, RL for robotics, LLM as agent, artificial general intelligence (AGI) and generative artificial intelligence (GAI). I have also spent time working at RealAI Inc. , Huawei Noah's Ark Lab, Tencent AI Lab, Carnegie Mellon University and Sensetime Inc. . And I am also the founder of the OpenRL Lab and TARTRL group.
  • We are looking for self-motivated interns and full-timers who have a strong background in mathematics/computer science and are eager to get involved in cutting-edge, fundamental AI research. Please feel free to drop me an email if you are interested in collaborating with me.
  • 📫 Email: [email protected]

Pinned

  1. OpenRL-Lab/openrl OpenRL-Lab/openrl Public

    Unified Reinforcement Learning Framework

    Python 567 57

  2. OpenRL-Lab/TiZero OpenRL-Lab/TiZero Public

    Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体

    Python 44 6

  3. OpenRL-Lab/Wandb_Tutorial OpenRL-Lab/Wandb_Tutorial Public

    How to use wandb?

    Python 546 46

  4. webtemplate webtemplate Public

    收集各种网站前端模板

    HTML 559 280

  5. RPNplus RPNplus Public

    RPN+(Tensorflow) for people detection

    Python 181 87

  6. couplet_generation couplet_generation Public

    generate couplet(对联生成) Tensorflow

    Python 36 5