Skip to content
View puyuan1996's full-sized avatar

Block or report puyuan1996

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. opendilab/DI-engine opendilab/DI-engine Public

    OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

    Python 3.6k 423

  2. opendilab/PPOxFamily opendilab/PPOxFamily Public

    PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

    Python 2.5k 207

  3. opendilab/LightZero opendilab/LightZero Public

    [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

    Python 1.5k 186

  4. opendilab/awesome-model-based-RL opendilab/awesome-model-based-RL Public

    A curated list of awesome model based RL resources (continually updated)

    1.3k 73

  5. opendilab/awesome-exploration-rl opendilab/awesome-exploration-rl Public

    A curated list of awesome exploration RL resources (continually updated)

    619 21

  6. opendilab/LightRFT opendilab/LightRFT Public

    LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement Fine-Tuning Framework

    Python 66 5