Skip to content
View ChengpengLi1003's full-sized avatar

Block or report ChengpengLi1003

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. CoRT CoRT Public

    Python 49

  2. Awesome-Long-Chain-of-Thought-Reasoning-with-tools Awesome-Long-Chain-of-Thought-Reasoning-with-tools Public

    A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.

    37 3

  3. DotaMath DotaMath Public

    30 2

  4. Q-learning Q-learning Public

    针对最经典的表格型Q learning算法进行了复现,能够支持gym中大多数的离散动作和状态空间的环境,譬如CliffWalking-v0。

    Python 9 1

  5. RL4CO RL4CO Public

    A open-sourced codebase for using offline reinforcement learning in combinatorial optimization

    Python 2

  6. tensorflowbook tensorflowbook Public

    Forked from csmhwu/tensorflowbook

    for tensorflow book writting

    1