Skip to content
View WooooDyy's full-sized avatar
🤡
🤡

Block or report WooooDyy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. LLM-Agent-Paper-List LLM-Agent-Paper-List Public

    The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

    7k 415

  2. AgentGym AgentGym Public

    Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

    Python 369 49

  3. LLM-Reverse-Curriculum-RL LLM-Reverse-Curriculum-RL Public

    Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

    Python 80 5

  4. MathCritique MathCritique Public

    Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".

    Python 43

  5. Self-Polish Self-Polish Public

    Codes for the EMNLP 2023 Findings paper "Self-Polish: Enhance Reasoning in Large Language Models via Problem Refining" by Zhiheng Xi, Senjie Jin, Yuhao Zhou, Rui Zheng, Songyang Gao, Tao Gui, Qi Zh…

    Python 29 4

  6. EarlyRobust EarlyRobust Public

    Codes for the EMNLP 2022 paper "Efficient Adversarial Training with Robust Early-Bird Tickets" by Zhiheng Xi, Rui Zheng, Tao Gui, Qi Zhang and Xuanjing Huang.

    Python 2 1