Skip to content
@xlang-ai

XLANG NLP Lab

Building language model agents that ground language instructions into code or actions executable in real-world environments

Welcome to the Executable Language Grounding (XLANG) Lab! We are part of the HKU NLP Group at the University of Hong Kong. XLang focuses on building language model agents that transform (“grounding”) language instructions into code or actions executable in real-world environments, including databases (data agent), web applications (plugins/web agent), and the physical world (robotic agent) etc,. It lies at the heart of language model agents or natural language interfaces that can interact with and learn from these real-world environments to facilitate human interaction with data analysis, web applications, and robotic instruction through conversation. Recent advances in XLang incorporate techniques such as LLM + external tools, code generation, semantic parsing, and dialog or interactive systems.

Pinned Loading

  1. OSWorld OSWorld Public

    [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

    Python 1.5k 164

  2. OpenAgents OpenAgents Public

    [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

    Python 4k 455

  3. instructor-embedding instructor-embedding Public

    [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

    Python 1.9k 139

  4. text2reward text2reward Public

    [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"

    Jupyter Notebook 136 8

  5. Binder Binder Public

    [ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"

    Python 304 36

  6. DS-1000 DS-1000 Public

    [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".

    Python 229 27

Repositories

Showing 10 of 18 repositories
  • Spider2 Public

    Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

    xlang-ai/Spider2’s past year of commit activity
    HTML 254 Apache-2.0 17 23 0 Updated Dec 27, 2024
  • aguvis Public

    Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

    xlang-ai/aguvis’s past year of commit activity
    Python 98 3 1 0 Updated Dec 24, 2024
  • OSWorld Public

    [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

    xlang-ai/OSWorld’s past year of commit activity
    Python 1,489 Apache-2.0 164 31 0 Updated Dec 20, 2024
  • text2reward Public

    [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"

    xlang-ai/text2reward’s past year of commit activity
    Jupyter Notebook 136 8 1 0 Updated Dec 17, 2024
  • EVOR Public
    xlang-ai/EVOR’s past year of commit activity
    Python 51 Apache-2.0 6 3 0 Updated Dec 15, 2024
  • Pai-Megatron-Patch Public Forked from alibaba/Pai-Megatron-Patch

    The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

    xlang-ai/Pai-Megatron-Patch’s past year of commit activity
    Python 0 Apache-2.0 110 0 0 Updated Nov 27, 2024
  • OpenAgents Public

    [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

    xlang-ai/OpenAgents’s past year of commit activity
    Python 4,048 Apache-2.0 455 11 2 Updated Nov 18, 2024
  • DS-1000 Public

    [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".

    xlang-ai/DS-1000’s past year of commit activity
    Python 229 CC-BY-SA-4.0 27 2 0 Updated Oct 30, 2024
  • BRIGHT Public

    BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

    xlang-ai/BRIGHT’s past year of commit activity
    Python 60 CC-BY-4.0 3 0 1 Updated Oct 22, 2024
  • Spider2-V Public

    [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

    xlang-ai/Spider2-V’s past year of commit activity
    Jupyter Notebook 113 Apache-2.0 7 1 0 Updated Aug 26, 2024

Most used topics

Loading…