Skip to content
@CLUEbenchmark

CLUE benchmark

Organization of Language Understanding Evaluation benchmark for Chinese: tasks & datasets, baselines, pre-trained Chinese models, corpus and leaderboard

Pinned

  1. CLUE CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    Python 3.8k 540

  2. SuperCLUE SuperCLUE Public

    SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

    2.6k 88

  3. SuperCLUE-Safety SuperCLUE-Safety Public

    SC-Safety: 中文大模型多轮对抗安全基准

    75 4

  4. SuperCLUE-Auto SuperCLUE-Auto Public

    汽车行业中文大模型测评基准,基于多轮开放式问题的细粒度评测

    17 1

  5. SuperCLUE-Agent SuperCLUE-Agent Public

    SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准

    71 2

  6. SuperCLUE-RAG SuperCLUE-RAG Public

    中文原生检索增强生成测评基准

    59 3

Repositories

Showing 10 of 45 repositories