Skip to content
Change the repository type filter

All

    Repositories list

    • Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
      MIT License
      831k100Updated Feb 6, 2026Feb 6, 2026
    • CLUE

      Public
      中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
      Python
      5454.3k782Updated Feb 6, 2026Feb 6, 2026
    • SuperCLUE

      Public
      SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
      1103.3k380Updated Feb 6, 2026Feb 6, 2026
    • 中文精确指令遵循测评基准(开源版)
      Python
      1700Updated Aug 12, 2025Aug 12, 2025
    • Math24o

      Public
      Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark
      Python
      01100Updated Mar 27, 2025Mar 27, 2025
    • 2024h1

      Public
      中文大模型基准测评2024上半年度报告,Report of LLMs in Chinese, First Half of 2024
      0110Updated Jul 9, 2024Jul 9, 2024
    • SuperCLUE-Video

      Public
      中文原生多层次文生视频测评基准
      11800Updated Jul 8, 2024Jul 8, 2024
    • 中文原生多模态理解测评基准(测评方案)
      0300Updated Jul 8, 2024Jul 8, 2024
    • SuperCLUE-Long

      Public
      中文原生长文本测评基准
      0500Updated Jul 8, 2024Jul 8, 2024
    • SuperCLUE-Image

      Public
      中文原生文生图测评基准
      0900Updated Jul 8, 2024Jul 8, 2024
    • 中文原生代码助手测评基准,产品级
      0000Updated Jul 8, 2024Jul 8, 2024
    • SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准
      614430Updated Jun 19, 2024Jun 19, 2024
    • 中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级
      01000Updated May 6, 2024May 6, 2024
    • 汽车智能座舱大模型测评基准
      01100Updated Apr 25, 2024Apr 25, 2024
    • Llama3开源模型中文版-全方位测评,基于SuperCLUE基准 | Llama3 Chinese Evaluation with SuperCLUE
      01610Updated Apr 21, 2024Apr 21, 2024
    • 中文原生检索增强生成测评基准
      413020Updated Apr 18, 2024Apr 18, 2024
    • 中文原生等级化代码能力测试基准
      11510Updated Apr 11, 2024Apr 11, 2024
    • SuperCLUE-Role中文原生角色扮演测评基准
      13600Updated Apr 3, 2024Apr 3, 2024
    • 中文原生工业测评基准
      01500Updated Mar 21, 2024Mar 21, 2024
    • SC-Safety: 中文大模型多轮对抗安全基准
      1215060Updated Mar 15, 2024Mar 15, 2024
    • SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅
      Python
      55800Updated Feb 5, 2024Feb 5, 2024
    • 汽车行业中文大模型测评基准,基于多轮开放式问题的细粒度评测
      43720Updated Dec 26, 2023Dec 26, 2023
    • SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准
      69480Updated Nov 9, 2023Nov 9, 2023
    • 中文通用大模型开放域多轮测评基准 | An Open Domain Benchmark for Foundation Models in Chinese
      38020Updated Aug 25, 2023Aug 25, 2023
    • Llama2开源模型中文版-全方位测评,基于SuperCLUE的OPEN基准 | Llama2 Chinese evaluation with SuperCLUE
      812720Updated Aug 2, 2023Aug 2, 2023
    • SuperCLUE高考作文机器自动阅卷系统
      01910Updated Jun 8, 2023Jun 8, 2023
    • PyCLUE

      Public
      Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark
      Python
      MIT License
      1513343Updated May 22, 2023May 22, 2023
    • CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition
      Python
      2981.5k591Updated Nov 21, 2022Nov 21, 2022
    • 搜索所有中文NLP数据集,附常用英文NLP数据集
      Python
      6264.4k100Updated Nov 21, 2022Nov 21, 2022
    • LGEB

      Public
      LGEB: Benchmark of Language Generation Evaluation
      Python
      11611Updated Oct 21, 2022Oct 21, 2022
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.