Skip to content

Issues: open-compass/opencompass

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug] 使用max-num-worker使得ssh断连
#1887 opened Feb 23, 2025 by timturing
2 tasks done
[Bug] The relative path in tools/list_configs.py
#1873 opened Feb 15, 2025 by Sibyl233
2 tasks done
[Bug] 使用提示词攻击,出现导入模块失败
#1860 opened Feb 8, 2025 by pomliuxj
2 tasks done
[Bug] chinese simpleQA dataset is not working
#1858 opened Feb 7, 2025 by hailsham
2 tasks done
[Feature] Add codeeval_pro & serval models
#1848 opened Jan 24, 2025 by Zhudongsheng75 Loading…
[Bug] SuperGLUE_RTE_gen KeyError
#1816 opened Jan 10, 2025 by shedding-ash
2 tasks done
[Feature] 如何添加评价指标
#1803 opened Jan 3, 2025 by Lichunyan3
1 task
[Feature] Support downloading dataset from OpenMind
#1792 opened Dec 27, 2024 by FightingZhen Loading…
6 tasks done
[Draft] Async pipeline
#1763 opened Dec 15, 2024 by HAOCHENYE Loading…
6 tasks
[Feature] dataset for humaneval-multipl
#1673 opened Nov 9, 2024 by jyshee
1 task
[Bug] mmlupro 正则提取错误
#1661 opened Nov 4, 2024 by bittersweet1999
2 tasks done
[Enhance] Enhance volc
#1642 opened Oct 26, 2024 by HAOCHENYE Loading…
6 tasks
SafetyBench数据集评测bug
#1622 opened Oct 18, 2024 by shutttttdown
2 tasks done
[Bug] Silent GPU failures, works with --debug
#1585 opened Oct 4, 2024 by anuragprat1k
2 tasks done
[Bug] pid_params.py dumps not successfully.
#1580 opened Sep 29, 2024 by tonysy
2 tasks done
[Bug] debug不显示日志
#1578 opened Sep 29, 2024 by HaltonJiang
2 tasks done
ProTip! Mix and match filters to narrow down what you’re looking for.