Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add more language tasks and fix fewshot evaluation bugs #228

Merged
merged 4 commits into from
Sep 6, 2024
Merged

Conversation

Luodian
Copy link
Contributor

@Luodian Luodian commented Sep 5, 2024

This pull request adds LMMS evaluation tasks for various subjects, including law, math, other, health, biology, history, physics, business, chemistry, economics, philosophy, and psychology. Each task consists of multiple-choice questions with answers, and each task has a corresponding description and process documentation.

Remove unused LM object if model is not an instance of LM

The code changes in this commit add LMMS evaluation tasks for various subjects, including law, math, other, health, biology, history, physics, business, chemistry, economics, philosophy, and psychology. These tasks consist of multiple-choice questions with answers, and each task has a corresponding description and process documentation.

Remove not used txt writer
Bring back anls
Update README.md
@Luodian Luodian requested a review from pufanyi September 5, 2024 16:24
The code changes in this commit fix a bug in the evaluator where the check for the LM object was incorrectly performed on the model object instead. This resulted in the LM object not being properly removed when it should have been. The fix ensures that the check is performed on the correct object, lm, and removes the LM object if it is not an instance of LM.

Remove not used txt writer
@Luodian Luodian merged commit 432d445 into main Sep 6, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant