add more language tasks and fix fewshot evaluation bugs #228

Luodian · 2024-09-05T16:23:49Z

This pull request adds LMMS evaluation tasks for various subjects, including law, math, other, health, biology, history, physics, business, chemistry, economics, philosophy, and psychology. Each task consists of multiple-choice questions with answers, and each task has a corresponding description and process documentation.

Remove unused LM object if model is not an instance of LM

The code changes in this commit add LMMS evaluation tasks for various subjects, including law, math, other, health, biology, history, physics, business, chemistry, economics, philosophy, and psychology. These tasks consist of multiple-choice questions with answers, and each task has a corresponding description and process documentation. Remove not used txt writer Bring back anls Update README.md

The code changes in this commit fix a bug in the evaluator where the check for the LM object was incorrectly performed on the model object instead. This resulted in the LM object not being properly removed when it should have been. The fix ensures that the check is performed on the correct object, lm, and removes the LM object if it is not an instance of LM. Remove not used txt writer

Luodian added 3 commits September 5, 2024 16:17

chore: Remove unused LM object if model is not an instance of LM

f7111b6

Merge remote-tracking branch 'origin/main' into dev/fewshot

764d6bf

Luodian requested a review from pufanyi September 5, 2024 16:24

Luodian merged commit 432d445 into main Sep 6, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add more language tasks and fix fewshot evaluation bugs #228

add more language tasks and fix fewshot evaluation bugs #228

Luodian commented Sep 5, 2024

add more language tasks and fix fewshot evaluation bugs #228

add more language tasks and fix fewshot evaluation bugs #228

Conversation

Luodian commented Sep 5, 2024