From 93dddead475176903c5dbfb7ad666bd300a9116a Mon Sep 17 00:00:00 2001 From: Jindong Wang Date: Sun, 17 Dec 2023 21:36:00 -0800 Subject: [PATCH] upd: readme --- README.md | 14 ++++++++------ 1 file changed, 8 insertions(+), 6 deletions(-) diff --git a/README.md b/README.md index 6b3794e..26fd177 100644 --- a/README.md +++ b/README.md @@ -171,20 +171,22 @@ We support a range of datasets to facilitate comprehensive analysis, including: - google/flan-t5-large - databricks/dolly-v1-6b -- llama2 (7b, 13b, 7b-chat, 13b-chat) +- Llama2 (7b, 13b, 7b-chat, 13b-chat) - vicuna-13b, vicuna-13b-v1.3 -- cerebras/Cerebras-GPT-13B +- Cerebras/Cerebras-GPT-13B - EleutherAI/gpt-neox-20b -- google/flan-ul2 -- palm -- chatgpt, gpt4 +- Google/flan-ul2 +- PaLM 2 +- ChatGPT +- GPT-4 +- Phi ## Benchmark Results Please refer to our [benchmark website](llm-eval.github.io) for benchmark results on Prompt Attacks, Prompt Engineering and Dynamic Evaluation DyVal. ## TODO -- [ ] Add prompt attacks and prompt engineering documents. +- [ ] Add support for multi-modal models such as LlaVa and BLIP2. ## Acknowledgements