From 1cef54113bbc83569b1d15480a94f323cb392cf6 Mon Sep 17 00:00:00 2001 From: Shahul ES Date: Tue, 9 May 2023 20:02:51 +0530 Subject: [PATCH] Update README.md --- README.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/README.md b/README.md index 73515e9..bb196a3 100644 --- a/README.md +++ b/README.md @@ -1,5 +1,6 @@ # Reward-Model Reward Model training framework for LLM RLHF. For in-depth understanding of Reward modeling, checkout our [blog](https://explodinggradients.com/) +The word nemesis originally meant the distributor of fortune, neither good nor bad, simply in due proportion to each according to what was deserved. ### Quick Start * Inference ```python @@ -16,5 +17,7 @@ tokenizer = AutoTokenizer.from_pretrained(MODEL) python src/training.py --config-name ``` + + ## Contributions * All contributions are welcome. Checkout #issues