Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
shahules786 authored May 9, 2023
1 parent 60c917b commit 1cef541
Showing 1 changed file with 3 additions and 0 deletions.
3 changes: 3 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,6 @@
# Reward-Model
Reward Model training framework for LLM RLHF. For in-depth understanding of Reward modeling, checkout our [blog](https://explodinggradients.com/)
The word nemesis originally meant the distributor of fortune, neither good nor bad, simply in due proportion to each according to what was deserved.
### Quick Start
* Inference
```python
Expand All @@ -16,5 +17,7 @@ tokenizer = AutoTokenizer.from_pretrained(MODEL)
python src/training.py --config-name <your-config-name>
```



## Contributions
* All contributions are welcome. Checkout #issues

0 comments on commit 1cef541

Please sign in to comment.