v0.0.1: initial release
What's Changed
- separate data processing to data.py by @aryamanarora in #1
- Initial attempt to adapt to HF trainer by @aryamanarora in #2
- add weight decay param by @aryamanarora in #3
- [Bug fix] fix layer parsing step after dataset creation and others by @frankaging in #4
- separate argparse from training fxn by @aryamanarora in #8
- change math and commonsense to LLM adaptor template by @frankaging in #7
- Adding in stsb support by @frankaging in #10
- add an option for normalized input; GLUE in training HF eval by @frankaging in #11
- gsm8k splits by @aryamanarora in #12
- sharing interventions across positions by @frankaging in #13
- fix padding on intervention locations by @aryamanarora in #15
- more update on the padding thing with gsm8k and others by @frankaging in #16
- add gd option by @frankaging in #17
- adjust decode stra by @frankaging in #18
- Zen/gsm8k by @frankaging in #19
- minor fix by @frankaging in #20
- move generation args to config file by @aryamanarora in #21
- Update README.md by @PinetreePantry in #22
- Verified README code, fix a bug preventing proper save and load by @PinetreePantry in #23
- Update README.md by @eltociear in #32
New Contributors
- @aryamanarora made their first contribution in #1
- @frankaging made their first contribution in #4
- @PinetreePantry made their first contribution in #22
- @eltociear made their first contribution in #32
Full Changelog: https://github.com/stanfordnlp/pyreft/commits/v0.0.1