Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release 2025.03 #58

Merged
merged 9 commits into from
Mar 28, 2025
Merged

Release 2025.03 #58

merged 9 commits into from
Mar 28, 2025

Conversation

mariusarvinte
Copy link
Collaborator

@mariusarvinte mariusarvinte commented Mar 21, 2025

🎉 Major Updates

  • Preliminary support for automatic swap batch size optimization using accelerate.find_executable_batch_size. This can speed-up single-device llmart runtime by up to 10x compared to the default value of 1. Enabled in command line using per_device_bs=-1. Closes Dynamic batch size #5

🎈 Minor Updates

🚧 Bug Fixes


mariusarvinte and others added 8 commits March 21, 2025 15:11
Implements PRs opened by `dependabot` PRs
Fixes #51 by running the final forward pass in training mode.
Fixes #35 and #36

---------

Signed-off-by: Ajith Raj <[email protected]>
Signed-off-by: harshit-parikh-28 <[email protected]>
Signed-off-by: Adarsh Anand <[email protected]>
Co-authored-by: Adarsh Anand <[email protected]>
Co-authored-by: Ajith Raj <[email protected]>
Co-authored-by: harshit-parikh-28 <[email protected]>
Minor patch to some requirements
- Add user configurable `max_new_tokens` for generation. Fixes #6
- Add customizable `bad_token_ids` policies. Fixes #8 
- Add config option to track `best_attack_token` based on the teacher
forcing loss. Fixes #12
- Add CLI reference docs. Fixes #30

---------

Signed-off-by: harshit-parikh-28 <[email protected]>
Signed-off-by: Marius Arvinte <[email protected]>
Signed-off-by: Adarsh Anand <[email protected]>
Co-authored-by: Adarsh Anand <[email protected]>
Co-authored-by: harshit-parikh-28 <[email protected]>
dxoigmn
dxoigmn previously approved these changes Mar 28, 2025
Copy link
Contributor

@dxoigmn dxoigmn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Copy link
Contributor

@dxoigmn dxoigmn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment