v1.6.1
What's new
Added π
- Added
retries
field toBeakerLaunchConfig
. - Allow running on Augusta cluster with existing train scripts.
- Added
olmo_core.utils.logging_configured()
function to check if logging has been configured.
Fixed β
- Fixed a potential distributed deadlock bug when training without a separate CPU-only bookkeeping backend.
- Removed some unnecessary host-device syncs in
olmo_core.distributed.utils
. - Added
Trainer(Config).async_bookkeeping
field to toggle async bookkeeping.
Commits
cae88f5 (chore) prepare for release v1.6.1
83db5f7 Some fixes/improvements around synchronous bookkeeping operations (#83)
c435c94 increase timeout for CI checks
4a56200 update cluster list (#82)
e27ba74 Update throughput numbers, add logging_configured()
util function (#81)
bec0a3c Allow running on Augusta cluster (#80)
c7c3a5a Set env vars for Augusta cluster
b9351e2 Add retries
field to BeakerLaunchConfig
(#79)