Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Port distributed training support from existing PR #1315

Open
lrzpellegrini opened this issue Mar 7, 2023 · 0 comments
Open

Port distributed training support from existing PR #1315

lrzpellegrini opened this issue Mar 7, 2023 · 0 comments
Assignees

Comments

@lrzpellegrini
Copy link
Collaborator

lrzpellegrini commented Mar 7, 2023

I'm opening this PR to keep track of the work needed to port the content of the #996 PR to the main branch.

The idea is to split that PR (which is huge and based on a quite old version of the codebase) and, starting from the current state of the main branch, port its main elements in smaller PRs.
I'll keep this issue updated as I work on this.

Many changes are not strictly related to supporting distributed training but may benefit Avalanche in general.

  1. I'm starting with porting the modernized object detection/segmentation dataset, strategies, and metrics. I'll also port the generalized batch collate functionality.

Changes in Distributed Training PR #996:

Legend:

  • 🔲 Not ported
  • ⌛ Work in progress
  • 💬 PR opened, discussion in progress
  • ✔️ Merged into main branch

Base elements

Strategy e plugins

Models

Detection

Data Loader

Loggers and metrics

Unit tests

Typing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant