-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
train error #39
Comments
(ViTMatte) jyw@rtx6000:~/vitmatte/ViTMatte$ python main.py --config-file configs/ViTMatte_S_100ep.py --num-gpus 1 sys.platform linux PyTorch built with:
[11/11 19:26:30 detectron2]: Command line arguments: Namespace(config_file='configs/ViTMatte_S_100ep.py', dist_url='tcp://127.0.0.1:50166', eval_only=False, machine_rank=0, num_gpus=1, num_machines=1, opts=[], resume=False) train.max_iter = int(43100 / 16 / 2 * 100) optimizer.lr=5e-4 train.init_checkpoint = './pretrained/dino_vit_s_fna.pth' dataloader.train.batch_size=16 WARNING [11/11 19:26:33 d2.config.lazy]: The config contains objects that cannot serialize to a valid yaml. ./output_of_train/ViTMatte_S_100ep/config.yaml is human-readable but cannot be loaded. |
Traceback (most recent call last):
File "main.py", line 132, in
launch(
File "/home/jyw/anaconda3/envs/ViTMatte/lib/python3.8/site-packages/detectron2/engine/launch.py", line 87, in launch
main_func(*args)
File "main.py", line 126, in main
do_train(args, cfg)
File "main.py", line 78, in do_train
train_loader = instantiate(cfg.dataloader.train)
File "/home/jyw/anaconda3/envs/ViTMatte/lib/python3.8/site-packages/detectron2/config/instantiate.py", line 67, in instantiate
cfg = {k: instantiate(v) for k, v in cfg.items()}
File "/home/jyw/anaconda3/envs/ViTMatte/lib/python3.8/site-packages/detectron2/config/instantiate.py", line 67, in
cfg = {k: instantiate(v) for k, v in cfg.items()}
File "/home/jyw/anaconda3/envs/ViTMatte/lib/python3.8/site-packages/detectron2/config/instantiate.py", line 83, in instantiate
return cls(**cfg)
File "/home/jyw/anaconda3/envs/ViTMatte/lib/python3.8/site-packages/torch/utils/data/distributed.py", line 68, in init
num_replicas = dist.get_world_size()
File "/home/jyw/anaconda3/envs/ViTMatte/lib/python3.8/site-packages/torch/distributed/distributed_c10d.py", line 1181, in get_world_size
return _get_group_size(group)
File "/home/jyw/anaconda3/envs/ViTMatte/lib/python3.8/site-packages/torch/distributed/distributed_c10d.py", line 566, in _get_group_size
default_pg = _get_default_group()
File "/home/jyw/anaconda3/envs/ViTMatte/lib/python3.8/site-packages/torch/distributed/distributed_c10d.py", line 697, in _get_default_group
raise RuntimeError(
RuntimeError: Default process group has not been initialized, please make sure to call init_process_group.
The text was updated successfully, but these errors were encountered: