You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
2024-01-25 08:18:39,301 - INFO - Distributed: True
2024-01-25 08:18:39,301 - INFO - Mix precision training: False
Traceback (most recent call last):
File "./tools/train.py", line 206, in
main()
File "./tools/train.py", line 178, in main
train_set = build_dataset(cfg.data.train, logger)
File "/home/ubuntu18/SoftGroup/softgroup/data/init.py", line 22, in build_dataset
return STPLS3DDataset(**_data_cfg)
File "/home/ubuntu18/SoftGroup/softgroup/data/custom.py", line 37, in init
self.filenames = self.get_filenames()
File "/home/ubuntu18/SoftGroup/softgroup/data/custom.py", line 42, in get_filenames
assert len(filenames) > 0, 'Empty dataset.'
AssertionError: Empty dataset.
ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: 1) local_rank: 0 (pid: 24126) of binary: /home/ubuntu18/.conda/envs/softgroup/bin/python
Traceback (most recent call last):
File "/home/ubuntu18/.conda/envs/softgroup/bin/torchrun", line 8, in
sys.exit(main())
File "/home/ubuntu18/.conda/envs/softgroup/lib/python3.7/site-packages/torch/distributed/elastic/multiprocessing/errors/init.py", line 345, in wrapper
return f(*args, **kwargs)
File "/home/ubuntu18/.conda/envs/softgroup/lib/python3.7/site-packages/torch/distributed/run.py", line 724, in main
run(args)
File "/home/ubuntu18/.conda/envs/softgroup/lib/python3.7/site-packages/torch/distributed/run.py", line 718, in run
)(*cmd_args)
File "/home/ubuntu18/.conda/envs/softgroup/lib/python3.7/site-packages/torch/distributed/launcher/api.py", line 131, in call
return launch_agent(self._config, self._entrypoint, list(args))
File "/home/ubuntu18/.conda/envs/softgroup/lib/python3.7/site-packages/torch/distributed/launcher/api.py", line 247, in launch_agent
failures=result.failures,
torch.distributed.elastic.multiprocessing.errors.ChildFailedError:
Hi, I am using stpls3d dataset. But as you noticed, the error " File "/home/ubuntu18/SoftGroup/softgroup/data/custom.py", line 42, in get_filenames
assert len(filenames) > 0, 'Empty dataset.'
AssertionError: Empty dataset"
I don't know why it was called custom.py instead stpls3d.py.
Can you please help me to take a look.
The text was updated successfully, but these errors were encountered:
./tools/dist_train.sh configs/softgroup/softgroup_stpls3d.yaml 1
2024-01-25 08:18:39,300 - INFO - Config:
model:
channels: 16
num_blocks: 7
semantic_classes: 15
instance_classes: 14
sem2ins_classes: []
semantic_only: False
semantic_weight: [1.0, 1.0, 44.0, 21.9, 1.8, 25.1, 31.5, 21.8, 24.0, 54.4, 114.4,
81.2, 43.6, 9.7, 22.4]
ignore_label: -100
with_coords: False
grouping_cfg:
score_thr: 0.2
radius: 0.9
mean_active: 3
class_numpoint_mean: [-1., 10408., 58., 124., 1351., 162., 430., 1090., 451., 26., 43.,
61., 39., 109., 1239]
npoint_thr: 0.05 # absolute if class_numpoint == -1, relative if class_numpoint != -1
ignore_classes: [0]
instance_voxel_cfg:
scale: 3
spatial_shape: 20
train_cfg:
max_proposal_num: 300
pos_iou_thr: 0.5
match_low_quality: True
min_pos_thr: 0.1
test_cfg:
x4_split: False
cls_score_thr: 0.001
mask_score_thr: -0.5
min_npoint: 15
eval_tasks: ['semantic', 'instance']
fixed_modules: []
data:
train:
type: 'stpls3d'
data_root: 'dataset/stpls3d'
prefix: 'train'
suffix: '_inst_nostuff.pth'
training: True
repeat: 4
voxel_cfg:
scale: 3
spatial_shape: [128, 512]
max_npoint: 250000
min_npoint: 5000
test:
type: 'stpls3d'
data_root: 'dataset/stpls3d'
prefix: 'val_250m'
suffix: '_inst_nostuff.pth'
training: False
voxel_cfg:
scale: 3
spatial_shape: [128, 512]
max_npoint: 250000
min_npoint: 5000
dataloader:
train:
batch_size: 4
num_workers: 4
test:
batch_size: 1
num_workers: 1
optimizer:
type: 'Adam'
lr: 0.004
eval_min_npoint: 10
fp16: False
epochs: 108
step_epoch: 20
save_freq: 4
pretrain: './work_dirs/softgroup_stpls3d_backbone/latest.pth'
work_dir: ''
2024-01-25 08:18:39,301 - INFO - Distributed: True
2024-01-25 08:18:39,301 - INFO - Mix precision training: False
Traceback (most recent call last):
File "./tools/train.py", line 206, in
main()
File "./tools/train.py", line 178, in main
train_set = build_dataset(cfg.data.train, logger)
File "/home/ubuntu18/SoftGroup/softgroup/data/init.py", line 22, in build_dataset
return STPLS3DDataset(**_data_cfg)
File "/home/ubuntu18/SoftGroup/softgroup/data/custom.py", line 37, in init
self.filenames = self.get_filenames()
File "/home/ubuntu18/SoftGroup/softgroup/data/custom.py", line 42, in get_filenames
assert len(filenames) > 0, 'Empty dataset.'
AssertionError: Empty dataset.
ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: 1) local_rank: 0 (pid: 24126) of binary: /home/ubuntu18/.conda/envs/softgroup/bin/python
Traceback (most recent call last):
File "/home/ubuntu18/.conda/envs/softgroup/bin/torchrun", line 8, in
sys.exit(main())
File "/home/ubuntu18/.conda/envs/softgroup/lib/python3.7/site-packages/torch/distributed/elastic/multiprocessing/errors/init.py", line 345, in wrapper
return f(*args, **kwargs)
File "/home/ubuntu18/.conda/envs/softgroup/lib/python3.7/site-packages/torch/distributed/run.py", line 724, in main
run(args)
File "/home/ubuntu18/.conda/envs/softgroup/lib/python3.7/site-packages/torch/distributed/run.py", line 718, in run
)(*cmd_args)
File "/home/ubuntu18/.conda/envs/softgroup/lib/python3.7/site-packages/torch/distributed/launcher/api.py", line 131, in call
return launch_agent(self._config, self._entrypoint, list(args))
File "/home/ubuntu18/.conda/envs/softgroup/lib/python3.7/site-packages/torch/distributed/launcher/api.py", line 247, in launch_agent
failures=result.failures,
torch.distributed.elastic.multiprocessing.errors.ChildFailedError:
./tools/train.py FAILED
Failures:
<NO_OTHER_FAILURES>
Root Cause (first observed failure):
[0]:
time : 2024-01-25_08:18:44
host : ubuntu18
rank : 0 (local_rank: 0)
exitcode : 1 (pid: 24126)
error_file: <N/A>
traceback : To enable traceback see: https://pytorch.org/docs/stable/elastic/errors.html
Hi, I am using stpls3d dataset. But as you noticed, the error " File "/home/ubuntu18/SoftGroup/softgroup/data/custom.py", line 42, in get_filenames
assert len(filenames) > 0, 'Empty dataset.'
AssertionError: Empty dataset"
I don't know why it was called custom.py instead stpls3d.py.
Can you please help me to take a look.
The text was updated successfully, but these errors were encountered: