Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

使用BISRNRT作为Net训练模型时出现问题 #7

Open
Gchang9 opened this issue May 22, 2022 · 6 comments
Open

使用BISRNRT作为Net训练模型时出现问题 #7

Gchang9 opened this issue May 22, 2022 · 6 comments

Comments

@Gchang9
Copy link

Gchang9 commented May 22, 2022

您好,在使用SSCDl作为Net训练时,没有问题。在使用BiSRNet作为Net训练时,出现了问题,参数和数据及数据读取方式全部是按照您的代码做的。
出现的问题是:
训练的时候显示:WARNING:Nan or Inf found in Input tensor,且train_seg loss bn_loss均为nan

@Gchang9
Copy link
Author

Gchang9 commented May 22, 2022

似乎关闭Tensorboard就不会出现这个问题,请问是为什么呢?

@Gchang9
Copy link
Author

Gchang9 commented May 23, 2022

关闭Tensorboard之后不会报warning,但训练几个epoch之后还是会出现train_seg loss bn_loss均为nan,想请问您一下是什么情况呢?

@Gchang9
Copy link
Author

Gchang9 commented May 23, 2022

您好,由于我把近3000组图像划分为了训练、测试、验证。导致训练的时候最后一个batch不能整除。目前我把drop_last设置为True可以正常训练了。但Tensorboard的暂时还未发现解决方案、

@DingLei14
Copy link
Owner

Hi. Is the batch size too small? The BiSRNet is a bit hard to train but the loss shouldn't be crazy.
I got accuracy improvements while freezing the other model parts and training only the SR modules. You can also try that.

@Gchang9
Copy link
Author

Gchang9 commented May 23, 2022 via email

@Joshua00044444
Copy link

请问是如何修改drop_last的参数的

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants