-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cls loss & Total loss too big #285
Comments
what's the training parameter and command? Try update the latest code and train with |
python train.py -c 4 -p crowdhuman --batch_size 8 --lr 1e-5 --num_epochs 100 |
I think there is something wrong with the dataset. Can you run the tutorial? Try training on the shape dataset. If cls loss drop to less than 10 in a few epochs, the code is fine. It's take about 5 minutes. |
@ChuyiZhong see #252 |
It's not normal. It should drop to under 1.0 real soon. |
Here is the results after 10 iterations:
For 1-class model. I have cross-checked my The command used for training is:
@zylo117 Does it look normal? |
Yes, now it is. |
Yes I cloned today's repo: One doubt still remaining:
@zylo117 Is this normal behaviour for this repo? |
It is. It might be caused by a lower lr. |
@zylo117 Thanks for proactive replies. Basically the person in the dataset is in lookdown or top-down view. I have only 10k training images. I trained with the above command. Why this would have happened? My intuition is since it is the person class but from a different view it should have improved at least better than pre-trained model? |
if it's not overfitting like your logs say. it's underfitting. |
Yeah, after 100 epochs AP results were improved. |
after 100 epochs ,loss = ? ,thanks |
@lzh18628137361 approx after 50-60 epochs the losses are in the range of: classification loss: 0.00256, Reg loss: 0.0018 . |
This issue is caused by the initialization of the classifier layer. In practice, it is desirable to initialize the classifier layer such that it predicts
because sigmoid(-4.59) = 0.01 |
Hi all,
I am training it on my own dataset with one class, but the cls loss and total loss seem strange to me. How to get this right?
Thanks in advance.
The text was updated successfully, but these errors were encountered: