Some curiosity about the baseline training configuration. #18

icecream-Tnak · 2022-09-26T02:32:37Z

Thank you for your contribution to the community, the creation of the Chinese recognition benchmark is critical to the advancement of the field.

I would like to know some more details about the training configuration of some baseline models for the sub-dataset "scene". Such as specific epoch, batchsize, lr, weight_decay, max_length, grad_clip, etc.

In particular, I noticed that the results of TransOCR reported in the paper (arXiv:2112.15093) are different from the results on GitHub. After the paper was submitted to arxiv, better experimental results were obtained based on different experimental hyperparameters?

Thank you again for your outstanding work and hope you can get back to me. Because I didn't find the specific configuration file, and the default parameters in TransOCR/main.py, such as epoch = 1000. Unfortunately, my current hardware cannot support such a long training time.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some curiosity about the baseline training configuration. #18

Some curiosity about the baseline training configuration. #18

icecream-Tnak commented Sep 26, 2022 •

edited

Loading

Some curiosity about the baseline training configuration. #18

Some curiosity about the baseline training configuration. #18

Comments

icecream-Tnak commented Sep 26, 2022 • edited Loading

icecream-Tnak commented Sep 26, 2022 •

edited

Loading