I tried to inscrease batchsize but met some difficulties #83

DengYizhe · 2024-12-23T03:17:22Z

I inscreased batchsize by refill the shape of query(Instances in code) in different batches by 0 or -1, which made it possible to train in batchsize>1 during pre process, and keep the post process batch by batch. But I met the problem that loss is large, grad_norm is small and the loss descents slowly. I wonder if anyone else tried to achieve larger batchsize, and did you met the same case?

DengYizhe changed the title ~~I tried to inscrease batchsize~~ I tried to inscrease batchsize but met some difficulties Dec 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I tried to inscrease batchsize but met some difficulties #83

I tried to inscrease batchsize but met some difficulties #83

DengYizhe commented Dec 23, 2024 •

edited

Loading

I tried to inscrease batchsize but met some difficulties #83

I tried to inscrease batchsize but met some difficulties #83

Comments

DengYizhe commented Dec 23, 2024 • edited Loading

DengYizhe commented Dec 23, 2024 •

edited

Loading