Request: Training by Multi-GPU #25

RGTails · 2023-01-26T13:23:17Z

All is in title.

yggdrasil75 · 2023-02-10T13:14:51Z

would be nice. but I dont know how well it would work if the 2 gpus arent linked. though training 2 different models by 2 different gpus would be good. and queuing said training would also be good. train 50 epoches on model 1 on gpu 0, 50 for model 2 on gpu 1, gpu 1 being faster means it moves on to model 3 automatically, gpu 0 then continues the training for model 2 once it finishes and sees that the model is not active, 1 moves on to model 1, etc. over a week the 2 gpus train models 50 epoches at a time and split the workload somewhat evenly.

how this could possibly be done where it actually trains 1 model with both: train 1 epoch on each, use difference in time it takes to split the concepts to where faster gets slightly more, and slower slightly less (based on difference) and then merge the models every x epoches (small data set 5-10, large data set 1 or 2) maybe also randomly move concepts around between the 2 trying to maintain the balance while preventing either from missing some part of the model because it doesnt have all the concepts.

rbbrdckybk · 2023-02-14T00:00:56Z

Adding training is something that's on my list, but I'll probably need a solid weekend to get a quick & dirty version implemented. Been busier than usual so it's probably a few weeks off at least. Will leave this open as a reminder though!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request: Training by Multi-GPU #25

Request: Training by Multi-GPU #25

RGTails commented Jan 26, 2023

yggdrasil75 commented Feb 10, 2023

rbbrdckybk commented Feb 14, 2023

Request: Training by Multi-GPU #25

Request: Training by Multi-GPU #25

Comments

RGTails commented Jan 26, 2023

yggdrasil75 commented Feb 10, 2023

rbbrdckybk commented Feb 14, 2023