You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Merlin models needs to differentiate itself relative to other RecSys library solutions. One of those areas of differentiation needs to be performance on the GPU. If our libraries don't follow best practices and achieve fast performance that we can measure on GPU then our potential customers have no reason to use the library.
Goal:
Provide performant retrieval models in production
Follow best practices by our colleagues for GPU optimization
viswa-nvidia
changed the title
[RMP] Performance optimization of model training and serving
[RMP] Performance optimization of model training
May 25, 2022
karlhigley
changed the title
[RMP] Performance optimization of model training
[RMP] Improve the speed of training models with Merlin Models
May 25, 2022
Check this bug is done NVIDIA-Merlin/models#339 and this issue should be closed. This is an ongoing effort. The profiling portion will be spun off as a separate RMP ticket
gabrielspmoreira
changed the title
[RMP] Improve the speed of training models with Merlin Models
[RMP] Improve the speed of training retrieval models with Merlin Models
Oct 26, 2022
@EvenOldridge@viswa-nvidia As we discussed in the Grooming meeting today, I have tested the pending runtime issue of retrieval models training (NVIDIA-Merlin/models#339) with the current implementation (for both V1 and V2) and it doesn't occur anymore. So that bug was closed.
I also extracted the profiling task of retrieval model pipelines from this RMP to a new RMP #709 , which also addresses ranking model pipelines, as profiling tasks will require an external support (Valerie).
So I am closing this RMP, as it already delivers value with the finished perf improvements on retrieval models.
Problem:
Merlin models needs to differentiate itself relative to other RecSys library solutions. One of those areas of differentiation needs to be performance on the GPU. If our libraries don't follow best practices and achieve fast performance that we can measure on GPU then our potential customers have no reason to use the library.
Goal:
Constraints
Possible Optimizations
Retrieval models
The text was updated successfully, but these errors were encountered: