[RMP] Tensorflow support for session based recommendations integration in Merlin #433

viswa-nvidia · 2022-07-05T15:13:06Z

Problem:

Session-based and sequential-based models are an active research area for providing personalized recommendations. Transformers4Rec library was built to support the definition of such architectures and the results of our experiments conducted in the T4Rec paper showed the effectiveness of Transformers in modeling short sequences observed in session-based tasks. The T4Rec library was also used to win various RecSys challenges. We also observe a growing interest and active engagement from customers in using Transformers4Rec.

T4Rec was not actively updated for several months as the team shifted its focus to developing the Merlin Model library. MM does not currently support sequential and session-based recsys architectures. MM should support these sequential architectures and provide all necessary support to our users so that they can build such effective models.

Goal:

Port the Transformers4Rec TF API to MM.
The main blocks are: Masking, Transformer, RNN, and NextItemPredictionTask
Provide an example that demonstrates < @jsohn-nvidia to clarify>

Definition of Done

Have an example that serves TF session based model in conjunction with a NVT workflow where the session based models scores the whole catalog

Constraints:

T4Rec TF API is not as stable and complete as torch API. The main missing points are:
- Two out of the 4 masking classes are missing: PLM and RTD
- Support for training techniques embedding in HuggingFace trainer class: multi-gpu, early stopping, checkpoints saving...
- Conduct experiments with real-world datasets (like the one conducted in the T4Rec paper with the Pytorch API)

Starting Point:

Training

Proposed API:

inputs = InputBlock(
  schema=con_schema + seq_schema,
  post=BroadcastToSequence(con_schema, seq_schema)
)
model = RetrievalModel(
  XLNetEncoder(inputs, n_head=4, n_layer=2),
  CategoricalOutput(inputs.select_by_tag(Tags.ITEM_ID))
)

topk = TopKEncoder(model)
topk.evaluate(...)

loader = mm.Loader(
  dataset, 
  batch_size=1000,
  transforms=PredictMasked(seq_schema, target=Tags.ITEM_ID)
)

model.fit(loader)

Inputs

Add support for sequential-inputs & shared embeddings
@edknv, @oliverholworthy & @marcromeyn

Masking

Add training strategies for sequence models
@gabrielspmoreira

RetrievalModel

Make RetrievalModel more generic to session-based use-cases + allow encoders being served
@marcromeyn

Outputs

Improve model-outputs to handle session-based recsys
@marcromeyn & @sararb

Port Sequence Architectures

Add sequence-encoding blocks like transformers
@sararb

Fixes for the GTC tutorial on session-based recommendation

Inference support

Save schema on model save

Session-base model can be used as a candidate generation model, for that we need the following options:
Define NextITemPredictionTask as a sub-class of RetrievalModel.Note: Make sure we can export the encoder block (Transformer or RNN) together with a SequenceSummary post layer as the query tower. models#736
Note: Make sure we can export the encoder block (Transformer or RNN) together with a SequenceSummary post layer as the query tower.
Export the item embeddings table for the ANN index models#735
Constraint: Provide information about input list features to Merlin System [RMP] Establish a metadata standard for serializing information about Merlin components #489

Systems

Documentation

Examples

Small scale (see #352)

(WIP) Switch ragged multi-hot columns with two Triton arrays to a single ColumnSchema systems#173
[FEA] Add list column support in Merlin Systems systems#135 ( Note: Ability to server session based depends on this work )
[Task] Transformer-Based End-To-End Example with TensorFlow Merlin Models models#734

Blocker

The text was updated successfully, but these errors were encountered:

EvenOldridge · 2022-08-03T16:38:13Z

@sararb @gabrielspmoreira can you flesh this out as best as possible in @marcromeyn's absence.

EvenOldridge · 2022-08-03T16:43:54Z

@gabrielspmoreira what does the architecture look like for the system for session based? Are we planning to use session generation to feed into a candidate generation stage?

gabrielspmoreira · 2022-08-03T21:05:09Z

@

@gabrielspmoreira what does the architecture look like for the system for session based? Are we planning to use session generation to feed into a candidate generation stage?

@EvenOldridge the session-based recommendation works as a next-item prediction task. It can be seen as a retrieval model, where the query tower users a sequential model (e.g. RNN, Transformer) and outputs a query representation/vector. During inference, such vectors can be used to retrieve the similar items from ANN the same way a retrieval model does.
So we believe we wouldn't need anything special on Merlin Systems related to the output of a session-based recommendation model. The main different is in the input, as such sequential models expect list features as input. NVTabular already supports processing and storing such list features as you know, but there might be some challenges on Systems building the Triton ensemble with list features support.

viswa-nvidia · 2022-08-25T16:17:30Z

@karlhigley , is the systems section here updated. Pleaes review

karlhigley · 2022-08-25T16:38:13Z

It is up to date with the current state of our knowledge of the work

viswa-nvidia · 2022-08-29T19:50:39Z

@marcromeyn , in one of the meetings, I made a note that this task is dependent on some tasks covered in RMP479-EMBEDDINGS initiative [RMP] Enable users to pass embedding tables directly into the input block in order to more easily support new functionality (non-trainable embeddings, different dimensions, model parallel, etc) . Is this correct ? which are these tasks. ? @EvenOldridge for vis.

viswa-nvidia · 2022-09-26T17:04:21Z

@rnyak , please link the systems - multi hot related development

viswa-nvidia · 2022-11-15T17:42:56Z

@oliverholworthy , please add the input output schema related tickets to this ticket

oliverholworthy · 2022-11-15T17:50:28Z

@viswa-nvidia For the saving method:

This is the parent issue for that:

[FEA] Save input and output schema when .save methods are called on models models#669

We have implemented the save method to save input schema, but currently missing output schema.

oliverholworthy · 2022-11-15T17:51:45Z

Also identified an error in Merlin Models that may impact ability to serve Transformer-based models that affects issues with saving a model after loading.

NVIDIA-Merlin/models#878

viswa-nvidia · 2023-01-10T17:49:07Z

@karlhigley , please add the Triton related PR ( serving signatures haven't matched up with what model expects ) in the ticket.

viswa-nvidia · 2023-04-11T16:50:23Z

@rnyak to follow up with @radekosmulski for blocker ( 23.04 )

EvenOldridge · 2023-04-26T02:31:41Z

It was pointed out to me that we should be consistent about when we consider something done or not so I'm going to reopen this and move it to 22.05. @bbozkaya You've had the only remaining ticket (review the API) on your todo for the past two weeks with no progress. Is this something you're able to take on so that we can close the ticket. If not let us know and we can reassign.

viswa-nvidia added the roadmap label Jul 5, 2022

viswa-nvidia assigned sararb Jul 5, 2022

EvenOldridge added this to the Merlin 22.09 milestone Jul 13, 2022

sararb changed the title ~~[RMP]Session based recommendations integration in to T4R and MM~~ [RMP]Session based recommendations integration in MM Jul 14, 2022

sararb mentioned this issue Jul 14, 2022

[Task] First support of session-based recommendation with RNNs block #451

Closed

6 tasks

viswa-nvidia changed the title ~~[RMP]Session based recommendations integration in MM~~ [ERMP]Session based recommendations integration in MM Jul 20, 2022

viswa-nvidia changed the title ~~[ERMP]Session based recommendations integration in MM~~ [RMP]Session based recommendations integration in MM Jul 29, 2022

viswa-nvidia changed the title ~~[RMP]Session based recommendations integration in MM~~ [RMP]Session based recommendations integration in Merlin Jul 29, 2022

EvenOldridge changed the title ~~[RMP]Session based recommendations integration in Merlin~~ [RMP] Tensorflow support for session based recommendations integration in Merlin Jul 29, 2022

EvenOldridge modified the milestones: Merlin 22.09, Merlin 22.10 Aug 3, 2022

EvenOldridge assigned gabrielspmoreira, radekosmulski, marcromeyn and sararb and unassigned sararb Aug 3, 2022

viswa-nvidia modified the milestones: Merlin 22.10, Merlin 22.11 Sep 26, 2022

viswa-nvidia modified the milestones: Merlin 22.11, Merlin 22.12 Oct 25, 2022

karlhigley mentioned this issue Nov 21, 2022

[BUG] SavedModel serving signature contains all input features even when they aren't used NVIDIA-Merlin/models#898

Closed

viswa-nvidia modified the milestones: Merlin 22.12, Merlin 23.01 Nov 29, 2022

viswa-nvidia added 23.01 22.12 22.11 22.10 labels Dec 15, 2022

viswa-nvidia modified the milestones: Merlin 23.01, Merlin 23.02 Dec 20, 2022

karlhigley mentioned this issue Dec 21, 2022

[INF] Merlin Commons #776

Open

11 tasks

viswa-nvidia modified the milestones: Merlin 23.02, Merlin 23.03 Jan 24, 2023

viswa-nvidia modified the milestones: Merlin 23.03, Merlin 23.04 Feb 28, 2023

viswa-nvidia closed this as completed Apr 25, 2023

EvenOldridge reopened this Apr 26, 2023

EvenOldridge modified the milestones: Merlin 23.04, Merlin 23.05 Apr 26, 2023

gabrielspmoreira mentioned this issue Apr 26, 2023

[RMP] Quick Start for Session-Based Recommendation #927

Open

21 tasks

bbozkaya mentioned this issue May 2, 2023

[Task] Add/Fix the doc strings for TF based session based API documentation NVIDIA-Merlin/models#1078

Closed

EvenOldridge modified the milestones: Merlin 23.05, Merlin 23.06 May 30, 2023

viswa-nvidia closed this as completed Jun 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RMP] Tensorflow support for session based recommendations integration in Merlin #433

[RMP] Tensorflow support for session based recommendations integration in Merlin #433

viswa-nvidia commented Jul 5, 2022 •

edited by edknv

Loading

EvenOldridge commented Aug 3, 2022

EvenOldridge commented Aug 3, 2022

gabrielspmoreira commented Aug 3, 2022

viswa-nvidia commented Aug 25, 2022

karlhigley commented Aug 25, 2022

viswa-nvidia commented Aug 29, 2022 •

edited

Loading

viswa-nvidia commented Sep 26, 2022

viswa-nvidia commented Nov 15, 2022

oliverholworthy commented Nov 15, 2022

oliverholworthy commented Nov 15, 2022

viswa-nvidia commented Jan 10, 2023

viswa-nvidia commented Apr 11, 2023

EvenOldridge commented Apr 26, 2023

[RMP] Tensorflow support for session based recommendations integration in Merlin #433

[RMP] Tensorflow support for session based recommendations integration in Merlin #433

Comments

viswa-nvidia commented Jul 5, 2022 • edited by edknv Loading

Problem:

Goal:

Definition of Done

Constraints:

Starting Point:

Training

Inputs

Masking

RetrievalModel

Outputs

Port Sequence Architectures

Fixes for the GTC tutorial on session-based recommendation

Inference support

Save schema on model save

Systems

Documentation

Examples

Blocker

EvenOldridge commented Aug 3, 2022

EvenOldridge commented Aug 3, 2022

gabrielspmoreira commented Aug 3, 2022

viswa-nvidia commented Aug 25, 2022

karlhigley commented Aug 25, 2022

viswa-nvidia commented Aug 29, 2022 • edited Loading

viswa-nvidia commented Sep 26, 2022

viswa-nvidia commented Nov 15, 2022

oliverholworthy commented Nov 15, 2022

oliverholworthy commented Nov 15, 2022

viswa-nvidia commented Jan 10, 2023

viswa-nvidia commented Apr 11, 2023

EvenOldridge commented Apr 26, 2023

viswa-nvidia commented Jul 5, 2022 •

edited by edknv

Loading

viswa-nvidia commented Aug 29, 2022 •

edited

Loading