[WIP] Wide & Deep migration to PyTorch #2168

daviddavo · 2024-09-18T14:03:03Z

Description

Migrating Wide & Deep out of tensorflow.

Note: Previously I have only used models from high level libraries like Keras. I'm doing this to learn PyTorch, so feel free to give me any pointers or even scrap everything if it is not useful.

Related Issues

[BUG] tensorflow-estimator is removed from tensorflow 2.16.1 #2072

References

Checklist:

I have followed the contribution guidelines and code style for this project.
I have added tests covering my contributions.
I have updated the documentation accordingly.
I have signed the commits, e.g. git commit -s -m "your commit message".
This PR is being made to staging branch AND NOT TO main branch.

WIP Tasks

Signed-off-by: David Davó <[email protected]>

miguelgfierro · 2024-09-21T07:40:41Z

Sorry @daviddavo I pressed the wrong action. I changed the PR to ready for review because we are having some problems with the tests. Hopefully, we can fix it by next week.

daviddavo · 2024-09-21T15:10:06Z

NP, I still have quite some work to do

miguelgfierro · 2024-09-23T15:31:52Z

FYI @daviddavo the tests should be working now after #2169

daviddavo · 2024-09-23T21:43:49Z

The tensorflow estimators approach currently used by recommenders uses a binary regressor (default value of n_classes). To get the recommendations, all user-item pairs are used as input (created using the user_item_pairs method).

On the PyTorch notebook the output is not binary, instead there is a class for each movie. The aim of this notebook is to predict the next movie to watch, and not to make a top-k recommendations, a different problem.

The question is, what does the original paper do? Does it output a single scalar? Or does it output a vector with a value corresponding to each item? As I understand it, it should be a single value ( $P\left(Y=1|X\right)=\sigma\left(...\right)$ ), but that notebook made me doubt.

Nevertheless, I think I will modify the current model so the "head" returns a scalar and to get the top-k recommendations we pass all the possible user-item pairs, as the recommenders' tensorflow implementation does.

Edit: NVIDIA's deep learning examples also output a single value. My doubts have been resolved but I'll keep this post as some kind of documentation. I'll finish the model and do the training soon.

Signed-off-by: David Davó <[email protected]>

daviddavo · 2024-09-24T19:19:17Z

Loss function decreases over time in my jupyter notebook. the only thing remaining is the "software engineering" part

Signed-off-by: David Davó <[email protected]>

daviddavo · 2024-10-07T08:23:47Z

Now that I'm testing it with the full 100k dataset, I realized its very slow. I will profile it next weekend, but I have a hunch that the problem is the DataLoader, which uses a lot of slow .locs

miguelgfierro · 2024-11-25T14:51:44Z

@daviddavo how are you doing with this PR? Let me know if you need any help

miguelgfierro · 2025-01-14T17:18:42Z

@@daviddavo how are things, I would like to ask whether you would be continuing with this work.

[WIP] Started Wide & Deep pytorch migration

551bf33

Signed-off-by: David Davó <[email protected]>

daviddavo requested review from miguelgfierro, gramhagen, anargyri, loomlike, wutaomsft and SimonYansenZhao as code owners September 18, 2024 14:03

daviddavo marked this pull request as draft September 18, 2024 14:03

daviddavo added 3 commits September 20, 2024 15:14

Working wide_and_deep pytorch module

756ee6d

Signed-off-by: David Davó <[email protected]>

Added support for additional embeddings in wide and deep pytorch module

fc49b13

Signed-off-by: David Davó <[email protected]>

Removed old pytorch model

d5e461e

Signed-off-by: David Davó <[email protected]>

miguelgfierro marked this pull request as ready for review September 21, 2024 07:39

daviddavo marked this pull request as draft September 21, 2024 15:09

Added [start quote] hashed [end quote] cross features

9bd7d25

Signed-off-by: David Davó <[email protected]>

daviddavo added 6 commits September 29, 2024 17:12

Created WideandDeep wrapper class

81a8d27

Signed-off-by: David Davó <[email protected]>

Added continuous features (genres) to wide and deep

5829d16

Signed-off-by: David Davó <[email protected]>

Save wide and deep model

3df2dfe

Signed-off-by: David Davó <[email protected]>

Speedup 10x WideAndDeep._get_uip_cont

cc1e2f9

Signed-off-by: David Davó <[email protected]>

WideDeep Avoid eval every iter

f713841

Signed-off-by: David Davó <[email protected]>

Moved test_wide_deep_utils to pytorch

2dd40c0

Signed-off-by: David Davó <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Wide & Deep migration to PyTorch #2168

[WIP] Wide & Deep migration to PyTorch #2168

daviddavo commented Sep 18, 2024 •

edited

Loading

miguelgfierro commented Sep 21, 2024 •

edited

Loading

daviddavo commented Sep 21, 2024

miguelgfierro commented Sep 23, 2024

daviddavo commented Sep 23, 2024

daviddavo commented Sep 24, 2024

daviddavo commented Oct 7, 2024

miguelgfierro commented Nov 25, 2024

miguelgfierro commented Jan 14, 2025

[WIP] Wide & Deep migration to PyTorch #2168

Are you sure you want to change the base?

[WIP] Wide & Deep migration to PyTorch #2168

Conversation

daviddavo commented Sep 18, 2024 • edited Loading

Description

Related Issues

References

Checklist:

WIP Tasks

miguelgfierro commented Sep 21, 2024 • edited Loading

daviddavo commented Sep 21, 2024

miguelgfierro commented Sep 23, 2024

daviddavo commented Sep 23, 2024

daviddavo commented Sep 24, 2024

daviddavo commented Oct 7, 2024

miguelgfierro commented Nov 25, 2024

miguelgfierro commented Jan 14, 2025

daviddavo commented Sep 18, 2024 •

edited

Loading

miguelgfierro commented Sep 21, 2024 •

edited

Loading