[BUG] Error when loading the session-based model in example #908

FredHJC · 2022-11-30T19:02:46Z

❓ Questions & Help

Details

https://github.com/NVIDIA-Merlin/models/blob/main/examples/usecases/ecommerce-session-based-next-item-prediction-for-fashion.ipynb

We are trying to implement the session-based models shown in the above notebook. However, there is a consistent error when loading the saved model. TypeError: ('Keyword argument not understood:', 'layer was saved without config')

We can just run the example notebook and save the trained model. Then the error occurs when trying to load it. It looks like customized layers / prediction tasks should be handled with a manually specified config.

The text was updated successfully, but these errors were encountered:

zhiruiwang · 2022-12-01T03:46:36Z

Steps/Code to reproduce bug

Run the Dressipi notebook Transformer-based model example after the model is trained:

model_transformer.fit(loader, 
                      validation_data=val_loader,
                      epochs=EPOCHS
                     )

Save the model to disk, which is successful:

model_transformer.save(os.path.join('/ecom_modeling/11-30-session', 'transformer'))

It does have some warnings though:

WARNING:tensorflow:Skipping full serialization of Keras layer TFSharedEmbeddings(
  (_feature_shapes): Dict(
    (f_47_list_seq): TensorShape([1024, None])
    (f_68_list_seq): TensorShape([1024, None])
    (item_id_list_seq): TensorShape([1024, None])
    (item_id_last): TensorShape([1024, 1])
  )
  (_feature_dtypes): Dict(
    (f_47_list_seq): tf.int32
    (f_68_list_seq): tf.int32
    (item_id_list_seq): tf.int32
    (item_id_last): tf.int32
  )
), because it is not built.
.......
INFO:tensorflow:Unsupported signature for serialization: ((Prediction(outputs={'purchase_id_first/categorical_output': TensorSpec(shape=(None, 23272), dtype=tf.float32, name='outputs/outputs/purchase_id_first/categorical_output')}, targets={'purchase_id_first/categorical_output': TensorSpec(shape=(None, 23272), dtype=tf.float32, name='outputs/targets/purchase_id_first/categorical_output')}, sample_weight={'purchase_id_first/categorical_output': None}, features=None, negative_candidate_ids=None), <tensorflow.python.framework.func_graph.UnknownArgument object at 0x7fb613ff95e0>), {}).

Load the model from disk, which causes the error:

model_loaded = tf.keras.models.load_model(
        os.path.join('/ecom_modeling/11-30-session', 'transformer'))

TypeError: ('Keyword argument not understood:', 'layer was saved without config')

We are wondering if we did something wrong with the saving and loading of the model, or if there's a bug in saving and loading Merlin session-based models.

Also not sure if it's related to #898 or #889

sararb · 2022-12-19T18:39:22Z

Thank you for reporting the bug.

I was able to reproduce this error when loading the model without importing merlin.models.tf. In fact, the custom layers will not be understood by Tensorflow if Merlin Models (MM) is not imported.

The recommended way to load a trained MM model is to import merlin.models.tf before calling tf.keras.models.load_model, as follows:

import tensorflow as tf
import merlin.models.tf as mm
model = tf.keras.models.load_model('transformer-model')

Please let us know if this fixes the loading issue. Thanks!

oliverholworthy · 2022-12-19T18:46:55Z

We also have a classmethod on our Model class. So can also do the following to load the model

import merlin.models.tf as mm

mm.Model.load('<path-to-saved-model-directory>')

rnyak · 2023-01-05T14:58:04Z

@zhiruiwang @FredHJC closing this issue since it should be solved via #927.

zhiruiwang · 2023-01-11T20:24:20Z

@rnyak We used the Merlin-tensorflow 22.12 image and refactored our pipeline to use the lastest API of merlin-models codebase, now the saving and loading two tower, LSTM, and transformer models are all working. Thanks for the help!

FredHJC added the status/needs-triage label Nov 30, 2022

FredHJC mentioned this issue Dec 5, 2022

[BUG] Error when loading the session-based model, layer saved without config #913

Closed

sararb assigned rnyak Dec 12, 2022

rnyak added this to the Merlin 22.12 milestone Dec 14, 2022

rnyak added bug Something isn't working P0 labels Dec 14, 2022

rnyak changed the title ~~[QST] Error when loading the session-based model in example~~ [BUG] Error when loading the session-based model in example Dec 19, 2022

rnyak assigned sararb Dec 19, 2022

rnyak modified the milestones: Merlin 22.12, Merlin 23.01 Dec 19, 2022

rnyak mentioned this issue Dec 19, 2022

Fix the serialization of SequenceSummary block #927

Merged

rnyak closed this as completed Jan 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Error when loading the session-based model in example #908

[BUG] Error when loading the session-based model in example #908

FredHJC commented Nov 30, 2022

zhiruiwang commented Dec 1, 2022

sararb commented Dec 19, 2022

oliverholworthy commented Dec 19, 2022

rnyak commented Jan 5, 2023

zhiruiwang commented Jan 11, 2023

[BUG] Error when loading the session-based model in example #908

[BUG] Error when loading the session-based model in example #908

Comments

FredHJC commented Nov 30, 2022

❓ Questions & Help

Details

zhiruiwang commented Dec 1, 2022

Steps/Code to reproduce bug

sararb commented Dec 19, 2022

oliverholworthy commented Dec 19, 2022

rnyak commented Jan 5, 2023

zhiruiwang commented Jan 11, 2023