[Draft] Add Experimental limited sparse embedding bag #8905

amjames · 2025-03-28T22:22:45Z

Users of torch_xla encounter an issue when using the sparse=True option with the Embedding or EmbeddingBag modules.

The gradient for weight is created as a sparse tensor and there is no dispatch registered for the combination of sparse creation APIs w/ the XLA key, or the Sparse functionality key and the XLA backed key used in conjunction.

This is a workaround that can be removed, ported to C++, or extended later:

SparseCOOTensor: a tensor subclass implementing the optimization and semantics of upstream SparseTensor. it is Composabile with the XLA device.
drop in replacements for F.embedding F.embedding_bag, nn.Embedding, and nn.EmbeddingBag which forward to a custom implementation of the backward and produce the above tensor subclass rather than a native torch sparse tensor.

The tensor subclass may have component tensors indices and values which have xla device without issue.

fixes #8719

amjames added 3 commits March 28, 2025 22:15

Add sparse COO tensor subclass impl and limited ops

bbb61aa

create embeding bag module

a524402

Add embedding bag implementations

2dd1736

amjames requested a review from ysiraichi March 28, 2025 22:23

Add raw embedding

c67588f

amjames force-pushed the amjames/sparse_embedding_bag branch from aacfd1b to c67588f Compare March 28, 2025 22:27

miladm assigned lsy323 Apr 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Draft] Add Experimental limited sparse embedding bag #8905

[Draft] Add Experimental limited sparse embedding bag #8905

amjames commented Mar 28, 2025

[Draft] Add Experimental limited sparse embedding bag #8905

Are you sure you want to change the base?

[Draft] Add Experimental limited sparse embedding bag #8905

Conversation

amjames commented Mar 28, 2025