Question about sample_sparse_structure #153

kevinhuangxf · 2025-01-18T07:43:40Z

Thanks for the excellent work!

I have a question about the sample_sparse_structure function. If it samples difference size of coordinates from the image encoded latents, how can this operation be applied for multiple batch size during training ?

JeffreyXiang · 2025-01-20T07:13:48Z

Hi,
Batched attention with different sequence length can be efficiently handled with modern attention implementation like flash-attn and xformers. See the code here for more details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about sample_sparse_structure #153

Question about sample_sparse_structure #153

kevinhuangxf commented Jan 18, 2025

JeffreyXiang commented Jan 20, 2025

Question about sample_sparse_structure #153

Question about sample_sparse_structure #153

Comments

kevinhuangxf commented Jan 18, 2025

JeffreyXiang commented Jan 20, 2025