Various transformer updates to improve performance #182

coryMosaicML · 2024-11-26T06:14:06Z

This PR includes a few updates and fixes to the diffusion transformer to improve performance

Changes include:

corystephenson-db added 8 commits November 22, 2024 06:23

Add synthetic dataloader

4d17362

Some tweaks

f9dbdb5

Selectable attention backends

53d2874

Add DiT blocks and block groups

1212003

Remove masking

059d741

Clean up old mask code

30ba06a

Add option to not use masks

22816dc

Need a no grad when precomputing embeddings for logged images

603b23c

coryMosaicML merged commit b7e5029 into mosaicml:main Nov 26, 2024
5 checks passed

Provide feedback