Releases · facebookresearch/xformers · GitHub

15 Mar 15:52

dianaml0

v0.0.10

Fixed

Expose bias flag for feedforwards, same default as Timm [#220]
Update eps value for layernormm, same default as torch [#221]
PreNorm bugfix, only one input was normalized [#233]

Added

Add DeepNet (DeepNorm) residual path and init [#227]

Assets 2

09 Feb 20:50

dianaml0

v0.0.9

Added

Compositional Attention [#41]
Experimental Ragged attention [#189]
Mixture of Experts [#181]
BlockSparseTensor [#202]
nd-tensor support for triton softmax [#210]

Fixed

bugfix Favor, single feature map [#183]
sanity check blocksparse settings [#207]
fixed some pickability [#204]

Assets 2

07 Jan 19:19

dianaml0

v0.0.8

Fixed

Much faster fused dropout
Fused dropout repeatability

Added

Embedding weight tying option

Assets 2

30 Nov 20:56

dianaml0

v0.0.7

Fixed

Dropout setting not properly passed in many attentions

Assets 2

24 Nov 16:18

dianaml0

v0.0.6

Fixed

Fix self attention optimization not being triggered, broken residual path [#119]
Improve speed by not using contiguous Tensors when not needed [#119]

Added

Attention mask wrapper [#113]
ViT comparison benchmark [#117]

Assets 2

18 Nov 17:07

blefaudeux

v0.0.5

fixing the 0.0.4 pip package, next release will be better in that we'll try to expose pre-built binaries

Assets 2

17 Nov 04:53

blefaudeux

v0.0.4

Fixing causality not being respected by the scaled dot product attention
Fixing Favor causal trainability
Enabling FusedLayerNorm by default if Triton is available
Fixing Favor with fp16

Assets 2

05 Nov 23:01

blefaudeux

v0.03

[0.0.3] - 2021-11-01

Fixed

Nystrom causal attention [#75]

Assets 2

01 Nov 21:19

blefaudeux

v0.0.2

[0.0.2] - 2021-11-01

Fixed

More robust blocksparse [#24]

Added

Rotary embeddings [#32]
More flexible layernorm [#50]
More flexible blockfactory config (key deduplication)

Assets 2