-
Convolutional Sequence to Sequence Learning paper appeared in ICML17/offical code
-
Attention Is All You Need paper/offical code/pytorch code
To learn more about self-attention mechanism, you could read "A Structured Self-attentive Sentence Embedding".