* Code for The Annotated Transformer blog post (GitHub): https://github.com/harvardnlp/annotated-transformer
Rush also presents a workshop paper on this model (http://aclweb.org/anthology/W18-2509).
Of course all of that is in reference to the original Google Brain/Research paper, "Attention Is All You Need"
* arXiv landing page: https://arxiv.org/abs/1706.03762
* PDF: https://arxiv.org/pdf/1706.03762.pdf