Every Layer Counts: Multi-Layer Multi-Head Attention for Neural Machine Translation
2020 ◽
Vol 115
(1)
◽
pp. 51-82
2019 ◽
Vol 28
(4)
◽
pp. 1-29
◽
Keyword(s):