by Synced 2017-09-25 1 AI Research A Brief Overview of Attention Mechanism Attention is simply a vector, often the outputs of dense layer using softmax function.
by Synced 2017-07-11 Number of comments0 Research Massive Exploration of Neural Machine Translation Architectures Although no innovations for the NMT architecture was introduced, the authors claim that a classic NMT baseline system with carefully tuned hyperparameters can still achieve comparable result to the state-of-the-art.