Tag: attention model

AI Research

A Brief Overview of Attention Mechanism

Attention is simply a vector, often the outputs of dense layer using softmax function.

Research

Memory, Attention, Sequences

In the article “Memory, attention, sequences”, the author predicts that future work on neural networks will emphasize understanding complex spatio-temporal data from the real world, which is highly contextual and noisy.