"In fact, I’d go as far as to say that
The concept of attention is the most interesting recent architectural innovation in neural networks."