Machine Learning Research
Taming Transformers: Researchers find new strategies to accelerate transformer architecture.
The transformer architecture is astonishingly powerful but notoriously slow. Researchers have developed numerous tweaks to accelerate it — enough to warrant a look at how these alternatives work, their strengths, and their weaknesses.