A Framework for Understanding Learnability in Transformers

Publication
ICML