Separations in the Representational Capabilities of Transformers and Recurrent Architectures

Publication
Annual Conference on Neural Information Processing Systems (NeurIPS 2024)