Discovering Interpretable Algorithms by Decompiling Transformers to RASP

Publication
arxiv