Discovering Interpretable Algorithms by Decompiling Transformers to RASP (bibtex)
by Xinting Huang, Aleksandra Bakalova, Satwik Bhattamishra, William Merrill, Michael Hahn
Reference:
Discovering Interpretable Algorithms by Decompiling Transformers to RASPXinting Huang, Aleksandra Bakalova, Satwik Bhattamishra, William Merrill, Michael HahnarXiv preprint, 2026.
Bibtex Entry:
@article{huang2026discoveringinterpretablealgorithmsdecompiling,
      title={Discovering Interpretable Algorithms by Decompiling Transformers to RASP},
      author={Xinting Huang and Aleksandra Bakalova and Satwik Bhattamishra and William Merrill and Michael Hahn},
      year={2026},
      journal={arXiv preprint},
      url={https://arxiv.org/abs/2602.08857},
}
Powered by bibtexbrowser