Separations in the Representational Capabilities of Transformers and Recurrent Architectures (bibtex)
by Satwik Bhattamishra, Michael Hahn, Phil Blunsom, Varun Kanade
Reference:
Separations in the Representational Capabilities of Transformers and Recurrent ArchitecturesSatwik Bhattamishra, Michael Hahn, Phil Blunsom, Varun KanadeThe Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024), 2024.
Bibtex Entry:
@inproceedings{bhattamishra2024separationsrepresentationalcapabilitiestransformers,
      title={Separations in the Representational Capabilities of Transformers and Recurrent Architectures},
      author={Satwik Bhattamishra and Michael Hahn and Phil Blunsom and Varun Kanade},
      year={2024},
booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024)},
      eprint={2406.09347},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2406.09347},
}
Powered by bibtexbrowser