Language, Computation and Cognition Lab (LaCoCo)
Language, Computation and Cognition Lab (LaCoCo)
Home
People
Publications
Teaching
Joining
Light
Dark
Automatic
InversionView: A General-Purpose Method for Reading Information from Neural Activations
Xinting Huang
,
Madhur Panwar
,
Navin Goyal
,
Michael Hahn
February, 2024
Preprint
Type
Conference paper
Publication
Annual Conference on Neural Information Processing Systems (NeurIPS 2024)
Previously: Mechanistic Interpretability Workshop at ICML 2024 (oral)
🏆 Awarded Second Place Prize
Cite
×