Publications

Divyansh Singhvi, Andrej Erkelens, Raghav Jain, Diganta Misra, Naomi Saphra (2023). Shapley Interactions for Complex Feature Attribution. NeurIPS Workshop on Attribution at Scale.

Cite URL

Jeevesh Juneja, Rachit Bansal, Kyunghyun Cho, João Sedoc, Naomi Saphra (2023). Linear Connectivity Reveals Generalization Strategies. International Conference on Learning Representations (ICLR).

Cite URL

Naomi Saphra (2023). Interpretability Creationism. The Gradient.

Cite URL

Michael Hu, Angelica Chen, Naomi Saphra, Kyunghyun Cho (2023). Delays, Detours, and Forks in the Road: Latent State Models of Training Dynamics. Transactions of Machine Learning Research (TMLR).

Cite URL

Thibault Sellam, Steve Yadlowsky, Ian Tenney, Jason Wei, Naomi Saphra, Alexander D'Amour, Tal Linzen, Jasmijn Bastings, Iulia Raluca Turc, Jacob Eisenstein, Dipanjan Das, Ellie Pavlick (2022). The MultiBERTs: BERT Reproductions for Robustness Analysis. International Conference on Learning Representations (ICLR).

Cite URL

Bingchen Zhao, Yuling Gu, Jessica Zosa Forde, Naomi Saphra (2022). One Venue, Two Conferences: The Separation of Chinese and American Citation Networks. NeurIPS Workshop on Cultures of AI and AI for Culture.

Cite URL

Josef Valvoda, Naomi Saphra, Jonathan Rawski, Ryan Cotterell, Adina Williams (2022). Learning Transductions to Test Systematic Compositionality. International Conference on Computational Linguistics (COLING).

PDF Cite

Jennifer C. White, Tiago Pimentel, Naomi Saphra, Adina Williams, Ryan Cotterell (2021). A Non-Linear Structural Probe. North American Association for Computational Linguistics (NAACL).

Cite URL

Mohammad Tahaei, Kami Vaniea, Naomi Saphra (2020). Understanding Privacy-Related Questions on Stack Overflow. Conference on Human Factors in Computing Systems (CHI).

Cite URL

Tiago Pimentel, Naomi Saphra, Adina Williams, Ryan Cotterell (2020). Pareto Probing: Trading Off Accuracy for Complexity. Empirical Methods in Natural Language Processing (EMNLP).

Cite URL