Publications

(2023). Shapley Interactions for Complex Feature Attribution. NeurIPS Workshop on Attribution at Scale.
(2023). Linear Connectivity Reveals Generalization Strategies. International Conference on Learning Representations (ICLR).
(2023). Interpretability Creationism. The Gradient.
(2023). Delays, Detours, and Forks in the Road: Latent State Models of Training Dynamics. Transactions of Machine Learning Research (TMLR).
(2022). The MultiBERTs: BERT Reproductions for Robustness Analysis. International Conference on Learning Representations (ICLR).
(2022). One Venue, Two Conferences: The Separation of Chinese and American Citation Networks. NeurIPS Workshop on Cultures of AI and AI for Culture.
(2022). Learning Transductions to Test Systematic Compositionality. International Conference on Computational Linguistics (COLING).
(2021). A Non-Linear Structural Probe. North American Association for Computational Linguistics (NAACL).
(2020). Understanding Privacy-Related Questions on Stack Overflow. Conference on Human Factors in Computing Systems (CHI).
(2020). Pareto Probing: Trading Off Accuracy for Complexity. Empirical Methods in Natural Language Processing (EMNLP).