Naomi Saphra

Research Fellow

About Me

I am a research fellow at the Kempner Institute at Harvard University and incoming Assistant Professor in Boston University’s faculty of Computing & Data Science. I am interested in NLP training dynamics: how models learn to encode linguistic patterns or other structure and how we can encode useful inductive biases into the training process. Recently, I have begun collaborating with natural and social scientists to use interpretability to understand the world around us. I have become particularly interested in fish. Previously, I earned a PhD from the University of Edinburgh on Training Dynamics of Neural Language Models; worked at NYU, Google and Facebook; and attended Johns Hopkins and Carnegie Mellon University. Outside of research, I play roller derby under the name Gaussian Retribution, perform standup comedy, and shepherd disabled programmers into the world of code dictation.

I am recruiting PhD students to begin in 2026 at Boston University.

Download CV

Interests

Language modeling
Interpretability
Training Dynamics
Generalization
AI for Scientific Understanding

Education

PhD in Informatics
University of Edinburgh
MEng in Computer Science
Johns Hopkins University
BSc Artificial Intelligence
Carnegie Mellon University

My Research

My core agenda focuses on a single goal: to completely and comprehensively understand language model training. This objective combines linguistics, optimization, learning dynamics, science of deep learning, interpretability, and behavioral analysis. Recently, I have begun using similar approaches to study scientific discovery models and enhance broader scientific understanding.

My current publication list is available on my Google Scholar.

The AI Researcher's Guide to a Non-Boring Bluesky Feed

Apr 24, 2025

How to migrate to bsky without a boring feed.

Apr 24, 2025

The Parable of the Prinia's Egg: An Allegory for AI Science

Sep 17, 2023

I discuss what counts as strong evidence for an explanation of model behavior.

Sep 17, 2023

Interpretability Creationism

Jun 7, 2022

Nothing in Deep Learning Makes Sense Except in the Light of SGD.

Jun 7, 2022

Against Monodomainism

Apr 28, 2021

A petty rant on the exceptional treatment of computer vision applications, directed at the machine learning community.

Apr 28, 2021

See all

Featured Publications

Distributional Scaling Laws for Emergent Capabilities

Large Language Models

Michael Y. Hu, Shreyans Jain, Sangam Chaulagain, Naomi Saphra (2025). How to visualize training dynamics in neural networks. Blog Post Track at International Conference on Learning Representations (ICLR BlogPosts).

Cite

Oskar Van Der Wal, Pietro Lesci, Max Müller-Eberstein, Naomi Saphra, Hailey Schoelkopf, Willem Zuidema, Stella Biderman (2025). PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs. International Conference on Learning Representations (ICLR).

PDF Cite

USVSN Sai Prashanth, Alvin Deng, Kyle O'Brien, Jyothir S V, Mohammad Aflah Khan, Jaydeep Borkar, Christopher A. Choquette-Choo, Jacob Ray Fuehne, Stella Biderman, Tracy Ke, Katherine Lee, Naomi Saphra (2025). Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon. International Conference on Learning Representations (ICLR).

Cite URL

Rosie Zhao, Tian Qin, David Alvarez-Melis, Sham Kakade, Naomi Saphra (2025). Distributional Scaling Laws for Emergent Capabilities.

Cite URL

Tian Qin, Naomi Saphra, David Alvarez-Melis (2025). Sometimes I am a Tree: Data drives fragile hierarchical generalization.

Cite URL

See all publications