Distributional Scaling Laws for Emergent CapabilitiesMar 1, 2025·Rosie Zhao,Tian Qin,David Alvarez-Melis,Sham KakadeNaomi Saphra· 0 min read Cite URLTypePreprintLast updated on Mar 1, 2025Large Language Models Science of Deep Learning Compositionality AuthorsNaomi SaphraResearch Fellow ← Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon May 1, 2025Sometimes I am a Tree: Data drives fragile hierarchical generalization Feb 1, 2025 →