Naomi Saphra
Open Menu
Close Menu
Bio
Posts
Publications
Language Model Training
TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Jan 1, 2024
Fast Forwarding Low-Rank Training
Jan 1, 2024
Dynamic Masking Rate Schedules for MLM Pretraining
Jan 1, 2024