news

Feb 18, 2025 Excited to share our work on understanding how silent data corruption errors affect LLM training! We study silent errors ocurring in real-world unhealthy hardware swept out by production fleet management, characterize the magnitude of these errors, and analyze they impact tensors during training and model quality.
Sep 16, 2024 I’ve started as an Student Researcher on the Learning2Perf Team at Google, working on improving code understanding in LLMs and automating code optimization at scale! I’ll be interning remotely from the Cambridge area and around the Google Cambridge office: let me know if you’d ever like to chat!
May 13, 2024 I’ve started as an Applied Scientist intern in the AWS AI Research and Education (AIRE) Lab at Amazon NYC this summer, working on fault resiliency in LLM training! Definitely reach out if you’re in the area and want to chat!
Aug 28, 2023 Started as a PhD student at Harvard, working on ML + systems, large language models, and code generation!