publications

publications by categories in reversed chronological order.

2026

  1. SWE-fficiency: Can Language Models Optimize Real-World Repositories on Real Workloads?
    Jeffrey Jian Ma, Milad Hashemi, Amir Yazdanbakhsh, Kevin Swersky, Ofir Press, Enhui Li, Vijay Janapa Reddi, and Parthasarathy Ranganathan
    In The Fourth-Third International Conference on Machine Learning, 2026
  2. QuArch: A Benchmark for Evaluating LLM Reasoning in Computer Architecture
    Shvetank Prakash, Andrew Cheng, Arya Tschand, Mark Mazumder, Varun Gohil, Jeffrey Ma, Jason Yik, Zishen Wan, Jessica Quaye, Elisavet Lydia Alvanaki, Avinash Kumar, Chandrashis Mazumdar, Tuhin Khare, Alexander Ingare, Ikechukwu Uchendu, Radhika Ghosal, Abhishek Tyagi, Chenyu Wang, Andrea Mattia Garavagno, Sarah Gu, Alice Guo, Grace Hur, Luca Carloni, Tushar Krishna, Ankita Nayak, Amir Yazdanbakhsh, and Vijay Janapa Reddi
    2026

2025

  1. QuArch: A Question-Answering Dataset for AI Agents in Computer Architecture
    Shvetank Prakash, Andrew Cheng, Jason Yik, Arya Tschand, Radhika Ghosal, Ikechukwu Uchendu, Jessica Quaye, Jeffrey Ma, Shreyas Grampurohit, Sofia Giannuzzi, Arnav Balyan, Fin Amin, Aadya Pipersenia, Yash Choudhary, Ankita Nayak, Amir Yazdanbakhsh, and Vijay Janapa Reddi
    IEEE Computer Architecture Letters, 2025
  2. Understanding Silent Data Corruption in LLM Training
    Jeffrey Ma, Hengzhi Pei, Leonard Lausen, and George Karypis
    2025
  3. A2Perf: Real-World Autonomous Agents Benchmark
    Ikechukwu Uchendu, Jason Jabbour, Korneel Van Berghe, Joel Runevic, Matthew Stewart, Jeffrey Ma, Srivatsan Krishnan, Izzeddin Gur, Austin Huang, Colton Bishop, Paige Bailey, Wenjie Jiang, Ebrahim M. Songhori, Sergio Guadarrama, Jie Tan, Jordan K. Terry, Aleksandra Faust, and Vijay Janapa Reddi
    2025
  4. When Silicon Fails Silently: Characterizing Hardware-Induced Corruption in LLM Training
    Jeffrey Ma, Hengzhi Pei, Leonard Lausen, and George Karypis
    In 2025 IEEE 31st International Symposium on On-Line Testing and Robust System Design (IOLTS), 2025
  5. When Silicon Fails Silently: Characterizing Hardware-Induced Corruption in LLM Training
    Jeffrey Ma, Hengzhi Pei, Leonard Lausen, and George Karypis
    2025
  6. SwizzlePerf: Hardware-Aware LLMs for GPU Kernel Performance Optimization
    Arya Tschand, Kesavan Ramakrishnan, Muhammad A. Awad, Ryan Swann, Jeffrey Jian Ma, Keith Lowery, and Vijay Janapa Reddi
    In NeurIPS 2025 Workshop on Machine Learning for Systems, 2025

2024

  1. FedStaleWeight: Buffered Asynchronous Federated Learning with Fair Aggregation via Staleness Reweighting
    Jeffrey Ma, Alan Tu, Yiling Chen, and Vijay Janapa Reddi
    2024

2021

  1. Polymatrix Competitive Gradient Descent
    Jeffrey Ma, Alistair Letcher, Florian Schäfer, Yuanyuan Shi, and Anima Anandkumar
    Nov 2021

2020

  1. Diagnostic Image Quality Assessment and Classification in Medical Imaging: Opportunities and Challenges
    Jeffrey Jian Ma, Ukash Nakarmi, Cedric Yue Sik Kin, Christopher Sandino, Joseph Y. Cheng, Ali B. Syed, Peter Wei, John M. Pauly, and Shreyas Vasanawala
    Proceedings of the 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI), May 2020
  2. Analysis of Deep Learning models for Diagnostic Image Quality Assessment in Magnetic Resonance Imaging
    Jeffrey Jian Ma, Ukash Nakarmi, Cedric Yue Sik Kin, Joseph Y. Cheng, Christopher Sandino, Ali B. Syed, Peter Wei, John M. Pauly, and Shreyas Vasanawala
    Proceedings of the 2020 28th International Society for Magnetic Resonance in Medicine (ISMRM) Annual Meeting, Aug 2020
  3. Toward Continuous Social Phenotyping: Analyzing Gaze Patterns in an Emotion Recognition Task for Children With Autism Through Wearable Smart Glasses
    Anish Nag, Nick Haber, Catalin Voss, Serena Tamura, Jena Daniels, Jeffrey Jian Ma, Bryan Chiang, Shasta Ramachandran, Jessey Schwartz, Terry Winograd, Carl Feinstein, and Dennis P Wall
    Journal of Medical Internet Research (JMIR), Apr 2020