Ahmad Humayun

prof_pic.jpg

4th Floor Gilbert Place

220 Gilbert St

Blacksburg, VA 24060

ahmad35@vt.edu
LinkedIn
GitHub
Google Scholar
CV

Hi, I’m Ahmad Humayun, a PhD candidate in Computer Science at Virginia Tech, advised by Prof. Muhammad Ali Gulzar on automated software testing and security of distributed data-intensive scalable computing (DISC) programs. I also collaborate closely with Prof. Miryung Kim @ UCLA.

My research focuses on developing novel methods to improve testing for big data analytics applications, targeting DISC frameworks like Apache Spark a Apache Flink. I’ve published my work at top-tier venues including ESEC/FSE and IEEE/ACM ASE. My tools have discovered multiple previously unknown bugs in Apache Spark and Apache Flink.

I recently completed an internship as an Applied Scientist at Amazon Web Services (Summer 2025), where I developed an LLM-powered application to automate the modeling of complex distributed algorithms in low-resource programming languages, deployed both as a standalone application and an MCP server. Previously, as an Applied Scientist intern at AWS (Summer 2024), I enhanced the automated testing infrastructure of critical AWS Services.

news

Oct 11, 2025 πŸŽ‰ We just submitted exciting work on DAG-based fuzzing for Dataflow Frameworks to OOPSLA β€˜26! Our work has exposed optimizer issues in Apache Spark and Apache Flink!
Aug 15, 2025 πŸš€ I was offered to return for another Applied Science internship at AWS!
Nov 07, 2024 πŸŽ–οΈ Honored to serve on the Program Committee for TaPP 2024 - Workshop on the Theory and Practice of Provenance!
Aug 15, 2024 πŸš€ Excited to start my internship at AWS as an Applied Scientist working with Ankush Desai and Aman Goel!
Apr 15, 2024 πŸŽ‰ Our paper on natural symbolic execution-based testing for big data analytics has been accepted at ESEC/FSE 2024!
Sep 13, 2023 🎀 I presented our work on Natural Input Generation for Big Data Analytics at ASE 2023 in Luxembourg!
Sep 11, 2023 🎀 I presented Co-dependence Aware Fuzzing for Dataflow-based Big Data Analytics at ESEC/FSE 2023 in San Francisco you can find my talk here!
Aug 21, 2023 πŸ† I was awarded a SIGSOFT grant to present my work at the 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023) in Luxembourg.
Jul 17, 2023 πŸŽ‰ Our paper on natural input generation for data intensive applications has been accepted at ASE β€˜23!
May 04, 2023 πŸŽ‰ Our paper on co-dependence aware fuzzing for dataflow-based big data analytics has been accepted at ESEC/FSE 2023!

selected publications

  1. ESEC/FSE 2024
    Natural Symbolic Execution-Based Testing for Big Data Analytics
    Yaoxuan Wu, Ahmad Humayun, Muhammad Ali Gulzar, and 1 more author
    Proc. ACM Softw. Eng., Jul 2024
  2. ESEC/FSE 2023
    Co-dependence Aware Fuzzing for Dataflow-Based Big Data Analytics
    Ahmad Humayun, Miryung Kim, and Muhammad Ali Gulzar
    In Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, San Francisco, CA, USA, Jul 2023
  3. ASE 2023
    NaturalFuzz: Natural Input Generation for Big Data Analytics
    Ahmad Humayun, Yaoxuan Wu, Miryung Kim, and 1 more author
    In 2023 38th IEEE/ACM International Conference on Automated Software Engineering (ASE), Jul 2023