Kunal Mukherjee, Ph.D

I am a Postdoctoral Research Associate in the Department of Computer Science at Virginia Tech (Blacksburg, VA), working closely with Dr. Murat Kantarcıoğlu. I received my M.S. and Ph.D. in Computer Science from The University of Texas at Dallas.

🔍 My current research focus is on adversarial robustness of graph learning and the governed, forensic use of LLMs in system provenance and security operations. I develop agentic, retrieval-augmented pipelines that transform threat reports into actionable signals, and I study how to make GNNs and LLMs trustworthy under real-world attacks and constraints.

  • Agentic RAG for Threat Intel: Designed a retrieval-augmented pipeline where an LLM-based agent parses threat reports and extracts indicators of compromise (IOCs) and TTPs for downstream detection and hunt workflows.
  • Adversarial Attacks on GNNs: Building realistic attack generators and evaluation suites for domains beyond system logs, including blockchain and social networks, to stress-test link/node classification and detection tasks.
  • LLM-Guided Forensics & Governance: Developing methods for LLM-assisted provenance forensics (triage, explanation, evidence tracing) alongside policy and guardrails for reliable, auditable agent behavior in security settings.
  • Provenance-Centered Detection: Advancing PIDS pipelines that fuse graph-structured system activity with LLM reasoning and GNN inference for robust anomaly detection and interpretable investigations.

Prior work (Ph.D., thesis): I expanded the scope of provenance-based intrusion detection systems (PIDS) to IoT settings, validated their robustness via an adversarial attack generation framework, and improved explainability for GNN-based PIDS. I also collaborated with industry partners (e.g., A*STAR Institute for Infocomm Research, Acronis, Inc., and Guardora) and applied GNNs to increase relevance and diversity in recommendation systems, including building an explanation framework for link-prediction during my time at Zillow Group.

He is currently in the academic job market for Fall '26 or applied scientist researcher positions. Please contact him (kunmukh at gmail.com) if you would like to discuss potential collaborations. Please find his research statement, teaching statement, and community statement.

Recent News

Security Research

  • Degree: Doctorate
  • City: Blacksburg, Virginia
  • Phone: +1 812-550-3890

Research Interests

  • Anomaly Detection and Malware Classification
  • Explainable ML
  • Adversarial ML
  • Differential Privacy
  • Security and Goverannce in LLMs
  • Recommendation Systems
  • Graph Learning: Data Mining and Pattern Recognition
  • System/IoT/Network Security

Projects

Realizable Adversarial System Action Generation Framework

Automated, data-driven framework that generates adversarial real-world attacks capable of evading ML-based IDS, [USENIX '23].

LLM-driven Forencis and Intrusion Detection Agents

Built an LLM-based agent that mines and interprets threat reports, then queries provenance data sources to investigate reported attacks, [ arXiv '25, Demo].

Adversarial Robustness of Graph Neural Networks

Designing realistic adversarial attacks on GNNs across domains (e.g., blockchain, social media, and system provenance) to evaluate and strengthen robustness of graph learning models.

Explainability Framework for GNN-based Intrusion Detectors

Ground truth–aware explanation framework for enhancing the interpretability of GNN-based IDS, [KDD '25, arXiv '23].

Privacy-Preserving Federated Intrusion Detection for IoT

Federated IDS framework customized for IoT-specific constraints, integrating differential privacy to detect evasive attacks effectively, [ACNS '24].

Differential Privacy for System Provenance Graphs

Developed a differential privacy framework for heterogeneous provenance graphs, balancing privacy guarantees with detection accuracy, [ACNS '25].

"

Dissertation

IoT Integration, Adversarial Attacks, and Threat Explanations in Provenance-Based Intrusion Detection Systems

Kunal Mukherjee.

UTD Press. May, 2025.

Publications

Evading Provenance-Based ML Detectors with Adversarial System Actions

Kunal Mukherjee, Joshua Wiedemeier, Tianhao Wang, James Wei, Feng Chen, Muhyun Kim, Murat Kantarcioglu, and Kangkook Jee.

In Proceedings of Usenix Security. Aug, 2023.

Artifacts evaluated and badges awarded: Available, Functional, Reproducible.

ProvIoT: Detecting Stealthy Attacks in IoT through Federated Edge-Cloud Security

Kunal Mukherjee, Joshua Wiedemeier, Qi Wang, Junpei Kamimura, John Junghwan Rhee, James Wei, Zhichun Li, Xiao Yu, Lu-An Tang, Jiaping Gui, Kangkook Jee.

In Proceedings of 22nd International Conference on Applied Cryptography and Network Security. March, 2024.

ProvDP: Differential Privacy for Provenance Dataset

Kunal Mukherjee, Jonathan Yu, Partha De, Dinil Mon Divakaran

In Proceedings of 23nd International Conference on Applied Cryptography and Network Security. June, 2025.

Z-REx: GNN-based Recommendation Explanation using Human-interpretable Language

Kunal Mukherjee, Zachary Harrison, Saeid Balaneshin

(Oral) KDD Workshop on ML on Graphs in the Era of Generative AI (MLoG-GenAI@KDD). August, 2025.

Robust Explanation of GNN-based IDS for System Provenance with Graph Structural Features

Kunal Mukherjee, Joshua Wiedemeier, Tianhao Wang, Muhyun Kim, Feng Chen, Murat Kantarcioglu, Kangkook Jee.

arXiv. Jun, 2023. (submitted to NDSS 2026).

ProvSEEK: LLM-Powered Threat Intelligence Extraction and Correlation Framework

Kunal Mukherjee, Murat Kantarcioglu.

arXiv. Sept, 2025. (submitted to USENIX 2026).

ProvCreator: Synthesizing Complex Heterogenous Graphs with Node and Edge Attributes

Tianhao Wang, Simon Klancher, Kunal Mukherjee, Joshua Wiedemeier, Feng Chen, Murat Kantarcioglu, Kangkook Jee.

arXiv. Jul, 2025. (submitted to NeurIPS 2025).

Resume

Summary

Kunal Mukherjee

Postdoctoral Research Associate at Virginia Tech working with Dr. Murat Kantarcıoğlu on adversarial robustness of Graph Neural Networks (GNNs) and agentic AI for system provenance forensics. His research bridges graph learning and large language models (LLMs) to develop scalable, interpretable, and privacy-preserving solutions for cybersecurity. He designs frameworks that generate realistic adversarial attacks on GNNs, build RAG-based agent pipelines for automated threat intelligence extraction, and explore LLM-guided forensic analysis and governance for trustworthy adoption of AI in security operations.

His broader expertise spans Adversarial ML, Explainable ML, Anomaly Detection, and Privacy-preserving Generative AI, with a strong record of publishing at top-tier venues (e.g., USENIX Security, ACNS, KDD). He has also collaborated with industry leaders (e.g., Zillow Group) to apply GNNs to recommendation systems, resulting in impactful research and a patent filing.

Education

Doctorate and M.S, Computer Science

Aug 2019 - May 2025

University of Texas at Dallas, Dallas, TX

Bachelor of Science, Computer Engineering

Aug 2016 - Jun 2019

University of Evansville, Evansville, IN

  • Senior Thesis: Location Dependent Cryptosystem
  • Advisor: Late Dr. Dick Blandford and Dr. Donald Roberts
  • Minors: Computer Science and Engineering Management

Professional Experience

Postdoctoral Research Associate

08/2025 - Present

Department of Computer Science, Virginia Tech, Blacksburg, VA

  • Conducting research under Dr. Murat Kantarcıoğlu on adversarial robustness of GNNs and LLM adoption in system provenance.
  • Implementing realistic adversarial attacks on GNNs across domains such as blockchain and social networks to evaluate detection robustness.
  • Exploring LLM-guided forensics and AI governance for reliable adoption of agentic AI in cybersecurity investigations.

Applied Scientist Intern

05/2024 – 12/2024

Zillow Group, Inc., Dallas, TX

  • Designed a GNN-based recommendation system, yielding a 40x increase in nDCG and a 60x boost in diversity.
  • Engineered a novel explainability framework for recommendations to improve transparency and accountability.
  • Work resulted in an oral research paper at KDD '25 (MLoG-GenAI workshop) and a patent application.

Hardware Research Intern

05/2017 - 07/2019

Ciholas, Inc., Evansville, IN

  • Designed a proprietary quaternion-based sensor fusion model to accurately extrapolate device orientation.
  • Deployed in production after extensive regression testing, generating $5M in revenue.