Publications
2026
Trustworthy AI Software Engineers
Preprint, 2026 PDF
Outrunning LLM Cutoffs: A Live Kernel Crash Resolution Benchmark for All
ICML 2026 PDF
Code Quality Analysis of Translations from C to Rust
Preprint, 2026 PDF
SWE-Spot: Building Small Repo-Experts with Repository-Centric Learning
Preprint, 2026 PDF
Your compiler is backdooring your model: Understanding and exploiting compilation inconsistency vulnerabilities in deep learning compilers
IEEE Symposium on Security and Privacy (S&P) 2026 PDF
★ Distinguished Paper AwardCodeSense: a Real-World Benchmark and Dataset for Code Semantic Reasoning
ICLR 2026 PDF
Understanding APR Agents Through the Lens of Traceability: An Empirical Study
ISSTA 2026 PDF
2025
C2SaferRust: Transforming C Projects into Safer Rust with NeuroSymbolic Techniques
IEEE Transactions on Software Engineering, 2025 PDF
Benchmarking large language models under data contamination: A survey from static to dynamic evaluation
EMNLP 2025 PDF
Mechanics of Learned Reasoning 1: TempoBench, A Benchmark for Interpretable Deconstruction of Reasoning System Performance
Preprint, 2025 PDF
AppForge: From Assistant to Independent Developer--Are GPTs Ready for Software Development?
ICLR 2025 PDF
REFINE: Enhancing Program Repair Agents through Context-Aware Patch Refinement
Preprint, 2025 PDF
Red Teaming Program Repair Agents: When Correct Patches can Hide Vulnerabilities
Preprint, 2025 PDF
FaultLine: Automated proof-of-vulnerability generation using LLM agents
Preprint, 2025 PDF
Code Reasoning for Software Engineering Tasks: A Survey and A Call to Action
Preprint, 2025 PDF
CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation
2025 IEEE/ACM International Workshop on Large Language Models for Code PDF
CrashFixer: A crash resolution agent for the Linux kernel
Preprint, 2025 PDF
EditLord: Learning Code Transformation Rules for Code Editing
ICML 2025 PDF
DyCodeEval: Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination
ICML 2025 PDF
Vulnerability detection with code language models: How far are we?
ICSE 2025 PDF
2024
Semcoder: Training code language models with comprehensive semantics reasoning
Advances in Neural Information Processing Systems 37, 60275-60308 PDF
Can LLM prompting serve as a proxy for static analysis in vulnerability detection
Preprint, 2024 PDF
Kgym: A platform and dataset to benchmark large language models on Linux kernel crash resolution
Advances in Neural Information Processing Systems 37, 78053-78078 PDF
Comment on Revisiting Neural Program Smoothing for Fuzzing
Preprint, 2024 PDF
Yuga: Automatically Detecting Lifetime Annotation Bugs in the Rust Language
IEEE Transactions on Software Engineering 50(10), 2602-2613 PDF
Spectra: Enhancing the code translation ability of language models by generating multi-modal specifications
Preprint, 2024 PDF
Cycle: Learning to self-refine the code generation
Proceedings of the ACM on Programming Languages 8 (OOPSLA1), 392-418 PDF
Automated Code Editing with Search-Generate-Modify
IEEE Transactions on Software Engineering, 2024 PDF
Towards causal deep learning for vulnerability detection
ICSE 2024 PDF
TRACED: Execution-aware Pre-training for Source Code
ICSE 2024 PDF
Beyond accuracy: Evaluating self-consistency of code LLMs with IdentityChain
ICLR 2024 PDF
2023
Language-guided traffic simulation via scene-level diffusion
Conference on Robot Learning, 144-177 PDF
Guided conditional diffusion for controllable traffic simulation
ICRA 2023, 3560-3566 PDF
On ML-based program translation: perils and promises
ICSE 2023 PDF
Summarize and generate to back-translate: Unsupervised translation of programming languages
EACL 2023 PDF
Tracefixer: Execution trace-driven program repair
Preprint, 2023 PDF
Concord: Clone-aware contrastive learning for source code
ISSTA 2023 PDF
★ Distinguished Paper AwardA static evaluation of code completion by large language models
ACL 2023 PDF
Cameo: A causal transfer learning approach for performance optimization of configurable computer systems
ACM Symposium on Cloud Computing 2023, 555-571 PDF
2022
Learning approximate execution semantics from traces for binary function similarity
IEEE Transactions on Software Engineering 49(4), 2776-2790 PDF
Natgen: generative pre-training by 'naturalizing' source code
ESEC/FSE 2022 PDF
NeuDep: neural binary memory dependence analysis
ESEC/FSE 2022 PDF
Multi-lingual evaluation of code generation models
Preprint, 2022 PDF
Cargo: AI-guided dependency analysis for migrating monolithic applications to microservices architecture
ASE 2022 PDF
★ Distinguished Paper AwardNeural network guided evolutionary fuzzing for finding traffic violations of autonomous vehicles
IEEE Transactions on Software Engineering 49(4), 1860-1875 PDF
Detecting multi-sensor fusion errors in advanced driver-assistance systems
ISSTA 2022 PDF
Automatic map generation for autonomous driving system testing
Preprint, 2022 PDF
Unicorn: Reasoning about configurable system performance through the lens of causality
EuroSys 2022, 199-217 PDF
Repairing Group-Level Errors for DNNs Using Weighted Regularization
Preprint, 2022 PDF
VELVET: a noVel Ensemble Learning approach to automatically locate VulnErable sTatements
SANER 2022 PDF
Deep learning based vulnerability detection: Are we there yet?
IEEE Transactions on Software Engineering 48(9), 3280-3296 PDF
★ Best Paper Award Runner-upTowards Learning (Dis)-Similarity of Source Code from Program Contrasts
ACL 2022 PDF
2021
A survey on scenario-based testing for automated driving systems in high-fidelity simulation
Preprint, 2021 PDF
On multi-modal learning of editing source code
ASE 2021 PDF
Retrieval augmented code generation and summarization
Findings of EMNLP 2021, 2719-2734 PDF
Stateformer: Fine-grained type recovery from binaries using generative state modeling
ESEC/FSE 2021 PDF
DIRECT: A Transformer-based Model for Decompiled Identifier Renaming
NLP4Prog 2021 PDF
Understanding local robustness of deep neural networks under natural variations
FASE 2021 PDF
Unified pre-training for program understanding and generation
NAACL 2021 PDF
CADET: Debugging and fixing misconfigurations using counterfactual reasoning
Preprint, 2021 PDF
2020
Patching as translation: the data and the metaphor
ASE 2020 PDF
Repairing confusion and bias errors for DNN-based image classifiers
ESEC/FSE 2020 PDF
MTFuzz: fuzzing with a multi-task neural network
ESEC/FSE 2020 PDF
Codit: Code editing with tree-based neural models
IEEE Transactions on Software Engineering 48(4), 1385-1399 PDF
Multitask learning strengthens adversarial robustness
ECCV 2020, 158-174 PDF
ConEx: Efficient exploration of big-data system configurations for better performance
IEEE Transactions on Software Engineering 48(3), 893-909 PDF
A transformer-based approach for source code summarization
ACL 2020 PDF
Testing DNN image classifiers for confusion & bias errors
ICSE 2020 PDF
Neutaint: Efficient dynamic taint analysis with neural networks
IEEE Symposium on Security and Privacy 2020, 1527-1543 PDF
2019
Metric learning for adversarial robustness
NeurIPS 2019 PDF
Bringing engineering rigor to Deep Learning
ACM SIGOPS Operating Systems Review 53(1), 59-67 PDF
Neuzz: Efficient fuzzing with neural program smoothing
IEEE Symposium on Security and Privacy 2019, 803-817 PDF
Toward optimal selection of information retrieval models for software engineering tasks
SCAM 2019 PDF
2018
Building language models for text with named entities
ACL 2018 PDF
DeepTest: Automated testing of deep-neural-network-driven autonomous cars
ICSE 2018 PDF
Searching for high-performing software configurations with metaheuristic algorithms
ICSE 2018 PDF
Which similarity metric to use for software documents? A study on information retrieval based software engineering tasks
ICSE 2018 PDF
Entropy guided spectrum based bug localization using statistical language model
Preprint, 2018 PDF
A case study on the impact of similarity measure on information retrieval based software engineering tasks
Preprint, 2018 PDF
ARiSE Lab