Trustworthy and Robust AI Systems

Investigating the reliability, safety, and robustness of AI systems in software engineering and beyond.

Overview

AI systems are increasingly deployed in safety-critical settings, from autonomous vehicles to software development tools. Our research studies how to make these systems more robust, how to detect and repair their failures, and how to understand the security risks introduced by the AI toolchain itself.

This spans work on DNN testing, adversarial robustness, autonomous driving simulation, compiler-level attacks on ML models, and causal approaches to system configuration.

Key Directions

DNN Testing & Repair: Systematic methods for finding confusion and bias errors in neural networks, and techniques for repairing them.
Autonomous Driving: Simulation-based testing, traffic generation, and scenario-based evaluation for self-driving systems.
Adversarial Robustness: Metric learning and multitask approaches that strengthen model robustness under adversarial and natural variations.
ML Toolchain Security: Discovering and exploiting vulnerabilities in deep learning compilers and infrastructure.
Causal Configuration: Using causal reasoning to understand and optimize the performance of configurable systems.

Impact

DeepTest was a pioneering work in automated testing of autonomous driving systems and has been highly cited. Our work on compiler backdoors was accepted at IEEE S&P 2026, revealing a new attack surface in the ML supply chain. The Unicorn and Cameo systems brought causal reasoning to system performance optimization.

Contributors

Baishakhi Ray Simin Chen Ira Ceka Ziyuan Zhong Yuchi Tian Rahul Krishna Saikat Chakraborty

Selected Publications

Trustworthy AI Software Engineers

A Aleti, B Ray, R Hoda, S Chen · Preprint, 2026

Your compiler is backdooring your model: Understanding and exploiting compilation inconsistency vulnerabilities in deep learning compilers

S Chen, J Peng, Y He, J Yang, B Ray · IEEE Symposium on Security and Privacy (S&P) 2026

Trustworthy and Robust AI Systems

Trustworthy and Robust AI Systems

Overview

Key Directions

Impact

Contributors

Selected Publications

Trustworthy AI Software Engineers

Your compiler is backdooring your model: Understanding and exploiting compilation inconsistency vulnerabilities in deep learning compilers

DyCodeEval: Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination

Towards causal deep learning for vulnerability detection

Language-guided traffic simulation via scene-level diffusion

Guided conditional diffusion for controllable traffic simulation

Cameo: A causal transfer learning approach for performance optimization of configurable computer systems

Neural network guided evolutionary fuzzing for finding traffic violations of autonomous vehicles

Detecting multi-sensor fusion errors in advanced driver-assistance systems

Automatic map generation for autonomous driving system testing

Unicorn: Reasoning about configurable system performance through the lens of causality

Repairing Group-Level Errors for DNNs Using Weighted Regularization