Research Output

Publications

A selection of work on robustness, interpretability, detection, unlearning and reliable foundation models. Full lists on Google Scholar and DBLP.

20264 papers
Decomposition-Enhanced Training for Post-Hoc Attributions in Language Models
S. Balasubramanian, S. Basu, K. Goswami, R. Rossi, V. Manjunatha, R. Santhosh, R. Zhang, S. Feizi, N. Lipka
EACL Main
SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control
A. Zarei, S. Basu, M. Pournemat, S. Nag, R. A. Rossi, S. Feizi
CVPR
Revisiting the Past: Data Unlearning with Model State History
K. Rezaei, M. Saberi, A. Ravichander, S. Feizi
ICLR
GHOST: Hallucination-Inducing Image Generation for Multimodal LLMs
A. Y. Parast, P. Hosseini, H. Asadollahzadeh, A. S. Moakhar, B. Azam, S. Feizi, N. Akhtar
ICLR
202517 papers
RESTOR: Knowledge Recovery in Machine Unlearning
K. Rezaei, K. Chandu, S. Feizi, Y. Choi, F. Brahman, A. Ravichander
TMLR
Can AI-Generated Text be Reliably Detected? Stress Testing AI Text Detectors Under Various Attacks
V. S. Sadasivan, A. Kumar, S. Balasubramanian, W. Wang, S. Feizi
Localizing Knowledge in Diffusion Transformers
A. Zarei, S. Basu, K. Rezaei, Z. Lin, S. Nag, S. Feizi
NeurIPS
RePanda: pandas-powered tabular verification and reasoning
A. Chegini, K. Rezaei, H. Eghbalzadeh, S. Feizi
ACL Main
Tool Preferences in Agentic LLMs are Unreliable
K. Faghih, W. Wang, Y. Cheng, S. Bharti, G. Sriramanan, S. Balasubramanian, P. Hosseini, S. Feizi
EMNLP
On Mechanistic Circuits for Extractive Question-Answering
S. Basu, C. Zhao, J. Wang, R. Rossi, V. Morariu, S. Feizi, V. Manjunatha
COLM
Almost AI, Almost Human: The Challenge of Detecting AI-polished Writing
S. Saha, S. Feizi
ACL
A Closer Look at Bias and Chain-of-Thought Faithfulness of Large (Vision) Language Models
S. Balasubramanian, S. Basu, S. Feizi
EMNLP
Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text
Y. Cheng, V. S. Sadasivan, M. Saberi, S. Saha, S. Feizi
NeurIPS
DyePack: Provably Flagging Test Set Contamination in LLMs Using Backdoors
Y. Cheng, W. Wang, M. Moayeri, S. Feizi
EMNLP
Unearthing Skill-level Insights for Understanding Trade-offs of Foundation Models
M. Moayeri, V. Balachandran, V. Chandrasekaran, S. Yousefi, T. Fel, S. Feizi, B. Nushi, N. Joshi, V. Vineet
ICLR
AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models
A. Zarei, J. Pan, M. Gwilliam, S. Feizi, Z. Yang
Preprint
Reasoning Under Uncertainty: Exploring Probabilistic Reasoning Capabilities of LLMs
M. Pournemat, K. Rezaei, G. Sriramanan, A. Zarei, J. Fu, Y. Wang, H. Eghbalzadeh, S. Feizi
Preprint
SpurLens: Automatic Detection of Spurious Cues in Multimodal LLMs
P. Hosseini, S. Nawathe, M. Moayeri, S. Balasubramanian, S. Feizi
Preprint
IConMark: Robust Interpretable Concept-Based Watermark for AI Images
V. S. Sadasivan, M. Saberi, S. Feizi
ICLR Workshop on GenAI Watermarking
Chain-of-Defensive-Thought: Structured Reasoning Elicits Robustness against Reference Corruption
W. Wang, P. Hosseini, S. Feizi
Preprint
How Learnable Grids Recover Fine Detail in Low Dimensions: An NTK Analysis of Multigrid Parametric Encodings
S. Audia, S. Feizi, M. Zwicker, D. Manocha
Preprint
202416 papers
Certifying LLM Safety against Adversarial Prompting
A. Kumar, C. Agarwal, S. Srinivas, A. Li, S. Feizi, H. Lakkaraju
LLM-Check: Investigating Detection of Hallucinations in Large Language Models
G. Sriramanan, S. Bharti, V. S. Sadasivan, S. Saha, P. Kattakinda, S. Feizi
NeurIPS
PRIME: Prioritizing Interpretability in Failure Mode Extraction
K. Rezaei, M. Saberi, M. Moayeri, S. Feizi
ICLR
Robustness of AI-Image Detectors: Fundamental Limits and Practical Attacks
M. Saberi, V. S. Sadasivan, K. Rezaei, A. Kumar, A. Chegini, W. Wang, S. Feizi
Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP
S. Balasubramanian, S. Basu, S. Feizi
NeurIPS
On Mechanistic Knowledge Localization in Text-to-Image Generative Models
S. Basu, K. Rezaei, P. Kattakinda, V. Morariu, N. Zhao, R. A. Rossi, V. Manjunatha, S. Feizi
ICML
Understanding Information Storage and Transfer in Multi-Modal Large Language Models
S. Basu, M. Grayson, C. Morrison, B. Nushi, S. Feizi, D. Massiceti
NeurIPS
Localizing and Editing Knowledge in Text-to-Image Generative Models
S. Basu, N. Zhao, V. Morariu, S. Feizi, V. Manjunatha
Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP
S. Basu, S. Xu Hu, M. Sanjabi, D. Massiceti, S. Feizi
DRSM: De-Randomized Smoothing on Malware Classifier Providing Certified Robustness
S. Saha, W. Wang, Y. Kaya, S. Feizi, T. Dumitras
ICLR
Fast Adversarial Attacks on Language Models In One GPU Minute
V. S. Sadasivan, S. Saha, G. Sriramanan, P. Kattakinda, A. Chegini, S. Feizi
ICML
WorldBench: Quantifying Geographic Disparities in LLM Factual Recall
M. Moayeri, E. Tabassi, S. Feizi
FAccT
Efficient Attention using Low-Dimensional Keys (Loki)
P. Singhania, S. Singh, S. He, S. Feizi, A. Bhatele
NeurIPS
Strong Baselines for Parameter-Efficient Few-Shot Learning
S. Basu, S. Xu Hu, D. Massiceti, S. Feizi
AAAI
Data-Centric Debugging: Mitigating Model Failures via Targeted Image Retrieval
S. Singla, A. M. Chegini, M. Moayeri, S. Feizi
WACV
Rethinking Artistic Copyright Infringements in the Era of Text-to-Image Generative Models
M. Moayeri, S. Basu, S. Balasubramanian, P. Kattakinda, A. Chegini, R. Brauneis, S. Feizi
Preprint
202316 papers
Identifying and Mitigating the Security Risks of Generative AI
C. Barrett, B. Boyd, S. Feizi, and others
Foundations and Trends in Privacy and Security
Interpolated Joint Space Adversarial Training for Robust and Generalizable Defenses
C. P. Lau, J. Liu, H. Souri, W. Lin, S. Feizi, R. Chellappa
IEEE TPAMI
Goal-Conditioned Q-Learning as Knowledge Distillation
A. Levine, S. Feizi
Provable Robustness against Wasserstein Distribution Shifts via Input Randomization
A. Kumar, A. Levine, S. Feizi
Run-Off Election: Improved Provable Defense against Data Poisoning Attacks
K. Rezaei, K. Banihashem, A. Chegini, S. Feizi
Text-To-Concept (and Back) via Cross-Model Alignment
M. Moayeri, K. Rezaei, M. Sanjabi, S. Feizi
Spuriosity Rankings: Sorting Data to Measure and Mitigate Biases
M. Moayeri, W. Wang, S. Singla, S. Feizi
NeurIPS
Identifying Interpretable Subspaces in Image Representations
N. Kalibhat, S. Bhardwaj, C. Bruss, H. Firooz, M. Sanjabi, S. Feizi
Adapting Self-Supervised Representations to Multi-Domain Setups
N. Kalibhat, S. Sharpe, J. Goodsitt, C. Bruss, S. Feizi
BMVC
Exploring Geometry of Blind Spots in Vision Models
S. Balasubramanian, G. Sriramanan, V. S. Sadasivan, S. Feizi
NeurIPS
Towards Improved Input Masking for Convolutional Neural Networks
S. Balasubramanian, S. Feizi
Hard-Meta-Dataset++: Towards Understanding Few-Shot Performance on Difficult Tasks
S. Basu, M. Stanley, J. Bronskill, S. Feizi, D. Massiceti
Sample Efficient Detection and Classification of Adversarial Attacks via Self-Supervised Embeddings
S. Balasubramanian, S. Feizi
ICCV
Diffused Redundancy in Pre-trained Representations
V. Nanda, T. Speicher, J. Dickerson, K. P. Gummadi, S. Feizi, A. Weller
NeurIPS
CUDA: Convolution-based Unlearnable Datasets
V. S. Sadasivan, M. Soltanolkotabi, S. Feizi
Certifiably Robust Policy Learning against Adversarial Multi-Agent Communication
Y. Sun, R. Zheng, P. Hassanzadeh, Y. Liang, S. Feizi, S. Ganesh, F. Huang
202214 papers
Mutual Adversarial Training: Learning together is better than going alone
J. Liu, C. P. Lau, H. Souri, S. Feizi, R. Chellappa
IEEE TIFS
Provable Adversarial Robustness for Fractional Lp Threat Models
A. Levine, S. Feizi
AISTATS
Policy Smoothing for Provably Robust Reinforcement Learning
A. Kumar, A. Levine, S. Feizi
Toward Efficient Robust Training against Union of Lp Threat Models
G. Sriramanan, M. Gor, S. Feizi
NeurIPS
Segment and Complete: Defending Object Detectors against Adversarial Patch Attacks
J. Liu, A. Levine, C. P. Lau, R. Chellappa, S. Feizi
Explicit Tradeoffs between Adversarial and Natural Distributional Robustness
M. Moayeri, K. Banihashem, S. Feizi
NeurIPS
Hard ImageNet: Segmentations for Objects with Strong Spurious Cues
M. Moayeri, S. Singla, S. Feizi
NeurIPS Datasets & Benchmarks
A Comprehensive Study of Image Classification Model Sensitivity to Foregrounds, Backgrounds and Visual Attributes
M. Moayeri, Y. Balaji, P. Pope, S. Feizi
CVPR★ Oral — top 3%
FOCUS: Familiar Objects in Common and Uncommon Settings
P. Kattakinda, S. Feizi
Improved deterministic L2 robustness on CIFAR-10 and CIFAR-100
S. Singla, S. Feizi
ICLR★ Oral — top 3%
Improved Techniques for Deterministic L2 Robustness
S. Singla, S. Feizi
NeurIPS
Salient ImageNet: How to Discover Spurious Features in Deep Learning?
S. Singla, S. Feizi
Improved Certified Defenses against Data Poisoning with (Deterministic) Finite Aggregation
W. Wang, A. Levine, S. Feizi
Lethal Dose Conjecture on Data Poisoning
W. Wang, A. Levine, S. Feizi
NeurIPS
Selected Earlier Work17 papers
Improved, Deterministic Smoothing for L1 Certified Robustness
A. Levine, S. Feizi
ICML 2021★ Long talk — top 3%
Deep Partition Aggregation: Provable Defense against General Poisoning Attacks
A. Levine, S. Feizi
ICLR 2021★ Best Paper, KDD AdvML Workshop
Perceptual Adversarial Robustness: Defense Against Unseen Threat Models
C. Laidlaw, S. Singla, S. Feizi
ICLR 2021
Influence Functions in Deep Learning Are Fragile
S. Basu, P. Pope, S. Feizi
ICLR 2021
Improving Deep Learning Interpretability by Saliency Guided Training
A. Ismail, H. Bravo, S. Feizi
NeurIPS 2021
GANs with Conditional Independence Graphs: On Subadditivity of Probability Divergences
M. Ding, C. Daskalakis, S. Feizi
AISTATS 2021★ Oral — top 3%
Network Functional Compression (Book Chapter)
S. Feizi, M. Medard
Information-Theoretic Methods in Data Science, Cambridge Univ. Press 2021
Robustness Certificates for Sparse Adversarial Attacks by Randomized Ablation
A. Levine, S. Feizi
AAAI 2020★ Oral — top 5%
(De)Randomized Smoothing for Certifiable Defense against Patch Attacks
A. Levine, S. Feizi
NeurIPS 2020
Curse of Dimensionality on Randomized Smoothing for Certifiable Robustness
A. Kumar, A. Levine, T. Goldstein, S. Feizi
ICML 2020
Understanding GANs in the LQG Setting: Formulation, Generalization and Stability
S. Feizi, F. Farnia, T. Ginart, D. Tse
IEEE J-SAIT 2020
Functional Adversarial Attacks
C. Laidlaw, S. Feizi
NeurIPS 2019
Are adversarial examples inevitable?
A. Shafahi, W. Huang, C. Studer, S. Feizi, T. Goldstein
ICLR 2019
Spectral Alignment of Graphs
S. Feizi, M. Mendoza, G. Quon, M. Medard, M. Kellis, A. Jadbabaie
IEEE TNSE 2018
Network Maximal Correlation
S. Feizi*, A. Makhdoumi*, K. Duffy, M. Kellis, M. Medard
IEEE TNSE 2017★ Best Paper (3-year period)
Network Functional Compression
S. Feizi, M. Medard
IEEE Trans. Information Theory, 2014
Network Deconvolution as a General Method to Distinguish Direct Dependencies in Networks
S. Feizi, D. Marbach, M. Medard, M. Kellis
Nature Biotechnology, 2013