Publications

225 results for Adversarial Robustness and Privacy

The Inherent Adversarial Robustness of Analog In-Memory Computing
- - Corey Liam Lammie
  - Julian Büchel
  - et al.
- 2025
- Nature Communications
Privacy without Noisy Gradients: Slicing Mechanism for Generative Model Training
- - Kristjan Greenewald
  - Yuancheng Yu
  - et al.
- 2024
- NeurIPS 2024
Unified Lookup Tables: Privacy-Preserving Foundation Models
- - Nikita Janakarajan
  - Irina Espejo Morales
  - et al.
- 2024
- NeurIPS 2024
Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAI
- - Ambrish Rawat
  - Stefan Schoepf
  - et al.
- 2024
- NeurIPS 2024
Membership Inference Attacks Against Time-Series Models
- - Noam Koren
  - Abigail Goldsteen
  - et al.
- 2024
- ACML 2024
MoJE: Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt Attacks
- - Giandomenico Cornacchia
  - Kieran Fraser
  - et al.
- 2024
- AIES 2024
On Robustness-Accuracy Characterization of Language Models using Synthetic Datasets
- - Ching-yun Ko
  - Pin-Yu Chen
  - et al.
- 2024
- COLM 2024
Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts
- - Zhi-yi Chin
  - Chieh-ming Jiang
  - et al.
- 2024
- ICML 2024
Be Your Own Neighborhood: Detecting Adversarial Examples by the Neighborhood Relations Built on Self-Supervised Learning
- - Zhiyuan He
  - Yijun Yang
  - et al.
- 2024
- ICML 2024
What Would Gauss Say About Representations? Probing Pretrained Image Models using Synthetic Gaussian Benchmarks
- - Irene Ko
  - Pin-Yu Chen
  - et al.
- 2024
- ICML 2024