Publications

165 results for Explainable AI

3rd TrustAI Workshop: Building Public Awareness and Engagement
- - Miriam Rateike
  - Brian Mboya
  - et al.
- 2025
- DLI 2025
Think Again! The Effect of Test-Time Compute on Preferences, Opinions, and Beliefs of Large Language Models
- - George Kour
  - Itay Nakash
  - et al.
- 2025
- ACL 2025
Learn more about our Explainable AI work
NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning
- - Zheyuan Zhang
  - Yiyang Li
  - et al.
- 2025
- ACL 2025
Multi-Level Explanations for Generative Language Models
- - Lucas Monteiro Paes
  - Dennis Wei
  - et al.
- 2025
- ACL 2025
BI-Bench : A Comprehensive Benchmark Dataset and Unsupervised Evaluation for BI Systems
- - Ankush Gupta
  - Aniya Aggarwal
  - et al.
- 2025
- ACL 2025
Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational Agents
- - Ivoline Ngong
  - Swanand Ravindra Kadhe
  - et al.
- 2025
- ACL 2025
ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features
- - Alec Helbling
  - Tuna Meral
  - et al.
- 2025
- ICML 2025
Avoiding Leakage Poisoning: Concept Interventions Under Distribution Shifts
- - Mateo Espinosa Zarlenga
  - Gabriele Dominici
  - et al.
- 2025
- ICML 2025
Learning interpretable positional encodings in transformers depends on initialization
- - Taku Ito
  - Luca Cocchi
  - et al.
- 2025
- ICML 2025
Predicting Glucose Levels in Diabetic Kidney Transplant Recipients Using Multivariate Temporal Modeling Across Clinically Defined Subcohorts
- - Carla Agurto Rios
  - Eduardo Castro
  - et al.
- 2025
- ICDH 2025