3rd TrustAI Workshop: Building Public Awareness and EngagementMiriam RateikeBrian Mboyaet al.2025DLI 2025
Think Again! The Effect of Test-Time Compute on Preferences, Opinions, and Beliefs of Large Language ModelsGeorge KourItay Nakashet al.2025ACL 2025
NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional ReasoningZheyuan ZhangYiyang Liet al.2025ACL 2025
Multi-Level Explanations for Generative Language ModelsLucas Monteiro PaesDennis Weiet al.2025ACL 2025
BI-Bench : A Comprehensive Benchmark Dataset and Unsupervised Evaluation for BI SystemsAnkush GuptaAniya Aggarwalet al.2025ACL 2025
Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational AgentsIvoline NgongSwanand Ravindra Kadheet al.2025ACL 2025
ConceptAttention: Diffusion Transformers Learn Highly Interpretable FeaturesAlec HelblingTuna Meralet al.2025ICML 2025
Avoiding Leakage Poisoning: Concept Interventions Under Distribution ShiftsMateo Espinosa ZarlengaGabriele Dominiciet al.2025ICML 2025
Learning interpretable positional encodings in transformers depends on initializationTaku ItoLuca Cocchiet al.2025ICML 2025
Predicting Glucose Levels in Diabetic Kidney Transplant Recipients Using Multivariate Temporal Modeling Across Clinically Defined SubcohortsCarla Agurto RiosEduardo Castroet al.2025ICDH 2025