SKIP-SALSA: Skip Synchronous Fusion of ASR LLM DecodersAshish MittalDarshan Prabhuet al.2025INTERSPEECH 2025
BI-Bench : A Comprehensive Benchmark Dataset and Unsupervised Evaluation for BI SystemsAnkush GuptaAniya Aggarwalet al.2025ACL 2025
Multi-Sense Embeddings for Language Models and Knowledge DistillationQitong WangMohammed Zakiet al.2025ACL 2025
Query-driven Document-level Scientific Evidence Extraction from Biomedical StudiesMassimiliano PronestiJoao Bettencourt-Silvaet al.2025ACL 2025
ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific LanguagesMehant KammakomatiSameer Pimparkhedeet al.2025ACL 2025
MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation SystemsYannis KatsisSara Rosenthalet al.2025ACL 2025
Combining Domain and Alignment Vectors Provides Better Knowledge-Safety Trade-offs in LLMsMegh ThakkarQuentin Fournieret al.2025ACL 2025