SKIP-SALSA: Skip Synchronous Fusion of ASR LLM DecodersAshish MittalDarshan Prabhuet al.2025INTERSPEECH 2025
Improving End-to-end Mixed-case ASR with Knowledge Distillation and Integration of Voice Activity CuesSashi NovitasariTakashi Fukudaet al.2025INTERSPEECH 2025
Voice Activity-based Text Segmentation for ASR Text DenormalizationSashi NovitasariTakashi Fukudaet al.2025INTERSPEECH 2025
Unraveling Cocaine and Heroin Addiction Patterns: A Robust LLM Approach Using iRISA Elements to Analyze Short and Spontaneous Speech SamplesCarla Agurto RiosBo Wenet al.2025ICDH 2025
Voice-based AI Agents: Filling the Economic Gaps in Digital Health DeliveryBo WenChen Wanget al.2025ICDH 2025
Beyond the Clinic: Leveraging Speech Acoustics and Phonetics for Cognitive Monitoring in PKUKely NorelCarla Agurto Rioset al.2025ICDH 2025
Comprehensive Layer-Wise Analysis of SSL Models for Audio Deepfake DetectionYassine ElkheirYounes Samihet al.2025NAACL 2025
LLM based Text Generation for Improved Low-resource Speech Recognition ModelsTohru NaganoGakuto Kurataet al.2025ICASSP 2025
Knowledge Distillation Based Training of Unified Conformer CTC Models for Multi-form ASRTakashi FukudaGakuto Kurataet al.2025ICASSP 2025