Improving End-to-end Mixed-case ASR with Knowledge Distillation and Integration of Voice Activity CuesSashi NovitasariTakashi Fukudaet al.2025INTERSPEECH 2025
Voice Activity-based Text Segmentation for ASR Text DenormalizationSashi NovitasariTakashi Fukudaet al.2025INTERSPEECH 2025
SKIP-SALSA: Skip Synchronous Fusion of ASR LLM DecodersAshish MittalDarshan Prabhuet al.2025INTERSPEECH 2025