Publications

117 results for Speech

Spoken question answering for visual queries
- - Nimrod Shabtay
  - Zvi Kons
  - et al.
- 2025
- INTERSPEECH 2025
Improving End-to-end Mixed-case ASR with Knowledge Distillation and Integration of Voice Activity Cues
- - Sashi Novitasari
  - Takashi Fukuda
  - et al.
- 2025
- INTERSPEECH 2025
Learn more about our Speech work
Exploring the Limits of Conformer CTC-Encoder for Speech Emotion Recognition using Large Language Models
- - Edmilson Da Silva Morais
  - Hagai Aronowitz
  - et al.
- 2025
- INTERSPEECH 2025
Voice Activity-based Text Segmentation for ASR Text Denormalization
- - Sashi Novitasari
  - Takashi Fukuda
  - et al.
- 2025
- INTERSPEECH 2025
SKIP-SALSA: Skip Synchronous Fusion of ASR LLM Decoders
- - Ashish Mittal
  - Darshan Prabhu
  - et al.
- 2025
- INTERSPEECH 2025
Unraveling Cocaine and Heroin Addiction Patterns: A Robust LLM Approach Using iRISA Elements to Analyze Short and Spontaneous Speech Samples
- - Carla Agurto Rios
  - Bo Wen
  - et al.
- 2025
- ICDH 2025
Voice-based AI Agents: Filling the Economic Gaps in Digital Health Delivery
- - Bo Wen
  - Chen Wang
  - et al.
- 2025
- ICDH 2025
Beyond the Clinic: Leveraging Speech Acoustics and Phonetics for Cognitive Monitoring in PKU
- - Kely Norel
  - Carla Agurto Rios
  - et al.
- 2025
- ICDH 2025
Comprehensive Layer-Wise Analysis of SSL Models for Audio Deepfake Detection
- - Yassine Elkheir
  - Younes Samih
  - et al.
- 2025
- NAACL 2025
LLM based Text Generation for Improved Low-resource Speech Recognition Models
- - Tohru Nagano
  - Gakuto Kurata
  - et al.
- 2025
- ICASSP 2025