Keywords:
- Aho-Corasick algorithm
- Air traffic control
- ASR
- ASR robustness
- Automatic Speech Recognition
- automatic speech recognition (ASR)
- contextual biasing
- Contextualisation and adaptation of ASR
- Criminal investigations
- Data Selection
- DISPLACE-2
- domain adaptation
- Domain Classification
- Dual mode encoder
- ECAPA-TDNN embedding
- embedding
- Foundation Models
- language identification
- LLM
- LLM-based ASR
- local speaker segmentation
- multitask training
- named entity recognition
- network analysis
- prompt projection
- pseudo-labelling
- rare word recognition
- real-time ASR
- ROXANNE
- ROXSD
- shallow fusion
- Speaker change detection
- Speaker Diarization
- Speaker identification
- speech recognition
- Speech-to-LLM alignment
- speech-to-text alignment
- streaming ASR
- streaming transducer
- text denoising
- Text fine-tuning
- TRACY · Law Enforcement Agencies · Suspect Detection· Non-Content Data· Social Influence Analysis· Link Prediction
- TRACY· Non-Content data· Law Enforcement Agencies · Suspect Detection· Mobile Signaling Data· ROXANNE
- transformer transducer
- whisper
- XLSR-Transducer
- Zipformer
Publications of Pradeep Rangappa sorted by first author
B
| CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024 |
|
C
| Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering, , , , , , , , , , , , and , in: Interspeech 2025, Rotterdam, The Netherlands, pages 3618--3622, 2025 |
[DOI] [URL] |
I
| Unifying Global and Near-Context Biasing in a Single Trie Pass., , , , , , , , , , , and , in: Text, Speech, and Dialogue. TSD 2025. Lecture Notes in Computer Science, Springer, Springer, 2025 |
[DOI] [URL] |
| Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , Idiap-RR-10-2024 |
|
K
| TokenVerse++: Towards Flexible Multitask Learning with Dynamic Task Activation, , , , , , , , , , and , in: 2025 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), IEEE, 2025 |
|
| Performance Evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward, , , , , , , , , and , in: SALMA Workshop, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025 |
[URL] |
L
| TRACY Canvas: A Criminal Network Visualization Tool, , , , , and , Idiap-RR-03-2025 |
|
M
| Autocrime - open multimodal platform for combating organized crime, , , , , , , , , , , , , , , , , and , in: Forensic Science International: Digital Investigation, 54, 2025 |
[DOI] [URL] |
| ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: Odyssey 2024: The Speaker and Language Recognition Workshop, pages 17-24, 2024 |
[DOI] [URL] |
R
| Detecting Criminal Networks via Non-Content Communication Data Analysis Techniques from the TRACY Project, , , , , , , , , and , in: Digital Forensics and Cyber Crime. ICDF2C 2024, Dubrovnik, Croatia, 2024 |
[DOI] [URL] |
| Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering, , , , , , , , , , , , and , in: Proc. Interspeech, 2025 |
|
| Accelerating Criminal Investigations with TRACY, , , , , , , and , in: 16th EAI International Conference on Digital Forensics & Cyber Crime, 2025 |
|
| Enhancing Speaker Diarization using Correlation-Based Clustering Initialization, , , and , Idiap-RR-09-2025 |
|
| Speech Data Selection for Efficient ASR Fine-Tuning using Domain Classifier and Pseudo-Label Filtering, , , , , , , , , , , and , in: 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), 2025 |
[DOI] [URL] |
T
| Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , in: Findings of the Association for Computational Linguistics: EMNLP 2024, pages 16747–16762, Association for Computational Linguistics (ACL), 2024 |
[DOI] [URL] |