Keywords:
- automatic speech recognition (ASR)
- Contextual Adaptation
- Data Selection
- Domain Classification
- F1 score
- finite-state transducers
- GPU decoding
- multitask learning
- multitask training
- named entity recognition
- pseudo-labelling
- real-time speech recognition
- shallow fusion
- Speaker change detection
- speaker turn detection
- speech recognition
- streaming transducer
- whisper
- XLSR-Transducer
- Zipformer
Publications of Karthik Pandia D S sorted by journal and type
Publications of type Idiap-RR
2024
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , Idiap-RR-10-2024 |
|
TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR, , , , , , , , and , Idiap-RR-07-2024 |
[URL] |
2023
Implementing contextual biasing in GPU decoder for online ASR, , , , , , and , Idiap-RR-02-2023 |
|
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (2024)
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper, , , , , , , , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, ACL, 2024 |
|
Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024 (2024)
Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers, , , , , , , and , in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Republic of Korea, pages 12592-12596, IEEE, 2024 |
[DOI] [URL] |
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (2024)
TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR, , , , , , , , and , in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 20988–20995, Association for Computational Linguistics (ACL), 2024 |
[URL] |
Proc. Interspeech 2023 (2023)
Implementing contextual biasing in GPU decoder for online ASR, , , , , , and , in: Proc. Interspeech 2023, 2023 |
|
Proc. of Interspeech 2014 (2014)
Feature Switching in the i-vector Framework for Speaker Verification, , , , and , in: Proc. of Interspeech 2014, pages 5, 2014 |