Keywords:
- ASR-free
- audio&text embeddings
- Automatic accent evaluation
- Automatic Speech Recognition
- BCN corpus
- BERT
- broadcast news
- Decompensation
- deep learning
- domain adaptation
- dynamic programming
- explainability
- German ASR
- ICU
- just-in-time retrieval
- keyword spotting
- KL-divergence
- language modeling
- Large Language Models
- LIWC
- LM compensation
- machine learning
- multimedia IR
- non-native speech
- open vocabulary
- Personality traits inference
- phonetic representation
- Posterior features
- psychological correlates
- punctuation
- Punctuation prediction N-grams Gradient Boosted Machine Audio Features
- recurrent neural network
- sentence boundary prediction
- speech recognition
- speech-based IR
- Vital Signs
Publications of Alexandre Nanchen sorted by first author
I
MediaParl: Bilingual mixed language accented speech database, , , , , and , Idiap-RR-03-2013 |
|
MediaParl: Bilingual mixed language accented speech database, , , , , and , in: Proceedings of the 2012 IEEE Workshop on Spoken Language Technology, pages 263--268, 2012 |
|
Exploiting un-transcribed foreign data for speech recognition in well-resourced languages, , , , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Florence, IT, pages 2322 - 2326, IEEE, 2014 |
[DOI] |
N
Towards interfacing large language models with ASR systems using confidence measures and prompting, , , , and , in: Proceedings of Interspeech, pages 2980-2984, 2024 |
[DOI] |
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, and , Idiap-RR-01-2019 |
|
EMPIRICAL EVALUATION AND COMBINATION OF PUNCTUATION PREDICTION MODELS APPLIED TO BROADCAST NEWS, and , in: Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019 |
|
Keep Sensors in Check: Disentangling Country-Level Generalization Issues in Mobile Sensor-Based Models with Diversity Scores, , , and , in: Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023 |
|
P
The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites, , , and , Idiap-RR-26-2010 |
|
The ACLD: Speech-based Just-in-Time Retrieval of Meeting Transcripts, Documents and Websites, , , and , in: ACM Multimedia Workshop on Searching Spontaneous Conversational Speech, Florence, Italy, 2010 |
|
Automatic Content Linking: Speech-based Just-in-time Retrieval for Multimedia Archives, , , , , and , in: Proceedings of the 33rd Annual ACM SIGIR Conference, Geneva, Switzerland, pages 703, 2010 |
[DOI] |
A Multimedia Retrieval System Using Speech Input, , , , , , , , , , and , in: Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), Cambridge, MA, 2009 |
|
User Interface Design in a Just-in-time Retrieval System for Meetings, , , , , , and , Idiap-RR-38-2009 |
|
A Speech-based Just-in-Time Retrieval System using Semantic Search, , , and , Idiap-RR-31-2011 |
|
A Speech-based Just-in-Time Retrieval System using Semantic Search, , , and , in: Proceedings of the ACL-HLT 2011 System Demonstrations (49th Annual Meeting of the Association for Computational Linguistics), Portland, OR, pages 80-86, 2011 |
[URL] |
A Just-in-Time Document Retrieval System for Dialogues or Monologues, , , and , in: SIGDIAL 2011 (12th annual SIGDIAL Meeting on Discourse and Dialogue), Demonstration Session, Portland, OR, pages 350-352, 2011 |
|
Language model domain adaptation for automatic speech recognition, , and , Idiap-RR-05-2020 |
|
R
Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, , , and , Idiap-RR-12-2015 |
|
Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, , , and , in: Proceedings of Interspeech, 2015 |
|
S
Open-Vocabulary Keyword Spotting With Audio And Text Embeddings, , , and , in: Proceedings of Interspeech 2019, 2019 |
[DOI] |
Automatic Speech Indexing System of Bilingual Video Parliament Interventions, , , , , and , Idiap-RR-25-2013 |
|
W
Comparative Study on Sentence Boundary Prediction for German and English Broadcast News, , , , and , Idiap-RR-18-2017 |
|