Keywords:
- Accent Identification
- ASR
- Automatic Speech Recognition
- Benchmarking
- benchmarks
- bilingual speakers
- BNF
- Code-Switching
- continuous F0 coding
- Deep neural network
- deep neural networks
- dialectal lexicon
- domain adaptation
- duration
- emphasis
- end-to-end
- French accents
- French Regional Accents
- German
- German language
- GMM Modelling
- HMM-based speech synthesis
- HSMM explicit duration modelling
- i-vectors
- intonation
- laboratory phonology
- Language targets
- Lexicon
- LID
- multi-dialect
- multilayer perceptron
- Multilingual
- NLP
- open vocabulary
- open-vocabulary
- parametric speech synthesis
- phone duration modelling
- Phonological speech representation
- probabilistic amplitude demodulation
- spectral amplitude modulation phase hierarchy
- speech corpus
- speech prosody
- speech recognition
- speech synthesis
- speech-to-speech translation
- spiking neural networks
- subword segmentation
- Subword unit
- Support Vector Regression
- SVM
- Swiss German
- Swiss prosody
- Swisscom
- TV Box
- Very low bit rate speech coding
- voice assistant
- Wav2vec
Publications of Alexandros Lazaridis sorted by title
A
An Evaluation Benchmark for Automatic Speech Recognition of German-English Code-Switching, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, 2021 |
|
C
Comparative Study on Sentence Boundary Prediction for German and English Broadcast News, , , , and , Idiap-RR-18-2017 |
|
Comparison of Subword Segmentation Methods for Open-vocabulary ASR using a Difficulty Metric, , , and |
|
COMPARISON OF SUBWORD SEGMENTATION METHODS FOR OPEN-VOCABULARYEND-TO-END SPEECH RECOGNITION, , , and , Idiap-RR-34-2020 |
|
Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, , , and , Idiap-RR-11-2016 |
|
Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, , , and , in: IEEE/ACM Trans. on Audio, Speech and Language Processing, 2016 |
|
CONTEXT-AWARE ATTENTION MECHANISM FOR SPEECH EMOTION RECOGNITION, , , and , in: IEEE Workshop on Spoken Language Technology, Athens, Greece, pages 126-131, 2018 |
[URL] |
D
DNN-based Speech Synthesis: Importance of input features and training data, , and , in: International Conference on Speech and Computer , SPECOM, pages 193-200, Springer Berlin Heidelberg, 2015 |
[DOI] |
I
Incremental Syllable-Context Phonetic Vocoding, , , , and , Idiap-RR-05-2015 |
|
Incremental Syllable-Context Phonetic Vocoding, , , , and , in: IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 23(6), 2015 |
[URL] |
Investigating Cross-lingual Multi-level Adaptive Networks: The Importance of the Correlation of Source and Target Languages, , , , and , in: Proceedings of the International Workshop on Spoken Language Translation, Seattle, WA, USA, 2016 |
|
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , Idiap-RR-22-2016 |
|
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , in: 9th ISCA Speech Synthesis Workshop, 2016 |
|
L
Learning to Translate Low-Resourced Swiss German Dialectal Speech into Standard German Text, , and , in: IEEE Automatic Speech Recognition and Understanding Workshop, Colombia, Cartagena, IEEE, 2021 |
|
M
Modeling Dialectal Variation for Swiss German Automatic Speech Recognition, , and , in: Proceedings of Interspeech, 2021 |
[DOI] |
O
On the Vulnerability of Speaker Verification to Realistic Voice Spoofing, , , and , in: IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1-8, IEEE, 2015 |
[DOI] [URL] |
P
Probabilistic Amplitude Demodulation features in Speech Synthesis for Improving Prosody, , and , Idiap-RR-12-2016 |
|
Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody, , and , in: Proceedings of Interspeech, San Francisco, USA, 2016 |
|
Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, , , and , Idiap-RR-04-2014 |
|
Prosody in Swiss French Accents: Investigation using Analysis by Synthesis, , , and , in: Speech Prosody, 2014 |
|
S
Self-attention for Speech Emotion Recognition, , and , in: Proc. Interspeech 2019, 2019 |
[DOI] |
Speech vocoding for laboratory phonology, , and , Idiap-RR-07-2015 |
|
Speech vocoding for laboratory phonology, , and , in: Computer Speech and Language, 2016 |
|
SPOKEN LANGUAGE IDENTIFICATION USING LANGUAGE BOTTLENECK FEATURES, , , , , and , Idiap-RR-08-2019 |
|
Spoken language identification using language bottleneck features, , , , , and , in: Proceedings of TSD, 2019 |
|
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, , , and , in: Interspeech, 2014 |
|
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, , , and , Idiap-RR-10-2014 |
|
SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, , and , Idiap-RR-03-2014 |
|
SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis, , and , in: Speech Prosody, 2014 |
|
SWISS FRENCH REGIONAL ACCENT IDENTIFICATION, , , , , and , in: Odyssey: The Speaker and Language Recognition Workshop, 2014 |
|
Syllable-based Regional Swiss French Accent Identification using Prosodic Features, , , and , in: Nouveaux cahiers de linguistique francaise, 2014 |
|
T
The SIWIS database: a multilingual speech database with acted emphasis, , , , , , , , , , , and , Idiap-RR-13-2016 |
|
The SIWIS database: a multilingual speech database with acted emphasis, , , , , , , , , , , and , in: Proceedings of Interspeech, San Francisco, USA, pages 1532--1535, 2016 |
[DOI] |
The SIWIS French Speech Synthesis Database – Design and recording of a high quality French database for speech synthesis, , , and , Idiap-RR-03-2017 |
|
Translation and Prosody in Swiss Languages, , , , , , , , , , and , in: Nouveaux cahiers de linguistique francaise, 2014 |
|