Keywords:
- accent embedding
- Accented speech
- acoustic modeling
- Anti-spoofing
- Articulation
- ASR-free
- audio&text embeddings
- Automatic accent assessment
- Automatic accent evaluation
- Automatic prosodic event detection
- automatic reading tutor
- Binary pattern matching
- Bob toolbox
- child speech recognition
- cognition
- Compressive sampling
- computer aided learning
- connectionist temporal classification
- continuous F0 coding
- Convolutional Neural Networks
- Deep neural network (DNN)
- deep neural networks
- dynamic programming
- Dysarthria
- end-to-end
- Fast $k$NN
- intelligibility
- Kaldi toolkit
- keyword spotting
- KL-divergence
- KL-HMM
- Kullback-Leibler divergence
- laboratory phonology
- lan- guage identification
- lexical model
- Linguistic parsing
- low bit rate speech coding
- Low bit rate speech vocoding
- multi-task
- Multilingual automatic speech recognition
- nasal sounds
- nearest neighbour rule of classification.
- neural computing
- non-modal phonation
- non-native speech
- open science
- open vocabulary
- parametric speech synthesis
- Parametric vocoding
- Parkinson's disease
- Phonation
- Phone attributes
- Phoneme classification
- phonetic representation
- Phonological features
- phonological posteriors
- Phonological speech representation
- phonological vocoding
- phonology
- pitch analysis
- Posterior features
- Posterior representatives
- probabilistic amplitude demodulation
- prosody
- python
- Quantized posterior hashing
- Reproducible research
- software
- speaker verification
- spectral amplitude modulation phase hierarchy
- Speech Analysis
- speech coding
- speech emphasis
- speech perception
- speech processing
- speech production
- speech prosody
- speech recognition
- speech synthesis
- spiking neural networks
- Structured sparse representation
- Structured sparsity
- triphone mapping
- Very low bit rate speech coding
- word emphasis
Publications of Milos Cernak sorted by journal and type
| 1 | 2 |
Publications of type Idiap-RR
2022
End-to-end Accented Speech Recognition, , and , Idiap-RR-04-2022 |
|
2017
Characterisation of voice quality of Parkinson's disease using differential phonological posterior features, , , , , and , Idiap-RR-16-2017 |
|
NASAL SPEECH SOUNDS DETECTION USING CONNECTIONIST TEMPORAL CLASSIFICATION, and , Idiap-RR-28-2017 |
|
Perceptual Information Loss due to Impaired Speech Production, , and , Idiap-RR-20-2017 |
|
2016
An Analysis of Rhythmic Staccato-Vocalization Based on Frequency Demodulation for Laughter Detection in Conversational Meetings, , , and , Idiap-RR-02-2016 |
|
Cognitive speech coding, and , Idiap-RR-27-2016 |
|
Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, , , and , Idiap-RR-11-2016 |
|
Information Theoretic Analysis of Production-Perception Efficiency: Case Study of Speech Pathology, , and , Idiap-RR-30-2016 |
|
Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, , , and , Idiap-RR-22-2016 |
|
On Structured Sparsity of Phonological Posteriors for Linguistic Parsing, , and , Idiap-RR-07-2016 |
[URL] |
On the impact of non-modal phonation on phonological features, , , , , , , , , , , , , and , Idiap-RR-28-2016 |
|
Phonetic and Phonological Posterior Search Space Hashing Exploiting Class-Specific Sparsity Structures, , , and , Idiap-RR-10-2016 |
|
Probabilistic Amplitude Demodulation features in Speech Synthesis for Improving Prosody, , and , Idiap-RR-12-2016 |
|
Sound Pattern Matching for Automatic Prosodic Event Detection, , , , and , Idiap-RR-03-2016 |
|
2015
A simple continuous excitation model for parametric vocoding, , and , Idiap-RR-03-2015 |
|
An Empirical Model of Emphatic Word Detection, and , Idiap-RR-11-2015 |
|
Automatic Accentedness Evaluation of Non-Native Speech Using Phonetic and Sub-Phonetic Posterior Probabilities, , , and , Idiap-RR-12-2015 |
|
HMM-based Non-native Accent Assessment using Posterior Features, , and , Idiap-RR-32-2015 |
|
Incremental Syllable-Context Phonetic Vocoding, , , , and , Idiap-RR-05-2015 |
|
Neuromorphic Based Oscillatory Device for Incremental Syllable Boundary Detection, and , Idiap-RR-14-2015 |
|
Phonological vocoding using artificial neural networks, , and , Idiap-RR-04-2015 |
|
Speech vocoding for laboratory phonology, , and , Idiap-RR-07-2015 |
|
Syllabic Pitch Tuning for Neutral-to-Emotional Voice Conversion, , and , Idiap-RR-31-2015 |
|
2014
Development of Bilingual ASR System for MediaParl Corpus, , , and , Idiap-RR-21-2014 |
|
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding, , , and , Idiap-RR-10-2014 |
|
2013
Automatic Speech Indexing System of Bilingual Video Parliament Interventions, , , , , and , Idiap-RR-25-2013 |
|
ON THE (UN)IMPORTANCE OF THE CONTEXTUAL FACTORS IN HMM-BASED SPEECH SYNTHESIS AND CODING, , and , Idiap-RR-06-2013 |
|
Robust triphone mapping for acoustic modeling, , and , Idiap-RR-02-2013 |
|
Syllable-based Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, , and , Idiap-RR-24-2013 |
|
2012
Baseline System for Automatic Speech Recognition with French GlobalPhone Database, and , Idiap-RR-26-2012 |
|
Progress report of a project in very low bit-rate speech coding, , and , Idiap-RR-08-2012 |
|
Computer Speech and Language
Characterisation of voice quality of Parkinson's disease using differential phonological posterior features, , , , , and , in: Computer Speech and Language, 2017 |
|
Speech vocoding for laboratory phonology, , and , in: Computer Speech and Language, 2016 |
|
Digital Signal Processing
NeuroSpeech: An open-source software for Parkinson's speech analysis, , , , , , , , , , , , , , and , in: Digital Signal Processing, 2017 |
[DOI] |
IEEE Signal Processing Letters
A Simple Continuous Pitch Estimation Algorithm, , and , in: IEEE Signal Processing Letters, 20(1):102--105, 2013 |
[URL] |
IEEE Signal Processing Magazine
Cognitive Speech Coding: Examining the Impact of Cognitive Speech Processing on Speech Compression, , and , in: IEEE Signal Processing Magazine, 35(3):97-109, 2018 |
[DOI] |
IEEE/ACM Trans. on Audio, Speech and Language Processing
Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, , , and , in: IEEE/ACM Trans. on Audio, Speech and Language Processing, 2016 |
|
IEEE/ACM Transactions on Audio, Speech, and Language Processing
Perceptual Information Loss due to Impaired Speech Production, , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2017 |
|
| 1 | 2 |