Keywords:
- Acoustic reverberation sparsity models
- Ad hoc array calibration
- Ad-hoc microphone array
- Automatic prosodic event detection
- Automatic Speech Recognition
- automatic speech recognition (ASR)
- Autoregressive modeling
- Beamforming
- Binary pattern matching
- cognition
- Compressive Acoustic Measurements
- Compressive sampling
- Compressive Sensing
- continuous F0 coding
- Convex optimization
- Convolutive source separation
- Deep neural network
- Deep neural network (DNN)
- Deep neural network posterior features
- Deep neural network posterior probabilities
- deep neural networks
- Dictionary learning
- Diffuse sound field
- Distant speech recognition
- Distributed source localization.
- Euclidean distance matrix
- exemplar-based modeling
- far-field asr
- Fast $k$NN
- Generalized cross correlation
- Generalized Trust Region Subproblem (GTRS).
- hidden variable
- Image Model
- Joint sparse recovery
- k-nearest neighbor (kNN) search
- Keyword Detection
- kNN classifier
- Least square solution.
- Linguistic parsing
- Low bit rate speech vocoding
- low-rank representation (LRR)
- low-rank sparsity
- Matrix completion
- microphone array
- Microphone array calibration
- missing data
- Model-Based Compressive Sensing
- Model-based sparse component analysis
- Model-based sparse recovery
- models
- Multi-party Speech
- Multi-party Speech Recognition
- Multi-speaker Localization
- nearest neighbour rule of classification.
- Non-negative matrix factorization
- Over-determined linear equation
- Overlapping Speech
- Pairwise distance estimation
- Phase transform
- Phone posterior
- Phoneme classification
- Phonological features
- Phonological posterior
- phonological posteriors
- posterior feature
- Posterior feature space
- Posterior hashing
- posterior probability
- Posterior probability structures
- Posterior representatives
- posterior space properties
- posterior-based metrics
- Pronunciation dewarping
- Quantized posterior hashing
- query by example
- Reverberant enclosure
- Reverberation
- Robust microphone placement
- Room acoustic characterization
- Room acoustic estimation
- Room Geometry
- Room geometry estima- tion
- Single-channel source localization
- skew- symmetric matrices
- soft targets
- Source localization
- sparse coding
- Sparse Component Analysis
- Sparse modeling
- Sparse Recovery
- sparse representation
- Sparse Signal Recovery
- Sparse word posterior probabilities
- Speaker localization
- speaker verification
- Spectrographic sparsity models
- Speech
- speech coding
- Speech dereverberation
- speech perception
- speech production
- speech recognition
- Speech source localization
- speech sparsity
- Speech spectral structures
- spiking neural networks
- spoken term detection
- Structural similarity measure
- Structured Sparse Coding
- Structured sparse representation
- Structured sparsity
- structured sparsity models
- Subspace detection
- synchronisation
- TDOA denoising
- TDOA estimation
- un- derdetermined convolutive speech separation
- Very low bit rate speech coding
- word emphasis
Publications of Afsaneh Asaei sorted by first author
C
Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, , , and , Idiap-RR-11-2016 |
|
Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding, , , and , in: IEEE/ACM Trans. on Audio, Speech and Language Processing, 2016 |
|
D
On quantifying the quality of acoustic models in hybrid DNN-HMM ASR, , and , in: Speech Communication, 119:24-35, 2020 |
[DOI] |
Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models, , and , in: Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017 |
|
Exploiting Eigenposteriors for Semi-supervised Training of DNN Acoustic Models with Sequence Discrimination, , and , in: Proceedings of Interspeech, 2017 |
|
Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-based Speech Recognition, , and , in: Speech Communication: Special Issue on Advances in Sparse Modeling and Low-rank Modeling for Speech Processing, 76:230–244, 2016 |
[DOI] |
Sparse Hidden Markov Models for Exemplar-based Speech Recognition Using Deep Neural Network Posterior Features, , and , Idiap-RR-19-2016 |
|
Sparse Modeling of Neural Network Posterior Probabilities for Exemplar-Based Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015 |
|
Dictionary Learning for Sparse Representation of Neural Network Exemplars in Speech Recognition, , and , in: Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015, pages 1093, 2015 |
|
Far-field ASR Using Low-rank and Sparse Soft Targets from Parallel Data, , and , in: IEEE Workshop on Spoken Language Technology, Athens, GREECE, pages 581-587, IEEE, 2018 |
|
Exploiting Low-dimensional Structures to Enhance DNN based Acoustic Modeling in Speech Recognition, , , and , in: Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), Shanghai, pages 5690-5694, IEEE, 2016 |
|
G
Manifold Sparse Beamforming, , and , in: IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Saint Martin, France, pages 113-116, IEEE, 2013 |
[DOI] |
H
A New Identity for the Least-square Solution of Overdetermined Set of Linear Equations, , and , Idiap-RR-35-2015 |
|
L
Low-Rank Representation of Nearest Neighbor Phone Posterior Probabilities to Enhance DNN Acoustic Modeling, , , and , Idiap-RR-04-2016 |
|
Low-Rank Representation of Nearest Neighbor Phone Posterior Probabilities to Enhance DNN Acoustic Modeling, , , and , in: Interspeech, 2016 |
|
M
Convexity in source separation: Models, geometry, and algorithms, , , , and , in: IEEE Signal Processing Magazine, Special Issue on Source Separation and Applications, 2013 |
|
R
Sparse Subspace Modeling for Query by Example Spoken Term Detection, , and , in: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2018 |
|
Phonetic Subspace Features for Improved Query by Example Spoken Term Detection, , and , in: Speech Communication, 103:27-36, 2018 |
[DOI] |
Subspace Regularized Dynamic Time Warping for Spoken Query Detection, , and , in: Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017 |
|
Sparse Subspace Modeling for Query by Example Spoken Term Detection, , and , Idiap-RR-01-2016 |
|
Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, , and , Idiap-RR-06-2016 |
|
Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, , and , in: Interspeech, 2016 |
|
Sparse Modeling of Posterior Exemplars for Keyword Detection, , , and , in: Proceedings of Interspeech, pages 3690-3694, 2015 |
|
T
Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, , , and , in: Proceeding of 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014 |
Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, , , and , in: Proceedings of the 4th Joint Workshop on Hands-free speech communication and Microphone Arrays, Villers-les-Nancy, pages 1 - 5, IEEE, 2014 |
[DOI] |
Spatial Sound Localization via Multipath Euclidean Distance Matrix Recovery, , , , and , in: IEEE Journal of Selected Topics in Signal Processing, 9(5):802-814, 2015 |
|
AN INTEGRATED FRAMEWORK FOR MULTI-CHANNEL MULTI-SOURCE LOCALIZATION AND VOICE ACTIVITY DETECTION, , , , and , Idiap-RR-16-2011 |
|
An Integrated Framework for Multi-Channel Multi-Source Localization and Voice Activity Detection, , , , and , in: The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2011 |
|
Robust Microphone Placement for Source Localization from Noisy Distance Measurements, , , , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2579-2583, IEEE, 2015 |
[DOI] |
Ad Hoc Microphone Array Calibration: Euclidean Distance Matrix Completion Algorithm and Theoretical Guarantees, , , , and , in: Signal Processing, 107:123–140, 2015 |
[DOI] |
V
TDOA Matrices: Algebraic Properties and their Application to Robust Denoising with Missing Data, , , and , in: IEEE Transactions on Signal Processing, 64(20):5242-5254, 2016 |
[DOI] [URL] |
Novel GCC-PHAT Model in Diffuse Sound Field for Microphone Array Pairwise Distance Based Calibration, , , , , , and , in: IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2669-2673, 2015 |
|