CONF
Mahmoodzadeh_IST-2_2010/IDIAP
Single Channel Speech Separation with a Frame-based Pitch Range Estimation Method in Modulation Frequency
Mahmoodzadeh, Azar
Abutalebi, Hamid Reza
Soltanianzadeh, Hamid
Sheikhzadeh, Hamid
acuostic frequency
modulation frequency
pitch frequency
speech separation
EXTERNAL
https://publications.idiap.ch/attachments/papers/2010/Mahmoodzadeh_IST-2_2010.pdf
PUBLIC
Proceedings of 5th International Symposium on Telecommunications
2010
December 2010
Computational Auditory Scene Analysis (CASA) has attracted a lot of interest in segregating speech from monaural mixtures. In this paper, we propose a new method for single channel speech separation with frame-based pitch range estimation in modulation frequency domain. This range is estimated in each frame of modulation spectrum of speech by analyzing onsets and offsets. In the proposed method, target speaker is separated from interfering speaker by filtering the mixture signal with a mask extracted from the modulation spectrogram of mixture signal. Systematic evaluation shows an acceptable level of separation comparing with classic methods.