Keywords:
- Acoustic features
- ASR
- breathing patterns
- Convolution Neural Network
- cross-transfer knowledge
- Customer satisfaction
- depression detection
- domain adaptation
- Emotion Recognition
- end-to-end modelling
- Expressive Vocalizations
- fine-tuning
- Finetuning
- Formant transitions
- Foundation Model
- Foundation Models
- LoRA
- modalities fusion
- Multi-task learning
- Parkinson’s disease
- PC-GITA
- Peft
- phoneme modeling
- Phonetic information
- pre-trained embedding
- Self-supervised embedding
- self-supervised learning
- Speech Analysis
- Speech Emotion Recognition
- Speech for health
- Speech in health
- Spoken Language Understanding
- Stop-consonants
- wav2vec2.0
Publications of Tilak Purohit
2025
Automatic Parkinson’s disease detection from speech: Layer selection vs adaptation of foundation models, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025 |
![]() |
Emotion information recovery potential of wav2vec2 network fine-tuned for speech recognition task, and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025 |
![]() |
Exploring the Complexity of Parkinson’s Patient Speech for Depression Detection task: A Qualitative Analysis, , , , and , in: Proceedings of Workshop on Speech Pathology Analysis and DEtection (SPADE), Hyderabad, India, IEEE, 2025 |
![]() |
2024
Cross-transfer Knowledge between Speech and Text Encoders to Evaluate Customer Satisfaction, , , , and , in: Proceedings of Interspeech, Kos Island, Greece, ISCA, 2024 |
![]() |
2023
A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence, , , and , in: Proceedings of Interspeech, Dublin, Ireland, ISCA, 2023 |
![]() |
Implicit phonetic information modeling for speech emotion recognition, , and , in: Proceedings of Interspeech, Dublin, Ireland, ISCA, 2023 |
![]() |
Towards learning emotion information from short segments of speech, , , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes Island, Greece, IEEE, 2023 |
![]() |
2022
Comparing Biosignal and Acoustic feature Representation for Continuous Emotion Recognition, , , , and , in: International Multimodal Sentiment Analysis Workshop and Challenge, 2022 |
![]() |
Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track, , , and , in: Proceedings of the ICML Expressive Vocalizations Workshop held in conjunction with the 39th International Conference on Machine Learning, Maryland, USA, 2022 |
![]() |