Keywords:
- Acoustic features
- ASR
- breathing patterns
- Convolution Neural Network
- cross-transfer knowledge
- Customer satisfaction
- depression detection
- domain adaptation
- Emotion Recognition
- end-to-end modelling
- Expressive Vocalizations
- fine-tuning
- Finetuning
- Formant transitions
- Foundation Model
- Foundation Models
- Interpretable features
- LoRA
- modalities fusion
- Multi-task learning
- Parkinson’s disease
- PC-GITA
- Peft
- phoneme modeling
- Phonetic information
- pre-trained embedding
- Self-supervised embedding
- self-supervised learning
- Speech Analysis
- Speech Emotion Recognition
- Speech for health
- Speech Foundation Models
- Speech in health
- Spoken Language Understanding
- Stop-consonants
- wav2vec2.0
Publications of Tilak Purohit sorted by title
A
| A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence, , , and , in: Proceedings of Interspeech, Dublin, Ireland, ISCA, 2023 |
|
| Automatic Parkinson’s disease detection from speech: Layer selection vs adaptation of foundation models, , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025 |
|
C
| Comparing Biosignal and Acoustic feature Representation for Continuous Emotion Recognition, , , , and , in: International Multimodal Sentiment Analysis Workshop and Challenge, 2022 |
|
| Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track, , , and , in: Proceedings of the ICML Expressive Vocalizations Workshop held in conjunction with the 39th International Conference on Machine Learning, Maryland, USA, 2022 |
|
| Cross-transfer Knowledge between Speech and Text Encoders to Evaluate Customer Satisfaction, , , , and , in: Proceedings of Interspeech, Kos Island, Greece, ISCA, 2024 |
|
E
| Emotion information recovery potential of wav2vec2 network fine-tuned for speech recognition task, and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Hyderabad, India, IEEE, 2025 |
|
| Exploring the Complexity of Parkinson’s Patient Speech for Depression Detection task: A Qualitative Analysis, , , , and , in: Proceedings of Workshop on Speech Pathology Analysis and DEtection (SPADE), Hyderabad, India, IEEE, 2025 |
|
I
| Implicit phonetic information modeling for speech emotion recognition, , and , in: Proceedings of Interspeech, Dublin, Ireland, ISCA, 2023 |
|
O
| On Detection of Depression in Parkinson's Disease Patients' Speech: Handcrafted Features vs. Speech Foundation Models, , , and , in: Automatic Assessment of Parkinsonian Speech, Springer Nature Switzerland AG, 2025 |
[URL] |
T
| Towards learning emotion information from short segments of speech, , , , and , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Rhodes Island, Greece, IEEE, 2023 |
|