Keywords:
- animal vocalizations
- bandwidth
- bioacoustics
- Biometrics
- call type classification
- call-type and caller classification
- discretization
- Face Recognition
- feature representations
- fine-tuning
- human speech
- low-rank adaptation
- machine learning
- Morphing Attack
- pre-training domain
- quantization
- self-supervised learning
- signal processing
- Speech Analysis
- speech and audio
- speech and audio feature representations
- StyleGAN 2
- token sequences
- transfer learning
- Vector quantization
- voice activity detection
- Vulnerability Analysis
- zero-frequency filtering
Publications of Eklavya Sarkar sorted by recency
| Towards Leveraging Sequential Structure in Animal Vocalizations, and , in: Neural Information Processing Systems workshop: AI for Non-Human Animal Communication, 2025 |
|
| Tokenwise Contrastive Speech and Text Pre-Training for Speech Emotion Recognition, and , Idiap-RR-07-2025 |
|
| Transferability of Learnt Speech Representations for Decoding Non-Human Vocal Communication, , Ecole Polytechnique Fédérale de Lausanne, 2025 |
|
| Leveraging Sequential Structure in Animal Vocalizations, and , Idiap-RR-06-2025 |
|
| Adaptation of Speech and Bioacoustics Models, , and , Idiap-RR-05-2025 |
|
| On feature representations for marmoset vocal communication analysis, , , , and , in: Bioacoustics: The International Journal of Animal Sound and its Recording:1-15, 2025 |
[DOI] [URL] |
| Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing, and , in: International Conference on Acoustics, Speech and Signal Processing, 2025 |
|
| Feature Representations for Automatic Meerkat Vocalization Classification, , , and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
| On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis, and , in: 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, 2024 |
|
| Feature Representations for Automatic Meerkat Vocalization Classification, , , and , Idiap-RR-06-2024 |
|
| Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?, and , in: Proceedings of Interspeech, 2023 |
|
| Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering, , and , in: Proceedings of Interspeech, 2022 |
|
| Are GAN-based Morphs Threatening Face Recognition?, , , and , in: International Conference on Acoustics, Speech and Signal Processing, 2022 |
|
| Modeling Source and System characteristics using Zero Frequency Filtering for Voice Activity Detection, , and , Idiap-Internal-RR-80-2021 |
| Vulnerability Analysis of Face Morphing Attacks from Landmarks and Generative Adversarial Networks, , , and , Idiap-RR-38-2020 |
|