Keywords:
- adaptation
- air surveillance data
- Air traffic control
- air traffic control communications
- air traffic controller
- air traffic management
- Assistant Based Speech Recognition
- Automatic Speech Recognition
- automatic speech recognition and understanding
- batch norm
- batch normalization
- call sign detection
- chunking
- command recognition rate
- Contextual Adaptation
- Cross-modal Attentio
- Cross-modal Attention
- diarization
- electronic flight strips
- finite-state transducers
- GDPR
- GPU decoding
- Human-Computer Interaction
- legal framework
- logistic regression
- multi-lingual automatic speech recognition
- multi-lingual SAD
- Multilingual automatic speech recognition
- multiple remote tower
- multitask acoustic modeling
- named entity recognition
- Natural language processing
- online speech recognition
- OpenSky Network
- personal data processing
- Robust Automatic Speech Recognition
- self-supervised pre-training
- signal processing
- Speaker change detection
- speaker recognition
- speaker role classification
- speaker role detection
- speaker role identification
- speaker verification
- Speech activity detection
- speech recognition
- speech understanding
- Spoken Language Understanding
- supervised adaptation
- Text-based speaker diarization
- tower utterances
- transfer learning
- wav2vec 2.0
- Word Consensus Networks
Publications of Seyyed Saeed Sarfjoo sorted by first author
F
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, , , and , Idiap-RR-01-2023 |
[URL] |
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data, , , and , in: 1st ISCA Symposium on Security and Privacy in Speech Communication, pages 10--13, 2021 |
[DOI] |
I
A two-step approach to leverage contextual data: speech recognition in air-traffic communications, , , , and , in: Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 |
|
J
How Does Pre-trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications, , , , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
BERTraffic: A Robust BERT-Based Approach for Speaker Change Detection and Role Identification of Air-Traffic Communications, , , , , , and , Idiap-RR-15-2021 |
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications, , , , , , and , in: 2023 IEEE Spoken Language Technology Workshop (SLT), IEEE, 2023 |
[URL] |
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications, , , , , , , , , , , , , , , and , in: Proceedings of 8th OpenSky Symposium 2020, OpenSky Network, pages 1-10, MDPI, 2020 |
[DOI] [URL] |
M
Idiap submission to the NIST SRE 2018 Speaker Recognition Evaluation, , , and , Idiap-RR-17-2019 |
|
O
Assistant Based Speech Recognition Support for Air Traffic Controllers in a Multiple Remote Tower Environment, , , , , , , , , , , , and , in: Aerospace, 10(6), 2023 |
[DOI] [URL] |
Robust Command Recognition for Lithuanian Air Traffic Control Tower Utterances, , , , , , , and , in: Interspeech, 2021 |
|
P
Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR, , , , , , and , Idiap-RR-22-2021 |
|
Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition, , , , , , and , in: 12th SESAR Innovation Days, 2022 |
|
Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator, , , , , and , in: 12th SESAR Innovation Days, 2022 |
|
S
Idiap submission to the NIST SRE 2019 Speaker Recognition Evaluation, , , , and , Idiap-RR-15-2019 |
|
Speech Activity Detection Based on Multilingual Speech Recognition System, , and , in: Interspeech, 2021 |
|
Supervised domain adaptation for text-independent speaker verification using limited data, , , and , in: Interspeech, pages 3815-3819, 2020 |
[URL] |
Domain Adaptation and Investigation of Robustness of DNN-based Embeddings for Text-Independent Speaker Verification Using Dilated Residual Networks, , and , Idiap-RR-10-2019 |
|
V
Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks, , , , , , , , and , in: Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023 |
|