Keywords:
- air surveillance data
- Air traffic control
- Assistant Based Speech Recognition
- Automatic Speech Recognition
- Building Blocks
- call sign detection
- Call-sign Detection
- Call-sign Recognition
- Command Prediction Model
- Contextual Adaptation
- Conversational technologies
- dialogue
- Direction of arrival estimation
- Discourse Annotation
- Gaussian mixture
- GDPR
- Kalman filters
- legal framework
- localization
- machine learning
- microphone arrays
- Monte Carlo methods
- Multimodal interaction
- Multiple speaker localization
- multiple speakers
- named entity recognition
- OpenSky Network
- personal data processing
- Representation and Processing
- ROXANNE
- ROXSD
- Steered response power
- tracking
- unsupervised learning
Publications of Dietrich Klakow sorted by title
A
A Context-Aware Speech recognition and Understanding System for Air Traffic Control Domain, , , , , and , in: Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, Okinawa, Japan, 2017 |
|
A Multiple Hypothesis Gaussian Mixture Filter for Acoustic Source Localization and Tracking, , and , Idiap-RR-09-2012 |
|
A Multiple Hypothesis Gaussian Mixture Filter for Acoustic Source Localization and Tracking, , and , in: 13th International Workshop on Acoustic Signal Enhancement, pages 233-236, 2012 |
|
A Probabilistic Framework for Multiple Speaker Localization, , , and , Idiap-RR-37-2012 |
|
A Probabilistic Framework for Multiple Speaker Localization, , , and , in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013 |
|
A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, , , and , Idiap-RR-10-2012 |
|
A TDOA Gaussian Mixture Model for Improving Acoustic Source Tracking, , , and , in: 20th European Signal Processing Conference, 2012 |
|
Adaptation of Assistant Based Speech Recognition to New Domains and Its Acceptance by Air Traffic Controllers, , , , , , , , , and , in: Proceedings of the 2nd International Conference on Intelligent Human Systems Integration (IHSI 2019): Integrating People and Intelligent Systems, San Diego, California, USA, pages 820 - 826, 2019 |
[DOI] |
Adaptive Beamforming with a Maximum Negentropy Criterion, , , , and , Idiap-RR-06-2008 |
|
Adaptive Beamforming with a Maximum Negentropy Criterion, , , , and , in: Proceedings of the Joint Workshop on Hands-free Speech Communication and Microphone Arrays, Italy, 2008 |
|
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications, , , , , , , , , , , , , , , and , in: Proceedings of 8th OpenSky Symposium 2020, OpenSky Network, pages 1-10, MDPI, 2020 |
[DOI] [URL] |
B
Beamforming with a Maximum Negentropy Criterion, , , , , and , in: IEEE Transactions on Audio Speech and Language Processing, 17(5), 2009 |
|
Boosting of contextual information in ASR for air-traffic call-sign recognition, , , , , , , and , in: Interspeech 2021, 2021 |
|
Building Blocks of Assistant Based Speech Recognition for Air Traffic Management Applications, , , , , , , and , in: Conference: SESAR Innovation Days 2018, European Union, Eurocontrol, Salzburg, Austria, SESARJU, 2018 |
[URL] |
F
Filter Bank Design based on Minimization of Individual Aliasing Terms for Minimum Mutual Information Subband Adaptive Beamforming, , , , , and , Idiap-RR-77-2007 |
|
Filter Bank Design based on Minimization of Individual Aliasing Terms for Minimum Mutual Information Subband Adaptive Beamforming, , , , , and , in: Proceedings of ICASSP 2008, Las Vegas, USA, 2008 |
|
Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition, , , , , and , Idiap-RR-02-2008 |
|
J
Joint Detection and Localization of Multiple Speakers using a Probabilistic Interpretation of the Steered Response Power, , , and , in: Statistical and Perceptual Audition Workshop, 2012 |
|
M
Maximum Negentropy Beamforming, , , , and , Idiap-RR-07-2008 |
|
R
ROCKIT: Roadmap for Conversational Interaction Technologies, , , , , , , , , , , , , , and , in: Proceedings of the 2014 Workshop on Roadmapping the Future of Multimodal Interaction Research including Business Opportunities and Challenges (RFMIR '14), pages 39-42, ACM, 2014 |
[DOI] |
ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and , in: Odyssey 2024: The Speaker and Language Recognition Workshop, pages 17-24, 2024 |
[DOI] [URL] |
S
Semi-supervised Adaptation of Assistant Based Speech Recognition Models for different Approach Areas, , , , , , , , , , and , in: 37th AIAA/IEEE Digital Avionics Systems Conference, AIAA/IEEE, London, 2018 |
[URL] |
T
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues, , , , , , , , , , , , , and , in: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland, European Language Resources Association (ELRA), 2014 |
[URL] |