Keywords:
- association rules
- audio processing
- confidence estimation
- data analysis
- direction of arrival
- high-definition video-conferencing
- microphone arrays
- mobile computing
- multi-face tracking
- multi-modal database
- multimodal signal processing
- pattern matching
- real-time audio processing
- reliability estimation
- Reverberation
- sensor fusion
- signal processing
- sound source localization
- source coding
- Speaker localization
- speech meta-data
- temporal alignment
- time synchronisation
- time synchronization
- time-frequency analysis
- time-quefrency analysis
- voice-activity detection
Publications of Danil Korchagin sorted by journal and type
Publications of type Idiap-RR
2011
Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, , Idiap-RR-08-2011 |
|
Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, , Idiap-RR-20-2011 |
|
Just-in-Time Multimodal Association and Fusion from Home Entertainment, , , and , Idiap-RR-10-2011 |
|
Multimodal Cue Detection Engine for Orchestrated Entertainment, , , and , Idiap-RR-34-2011 |
|
Social Focus of Attention as a Time Function Derived from Multimodal Signals, and , Idiap-RR-09-2011 |
|
2010
Automatic Time Skew Detection and Correction, , Idiap-RR-42-2010 |
|
Hands Free Audio Analysis from Home Entertainment, , and , Idiap-RR-27-2010 |
|
The TA2 Database - A Multi-Modal Database from Home Entertainment, , and , Idiap-RR-37-2010 |
|
2009
Automatic Temporal Alignment of AV Data, , and , Idiap-RR-39-2009 |
|
Automatic Temporal Alignment of AV Data with Confidence Estimation, , and , Idiap-RR-40-2009 |
|
Memoirs of Togetherness from Audio Logs, , Idiap-RR-36-2009 |
|
Out-of-Scene AV Data Detection, , Idiap-RR-31-2009 |
|
Real-Time ASR from Meetings, , , , , , , , and , Idiap-RR-15-2009 |
|
Publications of type Idiap-Com
Multimodal Data Flow Controller, , Idiap-Com-01-2009 |
|
Advances in Multimedia
Real-Time Audio-Visual Analysis for Multiperson Videoconferencing, , , , , , , , and , in: Advances in Multimedia, 2013:21, 2013 |
[DOI] [URL] |
International Journal of Computer and Electrical Engineering
The TA2 Database – A Multi-Modal Database From Home Entertainment, , and , in: International Journal of Computer and Electrical Engineering, 4(5):670-673, 2012 |
[URL] |
Publications of type Book
2012
Together Anywhere, Together Anytime, Technologies for Intimate Interactions, , , and , Centrum Wiskunde & Informatica, 2012 |
Proceedings International Conference on MultiMedia Modeling (2012)
Multimodal Cue Detection Engine for Orchestrated Entertainment, , , and , in: Proceedings International Conference on MultiMedia Modeling, Klagenfurt, Austria, 2012 |
|
Proceedings of the 19th European Signal Processing Conference (EUSIPCO) (2011)
A BSS-based Approach for Localization of Simultaneous Speakers in Reverberant Conditions, , , and , in: Proceedings of the 19th European Signal Processing Conference (EUSIPCO), 2011 |
|
Proceedings of the 3rd Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (2011)
Audio Spatio-Temporal Fingerprints for Cloudless Real-Time Hands-Free Diarization on Mobile Devices, , in: Proceedings of the 3rd Joint Workshop on Hands-Free Speech Communication and Microphone Arrays, Edinburgh, UK, 2011 |
|
Proceedings International Conference on Signal Acquisition and Processing (2011)
Automatic Time Skew Detection and Correction, , in: Proceedings International Conference on Signal Acquisition and Processing, Singapore, 2011 |
|
Proceedings European Signal Processing Conference (2011)
Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation, , in: Proceedings European Signal Processing Conference, Barcelona, Spain, 2011 |
|
Proceedings IEEE International Conference on Multimedia & Expo (2011)
Just-in-Time Multimodal Association and Fusion from Home Entertainment, , , and , in: Proceedings IEEE International Conference on Multimedia & Expo, Barcelona, Spain, 2011 |
|
Social Focus of Attention as a Time Function Derived from Multimodal Signals, and , in: Proceedings IEEE International Conference on Multimedia & Expo, Barcelona, Spain, 2011 |
|
International Conference on Signal Acquisition and Processing (2011)
The TA2 Database - A Multi-Modal Database from Home Entertainment, , and , in: International Conference on Signal Acquisition and Processing, Singapore, 2011 |
|
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2010)
Automatic Temporal Alignment of AV Data with Confidence Estimation, , and , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010 |
|
Proceedings of Interspeech (2010)
Hands Free Audio Analysis from Home Entertainment, , and , in: Proceedings of Interspeech, Makuhari, Japan, 2010 |
|
Proceedings International ICST Conference on User Centric Media (2009)
Memoirs of Togetherness from Audio Logs, , in: Proceedings International ICST Conference on User Centric Media, Venice, Italy, 2009 |
|
Proceedings IADIS International Conference Applied Computing (2009)
Out-of-Scene AV Data Detection, , in: Proceedings IADIS International Conference Applied Computing, Rome, Italy, 2009 |
|
Proceedings of Interspeech (2009)
Real-Time ASR from Meetings, , , , , , , , and , in: Proceedings of Interspeech, Brighton, UK., 2009 |
|