IM2 - Idiap Publications

Update cookies preferences

Name:

IM2

| 1 | 2 | 3 | 4 | 5 | 6 |

From Research to Reality: Evaluation of a Single-Computer Real-Time LVCSR System for Speech-Based Retrieval, Andrei Popescu-Belis, Maryam Habibi, Philip N. Garner and Nan Li, Idiap-RR-12-2017

attachment

Joint Similarity Learning for Predicting Links in Networks with Multiple-type Links, Majid Yazdani and Andrei Popescu-Belis, Idiap-RR-29-2015

attachment

Feature Mapping of Multiple Beamformed Sources for Robust Overlapping Speech Recognition Using a Microphone Array, Weifeng Li, Longbiao Wang, Yicong Zhou, John Dines, Mathew Magimai-Doss, Hervé Bourlard and Qingmin Liao, Idiap-RR-17-2014

attachment

Comparing different acoustic modeling techniques for multilingual boosting, David Imseng, John Dines, Petr Motlicek, Philip N. Garner and Hervé Bourlard, Idiap-RR-01-2013

attachment

End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks, Dimitri Palaz, Ronan Collobert and Mathew Magimai-Doss, Idiap-RR-40-2013

attachment

Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, Dimitri Palaz, Ronan Collobert and Mathew Magimai-Doss, Idiap-RR-13-2013

attachment

MediaParl: Bilingual mixed language accented speech database, David Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorvé and Alexandre Nanchen, Idiap-RR-03-2013

attachment

Recurrent Convolutional Neural Networks for Scene Labeling, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-41-2013

attachment

Recurrent Convolutional Neural Networks for Scene Parsing, Pedro H. O. Pinheiro and Ronan Collobert, Idiap-RR-22-2013

attachment

Using out-of-language data to improve an under-resourced speech recognizer, David Imseng, Petr Motlicek, Hervé Bourlard and Philip N. Garner, Idiap-RR-09-2013

attachment

Automatic Social Role Recognition In Professional Meetings, A. Sapru and Hervé Bourlard, Idiap-RR-35-2012

attachment

Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, Idiap-RR-15-2012

attachment

Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, David Imseng, Ramya Rasipuram and Mathew Magimai-Doss, Idiap-RR-01-2012

attachment

IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, Petr Motlicek, Fabio Valente and Igor Szoke, Idiap-RR-36-2012

attachment

Improving Object Classification using Pose Information, Hugo Penedones, Ronan Collobert, Francois Fleuret and David Grangier, Idiap-RR-30-2012

attachment

Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, Maryam Habibi and Andrei Popescu-Belis, Idiap-RR-14-2012

attachment

A Speech-based Just-in-Time Retrieval System using Semantic Search, Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen and Philip N. Garner, Idiap-RR-31-2011

attachment

AN INTEGRATED FRAMEWORK FOR MULTI-CHANNEL MULTI-SOURCE LOCALIZATION AND VOICE ACTIVITY DETECTION, Mohammad J. Taghizadeh, Philip N. Garner, Hervé Bourlard, Hamid Reza Abutalebi and Afsaneh Asaei, Idiap-RR-16-2011

attachment

BROADBAND BEAMPATTERN FOR MULTI-CHANNEL SPEECH ACQUISITION AND DISTANT SPEECH RECOGNITION, Mohammad J. Taghizadeh, Philip N. Garner and Hervé Bourlard, Idiap-RR-39-2011

attachment

Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition., Philip N. Garner, Idiap-RR-15-2011

attachment

Continuous Speech Recognition using Boosted Binary Features, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, Idiap-RR-35-2011

attachment

Finding Information in Multimedia Records of Meetings, Andrei Popescu-Belis, Denis Lalanne and Hervé Bourlard, Idiap-RR-32-2011

attachment

IMPROVING MICROPHONE ARRAY SPEECH RECOGNITION WITH COCHLEAR IMPLANT-LIKE SPECTRALLY REDUCED SPEECH, Cong-Thanh Do, Mohammad J. Taghizadeh and Philip N. Garner, Idiap-RR-40-2011

attachment

Improving non-native ASR through stochastic multilingual phoneme space transformations, David Imseng, Hervé Bourlard, John Dines, Philip N. Garner and Mathew Magimai-Doss, Idiap-RR-19-2011

attachment

Language dependent universal phoneme posterior estimation for mixed language speech recognition, David Imseng, Hervé Bourlard, Mathew Magimai-Doss and John Dines, Idiap-RR-13-2011

attachment

When Users Meet Technology: The Meeting Browser Development Helix, Andrei Popescu-Belis, Denis Lalanne and Hervé Bourlard, Idiap-RR-05-2011

attachment

Advances in Fast Multistream Diarization based on the Information Bottleneck Framework, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-23-2010

attachment

An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, David Imseng and Gerald Friedland, Idiap-RR-02-2010

attachment

An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-22-2010

attachment

Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams, Anindya Roy and Sébastien Marcel, Idiap-RR-13-2010

attachment

English Spoken Term Detection in Multilingual Recordings, Petr Motlicek, Fabio Valente and Philip N. Garner, Idiap-RR-21-2010

attachment

Estimating Cohesion in Small Groups using Audio-Visual Nonverbal Behavior, Hayley Hung and Daniel Gatica-Perez, Idiap-RR-12-2010

attachment

Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-01-2010

attachment

Fast Bounding Box Estimation based Face Detection, Venkatesh Bala Subburaman and Sébastien Marcel, Idiap-RR-38-2010

attachment

Hierarchical Multilayer Perceptron based Language Identification, David Imseng, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-14-2010

attachment

Hierarchical Tandem Features for ASR in Mandarin, Joel Praveen Pinto, Mathew Magimai-Doss and Hervé Bourlard, Idiap-RR-39-2010

attachment

Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, Idiap-RR-29-2010

attachment

KL Realignment for Speaker Diarization with Multiple Feature Streams, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, Idiap-RR-24-2010

attachment

Modeling and Understanding Flickr Communities through Topic-based Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, Idiap-RR-19-2010

attachment

The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites, Andrei Popescu-Belis, Jonathan Kilgour, Alexandre Nanchen and Peter Poller, Idiap-RR-26-2010

attachment

Towards mixed language speech recognition systems, David Imseng, Hervé Bourlard and Mathew Magimai-Doss, Idiap-RR-15-2010

attachment

Tracter: A Lightweight Dataflow Framework, Philip N. Garner and John Dines, Idiap-RR-10-2010

attachment

Tuning-Robust Initialization Methods for Speaker Diarization, David Imseng and Gerald Friedland, Idiap-RR-35-2010

attachment

A MAP Approach to Noise Compensation of Speech, Philip N. Garner, Idiap-RR-08-2009

attachment

APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, Sriram Ganapathy, Samuel Thomas, Petr Motlicek and Hynek Hermansky, Idiap-RR-35-2009

attachment

Automatic Out-of-Language Detection based on Confidence Measures derived from LVCSR Word and Phone Lattices, Petr Motlicek, Idiap-RR-06-2009

attachment

Automatic vs. human question answering over multimedia meeting recordings, Quoc Anh Le and Andrei Popescu-Belis, Idiap-RR-13-2009

attachment

ClusterRank: A Graph Based Method for Meeting Summarization, Nikhil Garg, Benoit Favre, Korbinian Reidhammer and Dilek Hakkani Tür, Idiap-RR-09-2009

attachment

Comparing meeting browsers using a task-based evaluation method, Andrei Popescu-Belis, Idiap-RR-11-2009

attachment

Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, Anindya Roy and Sébastien Marcel, Idiap-RR-28-2009

attachment

Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, Idiap-RR-12-2009

attachment

Multiple Object Tracking using Flow Linear Programming, Jerome Berclaz, Francois Fleuret and Pascal Fua, Idiap-RR-10-2009

attachment

Novel initialization methods for Speaker Diarization, David Imseng, Idiap-RR-07-2009

attachment

On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR, Mathew Magimai-Doss, Guillermo Aradilla and Hervé Bourlard, Idiap-RR-24-2009

attachment

Real-Time ASR from Meetings, Philip N. Garner, John Dines, Thomas Hain, Asmaa El Hannani, Martin Karafiat, Danil Korchagin, Mike Lincoln, Vincent Wan and Le Zhang, Idiap-RR-15-2009

attachment

Robust Speaker Diarization for Short Speech Recordings, David Imseng and Gerald Friedland, Idiap-RR-26-2009

attachment

SNR Features for Automatic Speech Recognition, Philip N. Garner, Idiap-RR-25-2009

attachment

Speaker Change Detection with Privacy-Preserving Audio Cues, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Daniel Gatica-Perez and Hervé Bourlard, Idiap-RR-23-2009

attachment

Speech/Non-Speech Detection in Meetings from Automatically Extracted Low Resolution Visual Features, Hayley Hung and Silèye O. Ba, Idiap-RR-20-2009

attachment

User Interface Design in a Just-in-time Retrieval System for Meetings, Andrei Popescu-Belis, Peter Poller, Jonathan Kilgour, Mike Flynn, Sebastian Germesin, Alexandre Nanchen and Majid Yazdani, Idiap-RR-38-2009

attachment

Visual activity context for focus of attention estimation in dynamic meetings, Silèye O. Ba, Hayley Hung and Jean-Marc Odobez, Idiap-RR-02-2009

attachment

Entropy coding of Quantized Spectral Components in FDLP audio codec, Petr Motlicek, Sriram Ganapathy and Hynek Hermansky, Idiap-RR-71-2008

attachment

Identifying Dominant People in Meetings from Audio-Visual Sensors, Hayley Hung and Daniel Gatica-Perez, Idiap-RR-65-2008

attachment

Kernel Based Text-Independnent Speaker Verification, Johnny Mariéthoz, Samy Bengio and Yves Grandvalet, Idiap-RR-68-2008

attachment

Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-75-2008

attachment

MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, Sriram Ganapathy, Petr Motlicek and Hynek Hermansky, Idiap-RR-74-2008

attachment

Modulation Frequency Features For Phoneme Recognition In Noisy Speech, Sriram Ganapathy, Samuel Thomas and Hynek Hermansky, Idiap-RR-70-2008

attachment

Multi-layer Boosting for Pattern Recognition, Francois Fleuret, Idiap-RR-76-2008

attachment

Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Hynek Hermansky and Mathew Magimai-Doss, Idiap-RR-69-2008

attachment

Who Wants To Be A Millionaire? (II), Huseyn Gasimov, Petr Motlicek and Hervé Bourlard, Idiap-Com-02-2013

attachment

Face Detection using Ferns, Venkatesh Bala Subburaman and Sébastien Marcel, Idiap-Com-01-2011

attachment

Finding without searching, Andrei Popescu-Belis, Idiap-Com-01-2010

attachment

Ad Hoc Microphone Array Calibration: Euclidean Distance Matrix Completion Algorithm and Theoretical Guarantees, Mohammad J. Taghizadeh, Reza Parhizkar, Philip N. Garner, Hervé Bourlard and Afsaneh Asaei, in: Signal Processing, 107:123–140, 2015

attachment

[DOI]

What Your Face Vlogs About: Expressions of Emotion and Big-Five Traits Impressions in YouTube, Lucia Teijeiro-Mosquera, Joan-Isaac Biel, Jose Luis Alba-Castro and Daniel Gatica-Perez, in: IEEE Transactions Affective Computing, 2014

attachment

Broadcasting oneself: Visual Discovery of Vlogging Styles, Oya Aran, Joan-Isaac Biel and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 16(1):201-215, 2014

attachment

[DOI]

Mining Crowdsourced First Impressions in Online Social Video, Joan-Isaac Biel and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 16(7), 2014

attachment

Enhanced Diffuse Field Model for Ad Hoc Microphone Array Calibration, Mohammad J. Taghizadeh, Philip N. Garner and Hervé Bourlard, in: Signal Processing, 101:242-255, 2014

attachment

Hi YouTube! Personality Impressions and Verbal Content in Social Video, Joan-Isaac Biel, Daniel Gatica-Perez, John Dines and Vagia Tsminiaki, in: 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013, 2013

attachment

Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia, Majid Yazdani and Andrei Popescu-Belis, in: Artificial Intelligence Journal, 194:176–202, 2013

attachment

[DOI]

A Savitzky-Golay Filtering Perspective of Dynamic Feature Computation, S. R. Krishnan, Mathew Magimai-Doss and C. S. Seelamantula, in: IEEE Signal Processing Letters, 20(3):281 -- 284, 2013

[DOI]

Convexity in source separation: Models, geometry, and algorithms, Michael McCoy, Volkan Cevher, Quoc Tran Dinh, Afsaneh Asaei and Luca Baldassarre, in: IEEE Signal Processing Magazine, Special Issue on Source Separation and Applications, 2013

attachment

Applying multi- and cross-lingual stochastic phone space transformations to non-native speech recognition, David Imseng, Hervé Bourlard, John Dines, Philip N. Garner and Mathew Magimai-Doss, in: IEEE Transactions on Audio, Speech, and Language Processing, 2013

attachment

[DOI]

Using out-of-language data to improve an under-resourced speech recognizer, David Imseng, Petr Motlicek, Hervé Bourlard and Philip N. Garner, in: Speech Communication, 2013

attachment

[DOI]
[URL]

Finding Information in Multimedia Records of Meetings, Andrei Popescu-Belis, Denis Lalanne and Hervé Bourlard, in: IEEE Multimedia, 19(2):48-57, 2012

[DOI]
[URL]

The ICSI RT-09 Speaker Diarization System, Gerald Friedland, Adam Janin, David Imseng, Xavier Anguera, Luke Gottlieb, Marijn Huijbregts, Mary Tai Knox and Oriol Vinyals, in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):371--381, 2012

[DOI]

Transcribing meetings with the AMIDA systems, Thomas Hain, Lukas Burget, John Dines, Philip N. Garner, Frantisek Grezl, Asmaa El Hannani, Marijn Huijbregts, Martin Karafiat, Mike Lincoln and Vincent Wan, in: IEEE Transactions on Audio, Speech, and Language Processing, 20(2):486--498, 2012

attachment

[DOI]
[URL]

A Fast Parts-based Approach to Speaker Verification using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE Transactions on Information Forensics and Security, 7(1):241-254, 2012

attachment

The YouTube Lens: Crowdsourced Personality Impressions and Audiovisual Analysis of Vlogs, Joan-Isaac Biel and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 2012

attachment

Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Speech Communication, 54(1), 2012

[DOI]

Phase AutoCorrelation (PAC) features for noise robust speech recognition, Shajith Ikbal, Hemant Misra, Hynek Hermansky and Mathew Magimai-Doss, in: Speech Communication, 54(7):867–880, 2012

[DOI]

Automatic Identification of Discourse Markers in Multiparty Dialogues: An In-Depth Study of Like and Well, Andrei Popescu-Belis and Sandrine Zufferey, in: Computer Speech and Language, 25(3):499-518, 2011

attachment

[DOI]

Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues, Silèye O. Ba and Jean-Marc Odobez, in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 33(1):101-116, 2011

attachment

Analysis of MLP Based Hierarchical Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Mathew Magimai-Doss, Hynek Hermansky and Hervé Bourlard, in: IEEE Transcations on Audio, Speech, and Language Processing, 19(2):225-241, 2011

attachment

Current trends in multilingual speech processing, Hervé Bourlard, John Dines, Mathew Magimai-Doss, Philip N. Garner, David Imseng, Petr Motlicek, Hui Liang, Lakshmi Saheer and Fabio Valente, in: Sadhana, 36(5):885–915, 2011

attachment

[DOI]
[URL]

Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition, Philip N. Garner, in: Speech Communication, 53(8):991--1001, 2011

attachment

[DOI]

Privacy-sensitive recognition of group conversational context with sociometers, Dinesh Babu Jayagopi, Taemie Kim, Alex Pentland and Daniel Gatica-Perez, in: Springer Multimedia Systems Journal, 2011

attachment

VlogSense: Conversational Behavior and Social Attention in YouTube, Joan-Isaac Biel and Daniel Gatica-Perez, in: Transactions on Multimedia Computing, Communications and Applications, 2011

attachment

Tuning-Robust Initialization Methods for Speaker Diarization, David Imseng and Gerald Friedland, in: IEEE Transactions on Audio, Speech, and Language Processing, 18(8):2028-2037, 2010

attachment

[DOI]

Mining group nonverbal conversational patterns using probabilistic topic models, Dinesh Babu Jayagopi and Daniel Gatica-Perez, in: IEEE Transactions on Multimedia, 2010

attachment

Contextual classification of image patches with latent aspect models, Florent Monay, Pedro Quelhas, Jean-Marc Odobez and Daniel Gatica-Perez, in: EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision, 2009

attachment

Capturing Order in Social Interactions, Alessandro Vinciarelli, in: IEEE Signal Processing Magazine, 2009

attachment

An Information Theoretic Approach to Speaker Diarization of Meeting Data, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: IEEE Transactions on Audio Speech and Language Processing, 17(7), 2009

attachment

[DOI]

Automatic Role Recognition in Multiparty Recordings: Using Social Affiliation Networks for Feature Extraction, Hugues Salamin, Sarah Favre and Alessandro Vinciarelli, in: IEEE Transactions on Multimedia, 11(7), 2009

attachment

Recognizing Human Visual Focus of Attention from Head Pose in Meetings, Silèye O. Ba and Jean-Marc Odobez, in: IEEE Transactions on Systems, Man, Cybernetics, Part-B, Vol. 39(No. 1), 2009

attachment

Social Signal Processing: Survey of an Emerging Domain, Alessandro Vinciarelli, Maja Pantic and Hervé Bourlard, in: Image and Vision Computing, 2009

attachment

The FEMTI guidelines for contextual MT evaluation: principles and tools, Paula Estrella, Andrei Popescu-Belis and Margaret King, in: Linguistica Antverpiensia New Series, 8, 2009

Multi-layer Boosting for Pattern Recognition, Francois Fleuret, in: Pattern Recognition Letter, 30, 2009

Tracking the visual focus of attention for a varying number of wandering people, Kevin C. Smith, Silèye O. Ba, Jean-Marc Odobez and Daniel Gatica-Perez, in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 30(7), 2008

attachment

Modeling Dominance in Group Conversations using NonVerbal Activity Cues, Dinesh Babu Jayagopi, Hayley Hung, Chuohao Yeo and Daniel Gatica-Perez, in: IEEE Transactions on Audio, Speech and Language Processing, 2008

attachment

Fast Recognition of Anticipation Related Potentials, Gangadhar Garipelli, Ricardo Chavarriaga and José del R. Millán, in: IEEE Transactions on Biomedical Engineering, 2008

attachment

Classification-based Probabilistic Modeling of Texture Transition for Fast Line Search Tracking and Delineation, Ali Shahrokni, Tom Drummond, Francois Fleuret and Pascal Fua, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008

Multi-Camera People Tracking with a Probabilistic Occupancy Map, Francois Fleuret, Jerome Berclaz, Richard Lengagne and Pascal Fua, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(2), 2008

Modulation Frequency Features For Phoneme Recognition In Noisy Speech, Sriram Ganapathy, Samuel Thomas and Hynek Hermansky, in: Journal of Acoustical Society of America - Express Letters, 2008

attachment

Stationary Features and Cat Detection, Francois Fleuret and Donald Geman, in: Journal of Machine Learning Research, 9, 2008

Dimensionality of Dialogue Act Tagsets: An Empirical Analysis of Large Corpora, Andrei Popescu-Belis, in: Language Resources and Evaluation, 42(1), 2008

attachment

[DOI]

Interactive Multimodal Information Management, Hervé Bourlard and Andrei Popescu-Belis, EPFL Press, 2013

Multimodal Signal Processing: Human Interactions in Meetings, Steve Renals, Hervé Bourlard, Jean Carletta and Andrei Popescu-Belis, Cambridge University Press, 2012

[URL]

Machine Learning for Multimodal Interaction IV, Andrei Popescu-Belis, Hervé Bourlard and Steve Renals, Springer-Verlag, LNCS, volume 4892, 2008

[DOI]

Machine Learning for Multimodal Interaction V, Andrei Popescu-Belis and Rainer Stiefelhagen, Springer-Verlag, LNCS, volume 5237, 2008

[DOI]

Interactive Multimodal Information Management: Shaping the Vision, Andrei Popescu-Belis and Hervé Bourlard, in: Interactive Multimodal Information Management, pages 1-17, EPFL Press, 2013

attachment

Learning to learn new models of human activities in indoor settings1, Fabian Nater, Tatiana Tommasi, Luc Van Gool and Barbara Caputo, in: Interactive Multimodal Information Management, EPFL Press, 2013

Learning to learn new models of human activities in indoor settings1, Fabian Nater, Tatiana Tommasi, Luc Van Gool and Barbara Caputo, in: Interactive Multimodal Information Management, EPFL Press, 2013

attachment

Medical image annotation, Barbara Caputo, in: Interactive Multimodal Information Management, EPFL Press, 2013

attachment

Speech Processing, Mathew Magimai-Doss, in: Interactive Multimodal Information Management, pages 221--245, EPFL Press, 2013

Evaluation of Meeting Support Technology, Simon Tucker and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 237-252, Cambridge University Press, 2012

Multimodal Signal Processing for Meetings: an Introduction, Andrei Popescu-Belis and Jean Carletta, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 1-11, Cambridge University Press, 2012

attachment

Speaker Diarization, Fabio Valente and Gerald Friedland, in: Multimodal Signal Processing: Human Interactions in Meetings, Cambridge University Press, 2012

[URL]

User Requirements for Meeting Support Technology, Denis Lalanne and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 210-221, Cambridge University Press, 2012

Analysis of Group Conversations: Modeling Social Verticality, Oya Aran and Daniel Gatica-Perez, in: Computer Analysis of Human Behavior, pages 293-322, Springer London, 2011

Call me Guru: user categories and large-scale behavior in YouTube, Joan-Isaac Biel and Daniel Gatica-Perez, in: Social Media Computing, Springer, 2011

attachment

Accessing a Large Multimodal Corpus using an Automatic Content Linking Device, Andrei Popescu-Belis, Jean Carletta, Jonathan Kilgour and Peter Poller, in: Multimodal Corpora: From Models of Natural Interaction to Systems and Applications, Springer-Verlag, 2009

attachment

[DOI]

Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions, Andrei Popescu-Belis, in: Multimodal Signal Processing for Human-Computer Interaction, Elsevier / Academic Press, 2009

Towards an Objective Test for Meeting Browsers: the BET4TQB Pilot Experiment, Andrei Popescu-Belis, Philippe Baudrion, Mike Flynn and Pierre Wellner, in: Machine Learning for Multimodal Interaction IV, Springer-Verlag, 2008

attachment

[DOI]

Gender Classification by LUT based boosting of Overlapping Block Patterns, Rakesh Metha, Manuel Günther and Sébastien Marcel, in: Scandinavian Conference on Image Analysis, pages 530-542, Springer International Publishing, 2015

attachment

[DOI]
[URL]

Ad-Hoc Microphone Array Calibration from Partial Distance Measurements, Mohammad J. Taghizadeh, Afsaneh Asaei, Philip N. Garner and Hervé Bourlard, in: Proceedings of the 4th Joint Workshop on Hands-free speech communication and Microphone Arrays, Villers-les-Nancy, pages 1 - 5, IEEE, 2014

attachment

[DOI]

Automatic Speech Recognition and Translation of a Swiss German Dialect: Walliserdeutsch, Philip N. Garner, David Imseng and Thomas Meyer, in: Proceedings of Interspeech, 2014

attachment

Detecting speaker roles and topic changes in multiparty conversations using latent topic models, A. Sapru and Hervé Bourlard, in: Proceedings of Interspeech, 2014

attachment

Enforcing Topic Diversity in a Document Recommender for Conversations, Maryam Habibi and Andrei Popescu-Belis, in: Proceedings of the Coling 2014 (25th International Conference on Computational Linguistics), Dublin, Ireland, pages 746-759, IEEE, 2014

attachment

Model-based Sparse Component Analysis for Reverberant Speech Localization, Afsaneh Asaei, Hervé Bourlard, Mohammad J. Taghizadeh and Volkan Cevher, in: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1439 - 1443, IEEE, 2014

attachment

[DOI]

Recurrent Convolutional Neural Networks for Scene Labeling, Pedro H. O. Pinheiro and Ronan Collobert, in: 31st International Conference on Machine Learning (ICML), Beijing, China, pages 82-90, JMLR, 2014

attachment

[URL]

The Workshop on Computational Personality Recognition 2014, Fabio Celli, Bruno Lepri, Joan-Isaac Biel, Giuseppe Riccardi, Daniel Gatica-Perez and Fabio Pianesi, in: Proceedings of the ACM International Conference on Multimedia, 2014

attachment

A Probabilistic Framework for Multiple Speaker Localization, Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel and Dietrich Klakow, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013

attachment

Automatic Social Role Recognition In Professional Meetings Using Conditional Random Fields, A. Sapru and Hervé Bourlard, in: Proceedings of Interspeech, 2013

attachment

Computing Text Semantic Relatedness using the Contents and Links of a Hypertext Encyclopedia, Majid Yazdani and Andrei Popescu-Belis, in: International Joint Conference on artificial intelligence, 2013

attachment

Context Aware Addressee Estimation for Human Robot Interaction, Samira Sheikhi, Dinesh Babu Jayagopi, Vasil Khalidov and Jean-Marc Odobez, in: Proceedings of the 6th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Gaze in Multimodal Interaction, 2013

Cross-Domain Personality Prediction: From Video Blogs to Small Group Meetings, Oya Aran and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013

attachment

Diverse Keyword Extraction from Conversations, Maryam Habibi and Andrei Popescu-Belis, in: Proceedings of the ACL 2013 (51th Annual Meeting of the Association for Computational Linguistics ), Short Papers, Sofia, Bulgaria, pages 651-657, ACL, 2013

attachment

Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks, Dimitri Palaz, Ronan Collobert and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2013

attachment

Euclidean Distance Matrix Completion for Ad-hoc Microphone Array Calibration, Mohammad J. Taghizadeh, Reza Parhizkar, Philip N. Garner and Hervé Bourlard, in: Proceedings IEEE International Conference On Digital Signal Processing, 2013

attachment

Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition, Aniruddha Adiga, Mathew Magimai-Doss and Chandra Sekhar Seelamantula, in: Proceedings of IEEE TENCON, 2013

attachment

Investigating the Impact of Language Style and Vocal Expression on Social Roles of Participants in Professional Meetings, A. Sapru and Hervé Bourlard, in: Affective Computing and Intelligent Interaction, Geneva, pages 324-329, IEEE, 2013

attachment

[DOI]

Learning to Rank on Network Data, Majid Yazdani, Ronan Collobert and Andrei Popescu-Belis, in: Mining and Learning with Graphs, 2013

attachment

Leveraging the robot dialog state for visual focus of attention recognition, Samira Sheikhi, Vasil Khalidov, David Klotz, Britta Wrede and Jean-Marc Odobez, in: Proceedings of the 15th ACM on International Conference on Multimodal Interaction, 2013

Manifold Sparse Beamforming, Baran Gözcü, Afsaneh Asaei and Volkan Cevher, in: IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Saint Martin, France, pages 113-116, IEEE, 2013

attachment

[DOI]

One of a Kind: Inferring Personality Impressions in Meetings, Oya Aran and Daniel Gatica-Perez, in: 15th ACM International Conference on Multimodal Interaction, 2013

attachment

Assessing the Impact of Language Style on Emergent Leadership Perception from Ubiquitous Audio, Dairazalia Sanchez-Cortes, Petr Motlicek and Daniel Gatica-Perez, in: Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia, Ulm, Germany, 2012

attachment

Automatic detection of conflict escalation in spoken conversations, Samuel Kim, Sree Harsha Yella and Fabio Valente, in: INTERSPEECH, ISCA, Portland, Oregon, USA., 2012

attachment

Automatic detection of conflicts in spoken conversations: ratings and analysis of broadcast political debates, Samuel Kim, Fabio Valente and Alessandro Vinciarelli, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012

attachment

Boosting localized binary features for speech recognition, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: Symposium on Machine Learning in Speech and Language Processing (MLSLP), 2012

attachment

Boosting under-resourced speech recognizers by exploiting out of language data - Case study on Afrikaans, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings of the 3rd International Workshop on Spoken Languages Technologies for Under-resourced Languages, Cape Town, pages 60--67, 2012

attachment

Combination of Sparse Classification and Multilayer Perceptron for Noise Robust ASR, Yang Sun, Mathew Magimai-Doss, Jort F. Gemmeke, B. Cranen, Louis ten Bosch and Lou Boves, in: Proceedings of Interspeech, 2012

attachment

COMBINING CEPSTRAL NORMALIZATION AND COCHLEAR IMPLANT-LIKE SPEECH PROCESSING FOR MICROPHONE ARRAY-BASED SPEECH RECOGNITION, Cong-Thanh Do, Mohammad J. Taghizadeh and Philip N. Garner, in: Proceedings of the IEEE Workshop on Spoken Language Technology, 2012

attachment

Comparing different acoustic modeling techniques for multilingual boosting, David Imseng, John Dines, Petr Motlicek, Philip N. Garner and Hervé Bourlard, in: Proceedings of Interspeech, Portland, Oregon, 2012

attachment

FaceTube: predicting personality from facial expressions of emotion in online conversational video, Joan-Isaac Biel, Lucia Teijeiro-Mosquera and Daniel Gatica-Perez, in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2012

attachment

IMPROVING ACOUSTIC BASED KEYWORD SPOTTING USING LVCSR LATTICES, Petr Motlicek, Fabio Valente and Igor Szoke, in: Proceedings on IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, Japan, pages 4413-4416, 2012

MediaParl: Bilingual mixed language accented speech database, David Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorvé and Alexandre Nanchen, in: Proceedings of the 2012 IEEE Workshop on Spoken Language Technology, pages 263--268, 2012

attachment

Microphone Array Beampattern Characterization for Hands-free Speech Applications, Mohammad J. Taghizadeh, Philip N. Garner and Hervé Bourlard, in: IEEE 7th Sensor Array and Multichannel Signal Processing Workshop(SAM), Hoboken, NJ, USA, pages 473-476, 2012

attachment

Modeling dominance effects on nonverbal behaviors using granger causality, Kyriaki Kalimeri, Bruno Lepri, Oya Aran, Dinesh Babu Jayagopi, Daniel Gatica-Perez and Fabio Pianesi, in: Proceedings of International Conference on Multimodal Interaction, ICMI 2012, Santa Monica, CA, 2012

attachment

The Good, the Bad, and the Angry: Analyzing Crowdsourced Impressions of Vloggers, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, 2012

attachment

Using Crowdsourcing to Compare Document Recommendation Strategies for Conversations, Maryam Habibi and Andrei Popescu-Belis, in: RecSys, Recommendation Utility Evaluation (RUE 2012), Dublin, Ireland, pages 15-20, 2012

attachment

Using KL-divergence and multilingual information to improve ASR for under-resourced languages, David Imseng, Hervé Bourlard and Philip N. Garner, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Kyoto, pages 4869--4872, 2012

attachment

Using Sparse Classification Outputs as Feature Observations for Noise Robust ASR, Yang Sun, B. Cranen, Jort F. Gemmeke, Lou Boves, Louis ten Bosch and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2012

attachment

A Just-in-Time Document Retrieval System for Dialogues or Monologues, Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen and Philip N. Garner, in: SIGDIAL 2011 (12th annual SIGDIAL Meeting on Discourse and Dialogue), Demonstration Session, Portland, OR, pages 350-352, 2011

attachment

A Speech-based Just-in-Time Retrieval System using Semantic Search, Andrei Popescu-Belis, Majid Yazdani, Alexandre Nanchen and Philip N. Garner, in: Proceedings of the ACL-HLT 2011 System Demonstrations (49th Annual Meeting of the Association for Computational Linguistics), Portland, OR, pages 80-86, 2011

[URL]

An Integrated Framework for Multi-Channel Multi-Source Localization and Voice Activity Detection, Mohammad J. Taghizadeh, Philip N. Garner, Hervé Bourlard, Hamid Reza Abutalebi and Afsaneh Asaei, in: The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2011

attachment

Analysis and Comparison of Recent MLP Features for LVCSR Systems, Fabio Valente, Mathew Magimai-Doss and Wen Wang, in: Proceedings of Interspeech 2011, 2011

attachment

Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition, David Imseng, Ramya Rasipuram and Mathew Magimai-Doss, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Hawaii, USA, pages 348-353, 2011

attachment

Fast Speaker Verification on Mobile Phone data using Boosted Slice Classifiers, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: IAPR IEEE International Joint Conference on Biometrics, Washington DC, 2011

attachment

Hierarchical Tandem Features for ASR in Mandarin, Joel Pinto, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, 2011

Improving non-native ASR through stochastic multilingual phoneme space transformations, David Imseng, Hervé Bourlard, John Dines, Philip N. Garner and Mathew Magimai-Doss, in: Proceedings of Interspeech, Florence, Italy, pages 537-540, 2011

attachment

Information Bottleneck Features for HMM/GMM Speaker Diarization of Meetings Recordings, Sree Harsha Yella and Fabio Valente, in: Interspeech, Florence, Italy, pages 953-956, 2011

attachment

Language dependent universal phoneme posterior estimation for mixed language speech recognition, David Imseng, Hervé Bourlard, Mathew Magimai-Doss and John Dines, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Prag, CZ, pages 5012-5015, 2011

attachment

Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus, Fabio Valente and Alessandro Vinciarelli, in: Proceedings of Interspeech, 2011

attachment

MULTISTREAM SPEAKER DIARIZATION THROUGH INFORMATION BOTTLENECK SYSTEM OUTPUTS COMBINATION, Deepu Vijayasenan, Fabio Valente and Petr Motlicek, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2011

attachment

Phoneme Recognition using Boosted Binary Features, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: IEEE Intl. Conference on Acoustics, Speech and Signal Processing 2011, 2011

attachment

Speaker Diarization of Meetings based on Speaker Role N-gram Models, Fabio Valente, Deepu Vijayasenan and Petr Motlicek, in: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011

attachment

The Kaldi Speech Recognition Toolkit, Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukas Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlicek, Yanmin Qian, Petr Schwarz, Jan Silovsky, Georg Stemmer and Karel Vesely, in: IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, Hilton Waikoloa Village, Big Island, Hawaii, US, IEEE Signal Processing Society, 2011

attachment

Using a Wikipedia-based Semantic Relatedness Measure for Document Clustering., Majid Yazdani and Andrei Popescu-Belis, in: Graph-based Methods for Natural Language Processing, 2011

attachment

You Are Known by How You Vlog: Personality Impressions and Nonverbal Behavior in YouTube, Joan-Isaac Biel, Oya Aran and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, Barcelona, 2011

attachment

A Comparative Study of MLP Front-ends for Mandarin ASR, Fabio Valente, Mathew Magimai-Doss, Christian Plahl, Ravuri Suman and Wang Wen, in: Proceedings of Interspeech, Japan, 2010

attachment

A Multimodal Corpus for Studying Dominance in Small Group Conversations, Oya Aran, Hayley Hung and Daniel Gatica-Perez, in: LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010, 2010

attachment

A Random Walk Framework to Compute Textual Semantic Similarity: a Unified Model for Three Benchmark Tasks, Majid Yazdani and Andrei Popescu-Belis, in: Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010 ), Carnegie Mellon University, Pittsburgh, PA, USA, 2010

attachment

An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, David Imseng and Gerald Friedland, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, pages 4946-4949, 2010

attachment

An Alternative Scanning Strategy to Detect Faces, Venkatesh Bala Subburaman and Sébastien Marcel, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010

attachment

Application of Out-Of-Language Detection To Spoken-Term Detection, Petr Motlicek and Fabio Valente, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010

attachment

Automatic Content Linking: Speech-based Just-in-time Retrieval for Multimedia Archives, Andrei Popescu-Belis, Jonathan Kilgour, Peter Poller, Alexandre Nanchen, Erik Boertjes and Joost de Wit, in: Proceedings of the 33rd Annual ACM SIGIR Conference, Geneva, Switzerland, pages 703, 2010

[DOI]

BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION, Anindya Roy, Mathew Magimai-Doss and Sébastien Marcel, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010

attachment

English Spoken Term Detection in Multilingual Recordings, Petr Motlicek, Fabio Valente and Philip N. Garner, in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010

attachment

Fast Bounding Box Estimation based Face Detection, Venkatesh Bala Subburaman and Sébastien Marcel, in: ECCV, Workshop on Face Detection: Where we are, and what next?, 2010

attachment

[URL]

Floor Holder Detection and End of Speaker Turn Prediction in Meetings, Alfred Dielmann, Giulia Garau and Hervé Bourlard, in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, ISCA, 2010

attachment

Hierarchical Multilayer Perceptron based Language Identification, David Imseng, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of Interspeech, Makuhari, Japan, pages 2722-2725, 2010

attachment

Introducing Crossmodal Biometrics:Person Identification from Distinct Audio & Visual Streams, Anindya Roy and Sébastien Marcel, in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010

attachment

Leveraging speaker diarization for meeting recognition from distant microphones, Andreas Stolcke, Gerald Friedland and David Imseng, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4390--4393, 2010

Multistream Speaker Diarization beyond Two Acoustic Feature Streams, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: International Conference on Acoustics, Speech, and Signal Processing, 2010

attachment

Recognizing conversational context in group interaction using privacy-sensitive mobile sensors, Dinesh Babu Jayagopi, Taemie Kim, Alex Pentland and Daniel Gatica-Perez, in: Proceedings of International Conference on Mobile and Ubiquitous Multimedia, Limassol, Cyprus, 2010

attachment

Social Signal Processing: Understanding Nonverbal Communication in Social Interactions, Alessandro Vinciarelli and Fabio Valente, in: Proceedings of Measuring Behavior 2010, Eindhoven (The Netherlands), 2010

attachment

Sparse Component Analysis for Speech Recognition in Multi-Speaker Environment, Afsaneh Asaei, Hervé Bourlard and Philip N. Garner, in: Proceedings of Interspeech, Makuhari, Japan, 2010

attachment

The ACLD: Speech-based Just-in-Time Retrieval of Meeting Transcripts, Documents and Websites, Andrei Popescu-Belis, Jonathan Kilgour, Alexandre Nanchen and Peter Poller, in: ACM Multimedia Workshop on Searching Spontaneous Conversational Speech, Florence, Italy, 2010

attachment

Towards a standard for dialogue act annotation, Harry Bunt, Jan Alexandersson, Jean Carletta, Jae-Woong Choe, Alex Fang, Koiti Hasida, Kiyong Lee, Volha Petukhova, Andrei Popescu-Belis, Laurent Romary, Claudia Soria and Traum. David, in: 7th International Conference on Language Resources and Evaluation, Malta, 2010

attachment

[URL]

Towards mixed language speech recognition systems, David Imseng, Hervé Bourlard and Mathew Magimai-Doss, in: Proceedings of Interspeech, Makuhari, Japan, pages 278-281, 2010

attachment

Tracter: A Lightweight Dataflow Framework, Philip N. Garner and John Dines, in: Proceedings of Interspeech, Makuhari, Japan, 2010

attachment

Using Audio and Visual Cues for Speaker Diarisation Initialisation, Giulia Garau and Hervé Bourlard, in: International Conference on Acoustics, Speech and Signal Processing, 2010

attachment

VARIATIONAL BAYESIAN SPEAKER DIARIZATION OF MEETING RECORDINGS, Fabio Valente, Petr Motlicek and Deepu Vijayasenan, in: Proceedings of ICASSP, 2010

attachment

View-Based Appearance Model Online Learning for 3D Deformable Face Tracking, Stéphanie Lefèvre and Jean-Marc Odobez, in: Proc. Int. Conf. on Computer Vision Theory and Applications, Angers, 2010

attachment

Vlogcast Yourself: Nonverbal Behavior and Attention in Social Media, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2010

attachment

Voices of Vlogging, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC, 2010

attachment

Audioâ€“Visual Synchronisation for Speaker Diarisation, Giulia Garau, Alfred Dielmann and Hervé Bourlard, in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, 2010

attachment

A Multimedia Retrieval System Using Speech Input, Andrei Popescu-Belis, Peter Poller, Jonathan Kilgour, Erik Boertjes, Jean Carletta, Sandro Castronovo, Michal Fapso, Alexandre Nanchen, Theresa Wilson, Joost de Wit and Majid Yazdani, in: Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), Cambridge, MA, 2009

attachment

APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION, Sriram Ganapathy, Samuel Thomas, Petr Motlicek and Hynek Hermansky, in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009

attachment

[URL]

Automatic Role Recognition in Multiparty Recordings Using Social Networks and Probabilistic Sequential Models, Sarah Favre, Alfred Dielmann and Alessandro Vinciarelli, in: ACM International Conference on Multimedia, 2009

attachment

Automatic vs. human question answering over multimedia meeting recordings, Quoc Anh Le and Andrei Popescu-Belis, in: 10th Annual Conference of the International Speech Communication Association, 2009

attachment

Characterising Conversationsal Group Dynamics Using Nonverbal Behaviour, Dinesh Babu Jayagopi, Raducanu Bogdan and Daniel Gatica-Perez, in: Proceedings ICME 2009, 2009

attachment

Discovering Group Nonverbal Conversational Patterns with Topics, Dinesh Babu Jayagopi and Daniel Gatica-Perez, in: Proceedings ICMI-MLMI, 2009

attachment

Implicit Human Centered Tagging, Alessandro Vinciarelli, Nicolae Suditu and Maja Pantic, in: Proceedings of IEEE Conference on Multimedia and Expo, 2009

attachment

Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard and Daniel Gatica-Perez, in: Proceedings of Interspeech 2009, 2009

attachment

Investigating the use of Visual Focus of Attention for Audio-Visual Speaker Diarisation, Giulia Garau, Silèye O. Ba, Hervé Bourlard and Jean-Marc Odobez, in: Proceedings of the ACM International Conference on Multimedia, Beijing, China, 2009

attachment

Joint Pose Estimator and Feature Learning for Object Detection, Karim Ali, Francois Fleuret, David Hasler and Pascal Fua, in: Proceedings of the IEEE International Conference on Computer Vision, 2009

KL Realignment for Speaker Diarization with Multiple Feature Streams, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: 10th Annual Conference of the International Speech Communication Association, 2009

Learning Large Margin Likelihood for Realtime Head Pose Tracking, Elisa Ricci and Jean-Marc Odobez, in: IEEE Int. Conference on Image Processing, Cairo, Egypt, IEEE, 2009

attachment

Learning Rotational Features for Filament Detection, German Gonzalez, Francois Fleuret and Pascal Fua, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2009

MLP Based Hierarchical System for Task Adaptation in ASR, Joel Praveen Pinto, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009

attachment

MULTI-MODAL SPEAKER DIARIZATION OF REAL-WORLD MEETINGS USING COMPRESSED-DOMAIN VIDEO FEATURES, Gerald Friedland, Hayley Hung and Chuohao Yeo, in: International Conference on Audio, Speech and Signal Processing, 2009

attachment

MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009

attachment

Mutual Information based Channel Selection for Speaker Diarization of Meetings Data, Deepu Vijayasenan, Fabio Valente and Hervé Bourlard, in: Proceedings of International conference on acoustics speech and signal processing, 2009

Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition, Weifeng Li, John Dines, Mathew Magimai-Doss and Hervé Bourlard, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009

attachment

Posterior features applied to speech recognition tasks with user-defined vocabulary, Guillermo Aradilla, Hervé Bourlard and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009

attachment

Predicting Remote Versus Collocated Group Interactions using Nonverbal Cues, Dairazalia Sanchez-Cortes, Dinesh Babu Jayagopi and Daniel Gatica-Perez, in: Proc. Int. Conf. on Multimodal Interfaces, Workshop on Multimodal Sensor-Based Systems and Mobile Phones for Social Computing,, Cambridge, 2009

[DOI]

Real-Time ASR from Meetings, Philip N. Garner, John Dines, Thomas Hain, Asmaa El Hannani, Martin Karafiat, Danil Korchagin, Mike Lincoln, Vincent Wan and Le Zhang, in: Proceedings of Interspeech, Brighton, UK., 2009

attachment

Robust Speaker Diarization for Short Speech Recordings, David Imseng and Gerald Friedland, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, pages 432-437, 2009

attachment

SNR Features for Automatic Speech Recognition, Philip N. Garner, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009

attachment

Speaker Change Detection with Privacy-Preserving Audio Cues, Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Daniel Gatica-Perez and Hervé Bourlard, in: Proceedings of ICMI-MLMI 2009, 2009

attachment

Steerable Features for Statistical 3D Dendrite Detection, German Gonzalez, Francois Aguet, Francois Fleuret, Michael Unser and Pascal Fua, in: Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention, 2009

Structure and appearance features for robust 3D facial actions tracking, Stéphanie Lefèvre and Jean-Marc Odobez, in: IEEE Proc. Int. Conf. on Multimedia and Expo, IEEE, 2009

attachment

Visual Activity Context For Focus of Attention Estimation in Dynamic Meetings, Silèye O. Ba, Hayley Hung and Jean-Marc Odobez, in: International Conference on Multimedia & Expo, 2009

attachment

Visual Speaker Localization Aided by Acoustic Models, Gerald Friedland, Chuohao Yeo and Hayley Hung, in: ACM Multimedia, 2009

Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, Joel Praveen Pinto, G. S. V. S. Sivaram, Hynek Hermansky and Mathew Magimai-Doss, in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009

attachment

Wearing a YouTube hat: directors, comedians, gurus, and user aggregated behavior, Joan-Isaac Biel and Daniel Gatica-Perez, in: Proceedings of the 17th ACM International Conference on Multimedia, ACM, 2009

attachment

Analyzing Flickr Groups, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: Proc. of the Intl. Conf. on Image and Video Retrieval, ACM, 2008

Automated Delineation of Dendritic Networks in Noisy Image Stacks, German Gonzalez, Francois Fleuret and Pascal Fua, in: proceedings of the European Conference on Computer Vision, 2008

Identifying Dominant People in Meetings from Audio-Visual Sensors, Hayley Hung and Daniel Gatica-Perez, in: International Conference on Automatic Face and Gesture Recognition, Amsterdam, The Netherlands, 2008

attachment

Investigating Automatic Dominance Estimation in Groups From Visual Attention and Speaking Activity, Hayley Hung, Dinesh Babu Jayagopi, Silèye O. Ba, Jean-Marc Odobez and Daniel Gatica-Perez, in: International Conference on Multi-modal Interfaces, 2008

attachment

Multi-Camera Tracking and Atypical Motion Detection with Behavioral Maps, Jerome Berclaz, Francois Fleuret and Pascal Fua, in: proceedings of the European Conference on Computer Vision, 2008

Principled Detection-by-classification from Multiple Views, Jerome Berclaz, Francois Fleuret and Pascal Fua, in: proceedings of the International Conference on Computer Vision Theory and Applications, 2008

Reference-based vs. task-based evaluation of human language technology, Andrei Popescu-Belis, in: LREC 2008 ELRA Workshop on Evaluation, ELRA, Marrakech, Morocco, 2008

attachment

Role Recognition for Meeting Participants: an Approach Based on Lexical Information and Social Network Analysis, Sarah Favre, Hugues Salamin, Alessandro Vinciarelli, Dilek Hakkani Tür and N. P. Garg, in: ACM International Conference on Multimedia, Vancouver, Canada, 2008

attachment

Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions, Sarah Favre, Hugues Salamin, John Dines and Alessandro Vinciarelli, in: International Conference on Multimodal Interfaces, Chania, Greece, 2008

attachment

Social Signal Processing: State-of-the-Art and Future Perspectives of an Emerging Domain, Alessandro Vinciarelli, Maja Pantic, Hervé Bourlard and Alex Pentland, in: Proceedings of the ACM International Conference on Multimedia, 2008

attachment

Social Signals, their Function, and Automatic Analysis: A Survey, Alessandro Vinciarelli, Maja Pantic, Hervé Bourlard and Alex Pentland, in: Proceedings of International Conference on Multimodal Interfaces (to appear), 2008

attachment

Task-based evaluation of meeting browsers: from BET task elicitation to user behavior analysis, Andrei Popescu-Belis, Mike Flynn, Pierre Wellner and Philippe Baudrion, in: 6th International Conference on Language Resources and Evaluation, Marrakech, Morocco, 2008

attachment

Topickr: Flickr Groups and Users Reloaded, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia, ACM, 2008

Towards Audio-Visual On-line Diarization Of Participants In Group Meetings, Hayley Hung and Gerald Friedland, in: European Conference on Computer Vision Workshop on Multi-camera and Multi-modal Sensor Fusion, 2008

attachment

ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIONS USING SPEAKER DIARIZATION STRATEGIES, Hayley Hung, Yan Huang, Gerald Friedland and Daniel Gatica-Perez, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007

Enabling speech applications using Ad-Hoc Microphone Arrays, Mohammad J. Taghizadeh, École Polytechnique Fédérale de Lausanne, 2015

attachment

Modeling Users’ Information Needs in a Document Recommender for Meetings, Maryam Habibi, EPFL, 2015

attachment

Speaker diarization of spontaneous meeting room conversations, Sree Harsha Yella, EPFL, 2015

attachment

Mining Conversational Social Video, Joan-Isaac Biel, EPFL, 2013

attachment

Multilingual speech recognition A posterior based approach, David Imseng, École Polytechnique Fédérale de Lausanne (EPFL), 2013

attachment

Alternative search techniques for face detection using location estimation and binary features, Venkatesh Bala Subburaman, ECOLE POLYTECHNIQUE FEDERALE DE LAUSANNE, 2012

attachment

Boosting Localized Features for Speaker and Speech Recognition, Anindya Roy, Ecole Polytechnique Federale de Lausanne (EPFL), 2011

attachment

An Information Theoretic Approach to Speaker Diarization of Meeting Recordings, Deepu Vijayasenan, Ecole polytechnique fédérale de Lausanne, 2010

attachment

Methods for Asynchronous and Non-Invasive EEG-Based Brain-Computer Interfaces. Towards Intelligent Brain-Actuated Wheelchairs, Ferran Galán, University of Barcelona, 2008

attachment

| 1 | 2 | 3 | 4 | 5 | 6 |

processing time: 3.8967 seconds.