logo Idiap Research Institute        
All book chapters sorted by author
| 1 | 2 | 3 |


A

Classifying the Social Media Author Profile Through a Multimodal Representation, Miguel Á. Álvarez-Carmona, Esaú Villatoro-Tello, Luis Villaseñor Pineda and Manuel Montes-y-Gómez, in: Intelligent Technologies: Concepts, Applications, and Future Directions. Studies in Computational Intelligence, Springer, 2022
[DOI]
[URL]
Anti-Spoofing: Face Databases, André Anjos, Ivana Chingovska and Sébastien Marcel, in: Encyclopedia of Biometrics, Springer US, 2014
[DOI]
[URL]
Face Anti-spoofing: Visual Approach, André Anjos, Jukka Komulainen, Sébastien Marcel, Abdenour Hadid and Matti Pietikainen, in: Handbook of Biometric Anti-Spoofing, pages 65-82, Springer-Verlag, 2014
[DOI]
An Introduction to Vein Presentation Attacks and Detection, André Anjos, Pedro Tome and Sébastien Marcel, in: Handbook of Biometric Anti-Spoofing, Springer International Publishing, 2019
[DOI]
[URL]
Otomatik İşaret Dili Tanıma ve Türk İşaret Dili için Bilgisayar Uygulamaları, Oya Aran, Ismail Ari, Alp Kindiroglu, Pinar Santemiz and Lale Akarun, in: Ellerle Konusmak: Turk Isaret Dili Arastirmalari / Research on Turkish Sign Language, pages 471-498, Koc University Press, 2016
Analysis of Group Conversations: Modeling Social Verticality, Oya Aran and Daniel Gatica-Perez, in: Computer Analysis of Human Behavior, pages 293-322, Springer London, 2011

B

Neural Networks in Automatic Speech Recognition, F. Beaufays, Hervé Bourlard, H. Franco and Nelson Morgan, in: to be published in The Handbook of Brain Theory and Neural Networks, Bradford Books, The MIT Press, 2000
Interactive Generation of Calligraphic Trajectories from Gaussian Mixtures, D. Berio, F. F. Leymarie and Sylvain Calinon, in: Mixture Models and Applications, pages 23-38, Springer, 2019
[DOI]
Learning From Humans, A. G. Billard, Sylvain Calinon and R. Dillmann, in: Handbook of Robotics, pages 1995-2014, Springer, 2016
[DOI]
[URL]
Intuitive Robot Programming, C. Blanc, Julius Jankowski, A. Sonderegger, Sylvain Calinon and S. Dégallier Rochat, in: Ergonomics in Robotics: Advances and Innovations, Springer, 2025
Hidden Markov Models and other Finite State Automata for Sequence Processing, Hervé Bourlard and Samy Bengio, in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002
attachment
Towards Robust and Adaptive Speech Recognition Models, Hervé Bourlard, Samy Bengio and Katrin Weber, in: Mathematical Foundations of Speech Processing and Recognition, Springer-Verlag, 2002
attachment
Connectionist Techniques, Hervé Bourlard and Nelson Morgan, in: Survey of the State of the Art in Human Language Technology, Cambridge University Press, 1998
Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions, Hervé Bourlard and Nelson Morgan, in: Adaptive Processing of Sequences and Data Structures, Springer Verlag, 1998
Applying Handwriting Recognition to US Census Forms, Thomas M. Breuel, in: Recent Developments in Computer Vision, Springer, 1995
attachment
Handwriting Recognition, Thomas M. Breuel, in: Recent Developments in Computer Vision, Springer, 1995
Two Simple and Domain-independent Approaches for Early Detection of Anorexia, Sergio Burdisso, Leticia Cagnina, Marcelo Errecalde and Manuel Montes-y-Gómez, in: Early Detection of Mental Health Disorders by Social Media Monitoring: The First Five Years of the eRisk Project, pages 159-182, Springer International Publishing, 2022
attachment
[DOI]
[URL]

C

Learning from Demonstration (Programming by Demonstration), Sylvain Calinon, in: Encyclopedia of Robotics, Springer, 2019
attachment
[DOI]
[URL]
Robot Learning with Task-Parameterized Generative Models, Sylvain Calinon, in: Robotics Research, pages 111-126, Springer, 2018
attachment
[DOI]
[URL]
Mixture Models for the Analysis, Edition, and Synthesis of Continuous Time Series, Sylvain Calinon, in: Mixture Models and Applications, pages 39-57, Springer, 2019
attachment
[DOI]
Programming industrial robots from few demonstrations., Sylvain Calinon, in: Human-Robot Collaboration: Unlocking the potential for industrial applications, pages 9-37, Institution of Engineering and Technology (IET), 2023
Learning Control, Sylvain Calinon and D. Lee, in: Humanoid Robotics: a Reference, pages 1261-1312, Springer, 2019
attachment
[DOI]
[URL]
Medical image annotation, Barbara Caputo, in: Interactive Multimodal Information Management, EPFL Press, 2013
attachment
Evaluation Methodologies, Ivana Chingovska, André Anjos and Sébastien Marcel, in: Handbook of Biometric Antispoofing, Springer, 2014
Anti-spoofing: Evaluation Methodologies, Ivana Chingovska, André Anjos and Sébastien Marcel, in: Encyclopedia of Biometrics, Springer US, 2014
[DOI]
Face Recognition Systems Under Spoofing Attacks, Ivana Chingovska, Nesli Erdogmus, André Anjos and Sébastien Marcel, in: Face Recognition Systems Under Spoofing Attacks, pages 165-194, Springer International Publishing, 2016
[DOI]
[URL]
Evaluation Methodologies for Biometric Presentation Attack Detection, Ivana Chingovska, Amir Mohammadi, André Anjos and Sébastien Marcel, in: Handbook of Biometric Anti-Spoofing, Springer International Publishing, 2019
attachment
[DOI]
[URL]
Les domaines d'application des technologies vocales, Gérard Chollet, in: Fondements et perspectives en traitement automatique de la parole, GDR-PRC Communication Homme-Machine, 1995
Assessment of speaker verification systems, Gérard Chollet and Frédéric Bimbot, in: Spoken Language Ressources and Assessment, EAGLES Handbook, 1995
Implementing Neural Networks Efficiently, Ronan Collobert, Koray Kavukcuoglu and Clément Farabet, in: Neural Networks: Tricks of the Trade, Springer, 2012
attachment

D

How does a dictation machine recognize speech ?, T. Dutoit, L. Couvreur and Hervé Bourlard, in: Applied Signal Processing--A MATLAB approach, Springer MA, 2008
attachment

E

Unsupervised methods for activity analysis and detection of abnormal events, Remi Emonet and Jean-Marc Odobez, in: Intelligent Video Surveillance Systems (ISTE), Wiley, 2013
attachment
[DOI]

F

Error-Related EEG Potentials in Brain-Computer Interfaces, Pierre W. Ferrez and José del R. Millán, in: Towards Brain-Computer Interfacing, The MIT Press, 2007
Supervised Ontogenic Networks, Emile Fiesler and K. Cios, in: Handbook of Neural Computation, 1996
Neural Network Topologies, Emile Fiesler, in: Handbook of Neural Computation, 1996
Re-Identification for Improved People Tracking, Francois Fleuret, Horesh Ben Shitrit and Pascal Fua, in: Person Re-Identification, pages 311-336, Springer, 2014

G

Modeling interest in face-to-face conversations from multimodal nonverbal behavior, Daniel Gatica-Perez, in: In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.,',','), Multimodal Signal Processing, Academic Press, Academic Press, 2009
attachment
Analysis of Small Groups, Daniel Gatica-Perez, Oya Aran and Dinesh Babu Jayagopi, in: Social Signal Processing, pages 349-367, Cambridge University Press. Editors J. Burgoon, N. Magnenat-Thalmann, M. Pantic, and A. Vinciarelli, 2017
[DOI]
MAAYA: Multimedia Methods to Support Maya Epigraphic Analysis, Daniel Gatica-Perez, Gulcan Can, Rui Hu, Stephane Marchand-Maillet, Jean-Marc Odobez, Carlos Pallan Gayol and Edgar Roman-Rangel, in: Arqueologia computacional: Nuevos enfoques para el analisis y la difusion del patrimonio cultural, INAH-RedTDPC, 2017
attachment
Visual Attention, Speaking Activity, and Group Conversational Analysis in Multi-Sensor Environments, Daniel Gatica-Perez and Jean-Marc Odobez, in: In H. Nakashima, J. Augusto, H. Aghajan (Eds.,',','), Handbook of Ambient Intelligence and Smart Environments, Springer, 2010
Multi-channel Face Presentation Attack Detection Using Deep Learning, Anjith George and Sébastien Marcel, in: Deep Learning-Based Face Analytics, Springer International Publishing, 2021
attachment
Sequential Design of Computer Experiments, David Ginsbourger, in: Wiley StatsRef: Statistics Reference Online, Wiley, 2018
Design of Computer Experiments Using Competing Distances Between Set-Valued Inputs, David Ginsbourger, Jean Baccou, Clément Chevalier and Frédéric Perales, in: mODa 11 - Advances in Model-Oriented Design and Analysis, pages 123-131, Springer International Publishing, 2016
[DOI]
On ANOVA Decompositions of Kernels and Gaussian Random Field Paths, David Ginsbourger, Olivier Roustant, Dominic Schuhmacher, Nicolas Durrande and Nicolas Lenz, in: Monte Carlo and Quasi-Monte Carlo Methods, pages 315-330, Springer International Publishing, 2016
[DOI]
Reactive Anticipatory Robot Skills with Memory, Hakan Girgin, Julius Jankowski and Sylvain Calinon, in: Robotic Research, pages 436-451, Springer, 2023
attachment
Discriminative Keyword Spotting, David Grangier, Joseph Keshet and Samy Bengio, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009

H

Remote Blood Pulse Analysis for Face Presentation Attack Detection, Guillaume Heusch and Sébastien Marcel, in: Handbook of Biometric Anti-Spoofing, Springer, 2019
[URL]

I

Compositionality in English deverbal compounds:The role of the head, Gianina Iordachioaia, Lonneke van der Plas and Glorianna Jagfeld, in: The role of constituents in multiword expressions. Phraseology and Multiword Expressions, Language Science Press, Berlin, 2020

K

A Kernel Wrapper for Phoneme Sequence Recognition, Joseph Keshet and Dan Chazan, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009
A Proposal for a Kernel-based Algorithm for Large Vocabulary Continuous Speech Recognition, Joseph Keshet, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009
A Large Margin Algorithm for Forced Alignment, Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer and Dan Chazan, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009
Hand Gesture Analysis, Cem Keskin, Oya Aran and Lale Akarun, in: Computer Analysis of Human Behavior,, pages 125-149, Springer London, 2011
A Cross-database Study of Voice Presentation Attack Detection, Pavel Korshunov and Sébastien Marcel, in: Handbook of Biometric Anti-Spoofing: Presentation Attack Detection, 2nd Edition, Springer, 2018
Presentation attack detection in voice biometrics, Pavel Korshunov and Sébastien Marcel, in: User-Centric Privacy and Security in Biometrics, The Institution of Engineering and Technology, 2017
attachment
Global Optimization with Sparse and Local Gaussian Process Models, Tipaluck Krityakierne and David Ginsbourger, in: Machine Learning, Optimization, and Big Data, pages 185-196, Springer International Publishing, 2015
[DOI]
On the Recognition Performance of BioHash-Protected Finger Vein Templates, Vedrana Krivokuca and Sébastien Marcel, in: Handbook of Vascular Biometrics, pages 465-480, Springer Open, 2019
attachment

L

User Requirements for Meeting Support Technology, Denis Lalanne and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 210-221, Cambridge University Press, 2012
DNN-based Speech Synthesis: Importance of input features and training data, Alexandros Lazaridis, Blaise Potard and Philip N. Garner, in: International Conference on Speech and Computer , SPECOM, pages 193-200, Springer Berlin Heidelberg, 2015
attachment
[DOI]
Rehabilitation of Count-based Models for Word Vector Representations, Rémi Lebret and Ronan Collobert, in: Computational Linguistics and Intelligent Text Processing, pages 417-429, Springer International Publishing, 2015
Speech Reading, Juergen Luettin, in: Modern Interface Technology: The Leading Edge, Research Studies Press Ltd., 1999
Active Shape Models for Visual Speech Feature Extraction, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Speechreading by Humans and Machines, Springer Verlag, 1996
attachment
Machine Recognition and Applications, Juergen Luettin, Michael Vogt and Christoph Bregler, in: Speechreading by Humans and Machines, Springer Verlag, 1996

M

Speech Processing, Mathew Magimai.-Doss, in: Interactive Multimodal Information Management, pages 221--245, EPFL Press, 2013
Differentiating the Multipoint Expected Improvement for Optimal Batch Design, Sébastien Marmin, Clément Chevalier and David Ginsbourger, in: Machine Learning, Optimization, and Big Data, pages 37-48, Springer International Publishing, 2015
[DOI]
Who Sees What? Examining Urban Impressions in Global South Cities, Luis Emmanuel Medina Rios, Salvador Ruiz-Correa, Darshan Santani and Daniel Gatica-Perez, in: Human Perception of Visual Information: Psychological and Computational Perspectives, Springer, 2022
attachment
Brain-Computer Interfaces, José del R. Millán, in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002
attachment
Robot Navigation, José del R. Millán, in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002
attachment
Tapping the Mind or Resonating Minds?, José del R. Millán, in: European Visions for the Knowledge Age, Cheshire Henbury, 2007
Non-Invasive Brain-Actuated Control of a Mobile Robot by Human EEG, José del R. Millán, F. Renkens, J. Mouriño and W. Gerstner, in: 2006 IMIA Yearbook of Medical Informatics, Schattauer Verlag, 2006
Neural Network Adaptations to Hardware Implementations, Perry Moerland and Emile Fiesler, in: Handbook of Neural Computation, Institute of Physics Publishing and Oxford University Publishing, 1997
attachment
Semantic Behavior Analysis of COVID-19 Patients: A Collaborative Framework, Amlan Mohanty, Debasish Kumar Mallick, Shantipriya Parida and Satya Ranjan Dash, in: Machine Learning for Healthcare Applications, John Wiley & Sons, Inc. USA and Scrivener Publishing LLC, USA, 2021
[URL]
Automatic Speech Recognition: an Auditory Perspective, Nelson Morgan, Hervé Bourlard and Hynek Hermansky, in: Speech Processing in the Auditory System, Springer Verlag, New York, 2000

N

Learning to learn new models of human activities in indoor settings1, Fabian Nater, Tatiana Tommasi, Luc Van Gool and Barbara Caputo, in: Interactive Multimodal Information Management, EPFL Press, 2013
Learning to learn new models of human activities in indoor settings1, Fabian Nater, Tatiana Tommasi, Luc Van Gool and Barbara Caputo, in: Interactive Multimodal Information Management, EPFL Press, 2013
attachment
Reconnaissance et compréhension de la parole: évaluation et applications, F. Néel, Gérard Chollet, F. Lamel, W. Minker and Andrei Constantinescu, in: Fondements et perspectives en traitement automatique de la parole, AUPELF -- UREF, 1996
Flickr Groups: Multimedia Communities for Multimedia Analysis, Radu-Andrei Negoescu and Daniel Gatica-Perez, in: Internet Multimedia Search and Mining, Bentham Science Publishers, 2011

O

Sampling techniques for audio-visual tracking and head pose estimation, Jean-Marc Odobez and Oswald Lanz, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 84-102, Cambridge University Press, 2012
attachment

P

Social Signal Processing: The Research Agenda, Maja Pantic, R. Cowie, F. D'Errico, Dirk Heylen, M. Mehu, C. Pelachaud, I. Poggi, M. Schroeder and Alessandro Vinciarelli, in: "Visual Analysis of Humans" by T.B.Moeslund, A.Hilton, V.Krueger and L.Sigal (eds.), pages 511-538, Springer Verlag, 2011
Interactive Multimodal Information Management: Shaping the Vision, Andrei Popescu-Belis and Hervé Bourlard, in: Interactive Multimodal Information Management, pages 1-17, EPFL Press, 2013
attachment
Accessing a Large Multimodal Corpus using an Automatic Content Linking Device, Andrei Popescu-Belis, Jean Carletta, Jonathan Kilgour and Peter Poller, in: Multimodal Corpora: From Models of Natural Interaction to Systems and Applications, Springer-Verlag, 2009
attachment
[DOI]
Multimodal Signal Processing for Meetings: an Introduction, Andrei Popescu-Belis and Jean Carletta, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 1-11, Cambridge University Press, 2012
attachment
Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions, Andrei Popescu-Belis, in: Multimodal Signal Processing for Human-Computer Interaction, Elsevier / Academic Press, 2009

S

More than Words: Inference of Socially Relevant Information from Nonverbal Vocal Cues in Speech, Hugues Salamin, Gelareh Mohammadi, Khiet Truong and Alessandro Vinciarelli, in: Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces: Theoretical and Practical Issues, A.Esposito (ed.), LNCS,Springer, 2010
attachment
Introduction to Sequence Analysis for Human Behavior Understanding, Hugues Salamin and Alessandro Vinciarelli, in: "Computer Analysis of Human Behavior" by A.Salah and T.Gevers (eds.), pages 21-40, Springer Verlag, 2011
An All-Optical Forward Propagation Multilayer Neural Network, Indu Saxena and Emile Fiesler, in: From Natural to Artificial Neural Computation, Springer Verlag, 1995
Ellipsometry, Indu Saxena, in: Optical Metrology, Artech House, 1997

T

Neural Network Initialization, Georg Thimm and Emile Fiesler, in: From Natural to Artificial Neural Computation, Springer Verlag, 1995
A Hybrid Approach to Continuous Speech Recognition, Kari Torkkola and Teuvo Kohonen, in: The handbook of brain theory and neural networks, The MIT Press, 1995
Evaluation of Meeting Support Technology, Simon Tucker and Andrei Popescu-Belis, in: Multimodal Signal Processing: Human Interactions in Meetings, pages 237-252, Cambridge University Press, 2012

V

Data-driven extraction of spectral-dynamics based posteriors, Fabio Valente, in: Handbook of Natural Language Processing and Machine Translation Handbook of Natural Language Processing and Machine Translation, Springer, 2011
[URL]
Speaker Diarization, Fabio Valente and Gerald Friedland, in: Multimodal Signal Processing: Human Interactions in Meetings, Cambridge University Press, 2012
[URL]
Sparsity in Topic Models, Jagannadan Varadarajan, Remi Emonet and Jean-Marc Odobez, in: Practical Applications of Sparse Modeling: Biology, Signal Processing and Beyond, MIT Press, 2012
attachment
Towards a Technology of Nonverbal Communication: Vocal Behavior in Social and Affective Phenomena, Alessandro Vinciarelli and Gelareh Mohammadi, in: "Affective Computing and Interaction: Psychological, Cognitive and Neuroscientific Perspectives" by D. Gokcay & G. Yildirim (eds.), igi-global, 2010
attachment

W

Deep Learning via Semi-Supervised Embedding, Jason Weston, Frédéric Ratle, Hossein Mobahi and Ronan Collobert, in: In Neural Networks: Tricks of the Trade, Springer, 2012
attachment

Y

Multi-Person Bayesian Tracking with Multiple Cameras, Jian Yao and Jean-Marc Odobez, in: Multi-camera networks: principles and applications, pages 363-388, Academic Press, 2009
attachment

Z

Evaluation Databases, Stan Z.Li, Javier Galbally, André Anjos and Sébastien Marcel, in: Handbook of Biometric Anti-Spoofing, pages 247-278, Springer-Verlag, 2014
[DOI]
| 1 | 2 | 3 |