Idiap Publications
USE CTRL-F to search text, document download comming soon

[1] Samaneh Abbasi-Sureshjani, Behdad Dasht Bozorg, Bart ter Haar Romeny, and Francois Fleuret. Boosted exudate segmentation in retinal images using residual nets. In Proceedings of the MICCAI Workshop on Ophthalmic Medical Image Analysis, 2017.
[2] Samaneh Abbasi-Sureshjani, Behdad Dasht Bozorg, Bart ter Haar Romeny, and Francois Fleuret. Exploratory study on direct prediction of diabetes using deep residual networks. In Proceedings of the thematic conference on computational vision and medical image processing, 2017.
[3] Vinayak Abrol, S. Pavankumar Dubagunta, and Mathew Magimai.-Doss. Understanding raw waveform based cnn through low-rank spectro-temporal decoupling. Idiap-RR Idiap-RR-11-2019, Idiap, 10 2019. [ .pdf ]
[4] Hamid Reza Abutalebi and Hossein Momenzadeh. Performance improvement of tdoa-based speaker localization in joint noisy and reverberant conditions. EURASIP Journal on Advances in Signal Processing, 2011. [ DOI | .pdf ]
[5] Hamid Reza Abutalebi, Hedieh Heli, Danil Korchagin, and Hervé Bourlard. A bss-based approach for localization of simultaneous speakers in reverberant conditions. In Proceedings of the 19th European Signal Processing Conference (EUSIPCO), August 2011. [ .pdf ]
[6] Hamid Reza Abutalebi, Mehdi Rashidinejad, Hervé Bourlard, and Ali Akbar Tadaion. Speech enhancement using beta-order mmse spectral amplitude estimator with laplacian prior. Idiap-RR Idiap-RR-24-2011, Idiap, 7 2011. [ .pdf ]
[7] M. Acheroy et al. Multi-modal person verification tools using speech and images. In European Conference on Multimedia Applications, Services and Techniques, Louvain-neuve, Belgium, 1996.
[8] IDIAP. Activity report 2005. Idiap-Com Idiap-Com-01-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[9] Corinne Fredouille, Johnny Mariéthoz, Cédric Jaboulet, Jean Hennebert, Chafic Mokbel, and Frédéric Bimbot. Behavior of a bayesian adaptation method for incremental enrollment in speaker verification. In ICASSP2000 - IEEE International Conference on Acoustics, Speech, and Signal Processing [2883]. IDIAP-RR 00-02. [ .ps.gz | .pdf ]
[10] Aniruddha Adiga, Mathew Magimai.-Doss, and Chandra Sekhar Seelamantula. Gammatone wavelet cepstral coefficients for robust speech recognition. In Proceedings of IEEE TENCON, October 2013. [ .pdf ]
[11] Niederberger Adi. Modeling and optimal control of the open torque-controlled quadruped robot solo-12. Idiap-Com Idiap-Com-02-2022, Idiap, 7 2022. [ .pdf ]
[12] Mikhail Kanevski and Stéphane Canu. Spatial data mapping with support vector regression. Idiap-RR Idiap-RR-09-2000, IDIAP, 2000. [ .pdf ]
[13] Mikhail Kanevski, Alexei Pozdnoukhov, Stéphane Canu, and Michel Maignan. Advanced spatial data analysis and modelling with support vector machines. Idiap-RR Idiap-RR-31-2000, IDIAP, 2000. [ .pdf ]
[14] Mikhail Kanevski, Alexei Pozdnoukhov, Stéphane Canu, Michel Maignan, Patrick Wong, and S. Shibli. Support vector machines for classification and mapping of reservoir data. Idiap-RR Idiap-RR-04-2001, IDIAP, 2001. [ .pdf ]
[15] Jitendra Ajmera, Hervé Bourlard, and I. Lapidot. Improved unknown-multiple speaker clustering using hmm. Idiap-RR Idiap-RR-23-2002, IDIAP, Martigny, Switzerland, 2002. [ .ps.gz | .pdf ]
[16] Jitendra Ajmera and Charles Wooters. A robust speaker clustering algorithm. In IEEE Automatic Speech Recognition Understanding Workshop [2884]. IDIAP-RR 03-38. [ .ps | .pdf ]
[17] Jitendra Ajmera, Iain A. McCowan, and Hervé Bourlard. An online audio indexing system. [2885]. IDIAP RR 03-39. [ .ps.gz | .pdf ]
[18] Jitendra Ajmera, Guillaume Lathoud, and Iain A. McCowan. Clustering and segmenting speakers and their locations in meetings. In ICASSP [2886]. IDIAP-RR 03-55. [ .ps.gz | .pdf ]
[19] Jitendra Ajmera, Iain A. McCowan, and Hervé Bourlard. Robust speaker change detection. In IEEE Signal Processing Letters (to appear) [2887]. IDIAP-RR 02-39. [ .ps.gz | .pdf ]
[20] Jitendra Ajmera, Iain A. McCowan, and Hervé Bourlard. Robust Audio Segmentation. Idiap-rr, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, 6 2004. thesis #3022. [ .ps.gz | .pdf ]
[21] Jitendra Ajmera, Iain A. McCowan, and Hervé Bourlard. Robust hmm-based speech/music segmentation. In ICASSP [2889]. IDIAP-RR 01-33. [ .ps | .pdf ]
[22] Jitendra Ajmera, Hervé Bourlard, I. Lapidot, and Iain A. McCowan. Unknown-multiple speaker clustering using hmm. In ICSLP [2890]. IDIAP-RR 02-07. [ .ps | .pdf ]
[23] Jitendra Ajmera, Iain A. McCowan, and Hervé Bourlard. Speech/music discrimination using entropy and dynamism features in a hmm classification framework. In Speech Communication [2891]. IDIAP-RR 01-26. [ .ps.gz | .pdf ]
[24] Zahid Akhtar, Abdenour Hadid, Mark Nixon, Massimo Tistarelli, Jean-Luc Dugelay, and Sébastien Marcel. Biometrics: In search of identity and security (q & a). IEEE MultiMedia, PP, 2017. [ DOI | .pdf ]
[25] Xavier Alameda-Pineda, Vasil Khalidov, Radu Horaud, and Florence Forbes. Finding audio-visual events in informal social gatherings. In IEEE/ACM 13th International Conference on Multimodal Interaction, 2011. Oustanding paper award. [ .pdf ]
[26] Marcel Alcoverro, Xavier Suau, Adolfo Lopez-Mendez, Josep R. Morros, Javier Ruiz-Hidalgo, Albert Gil, and Josep R. Casas. Gesture control interface for immersive panoramic displays. Multimedia Tools and Applications, 1380-7501:1--27, July 2013. [ DOI ]
[27] T. Alizadeh, Sylvain Calinon, and D. G. Caldwell. Learning from demonstrations with partially observable task parameters. In Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), pages 3309 -- 3314. IEEE, June 2014. [ DOI | .pdf ]
[28] Karim Ali, David Hasler, and Francois Fleuret. Flowboost - appearance learning from sparsely annotated video. In Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2011.
[29] Karim Ali, Francois Fleuret, David Hasler, and Pascal Fua. Joint pose estimator and feature learning for object detection. In Proceedings of the IEEE International Conference on Computer Vision, 2009.
[30] Karim Ali, Francois Fleuret, David Hasler, and Pascal Fua. A real-time deformable detector. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012. [ .pdf ]
[31] David Alonso del Barrio and Daniel Gatica-Perez. How did europe’s press cover covid-19 vaccination news? a five-country analysis. In MAD '22: Proceedings of the 1st International Workshop on Multimedia AI against Disinformation, June 2022. [ DOI | http | .pdf ]
[32] Andrew Lovitt. Truncation confusion patterns in onset consonants. Idiap-RR Idiap-RR-05-2007, IDIAP, 2007. Submitted for publication. [ .ps.gz | .pdf ]
[33] Ethem Alpaydin. Combined 5x2cv f-test for comparing supervised classification learning algorithms. Idiap-RR Idiap-RR-04-1998, IDIAP, 1998. Submitted for publication. [ .ps.gz | .pdf ]
[34] Ethem Alpaydin and Eddy Mayoraz. Combining linear dichomotizers to construct nonlinear polychotomizers. Idiap-RR Idiap-RR-05-1998, IDIAP, 1998. [ .ps.gz | .pdf ]
[35] Miguel Á. Álvarez-Carmona, Esaú VILLATORO-TELLO, Manuel Montes-y Gómez, and Luis Villaseñor Pineda. Author profiling in social media with multimodal information. In Journal of Computación y Sistemas (CyS), 24(3), April 2020. [ http ]
[36] Miguel Á. Álvarez-Carmona, Esaú VILLATORO-TELLO, Luis Villaseñor Pineda, and Manuel Montes-y Gómez. Classifying the social media author profile through a multimodal representation. In Intelligent Technologies: Concepts, Applications, and Future Directions. Studies in Computational Intelligence, volume 1028 of 7092. Springer, May 2022. [ DOI | http ]
[37] Ajay Srinivasamurthy, Petr Motlicek, Ivan Himawan, Gyorgy Szaszak, Youssef Oualil, and Hartmut Helmke. Semi-supervised learning with semantic knowledge extraction for improved speech recognition in air traffic control. In Proceedings of Interspeech 2017 [2892], pages 2406--2410. [ DOI | .pdf ]
[38] Johan M. Andersen. Baseline system for hybrid speech recognition on french (experiments on bref). Idiap-Com Idiap-Com-07-1998, IDIAP, 1998. [ .ps.gz | .pdf ]
[39] Catia Andreassi, Raphaelle Luisier, Hamish Crerar, Marousa Darsinou, Sasja Blokzijl-Franke, Lenn Tchern, Nicholas M. Luscombe, Giovanni Cuda, Marco Gaspari, Adolfo Saiardi, and Antonella Riccio. Cytoplasmic cleavage of impa1 3' utr is necessary for maintaining axon integrity. Cell Reports, 2021.
[40] Andrei Constantinescu and Gérard Chollet. Swiss polyphone and polyvar: Building databases for speech recognition and speaker verification. In Proceedings of The 3rd Slovenian-German and 2nd SDRV Workshop, Speech and Image Understanding, 1996.
[41] Joern Anemueller, Joerg-Henrik Back, Barbara Caputo, Jie Luo, Frank Ohl, Francesco Orabona, Rufin Vogels, Daphna Weinshall, and Alon Zweig. Biologically motivated audio-visual cue integration for object. In Proceedings of the first Internatinal Conference on Cognitive Systems, 2008. [ .pdf ]
[42] Joern Anemueller, Joerg-Henrik Back, Barbara Caputo, michal havlena, Jie Luo, Hendrik Kayser, Bastian Leibe, Petr Motlicek, Tomas Pajdla, Misha Pavel, Akihiko Torii, Luc Van Gool, Alon Zweig, and Hynek Hermansky. The dirac awear audio-visual platform for detection of unexpected and incongruent events. In Proceedings of the International Conference on Multimodal Interfaces [2893]. [ .pdf ]
[43] André Anjos, Laurent El Shafey, Roy Wallace, Manuel Günther, Chris McCool, and Sébastien Marcel. Bob: a free signal processing and machine learning toolbox for researchers. In Proceedings of the ACM Multimedia Conference [2894]. Submitted to the ACM MM 2012 Open Source Software Competition. [ http | .pdf ]
[44] André Anjos, Manuel Günther, Tiago de Freitas Pereira, Pavel Korshunov, Amir Mohammadi, and Sébastien Marcel. Continuously reproducing toolchains in pattern recognition and machine learning experiments. In Thirty-fourth International Conference on Machine Learning, August 2017. https://openreview.net/group?id=ICML.cc/2017/RML. [ http | .pdf ]
[45] André Anjos, Laurent El Shafey, and Sébastien Marcel. Beat: An open-science web platform. In Thirty-fourth International Conference on Machine Learning, August 2017. https://openreview.net/group?id=ICML.cc/2017/RML. [ http | .pdf ]
[46] André Anjos and Sébastien Marcel. Scoretoolkit documentation. Idiap-Com Idiap-Com-02-2012, Idiap, 4 2012. [ .pdf ]
[47] André Anjos, Laurent El Shafey, and Sébastien Marcel. Beat: An open-source web-based open-science platform. Idiap-RR Idiap-RR-14-2017, Idiap, 4 2017. [ .pdf ]
[48] André Anjos, Murali Mohan Chakka, and Sébastien Marcel. Motion-based counter-measures to photo attacks in face recognition. Institution of Engineering and Technology Journal on Biometrics, July 2013. [ http | .pdf ]
[49] André Anjos and Sébastien Marcel. Counter-measures to photo attacks in face recognition: a public database and a baseline. In International Joint Conference on Biometrics 2011, October 2011. [ http | .pdf ]
[50] André Anjos, Ivana Chingovska, and Sébastien Marcel. Anti-spoofing: Face databases. In Stan Z.Li and Anil Jain, editors, Encyclopedia of Biometrics. Springer US, second edition edition, 2014. [ DOI | http ]
[51] André Anjos, Jukka Komulainen, Sébastien Marcel, Abdenour Hadid, and Matti Pietikainen. Face anti-spoofing: Visual approach. In Sébastien Marcel, Mark Nixon, and Stan Z.Li, editors, Handbook of Biometric Anti-Spoofing, chapter 4, pages 65--82. Springer-Verlag, 2014. [ DOI ]
[52] André Anjos, Pedro Tome, and Sébastien Marcel. An introduction to vein presentation attacks and detection. In Sébastien Marcel, Mark Nixon, Julian Fierrez, and Nicholas Evans, editors, Handbook of Biometric Anti-Spoofing, chapter 18. Springer International Publishing, 2nd edition, 2019. [ DOI | http ]
[53] Niccolò Antonello and Philip N. Garner. A t-distribution based operator for enhancing out of distribution robustness of neural network classifiers. IEEE Signal Processing Letters, 27:1070--1074, June 2020. [ DOI | .pdf ]
[54] Niccolò Antonello, Enzo De Sena, Marc Moonen, A. Patrick Naylor, and Toon van Waterschoot. Joint acoustic localization and dereverberation through plane wave decomposition and sparse regularization. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(12):1893--1905, December 2019. [ DOI | http | .pdf ]
[55] IDIAP. Activity report 1999. Idiap-Com Idiap-Com-01-2000, IDIAP, 2000. [ .ps | .pdf ]
[56] IDIAP. Activity report 2000. Idiap-Com Idiap-Com-01-2001, IDIAP, 2001. [ .ps.gz | .pdf ]
[57] IDIAP. Activity report 2001. Idiap-Com Idiap-Com-01-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[58] IDIAP. Activity report 2002. Idiap-Com Idiap-Com-01-2003, IDIAP, 2003. [ .ps.gz | .pdf ]
[59] IDIAP. Activity report 2003. Idiap-Com Idiap-Com-01-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[60] IDIAP. Activity report 2004. Idiap-Com Idiap-Com-01-2005, IDIAP, 2005. [ .pdf ]
[61] Guillermo Aradilla, John Dines, and Sunil Sivadas. Using rasta in task independent tandem feature extraction. In Proceedings of ICSLP, 2004 [2895]. IDIAP-RR 04-22. [ .ps.gz | .pdf ]
[62] Guillermo Aradilla, Jithendra Vepa, and Hervé Bourlard. Using pitch as prior knowledge in template-based speech recognition. In Proceedings of ICASSP, 2006 [2896]. IDIAP-RR 05-65. [ .ps.gz | .pdf ]
[63] Guillermo Aradilla, Jithendra Vepa, and Hervé Bourlard. Improving speech recognition using a data-driven approach. In Proceedings of Interspeech, 2005 [2897]. IDIAP-RR 05-66. [ .ps.gz | .pdf ]
[64] Guillermo Aradilla, Jithendra Vepa, and Hervé Bourlard. An acoustic model based on kullback-leibler divergence for posterior features. In IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP) [2898]. IDIAP-RR 06-60. [ .ps.gz | .pdf ]
[65] Guillermo Aradilla, Jithendra Vepa, and Hervé Bourlard. Using posterior-based features in template matching for speech recognition. In International Conference on Spoken Language Processing [2899]. IDIAP-RR 06-23. [ .ps.gz | .pdf ]
[66] Guillermo Aradilla and Hervé Bourlard. Posterior-based features and distances in template matching for speech recognition. In 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI) [2900]. IDIAP-RR 07-41. [ .ps.gz | .pdf ]
[67] Guillermo Aradilla, Hervé Bourlard, and Mathew Magimai.-Doss. Using kl-based acoustic models in a large vocabulary recognition task. Idiap-RR Idiap-RR-14-2008, IDIAP, 2008. [ .ps.gz | .pdf ]
[68] Guillermo Aradilla, Hervé Bourlard, and Mathew Magimai.-Doss. Posterior features applied to speech recognition tasks with limited training data. Idiap-RR Idiap-RR-15-2008, IDIAP, 2008. [ .ps.gz | .pdf ]
[69] Guillermo Aradilla and Jitendra Ajmera. Detection and recognition of number sequences in spoken utterances. In 2nd Workshop on Speech in Mobile and Pervasive Environments (SiMPE) [2901]. IDIAP-RR 07-42. [ .ps.gz | .pdf ]
[70] Guillermo Aradilla, Hervé Bourlard, and Mathew Magimai.-Doss. Posterior features applied to speech recognition tasks with user-defined vocabulary. In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009. [ .pdf ]
[71] Guillermo Aradilla. Acoustic Models for Posterior Features in Speech Recognition. Idiap-rr, Ecole Polytechnique Fédérale de Lausanne, Lausanne , Switzerland, 9 2008. Thèse Ecole polytechnique fédérale de Lausanne EPFL, no 4164 (2008,',','), Faculté des sciences et techniques de l'ingénieur STI, Section de génie électrique et électronique, Institut de génie électrique et électronique IEL (Laboratoire de l'IDIAP LIDIAP). Dir.: Hervé Bourlard. [ .pdf ]
[72] Oya Aran, Dairazalia Sanchez-Cortes, Trinh-Minh-Tri Do, and Daniel Gatica-Perez. Anomaly detection in elderly daily behavior in ambient sensing environments. In Proceedings of the 7th Int. Workshop on Human Behavior Understanding, ACM Multimedia, 2016, 2016. [ .pdf ]
[73] Oya Aran and Daniel Gatica-Perez. One of a kind: Inferring personality impressions in meetings. In 15th ACM International Conference on Multimodal Interaction, 2013. [ .pdf ]
[74] Oya Aran and Daniel Gatica-Perez. Cross-domain personality prediction: From video blogs to small group meetings. In 15th ACM International Conference on Multimodal Interaction, 2013. [ .pdf ]
[75] Oya Aran and Daniel Gatica-Perez. Fusing audio-visual nonverbal cues to detect dominant people in conversations. In 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010 [2903]. [ .pdf ]
[76] Oya Aran, Joan-Isaac Biel, and Daniel Gatica-Perez. Broadcasting oneself: Visual discovery of vlogging styles. IEEE Transactions on Multimedia, 16(1):201--215, 2014. [ DOI | .pdf ]
[77] Oya Aran, Ismail Ari, Alp Kindiroglu, Pinar Santemiz, and Lale Akarun. Otomatik İşaret dili tanima ve türk İşaret dili için bilgisayar uygulamalari. In Ellerle Konusmak: Türk İşaret Dili Araştirmalari / Research on Turkish Sign Language, pages 471--498. Koc University Press, 2016. in Turkish.
[78] Oya Aran, Hayley Hung, and Daniel Gatica-Perez. A multimodal corpus for studying dominance in small group conversations. In LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010, 2010. [ .pdf ]
[79] Oya Aran and Lale Akarun. A multi-class classification strategy for fisher scores: Application to signer independent sign language recognition. Pattern Recognition, 43(5), 5 2010. [ DOI | .pdf ]
[80] Oya Aran and Daniel Gatica-Perez. Analysis of group conversations: Modeling social verticality. In Albert Ali Salah and Theo Gevers, editors, Computer Analysis of Human Behavior, pages 293--322. Springer London, 2011.
[81] Kari Torkkola and Teuvo Kohonen. A hybrid approach to continuous speech recognition. In Michael A. Arbib, editor, The handbook of brain theory and neural networks. The MIT Press, 1995.
[82] Diego Armentano, Jean-Marc Azaïs, David Ginsbourger, and Jose R. León. Conditions for the finiteness of the moments of the volume of level sets. Electronic Communications in Probability, 24(17), 2019. [ DOI | http ]
[83] Afsaneh Asaei, Nasser Mohammadiha, Mohammad J. Taghizadeh, Simon Doclo, and Hervé Bourlard. On application of non-negative matrix factorization for ad hoc microphone array calibration from incomplete noisy distances. In IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2694--2698. IEEE, April 2015. [ DOI | .pdf ]
[84] Afsaneh Asaei, Benjamin Picart, and Hervé Bourlard. Analysis of phone posterior feature space exploiting class specific sparsity and mlp-based similarity measure. In 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010. [ .pdf ]
[85] Afsaneh Asaei, Hervé Bourlard, and Volkan Cevher. Model-based compressive sensing for multi-party distant speech recognition. In 2011 IEEE International Conference on Acoustics, Speech and Signal Processing [2904]. [ .pdf ]
[86] Afsaneh Asaei, Michael E. Davies, Hervé Bourlard, and Volkan Cevher. Computational methods for structured sparse component analysis of convolutive speech mixtures. In Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, March 2012. [ .pdf ]
[87] Afsaneh Asaei, Hervé Bourlard, Mohammad J. Taghizadeh, and Volkan Cevher. Model-based sparse component analysis for reverberant speech localization. In 2014 IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1439 -- 1443. IEEE, May 2014. [ DOI | .pdf ]
[88] Afsaneh Asaei, Hervé Bourlard, and Benjamin Picart. Investigation of knn classifier on posterior features towards application in automatic speech recognition. Idiap-RR Idiap-RR-11-2010, Idiap, 6 2010. [ .pdf ]
[89] Afsaneh Asaei, Mohammad J. Taghizadeh, Hervé Bourlard, and Volkan Cevher. Multi-party speech recovery exploiting structured sparsity models. Idiap-RR Idiap-RR-22-2011, Idiap, 7 2011. [ .pdf ]
[90] Afsaneh Asaei, Milos Cernak, and Hervé Bourlard. Information theoretic analysis of production-perception efficiency: Case study of speech pathology. Idiap-RR Idiap-RR-30-2016, Idiap, 12 2016. [ .pdf ]
[91] Afsaneh Asaei, Mohammad J. Taghizadeh, Saeid Haghighatshoar, Bhiksha Raj, Hervé Bourlard, and Volkan Cevher. Binary sparse coding of convolutive mixtures for sound localization and separation via spatialization. IEEE Transactions on Signal Processing, 64(3):567--579, 2016. [ DOI | .pdf ]
[92] Afsaneh Asaei, Hervé Bourlard, and Philip N. Garner. Sparse component analysis for speech recognition in multi-speaker environment. In Proceedings of Interspeech, 9 2010. [ .pdf ]
[93] Afsaneh Asaei, Mohammad J. Taghizadeh, Hervé Bourlard, and Volkan Cevher. Multi-party speech recovery exploiting structured sparsity models. In Proceedings of Interspeech, 2011. [ .pdf ]
[94] Afsaneh Asaei, Milos Cernak, and Hervé Bourlard. On compressibility of neural network phonological features for low bit rate speech coding. In Proceeding of Interspeech, pages 418--422. ISCA, 2015. [ .pdf ]
[95] Afsaneh Asaei, Gil Luyet, Milos Cernak, and Hervé Bourlard. Efficient posterior exemplar search space hashing exploiting class-specific sparsity structures. In Interspeech [2905]. [ .pdf ]
[96] Afsaneh Asaei, Hervé Bourlard, and Volkan Cevher. A method, apparatus and computer program for determining the location of a plurality of speech source. 2012US-13/654055, 2012. [ http ]
[97] Afsaneh Asaei, Dhananjay Ram, and Hervé Bourlard. Phonological posterior hashing for query by example spoken term detection. In Proceedings of Interspeech [2906]. [ .pdf ]
[98] Afsaneh Asaei, Bhiksha Raj, Hervé Bourlard, and Volkan Cevher. Structured sparse coding for microphone array location calibration. In SAPA-SCALE Conference, The 5th ISCA workshop on statistical and perceptual audition, September 2012. [ .pdf ]
[99] Afsaneh Asaei, Mohammad J. Taghizadeh, Marjan Bahrololum, and Mohammed Ghanbari. Verified speaker localization utilizing voicing level in split-bands. Signal Processing, 89(6):1038--1049, June 2009. [ .pdf ]
[100] Afsaneh Asaei, Milos Cernak, and Marina Laganaro. Paos markers: Trajectory analysis of selective phonological posteriors for assessment of progressive apraxia of speech. In Proceeding on the 7th Workshop on Speech and Language Processing for Assistive Technologies (SLPAT), 2016. [ .pdf ]
[101] Afsaneh Asaei, Mohammad Golbabaee, Hervé Bourlard, and Volkan Cevher. Structured sparse acoustic modeling for speech separation. In Signal Processing with Adaptive Sparse Structured Representations SPARS. SPARS, 2013. Abstracts for Communication: http://spars2013.epfl.ch/index.php/Program. [ .pdf ]
[102] Afsaneh Asaei, Bhiksha Raj, Hervé Bourlard, and Volkan Cevher. A multipath sparse beamfroming method. In Signal Processing with Adaptive Sparse Structured Representations SPARS, 2013. Abstracts for Communiation: http://spars2013.epfl.ch/index.php/Program. [ .pdf ]
[103] Afsaneh Asaei, Milos Cernak, Hervé Bourlard, and Dhananjay Ram. Sparse pronunciation codes for perceptual phonetic information assessment. In Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS), 2017. Proceeding of Abstracts for Communication. [ .pdf ]
[104] Afsaneh Asaei, Hervé Bourlard, Mohammad J. Taghizadeh, and Volkan Cevher. Computational methods for underdetermined convolutive speech localization and separation via model-based sparse component analysis. Speech Communication, 76:201--217, 2016. [ .pdf ]
[105] Milos Cernak, Afsaneh Asaei, and Hervé Bourlard. On structured sparsity of phonological posteriors for linguistic parsing. In Speech Communication [2907], pages 36--45. [ DOI | http | .pdf ]
[106] Afsaneh Asaei, Milos Cernak, and Hervé Bourlard. Perceptual information loss due to impaired speech production. In IEEE/ACM Transactions on Audio, Speech, and Language Processing [2908]. [ .pdf ]
[107] Afsaneh Asaei, Mohammad Golbabaee, Hervé Bourlard, and Volkan Cevher. Structured sparsity models for reverberant speech separation. IEEE/ACM Transaction on Audio, Speech and Language Processing, 2014. [ .pdf ]
[108] Afsaneh Asaei. Model-based Sparse Component Analysis for Multiparty Distant Speech Recognition. PhD thesis, École Polytechnique Fédérale de Lausanne, 2013. [ .pdf ]
[109] Astrid Hagen, Andrew Morris, and Hervé Bourlard. From multi-band full combination to multi-stream full combination processing in robust asr. In ISCA ITRW ASR2000 [2909]. IDIAP-RR 00-20. [ .ps.gz | .pdf ]
[110] Astrid Hagen and Andrew Morris. Comparison of hmm experts with mlp experts in the full combination multi-band approach to robust asr. In ICSLP [2910]. IDIAP-RR 00-21. [ .ps.gz | .pdf ]
[111] Astrid Hagen and Hervé Bourlard. Using multiple time scales in the framework of multi-stream speech recognition. In ICSLP [2911]. IDIAP-RR 00-22. [ .ps.gz | .pdf ]
[112] Astrid Hagen, Hervé Bourlard, and Andrew Morris. Adaptive ml-weighting in multi-band recombination of gaussian mixture asr. In ICASSP [2912]. Published in: ICASSP, Salt Lake City, Utah, USA, May 2001. [ .ps.gz | .pdf ]
[113] Astrid Hagen and Hervé Bourlard. Error correcting posterior combination for robust multi-band speech recognition. In EUROSPEECH [2913]. [ .ps.gz | .pdf ]
[114] Cosmin Atanasoaei, Chris McCool, and Sébastien Marcel. Face detection using boosted jaccard distance-based regression. Idiap-RR Idiap-RR-02-2012, Idiap, 1 2012. Submitted to CVPR 2011. [ .pdf ]
[115] Cosmin Atanasoaei, Chris McCool, and Sébastien Marcel. On improving face detection performance by modelling contextual information. Idiap-RR Idiap-RR-43-2010, Idiap, 12 2010. [ .pdf ]
[116] Cosmin Atanasoaei. Multivariate Boosting with Look-up Tables for Face Processing. PhD thesis, EPFL, 2012. [ .pdf ]
[117] Manfredo Atzori, Arjan Gijsberts, Simone Heynen, Anne-Gabrielle Mittaz Hager, Olivier Deriaz, Patrick van der Smagt, Claudio Castellini, Barbara Caputo, and Henning Müller. Building the ninapro database: a resource for the biorobotics community. In Proceedings of the Fourth IEEE RAS/EMBS International Conference on Biomedical Robotics and Biomechatronics, 2012. [ .pdf ]
[118] Manfredo Atzori, Arjan Gijsberts, Simone Heynen, Anne-Gabrielle Mittaz Hager, Claudio Castellini, Barbara Caputo, and Henning Müller. Experiences in the creation of an electromyography database to help hand amputated persons. In Proceedings of the 24th European Medical Informatics Conference, 2012. [ .pdf ]
[119] Manfredo Atzori, Arjan Gijsberts, Ilja Kuzborskij, Simone Heynen, Anne-Gabrielle Mittaz Hager, Olivier Deriaz, Claudio Castellini, Henning Müller, and Barbara Caputo. Characterization of a benchmark database for myoelectric movement classification. Transactions on Neural Systems and Rehabilitation Engineering, 23:73--83, June 2014. [ DOI ]
[120] Alice Aubert, Romain Tavenard, Simon Malinowski, Thomas Guyet, René Quiniou, Jean-Marc Odobez, Remi Emonet, and Chantal Gascuel. Discovering temporal patterns in water quality time series, focusing on floods with the lda method. In European Geosciences Union, 2013. [ .pdf ]
[121] Alice Aubert, Romain Tavenard, Remi Emonet, A. de Lavenne, Simon Malinowski, Thomas Guyet, René Quiniou, Jean-Marc Odobez, Philippe Merot, and Chantal Gascuel. Clustering flood events from water quality time-series using latent dirichlet allocation model. Water Resources Research, 2013. Online published version before inclusion in an issue. [ DOI ]
[122] Umut Avci and Oya Aran. Predicting the performance in decision-making tasks: From individual cues to group interaction. IEEE Transactions on Multimedia, 18(4):643--658, 2016. [ DOI | http | .pdf ]
[123] Umut Avci and Oya Aran. Effect of nonverbal behavioral patterns on the performance of small groups. In ICMI Workshop on Understanding and Modeling Multiparty Multimodal Interactions, 2014. [ .pdf ]
[124] C. Neti, G. Potamianos, Juergen Luettin, I. Matthews, Hervé Glotin, D. Vergyri, J. Sison, and A. Mashari. Audio visual speech recognition. [2914].
[125] Minja Axelsson, Raquel Oliveira, Mattia Racca, and V. Kyrki. Social robot co-design canvases: A participatory design framework. ACM Transactions on Human-Robot Interaction, 11(1), 2022. [ DOI | http | .pdf ]
[126] Dario Azzimonti and David Ginsbourger. Estimating orthant probabilities of high dimensional gaussian vectors with an application to set estimation. Journal of Computational and Graphical Statistics, 27(2):255--267, 2018. [ DOI | http ]
[127] Dario Azzimonti, Julien Bect, Clément Chevalier, and David Ginsbourger. Quantifying uncertainties on excursion sets under a gaussian random field prior. SIAM/ASA J. Uncertainty Quantification, 4(1):850--874, 2016. [ DOI | .pdf ]
[128] Dario Azzimonti, David Ginsbourger, Jérémy Rohmer, and Déborah Idier. Profile extrema for visualizing and quantifying uncertainties on excursion regions. application to coastal flooding. Technometrics, 61(4):474--493, 2019. [ DOI | http ]
[129] Dario Azzimonti, David Ginsbourger, Clément Chevalier, Julien Bect, and Yann Richet. Adaptive design of experiments for conservative estimation of excursion sets. Technometrics, 2019. [ DOI | http ]
[130] Silèye O. Ba and Jean-Marc Odobez. A probabilistic framework for joint head tracking and pose estimation. In 17th Int. Conf. Pattern Recognition (ICPR) [2915]. Similar to RR-03-78. [ .ps.gz | .pdf ]
[131] Silèye O. Ba. Joint Head Tracking and Pose Estimation for Visual Focus of Attention Recognition. PhD thesis, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, 3 2007. Thèse sciences Ecole polytechnique fédérale de Lausanne EPFL, no 3764 (2007,',','), Faculté des sciences et techniques de l'ingénieur STI, Section de génie électrique et électronique, Institut de génie électrique et électronique IEL (Laboratoire de l'IDIAP LIDIAP). Dir.: Hervé Bourlard, Jean-Marc Odobez. [ .ps.gz | .pdf ]
[132] Silèye O. Ba and Jean-Marc Odobez. Probabilistic head pose tracking evaluation in single and multiple camera setups. In Classification of Events, Activities and Relationship Evaluation and Workshop [2916]. IDIAP-RR 07-21. [ .ps.gz | .pdf ]
[133] Silèye O. Ba and Jean-Marc Odobez. Multi-party focus of attention recognition in meetings from head pose and multimodal contextual cues. In IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP) [2917]. IDIAP-RR 07-50. [ .ps.gz | .pdf ]
[134] Silèye O. Ba and Jean-Marc Odobez. Visual focus of attention estimation from head pose posterior probability distributions. In International Conference on Multi-media & Expo [2918]. IDIAP-RR 07-75. [ .ps.gz | .pdf ]
[135] Timur Bagautdinov, Francois Fleuret, and Pascal Fua. Probability occupancy maps for occluded depth images. In Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 2829--2837, June 2015.
[136] Timur Bagautdinov, Alexandre Alahi, Francois Fleuret, Pascal Fua, and Sylvio Savarese. Social scene understanding: End-to-end multi-person action localization and collective activity recognition. In Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017.
[137] Sara Bahaadini, Afsaneh Asaei, David Imseng, and Hervé Bourlard. Posterior-based sparse representation for automatic speech recognition. In Proceeding of Interspeech [2919]. [ .pdf ]
[138] E. Bailly-Baillière, Samy Bengio, Frédéric Bimbot, M. Hamouz, J. Kittler, Johnny Mariéthoz, J. Matas, K. Messer, Vlad Popovici, F. Porée, B. Ruiz, and Jean-Philippe Thiran. The BANCA database and evaluation protocol. In 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA [2920]. [ .ps.gz | .pdf ]
[139] Mehdi Banitalebi Dehkordi, Hamid Reza Abutalebi, and Hossein Ghanei. A compressive sensing based compressed neural network for sound source localization. In Proceedings of International Symposium on Artificial Intelligence and Signal Processing, June 2011. [ .pdf ]
[140] Pierre Baqué, Timur Bagautdinov, Francois Fleuret, and Pascal Fua. Principled parallel mean-field inference for discrete random fields. In Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016.
[141] Pierre Baqué, Francois Fleuret, and Pascal Fua. Multi-modal mean-fields via cardinality-based clamping. In Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2017.
[142] Pierre Baqué, Francois Fleuret, and Pascal Fua. Deep occlusion reasoning for multi-camera multi-target detection. In Proceedings of the IEEE International Conference on Computer Vision, 2017.
[143] Pierre Baqué, Edoardo Remelli, Francois Fleuret, and Pascal Fua. Geodesic convolutional shape optimization. In Proceedings of the International Conference on Machine Learning, 2018.
[144] Felix Agakov and David Barber. Variational information maximization in gaussian channels. Idiap-RR Idiap-RR-88-2004, IDIAP, Rue de Simplon 4, Martigny, CH-1920, Switerland, 4 2004. IDIAP-RR 04-88. [ .ps.gz | .pdf ]
[145] Felix Agakov and David Barber. An auxiliary variational method. Idiap-RR Idiap-RR-86-2004, IDIAP, Rue de Simplon 4, Martigny, CH-1920, Switerland, 6 2004. IDIAP-RR 04-86. [ .ps.gz | .pdf ]
[146] David Barber. Are two classifiers performing equally? a treatment using bayesian hypothesis testing. Idiap-RR Idiap-RR-57-2004, IDIAP, Rue de Simplon 4, Martigny, CH-1920, Switerland, 5 2004. IDIAP-RR 04-57. [ .ps.gz | .pdf ]
[147] David Barber. The auxiliary variable trick for deriving kalman smoothers. Idiap-RR Idiap-RR-87-2004, IDIAP, Rue de Simplon 4, Martigny, CH-1920, Switerland, 12 2004. IDIAP-RR 04-87. [ .ps.gz | .pdf ]
[148] David Barber and Bertrand Mesot. Construction and comparison of approximations for switching linear gaussian state space models. Idiap-RR Idiap-RR-06-2005, IDIAP, Rue de Simplon 4, Martigny, CH-1920, Switerland, 2 2005. IDIAP-RR 05-06. [ .ps.gz | .pdf ]
[149] David Barber. Variational information maximization for population coding. Idiap-RR Idiap-RR-85-2004, IDIAP, Rue de Simplon 4, Martigny, CH-1920, Switerland, 6 2004. IDIAP-RR 04-85. [ .ps.gz | .pdf ]
[150] David Barber. A stable switching kalman smoother. Idiap-RR Idiap-RR-89-2004, IDIAP, Rue de Simplon 4, Martigny, CH-1920, Switerland, 12 2004. IDIAP-RR 04-89. [ .ps.gz | .pdf ]
[151] David Barber. Construction and comparison of approximations for switching linear gaussian state space models. Idiap-RR Idiap-RR-71-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[152] David Barber and Peter Sollich. Stable directed belief propagation in gaussian dags using the auxiliary variable trick. Idiap-RR Idiap-RR-72-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[153] Felix Agakov and David Barber. Kernelized infomax clustering. Idiap-RR Idiap-RR-73-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[154] David Barber. Efficient kalman smoothing for harmonic state-space models. Idiap-RR Idiap-RR-87-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[155] Mark Barnard, Jean-Marc Odobez, and Samy Bengio. Multi-modal audio-visual event recognition for football analysis. In Proc. IEEE Workshop on Neural Networks for Signal Processing (NNSP) [2921]. IDIAP-RR 03-12. [ .ps.gz | .pdf ]
[156] Mark Barnard and Jean-Marc Odobez. Robust playfield segmentation using map adaptation. In Proc. 17th International Conference on Pattern Recognition (ICPR 2004), Cambridge, United Kingdom, 8 2004. IDIAP-RR 03-77. [ .ps.gz | .pdf ]
[157] Mark Barnard and Jean-Marc Odobez. Sports event recognition using layered hmms. Idiap-RR Idiap-RR-07-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[158] Mark Barnard. Multimedia event modelling and recognition. PhD thesis, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, 2005. thesis #3370.
[159] Kevin Bascol, Remi Emonet, Elisa Fromont, and Jean-Marc Odobez. Unsupervised interpretable pattern discovery in time series using autoencoders. In IAPR Int. Workshops on Structural and Syntactic Pattern Recognition (SSPR), November 2016. [ .pdf ]
[160] Chantal Basurto, Oliver Paul, and Jérôme Kämpf. Machine learning techniques for the daylight and electric lighting performance predictions. In Proceedings of Building Simulation 2021, September 2021.
[161] Chantal Basurto and Jérôme Kämpf. An integrated and strategic evaluation of automatic blind controls to achieve energy and occupant's comfort objectives. In Proceedings of the 5th IBPSA-England Conference on Building Simulation and Optimization (Virtual), September 2020. [ .pdf ]
[162] Chantal Basurto, Roberto Boghetti, Moreno Colombo, Michael Pappinutto, Julien Nembrini, and Jérôme Kämpf. Implementation of machine learning techniques for the quasi real-time blind and electric lighting optimization in a controlled experimental facility. In Journal of Physics: Conference Series, volume 2021 of 2042. IOP Publishing, September 2021. [ DOI | http | .pdf ]
[163] Silèye O. Ba, Hayley Hung, and Jean-Marc Odobez. Visual activity context for focus of attention estimation in dynamic meetings. In International Conference on Multimedia & Expo [2922]. idiap-rr. [ .pdf ]
[164] Dinesh Babu Jayagopi, Silèye O. Ba, Jean-Marc Odobez, and Daniel Gatica-Perez. Predicting two facets of social verticality in meetings from five-minute time slices and nonverbal cues. In Proceedings - ICMI 2008, 2008. [ .pdf ]
[165] Silèye O. Ba and Jean-Marc Odobez. Recognizing human visual focus of attention from head pose in meetings. IEEE Transactions on Systems, Man, Cybernetics, Part-B, Vol. 39(No. 1), 2 2009. [ .pdf ]
[166] Silèye O. Ba and Jean-Marc Odobez. Multi-person visual focus of attention from head pose and meeting contextual cues. In IEEE Trans. on Pattern Analysis and Machine Intelligence [2923], pages 101--116. IDIAP-RR 08-47. [ .pdf ]
[167] F. Beaufays, Hervé Bourlard, H. Franco, and Nelson Morgan. Neural networks in automatic speech recognition. In Arbib [2924]. Published in The Handbook of Brain Theory and Neural Networks, second edition, M.A. Arbib (Ed.,',','), Bradford Books, The MIT Press, 2000.
[168] Julien Bect, François Bachoc, and David Ginsbourger. A supermartingale approach to gaussian process based sequential design of experiments. Bernoulli, 25(4A):2883--2919, 2019.
[169] Melika Behjati and James Henderson. Inducing meaningful units from character sequences with slot attention. arXiv, February 2021. [ http | .pdf ]
[170] Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Elie Khoury, Tommi Kurki, Tomi Kinnunen, and Chin-Hui Lee. Dialect levelling in finnish: A universal speech attribute approach. In The 15th Annual Conference of the International Speech Communication Association, 2014.
[171] I Bellido and Emile Fiesler. Do backpropagation trained neural networks have normal weight distributions? In International Conference on Artificial neural Networks, 1993. [ .pdf ]
[172] Yoshua Bengio and Jean-Sébastien Senécal. Adaptive importance sampling to accelerate training of a neural probabilistic language model. Idiap-RR Idiap-RR-35-2003, IDIAP, 2003. [ .ps.gz | .pdf ]
[173] Samy Bengio and Yoshua Bengio. Taking on the curse of dimensionality in joint distributions using neural networks. In IEEE Transaction on Neural Networks special issue on data mining and knowledge discovery [2925]. IDIAP-RR 00-01. [ .ps.gz | .pdf ]
[174] Samy Bengio, Hervé Bourlard, and Katrin Weber. An em algorithm for hmms with emission distributions represented by hmms. Idiap-RR Idiap-RR-11-2000, IDIAP, 2000. [ .ps.gz | .pdf ]
[175] Samy Bengio and Johnny Mariéthoz. Learning the decision function for speaker verification. In IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP [2926]. IDIAP-RR 00-40. [ .ps.gz | .pdf ]
[176] Samy Bengio, Johnny Mariéthoz, and Sébastien Marcel. Evaluation of biometric technology on XM2VTS. Idiap-RR Idiap-RR-21-2001, IDIAP, 2001. also available as the deliverable D71 of the European Project BANCA. [ .ps.gz | .pdf ]
[177] Samy Bengio and Johnny Mariéthoz. Comparison of client model adaptation schemes. Idiap-RR Idiap-RR-25-2001, IDIAP, 2001. also available as the deliverable D22 of the European Project BANCA. [ .ps.gz | .pdf ]
[178] Ronan Collobert, Samy Bengio, and Johnny Mariéthoz. Torch: a modular machine learning software library. Idiap-RR Idiap-RR-46-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[179] Samy Bengio, Christine Marcel, Sébastien Marcel, and Johnny Mariéthoz. Confidence measures for multimodal identity verification. In Information Fusion [2927]. [ .ps.gz | .pdf ]
[180] Samy Bengio. Multimodal authentication using asynchronous HMMs. In 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA [2928]. [ .ps.gz | .pdf ]
[181] Samy Bengio. An asynchronous hidden markov model for audio-visual speech recognition. In Becker et al. [2929]. [ .ps.gz | .pdf ]
[182] Samy Bengio. Multimodal speech processing using asynchronous hidden markov models. Information Fusion, 5(2), 2004. [ .ps.gz | .pdf ]
[183] Samy Bengio and Johnny Mariéthoz. The expected performance curve: a new assessment measure for person authentication. In Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop [2930]. [ .ps.gz | .pdf ]
[184] Samy Bengio and Johnny Mariéthoz. A statistical significance test for person authentication. In Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop [2931]. [ .ps.gz | .pdf ]
[185] Samy Bengio, Johnny Mariéthoz, and Mikaela Keller. The expected performance curve. In International Conference on Machine Learning, ICML, Workshop on ROC Analysis in Machine Learning [2932]. [ .ps.gz | .pdf ]
[186] Samy Bengio and Hervé Bourlard. Multi channel sequence processing. Idiap-RR Idiap-RR-04-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[187] Samy Bengio. Joint training of multi-stream HMMs. Idiap-RR Idiap-RR-22-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[188] Machine learning for multimodal interaction: First international workshop, MLMI'2004. volume 3361 of Lecture Notes in Computer Science. Springer-Verlag Heidelberg, 2005.
[189] Samy Bengio and Johnny Mariéthoz. Biometric person authentication is a multiple classifier problem. In 7th International Workshop on Multiple Classifier Systems, MCS [2933]. IDIAP-RR 07-03. [ .ps.gz | .pdf ]
[190] Yassir Benkhedda, Darshan Santani, and Daniel Gatica-Perez. Venues in social media: Examining ambiance perception through scene semantics. In Proceedings of the 25th ACM International Conference on Multimedia, ACM, 2017, 2017. [ .pdf ]
[191] Horesh Ben Shitrit, Jerome Berclaz, Francois Fleuret, and Pascal Fua. Tracking multiple objects under global appearance constraints. In Proceedings of the IEEE International Conference on Computer Vision, 2011.
[192] Horesh Ben Shitrit, Jerome Berclaz, Francois Fleuret, and Pascal Fua. Multi-commodity network flow for tracking multiple people. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013.
[193] Mohamed Faouzi BenZeghiba, Hervé Bourlard, and Johnny Mariéthoz. Speaker Verification Based on user-Customized Password. Idiap-RR Idiap-RR-13-2001, IDIAP, 2001. [ .ps.gz | .pdf ]
[194] Mohamed Faouzi BenZeghiba and Hervé Bourlard. User Customized HMM/ANN based Speaker Verification. Idiap-RR Idiap-RR-32-2001, IDIAP, 2001. [ .ps.gz | .pdf ]
[195] Mohamed Faouzi BenZeghiba and Hervé Bourlard. Confidence Measures in Multiple pronunciations Modeling For Speaker Verification. In Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04) [2934]. IDIAP-RR 03-53. [ .ps.gz | .pdf ]
[196] Mohamed Faouzi BenZeghiba and Hervé Bourlard. Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition. In International Conference on Spoken Language Processing (ICSLP 2004) [2935]. IDIAP-RR 04-23. [ .ps.gz | .pdf ]
[197] Mohamed Faouzi BenZeghiba. Joint Speech and Speaker Recognition. Idiap-rr, École Polytechnique Fédérale de Lausanne, Computer Science Department, Lausanne, Switzerland, 2005. thesis #3193. [ .ps.gz | .pdf ]
[198] Mohamed Faouzi BenZeghiba and Hervé Bourlard. User-customized password speaker verification using multiple reference and background models. In Speech Communication [2937]. IDIAP-RR 04-41. [ .ps.gz | .pdf ]
[199] Mohamed Faouzi BenZeghiba and Hervé Bourlard. User-Customized Password HMM Based Speaker Verification. In Proceedings of the COST275 Workshop on the Advent of Biometrics on the Internet [2938]. IDIAP-RR 02-35. [ .ps.gz | .pdf ]
[200] Mohamed Faouzi BenZeghiba and Hervé Bourlard. On the Combination of Speech and Speaker Recognition. In European Conference On Speech, Communication and Technology (EUROSPEECH'03) [2939]. IDIAP-RR 03-19. [ .ps.gz | .pdf ]
[201] Mohamed Faouzi BenZeghiba and Hervé Bourlard. Hybrid HMM/ANN and GMM Combination for user-Customized Password Speaker Verification. In Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03) [2940]. IDIAP-RR 02-45. [ .ps.gz | .pdf ]
[202] Mohamed Faouzi BenZeghiba and Hervé Bourlard. User-Customized Password Speaker Verification based on HMM/ANN and GMM Models. In International Conference on Spoken Language Processing (ICSLP 2002) [2941]. IDIAP-RR 02-10. [ .ps.gz | .pdf ]
[203] Jerome Berclaz, Francois Fleuret, and Pascal Fua. Multi-camera tracking and atypical motion detection with behavioral maps. In proceedings of the European Conference on Computer Vision, 2008.
[204] Jerome Berclaz, Francois Fleuret, and Pascal Fua. Multiple object tracking using flow linear programming. Idiap-RR Idiap-RR-10-2009, Idiap, 6 2009. [ .pdf ]
[205] Jerome Berclaz, Ali Shahrokni, Francois Fleuret, James Ferryman, and Pascal Fua. Evaluation of probabilistic occupancy map people detection for surveillance systems. In Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, 2009.
[206] Jerome Berclaz, Engin Turetken, Francois Fleuret, and Pascal Fua. Multiple object tracking using k-shortest paths optimization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011.
[207] Jerome Berclaz, Francois Fleuret, and Pascal Fua. Principled detection-by-classification from multiple views. In proceedings of the International Conference on Computer Vision Theory and Applications, volume 2, 2008.
[208] D. Berio, Sylvain Calinon, and F. F. Leymarie. Generating calligraphic trajectories with model predictive control. In Proc. 43rd Conf. on Graphics Interface, pages 132--139, May 2017. [ DOI | .pdf ]
[209] D. Berio, Sylvain Calinon, and F. F. Leymarie. Learning dynamic graffiti strokes with a compliant robot. In Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 3981--3986, October 2016. [ http | .pdf ]
[210] D. Berio, Sylvain Calinon, and F. F. Leymarie. Dynamic graffiti stylisation with stochastic optimal control. In Intl Workshop on movement and computing (MOCO), pages 1--8. ACM, June 2017. [ DOI | http | .pdf ]
[211] D. Berio, F. F. Leymarie, and Sylvain Calinon. Interactive generation of calligraphic trajectories from gaussian mixtures. In N. Bouguila and W. Fan, editors, Mixture Models and Applications, pages 23--38. Springer, 2019. [ DOI ]
[212] Giulia Bernardis and Hervé Bourlard. Improving posterior based confidence measures in hybrid HMM/ANN speech recognition systems. In Proceedings of International Conference on Spoken Language Processing (ICSLP'98) Sydney, Australia [2942]. IDIAP-RR 98-11. [ .ps.gz | .pdf ]
[213] Giulia Bernardis and Hervé Bourlard. Confidence measures in hybrid HMM/ANN speech recognition. In Proceedings of Workshop on Text, Speech and Dialog (TSD'98) Brno, Czech Republic, 9 1998.
[214] Frédéric Berthommier and Hervé Glotin. A measure of speech and pitch reliability from voicing. In F. Klassner, editor, Proc. Int. Joint Conf. on Artificial Intelligence (IJCAI), Computational Auditory Scene Analysis (CASA) workshop, Stockholm, 7 1999. Scandinavian AI Society.
[215] Frédéric Berthommier and Hervé Glotin. A new snr-feature mapping for robust multistream speech recognition. In Berkeley University Of California, editor, Proc. Int. Congress on Phonetic Sciences (ICPhS), volume 1 of XIV, San Francisco, 8 1999.
[216] Laurent Besacier, Juergen Luettin, Gilbert Maître, and E. Meurville. Experimental evaluation of text-dependent speaker verification on laboratory and field test databases in the M2VTS project. In Proceedings of the European Conference on Speech Communication and Technology, 1999. [ .pdf ]
[217] Jean-Luc Beuchat. Optimisation de réseaux de neurones. Master's thesis, EPFL, Lausanne, Switzerland, 1995. Report of a student project performed at IDIAP, supervised by Prof. Nicoud (EPFL, Lausanne) and G. Thimm (IDIAP, Martigny).
[218] Jean-Luc Beuchat. Reconnaissance de caractères manuscrits à l'aide de réseaux neuromimétiques. Idiap-RR Idiap-RR-18-1997, IDIAP, 1997. [ .ps.gz | .pdf ]
[219] Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh, and Petr Motlicek. Hierarchical multi-task learning framework for isometric-speech language translation. In ACL, 2022. [ .pdf ]
[220] Sushil Bhattacharjee and Sébastien Marcel. What you can't see can help you -- extended-range imaging for 3d-mask presentation attack detection. In Proceedings of the 16th International Conference on Biometrics Special Interest Group. Gesellschaft fuer Informatik e.V. (GI), 2017. [ .pdf ]
[221] Sushil Bhattacharjee, Amir Mohammadi, and Sébastien Marcel. Spoofing deep face recognition with custom silicone masks. In Proceedings of BTAS2018, October 2018. [ .pdf ]
[222] Sushil Bhattacharjee, Amir Mohammadi, André Anjos, and Sébastien Marcel. Recent advances in face presentation attack detection. In Sébastien Marcel, Mark Nixon, Julian Fierrez, and Nicholas Evans, editors, Handbook of Biometric Anti-Spoofing, Advances in Computer Vision and Pattern Recognition, chapter 10. Springer, 2nd edition, April 2019. [ http | .pdf ]
[223] Chidansh A. Bhatt, Andrei Popescu-Belis, and Matthew Cooper. Audiovisual summarization of lectures and meetings using a segment similarity graph. In Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR). ACM, ACM Press, 2016.
[224] Chidansh A. Bhatt and Andrei Popescu-Belis. Topic-level extractive summarization of lectures and meetings using a snippet similarity graph. Idiap-RR Idiap-RR-09-2014, Idiap, 6 2014. [ .pdf ]
[225] Chidansh A. Bhatt, Nikolaos Pappas, Maryam Habibi, and Andrei Popescu-Belis. Idiap at mediaeval 2013: Search and hyperlinking task. In MediaEval 2013 Workshop, CEUR Workshop Proceedings. CEUR-WS.org, October 2013. [ .pdf ]
[226] Chidansh A. Bhatt, Andrei Popescu-Belis, Maryam Habibi, Sandy Ingram, Stefano Masneri, Fergus McInnes, Nikolaos Pappas, and Oliver Schreer. Multi-factor segmentation for topic visualization and recommendation: the must-vis system. In Proceedings of the 21st ACM International Conference on Multimedia, pages 365--368. ACM, October 2013. [ DOI | http | .pdf ]
[227] Joan-Isaac Biel and Daniel Gatica-Perez. The youtube lens: Crowdsourced personality impressions and audiovisual analysis of vlogs. IEEE Transactions on Multimedia, 2012. [ .pdf ]
[228] Joan-Isaac Biel and Daniel Gatica-Perez. Wearing a youtube hat: directors, comedians, gurus, and user aggregated behavior. In Proceedings of the 17th ACM International Conference on Multimedia. ACM, 10 2009. [ .pdf ]
[229] Joan-Isaac Biel and Daniel Gatica-Perez. Vlogcast yourself: Nonverbal behavior and attention in social media. In Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), November 2010. [ .pdf ]
[230] Joan-Isaac Biel, Lucia Teijeiro-Mosquera, and Daniel Gatica-Perez. Facetube: predicting personality from facial expressions of emotion in online conversational video. In Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2012. [ .pdf ]
[231] Joan-Isaac Biel, Daniel Gatica-Perez, John Dines, and Vagia Tsminiaki. Hi youtube! personality impressions and verbal content in social video. 15th ACM International Conference on Multimodal Interaction, Sydney, Australia, ACM, 2013, 2013. [ .pdf ]
[232] Joan-Isaac Biel, Oya Aran, and Daniel Gatica-Perez. You are known by how you vlog: Personality impressions and nonverbal behavior in youtube. In Proceedings of AAAI International Conference on Weblogs and Social Media, 2011. [ .pdf ]
[233] Joan-Isaac Biel and Daniel Gatica-Perez. Voices of vlogging. In Proceedings of AAAI International Conference on Weblogs and Social Media, Washington DC, 5 2010. [ .pdf ]
[234] Joan-Isaac Biel and Daniel Gatica-Perez. The good, the bad, and the angry: Analyzing crowdsourced impressions of vloggers. In Proceedings of AAAI International Conference on Weblogs and Social Media, 2012. [ .pdf ]
[235] Joan-Isaac Biel, Nathalie Martin, David Labbe, and Daniel Gatica-Perez. Bites'n'bits: Inferring eating behavior from contextual mobile data. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (PACM IMWUT), 1(4):125--157, December 2017. article 125. [ .pdf ]
[236] Joan-Isaac Biel and Daniel Gatica-Perez. Call me guru: user categories and large-scale behavior in youtube. In Social Media Computing. Springer, 2011. [ .pdf ]
[237] Joan-Isaac Biel. Mining Conversational Social Video. PhD thesis, EPFL, 2013. [ .pdf ]
[238] Joan-Isaac Biel and Daniel Gatica-Perez. Mining crowdsourced first impressions in online social video. IEEE Transactions on Multimedia, 16(7), 2014. [ .pdf ]
[239] Joan-Isaac Biel and Daniel Gatica-Perez. Vlogsense: Conversational behavior and social attention in youtube. Transactions on Multimedia Computing, Communications and Applications, 2011. [ .pdf ]
[240] A. G. Billard, Sylvain Calinon, and R. Dillmann. Learning from humans. In B. Siciliano and O. Khatib, editors, Handbook of Robotics, chapter 74, pages 1995--2014. Springer, Secaucus, NJ, USA, 2nd edition edition, 2016. [ DOI | http ]
[241] Mickaël Binois, David Ginsbourger, and Olivier Roustant. On the choice of the low-dimensional domain for global optimization via random embeddings. Journal of Global Optimization, 2019. [ DOI | http ]
[242] A. Birk, T. Fromm, C. A. Mueller, T. Luczynski, A. Gomez Chavez, D. Koehntopp, A. Kupcsik, Sylvain Calinon, A. K. Tanwani, G. Antonelli, P. Di Lillo, E. Simetti, G. Casalino, G. Indiveri, L. Ostuni, A. Turetta, A. Caffaz, P. Weiss, T. Gobert, B. Chemisky, J. Gancet, T. Siedel, S. Govindaraj, X. Martinez, and P. Letier. Dexterous underwater manipulation from distant onshore locations. IEEE Robotics and Automation Magazine, 2018. [ .pdf ]
[243] Alexandre Bittar and Philip N. Garner. A bayesian interpretation of the light gated recurrent unit. In Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, June 2021. [ DOI | .pdf ]
[244] David Barber and Silvia Chiappa. Unified inference for variational bayesian linear gaussian state-space models. In NIPS [2943]. IDIAP-RR 06-50. [ .ps.gz | .pdf ]
[245] Jan Blom, Daniel Gatica-Perez, and N. Kiukkonen. People-centric mobile sensing with a pragmatic twist: from behavioral data points to active user involvement. In International Conference on Human-Computer Interaction with Mobile Devices and Services, 2011. [ .pdf ]
[246] Alex Bogatu, Norman Paton, Mark Douthwaite, Stuart Davie, and Andre Freitas. Cost–effective variational active entity resolution. In 37th IEEE International Conference on Data Engineering (ICDE), 2021. [ .pdf ]
[247] Alex Bogatu, Norman Paton, Mark Douthwaite, and Andre Freitas. Voyager: Data discovery for onboarding in data science. In 37th IEEE International Conference on Data Engineering (ICDE), 2022.
[248] Raducanu Bogdan, Vitria J., and Daniel Gatica-Perez. You are fired! nonverbal role analysis in competitive meetings. In Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP,',','), Taiwan., 4 2009. [ .pdf ]
[249] Bogdan Vlasenko and Andreas Wendemuth. Annotators' agreement and spontaneous emotion classification performance. In Proceedings of Interspeech, pages 1546--1550, 2015. [ .pdf ]
[250] Raducanu Bogdan and Daniel Gatica-Perez. Inferring competitive role patterns in reality tv show through nonverbal analysis. Multimedia Tools and Applications, Special issue on Social Media, 2010. [ .pdf ]
[251] Roberto Boghetti, Fabio Fantozzi, Jérôme Kämpf, Guglielmina Mutani, Giacomo Salvadori, and Valeria Todeschi. Building energy models with morphological urban-scale parameters: a case study in turin. In Proceedings of 4th Building Simulation Applications Conference - BSA 2019, June 2019. [ .pdf ]
[252] Roberto Boghetti, Fabio Fantozzi, Jérôme Kämpf, and Giacomo Salvadori. Understanding the performance gap: a machine learning approach on residential buildings in turin, italy. In Journal of Physics: Conference Series, volume 1343. IOP Publishing Ltd, November 2019. [ DOI ]
[253] R. Boite, Hervé Bourlard, T. Dutoit, J. Hancq, and H. Leich. Traitement de la Parole. Presses Polytechniques Universitaires Romandes, 2000.
[254] Antoine Bordes, Jason Weston, Ronan Collobert, and Yoshua Bengio. Learning structured embeddings of knowledge bases. In Conference on Artificial Intelligence, 2011. [ .pdf ]
[255] Antoine Bordes, Léon Bottou, Ronan Collobert, Dan Roth, Jason Weston, and Luke Zettlemoyer. Introduction to the special issue on learning semantics. Machine Learning, June 2013. [ DOI ]
[256] Florian Salamin, François Corthay, Olivier Bornet, and Jean-Luc Cochard. Datapump full-duplex. Idiap-Com Idiap-Com-02-1996, EIV / IDIAP, 8 1996. [ .ps.gz | .pdf ]
[257] Florian Salamin, François Corthay, Olivier Bornet, and Jean-Luc Cochard. Annulation d'écho sur une ligne téléphonique. Idiap-Com Idiap-Com-06-1996, EIV / IDIAP, 12 1996. [ .ps.gz | .pdf ]
[258] Samuel Vannay. Réalisation d'un majordome vocal. Idiap-Com Idiap-Com-04-1997, EPFL / IDIAP, 1997. [ .ps.gz | .pdf ]
[259] Olivier Bornet, Gérard Chollet, Jean-Luc Cochard, Andrei Constantinescu, and Dominique Genoud. Secured vocal access to telephone servers. In Proceedings of IVTTA 1996 IEEE Third Workshop Interactive Voice Technology for Telecommunications Applications [2944].
[260] Elizabeth Boschee, Joel Barry, Jayadev Billa, Marjorie Freedman, Thamme Gowda, Constantine Lignos, Chester Palen-Michel, Michael Pust, Banriskhem Khonglah, Srikanth Madikeri, Jonathan May, and Scott Miller. Saral: A low-resource cross-lingual domain-focused information retrieval system for effective rapid document triage. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations. 2019, pages 19--24, 2019.
[261] Z. Boulkenafet, J. Komulainen, Zahid Akhtar, A. Benlamoudi, SE. Bekhouche, F. Dornaika, A. Ouafi, Amir Mohammadi, Sushil Bhattacharjee, and Sébastien Marcel. A competition on generalized software-based face presentation attack detection in mobile scenarios. In Proceedings of the International Joint Conference on Biometrics, 2017, October 2017. [ .pdf ]
[262] Nicolas Bourdaud, Ricardo Chavarriaga, Ferran Galán, and José del R. Millán. Characterizing the eeg correlates of exploratory behavior. In IEEE Transactions on Neural Systems & Rehabilitation Engineering [2945]. IDIAP-RR 08-28. [ .ps.gz | .pdf ]
[263] Stéphane Dupont, Hervé Bourlard, and Christophe Ris. Robust speech recognition based on multi-stream features. In Proc. of the ESCA-NATO Workshop on Robust Speech Recognition for Unknown Communication Channels [2946]. IDIAP-RR 97-01. [ .ps.gz | .pdf ]
[264] Jean Hennebert, Christophe Ris, Hervé Bourlard, and Steve Renals. Estimation of global posteriors and forward-backward training of hybrid HMM/ANN systems. In EUROSPEECH'97, 9 1997. [ .ps.gz | .pdf ]
[265] Stéphane Dupont and Hervé Bourlard. Using multiple time scales in a multi-stream speech recognition system. In EUROSPEECH'97, 9 1997. [ .ps.gz | .pdf ]
[266] Hervé Bourlard. State-of-the-art and recent progress in hybrid HMM/ANN speech recognition. In W. Gerstner, A. Germond, M. Hasler, and J. D. Nicoud, editors, Proceedings of the International Conference on Artificial Neural Networks (ICANN'97), number 1327 in Lecture Notes in Computer Science. Springer-Verlag, 1997.
[267] Vincent Fontaine and Hervé Bourlard. Speaker-dependent speech recognition based on phone-like unit model -- application to voice dialing. In IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 4 1997. [ .ps.gz | .pdf ]
[268] Hervé Bourlard and Stéphane Dupont. Subband-based speech recognition. In IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 4 1997. [ .ps.gz | .pdf ]
[269] Stéphane Dupont, Hervé Bourlard, O. Deroo, Vincent Fontaine, and J. M. Boite. Hybrid HMM/ANN systems for training independent tasks: Experiments on 'phonebook' and related improvements. In IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 4 1997. [ .ps.gz | .pdf ]
[270] Hervé Bourlard and Nelson Morgan. Hybrid HMM/ANN systems for speech recognition: Overview and new research directions. In International School on Neural Nets: Adaptive Processing of Temporal Information. Springer Verlag, 1997. [ .ps.gz | .pdf ]
[271] Hervé Bourlard, Samy Bengio, and Katrin Weber. Towards robust and adaptive speech recognition models. Idiap-RR Idiap-RR-47-2002, IDIAP, Martigny, Switzerland, 2002. Published: Mathematical Foundations of Speech Processing and Recognition, IMA, Eds. R. Rosenfeld and M. Ostendorf. [ .ps.gz | .pdf ]
[272] Hervé Bourlard, Samy Bengio, Mathew Magimai.-Doss, Qifeng Zhu, Bertrand Mesot, and Nelson Morgan. Towards using hierarchical posteriors for flexible automatic speech recognition systems. Idiap-RR Idiap-RR-58-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[273] Hervé Bourlard, Stéphane Dupont, and Christophe Ris. Multi-stream speech recognition. Idiap-RR Idiap-RR-07-1996, IDIAP, 1996. [ .ps.gz | .pdf ]
[274] Vincent Fontaine and Hervé Bourlard. Speaker-dependent speech recognition based on phone-like units models --- application to voice dialing. Idiap-RR Idiap-RR-09-1996, IDIAP, 1996. [ .ps.gz | .pdf ]
[275] Hervé Bourlard. Non-stationary multi-channel (multi-stream) processing towards robust and adaptive asr. In Proc. of the ESCA Workshop on Robust Methods for Speech Recognition in Adverse Conditions, 1999.
[276] Hervé Bourlard, Samy Bengio, and Katrin Weber. New approaches towards robust and adaptive speech recognition. In Leen et al. [2947]. IDIAP-RR 01-01. [ .ps.gz | .pdf ]
[277] Hervé Bourlard and Samy Bengio. Hidden markov models and other finite state automata for sequence processing. In Arbib [2948]. [ .ps.gz | .pdf ]
[278] Hervé Bourlard, Samy Bengio, and Katrin Weber. Towards robust and adaptive speech recognition models. In Ostendorf et al. [2949]. [ .ps.gz | .pdf ]
[279] Hervé Bourlard, T. Adali, Samy Bengio, J. Larsen, and S. Douglas, editors. Proceedings of the Twelfth IEEE Workshop on Neural Networks for Signal Processing (NNSP). IEEE Press, 2002.
[280] Hervé Bourlard and Steve Renals. Recognition and understanding of meetings overview of the european ami and amida projects. In LangTech 2008 [2950]. IDIAP-RR 08-27. [ .pdf ]
[281] Hervé Bourlard. Connectionist speech recognition. In Proceedings of IK'98, Interdisziplinäres Kolleg, Spring Scholl, Günne am Möhnessee, Germany, March 7--14, 1998.
[282] Pere Pujol, Susagna Pol, Climent Nadeu, Astrid Hagen, and Hervé Bourlard. Comparison and Combination of Features in a Hybrid hmm/mlp and a hmm/gmm Speech Recognition System. In to be published in IEEE Transactions on Speech and Audio Processing [2951]. IDIAP-RR 03-48. [ .pdf ]
[283] Hervé Bourlard. Auto-association by multilayer perceptrons and singular value decomposition. Idiap-RR Idiap-RR-16-2000, IDIAP, 2000. [ .ps.gz | .pdf ]
[284] Hervé Bourlard and Selen Hande Kabil. Autoencoders reloaded. Springer Biological Cybernetics, June 2022. [ DOI | http ]
[285] Hervé Bourlard and Andrei Popescu-Belis. Interactive Multimodal Information Management. EPFL Press, Lausanne, 2013.
[286] Hervé Bourlard and Nelson Morgan. CONNECTIONIST SPEECH RECOGNITION - A Hybrid Approach. KLUWER ACADEMIC PUBLISHERS, 1994. [ .pdf ]
[287] Hervé Bourlard, John Dines, Mathew Magimai.-Doss, Philip N. Garner, David Imseng, Petr Motlicek, Hui Liang, Lakshmi Saheer, and Fabio Valente. Current trends in multilingual speech processing. Sadhana, 36(5):885–915, October 2011. [ DOI | .pdf | .pdf ]
[288] Hervé Bourlard, Marc Ferras, Nikolaos Pappas, Andrei Popescu-Belis, Steve Renals, Fergus McInnes, Peter Bell, Sandy Ingram, and Maël Guillemot. Processing and linking audio events in large multimedia archives: The eu inevent project. In Workshop on Speech, Language and Audio in Multimedia, July 2013. [ .pdf ]
[289] T. Dutoit, L. Couvreur, and Hervé Bourlard. How does a dictation machine recognize speech ? In Applied Signal Processing--A MATLAB approach [2952], idiap-rr 4. [ .pdf ]
[290] Hervé Bourlard and Nelson Morgan. Connectionist techniques. In R. Cole et al., editor, Survey of the State of the Art in Human Language Technology. Cambridge University Press, 1998.
[291] Hervé Bourlard and Nelson Morgan. Hybrid HMM/ANN systems for speech recognition: Overview and new research directions. In C. L. Giles and M. Gori, editors, Adaptive Processing of Sequences and Data Structures, Lecture Notes in Artificial Intelligence (1387). Springer Verlag, 1998.
[292] Rudolf Braun, Srikanth Madikeri, and Petr Motlicek. A comparison of methods for oov-word recognition on a new public dataset. In 2021 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE Signal Processing Society, June 2021. [ .pdf ]
[293] Herve Bredin, Ruiqing Yin, Juan Manuel Coria, Pavel Korshunov, Marvin Lavechin, Diego Fustes, Hadrien Titeux, Wassim Bouaziz, and Marie-Philippe Gill. pyannote.audio: neural building blocks for speaker diarization. In IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2020. [ .pdf ]
[294] Thomas M. Breuel. Higher-order statistics in visual object recognition. Idiap-RR Idiap-RR-02-1993, IDIAP, 1993. Published in Proc. IEEE Conf. on Computer Vision and Pattern Recognition. [ .ps.gz | .pdf ]
[295] Thomas M. Breuel. Recognition of handprinted digits. Idiap-RR Idiap-RR-06-1993, IDIAP, 1993. [ .ps.gz | .pdf ]
[296] Thomas M. Breuel. Geometric matching in computer vision--algorithms and open problems. Idiap-RR Idiap-RR-07-1993, IDIAP, 1993. [ .pdf ]
[297] Thomas M. Breuel. The 3d indexing problem. Idiap-RR Idiap-RR-08-1993, IDIAP, 1993. [ .ps.gz | .pdf ]
[298] Thomas M. Breuel. View-based recognition. Idiap-RR Idiap-RR-09-1993, IDIAP, 1993. [ .ps.gz | .pdf ]
[299] Thomas M. Breuel. An rbf network that learns some aspects of perceptual organization. Idiap-RR Idiap-RR-10-1993, IDIAP, 1993. [ .ps.gz | .pdf ]
[300] Thomas M. Breuel. Finding lines under bounded error. Idiap-RR Idiap-RR-11-1993, IDIAP, 1993. [ .ps.gz | .pdf ]
[301] Thomas M. Breuel. Handwriting recognition. In Second Asian Conference on Computer Vision (ACCV'95,',','), Singapore, 12 1995.
[302] Thomas M. Breuel. Higher-order statistics in visual object recognition. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, 1993. IDIAP-RR 93-02. [ .ps.gz | .pdf ]
[303] Thomas M. Breuel. Design and implementation of a system for the recognition of handwritten responses on us census forms. In IAPR Workshop on Document Analysis Systems, Kaiserslautern, 1994.
[304] Thomas M. Breuel. Recognition of handprinted digits using optimal bounded error matching. In International Conference on Document Analysis and Retrieval (ICDAR,',','), Tsukuba Science City, Japan, 1993.
[305] Thomas M. Breuel. A system for the off-line recognition of handwritten text. In International Conference on Pattern Recognition (ICPR,',','), Jerusalem [2953].
[306] Thomas M. Breuel. Handwriting recognition. In S. Z. Li, D. P. Mital, E. K. Teoh, and Haiyan Wang, editors, Recent Developments in Computer Vision. Springer, Berlin,, 1995.
[307] Thomas M. Breuel. Applying handwriting recognition to us census forms. In S. Z. Li, D. P. Mital, E. K. Teoh, and Haiyan Wang, editors, Recent Developments in Computer Vision. Springer, Berlin,, 1995. [ .pdf ]
[308] Victor Bros, Ketan Kotwal, and Sébastien Marcel. Vein enhancement with deep auto-encoders to improve finger vein recognition. In Biometrics Special Interest Group (BIOSIG 2021), 2021. [ .pdf ]
[309] Lara Brudermuller, Teguh Santoso Lembono, Suhan Shetty, and Sylvain Calinon. Trajectory prediction with compressed 3d environment representation using tensor train decomposition. In International Conference on Advanced Robotics, December 2021. [ .pdf ]
[310] Stéphane Brunet. Apprentissage de prototypes de caractères à partir de l'image d'un texte manuscrit et avec l'aide d'un opérateur. Idiap-RR Idiap-RR-01-1995, IDIAP, 1995. [ .ps.gz | .pdf ]
[311] D. Bruno, Sylvain Calinon, and D. G. Caldwell. Learning autonomous behaviours for the body of a flexible surgical robot. Autonomous Robots, 41(2):333--347, February 2017. [ DOI | http | .pdf ]
[312] D. Bruno, Sylvain Calinon, and D. G. Caldwell. Learning adaptive movements from demonstration and self-guided exploration. In Proc. IEEE Intl Conf. on Development and Learning and on Epigenetic Robotics (ICDL-EpiRob), pages 160--165, November 2014. [ .pdf ]
[313] D. Bruno, Sylvain Calinon, and D. G. Caldwell. Null space redundancy learning for a flexible surgical robot. In Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), pages 2443 -- 2448. IEEE, June 2014. [ DOI | .pdf ]
[314] D. Bruno, Sylvain Calinon, M. S. Malekzadeh, and D. G. Caldwell. Learning the stiffness of a continuous soft manipulator from multiple demonstrations. In Intelligent Robotics and Applications, volume 9246 of Lecture Notes in Computer Science, pages 185--195. Springer, liu, h. and kubota, n. and zhu, x. and dillmann, r. and zhou, d. edition, 2015. Best Paper Award Finalist at ICIRA'2015. [ DOI | http | .pdf ]
[315] Harry Bunt, Jan Alexandersson, Jean Carletta, Jae-Woong Choe, Alex Fang, Koiti Hasida, Kiyong Lee, Volha Petukhova, Andrei Popescu-Belis, Laurent Romary, Claudia Soria, and Traum. David. Towards a standard for dialogue act annotation. In 7th International Conference on Language Resources and Evaluation, 5 2010. [ .html | .pdf ]
[316] Hannah Burke, Anna Freeman, Paul O'Reagan, Oskar Wysocki, Andre Freitas, and et al. Biomarker identification using dynamic time warping analysis: a longitudinal cohort study of covid-19 patients in a uk tertiary hospital. BMJ Open, 2022.
[317] Anna Buttfield, Pierre W. Ferrez, and José del R. Millán. Online classifier adaptation in high frequency eeg. In Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, Graz, Austria, 9 2006. [ .pdf ]
[318] Anna Buttfield, Pierre W. Ferrez, and José del R. Millán. Towards a robust BCI: Error potentials and online learning. IEEE Trans. on Neural Systems and Rehabilitation Engineering, 14(2), 2006. [ .pdf ]
[319] Anna Buttfield and José del R. Millán. Online classifier adaptation in brain-computer interfaces. Idiap-RR Idiap-RR-16-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[320] Holger Caesar. Integrating language identification to improve multilingual speech recognition. Idiap-RR Idiap-RR-24-2012, Idiap, 7 2012. [ .pdf ]
[321] Sylvain Calinon. Stochastic learning and control in multiple coordinate systems. In Intl Workshop on Human-Friendly Robotics, pages 1--5, 2016. [ .pdf ]
[322] Sylvain Calinon, D. Bruno, and D. G. Caldwell. A task-parameterized probabilistic model with minimal intervention control. In Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), pages 3339 -- 3344. IEEE, June 2014. [ DOI | .pdf ]
[323] Sylvain Calinon. Robot learning with task-parameterized generative models. In Bicchi and Burgard [2954], pages 111--126. [ DOI | http | .pdf ]
[324] Sylvain Calinon. A tutorial on task-parameterized movement learning and retrieval. Intelligent Service Robotics, 9(1):1--29, January 2016. [ DOI | http | .pdf ]
[325] Sylvain Calinon. Gaussians on riemannian manifolds for robot learning and adaptive control. IEEE Robotics and Automation Magazine (RAM), 2020. [ .pdf ]
[326] Sylvain Calinon. Mixture models for the analysis, edition, and synthesis of continuous time series. In N. Bouguila and W. Fan, editors, Mixture Models and Applications, pages 39--57. Springer, 2019. [ DOI | .pdf ]
[327] Sylvain Calinon. Learning from demonstration (programming by demonstration). In M. H. Ang, O. Khatib, and B. Siciliano, editors, Encyclopedia of Robotics. Springer, 2019. [ DOI | http | .pdf ]
[328] Sylvain Calinon and D. Lee. Learning control. In P. Vadakkepat and A. Goswami, editors, Humanoid Robotics: a Reference, pages 1261--1312. Springer, 2019. [ DOI | http | .pdf ]
[329] Sylvain Calinon. Skills learning in robots by interaction with users and environment. In In Proc. of the Intl Conf. on Ubiquitous Robots and Ambient Intelligence (URAI), pages 161--162, November 2014. [ http | .pdf ]
[330] Francesco Camastra and Alessandro Vinciarelli. Intrinsic dimension estimation of data: an approach based on Grassberger-Procaccia's algorithm. In Neural Processing Letters [2955]. to appear. [ .ps.gz | .pdf ]
[331] Francesco Camastra and Alessandro Vinciarelli. Cursive character recognition by Learning Vector Quantization. In Pattern Recognition Letters [2956]. IDIAP-RR 00-47. [ .ps.gz | .pdf ]
[332] Francesco Camastra and Alessandro Vinciarelli. Machine Learning for Audio, Image and Video Analysis. Springer Verlag, 2008.
[333] G. Canal, E. Pignat, G. Alenya, Sylvain Calinon, and C. Torras. Joining high-level symbolic planning with low-level motion primitives in adaptive hri: application to dressing assistance. In Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), 2018.
[334] Olivier Canévet, Leonidas Lefakis, and Francois Fleuret. Sample distillation for object detection and image classification. In Proceedings of the 6th Asian Conference on Machine Learning (ACML), volume 29 of JMLR: Workshop and Conference Proceedings, November 2014. [ .pdf ]
[335] Olivier Canévet and Francois Fleuret. Efficient sample mining for object detection. In Proceedings of the 6th Asian Conference on Machine Learning (ACML), volume 29 of JMLR: Workshop and Conference Proceedings, November 2014. [ .pdf ]
[336] Olivier Canévet and Francois Fleuret. Large scale hard sample mining with monte carlo tree search. In Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2016. [ .pdf ]
[337] Olivier Canévet, Cijo Jose, and Francois Fleuret. Importance sampling tree for large-scale empirical expectation. In Proceedings of the International Conference on Machine Learning (ICML), June 2016.
[338] Olivier Canévet, Weipeng He, Petr Motlicek, and Jean-Marc Odobez. The mummer data set for robot perception in multi-party hri scenarios. In Proceedings of the 29th IEEE International Conference on Robot & Human Interactive Communication, September 2020. [ .pdf ]
[339] Olivier Canévet. Object Detection with Active Sample Harvesting. PhD thesis, École Polytechnique Fédérale de Lausanne, February 2017. [ .pdf ]
[340] Gulcan Can, Jean-Marc Odobez, and Daniel Gatica-Perez. How to tell ancient signs apart? recognizing and visualizing maya glyphs with cnns. ACM Journal on Computing and Cultural Heritage (JOCCH), 11(4):20, May 2018. [ DOI | .pdf ]
[341] Gulcan Can, Jean-Marc Odobez, and Daniel Gatica-Perez. Shape representations for maya codical glyphs: Knowledge-driven or deep? In 15th International Workshop on Content-Based Multimedia Indexing, June 2017. [ .pdf ]
[342] Gulcan Can, Jean-Marc Odobez, and Daniel Gatica-Perez. Is that a jaguar? segmenting ancient maya glyphs via crowdsourcing. In Proc. ACM Int. Workshop on Crowdsourcing for Multimedia, pages 37--40. ACM New York, November 2014. [ DOI | .pdf ]
[343] Gulcan Can, Jean-Marc Odobez, Carlos Pallan Gayol, and Daniel Gatica-Perez. Ancient maya writings as high-dimensional data: a visualization approach. In Digital Humanities (DH), July 2016. [ .pdf ]
[344] Gulcan Can, Yassir Benkhedda, and Daniel Gatica-Perez. Ambiance in social media venues: Visual cue interpretation by machines and crowds. In IEEE CVPR Workshop on Visual Understanding of Subjective Attributes, June 2018. [ .pdf ]
[345] Gulcan Can, Jean-Marc Odobez, and Daniel Gatica-Perez. Maya codical glyph segmentation: A crowdsourcing approach. In IEEE Transactions on Multimedia [2957], pages 711--725. published online. [ DOI | http | .pdf ]
[346] Gulcan Can, Jean-Marc Odobez, and Daniel Gatica-Perez. Evaluating shape representations for maya glyph classification. ACM Journal on Computing and Cultural Heritage (JOCCH), 9(3), September 2016.
[347] Gulcan Can. Visual Analysis of Maya Glyphs via Crowdsourcing and Deep Learning. PhD thesis, École Polytechnique Fédérale de Lausanne, September 2017. [ DOI | .pdf ]
[348] Yuanzhouhan Cao, Olivier Canévet, and Jean-Marc Odobez. Leveraging convolutional pose machines for fast and accurate head pose estimation. In 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 1089--1094. IEEE, October 2018. [ .pdf ]
[349] Barbara Caputo, Eric Hayman, Mario Fritz, and Jan-Olof Eklhund. Classifying Materials in the Real World. Idiap-RR Idiap-RR-69-2007, IDIAP, Martigny, Switzerland, 2007. [ .ps.gz | .pdf ]
[350] Tatiana Tommasi, Elisabetta La Torre, and Barbara Caputo. Kernel methods for melanoma recognition. In Proceedings of Workshop on Computer Vision Approaches to Medical Image Analysis (CVAMIA) 2006), 2006. [ .ps.gz | .pdf ]
[351] Andrzej Pronobis and Barbara Caputo. The more you learn, the less you store: Memory-controlled incremental svm. In Proceedings of International Cognitive Vision Workshop (ICVW) 2006), 2006. [ .ps.gz | .pdf ]
[352] Barbara Caputo. Spin glass models of markov random fields. International Journal on Image, Systems and Technology, 16(5), 2006. [ .ps.gz | .pdf ]
[353] Barbara Caputo and Novi Patricia. Overview of the imageclef 2014 domain adaptation task. In ImageCLEF 2014: Overview and analysis of the results, 2014. [ .pdf ]
[354] Barbara Caputo. Class specific object recognition using kernel gibbs distributions. ELectronic Letters on Computer vision and Image Analysis, 7(2), 2008. Special Issue on Computational Modelling of Objects Represented in Images. [ .pdf ]
[355] Barbara Caputo. Medical image annotation. In Hervé Bourlard and Andrei Popescu-Belis, editors, Interactive Multimodal Information Management. EPFL Press, 2013. [ .pdf ]
[356] Barbara Caputo, Eric Hayman, Mario Fritz, and J-O Ekluhnd. Classifying material in the real world. Image and vision Computing, accepted for pub, 2009.
[357] Fabien Cardinaux and Sébastien Marcel. Face verification using MLP and SVM. In XI Journees NeuroSciences et Sciences pour l'Ingenieur (NSI 2002) [2958]. [ .ps.gz | .pdf ]
[358] Fabien Cardinaux, Conrad Sanderson, and Sébastien Marcel. Comparison of MLP and GMM classifiers for face verification on XM2VTS. Idiap-RR Idiap-RR-10-2003, IDIAP, 2003. [ .ps.gz | .pdf ]
[359] Fabien Cardinaux, Conrad Sanderson, and Sébastien Marcel. Comparison of MLP and GMM classifiers for face verification on XM2VTS. In 4th International Conference on AUDIO- and VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION [2958]. [ .ps.gz | .pdf ]
[360] Fabien Cardinaux. Local Features and 1D-HMMs for Fast and Robust Face authentication. Idiap-RR Idiap-RR-17-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[361] Fabien Cardinaux, Conrad Sanderson, and Samy Bengio. Face Verification using adapted Generative Models. In The 6th International Conference on Automatic Face and Gesture Recognition, FG2004 [2959]. Published in IEEE International Conference on Automatic Face and Gesture Recognition (FG2004). [ .ps.gz | .pdf ]
[362] Fabien Cardinaux, Conrad Sanderson, and Samy Bengio. User Authentication via adapted Statistical Models of Face images. In IEEE Transaction on Signal Processing [2960]. IDIAP-RR 04-38. [ .ps.gz | .pdf ]
[363] Fabien Cardinaux. Face Authentication Based on Local Features and Generative Models. Idiap-rr, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, 2005. thesis #3410. [ .ps.gz | .pdf ]
[364] C. Carincotte, Xavier Naturel, M. Hick, Jean-Marc Odobez, Jian Yao, A. Bastide, and B. Corbucci. Understanding metro station usage using closed circuit television cameras analysis. In 11th International IEEE Conference on Intelligent Transportation Systems (ITSC) [2962]. [ .pdf ]
[365] Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Maël Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, Iain A. McCowan, Wilfried Post, Dennis Reidsma, and Pierre Wellner. The ami meeting corpus: a pre-announcement. In Machine Learning for Multimodal Interaction: Second International Workshop, MLMI'2005 [2963]. [ .ps.gz | .pdf ]
[366] Jesús Roberto Enrique León Carmona, Samuel González-López, Esaú VILLATORO-TELLO, and Jesús Miguel García-Gorrostieta. Analysis of vector representations in maintenance logs in the industry: Towards an information retrieval system. Journal of Research in Computing Science, May 2021.
[367] Daniel Carron. Deep learning of charisma. Idiap-Com Idiap-Com-03-2020, Idiap, 8 2020. [ .pdf ]
[368] Bruno Cartoni, Sandrine Zufferey, and Thomas Meyer. Using the europarl corpus for cross-linguistic research. Belgian Journal of Linguistics, (27):23 – 42, December 2013. [ http ]
[369] Bruno Cartoni, Sandrine Zufferey, Thomas Meyer, and Andrei Popescu-Belis. How comparable are parallel corpora? measuring the distribution of general vocabulary and connectives. In Proceedings of 4th Workshop on Building and Using Comparable Corpora, pages 78--86. ACL, June 2011. [ .pdf ]
[370] Bruno Cartoni and Thomas Meyer. Building 'directional corpora' for unbiased contrastive analysis. In Proceedings of Corpus Linguistics Conference, pages 29--30, July 2011. [ .pdf ]
[371] Bruno Cartoni, Sandrine Zufferey, and Thomas Meyer. Annotating the meaning of discourse connectives by looking at their translation: The translation-spotting technique. Dialogue & Discourse, 4(2):65--86, April 2013. [ DOI | .pdf ]
[372] Bruno Cartoni and Thomas Meyer. Extracting directional and comparable corpora from a multilingual corpus for translation studies. In Proceedings of the eighth international conference on Language Resources and Evaluation (LREC), page 6, May 2012. [ .pdf ]
[373] Claudio Castellini, Tatiana Tommasi, Nicoletta Noceti, Francesca Odone, and Barbara Caputo. Using object affordances to improve object recognition. IEEE Transaction on Autonomous Mental Development, 2011. [ .pdf ]
[374] J. B. Pierrot, Johan Lindberg, Johan Koolwaaij, H. P. Hutter, Dominique Genoud, Mats Blomberg, and Frédéric Bimbot. A comparison of a priori threshold setting procedures for speaker verification in the CAVE project. In ICASSP 98, 1998.
[375] Frédéric Bimbot, H. P. Hutter, Cédric Jaboulet, Johan Koolwaaij, Johan Lindberg, and J. B. Pierrot. An overview of the cave project research activities in speaker verification. In Reconnaissance du locuteur et ses applications commerciales et criminalistiques, 1998.
[376] Fabio Celli, Bruno Lepri, Joan-Isaac Biel, Giuseppe Riccardi, Daniel Gatica-Perez, and Fabio Pianesi. The workshop on computational personality recognition 2014. In Proceedings of the ACM International Conference on Multimedia, 2014. [ .pdf ]
[377] A. T. Cemgil, B. Kappen, and David Barber. A Generative Model for Music Transcription. In IEEE Transactions on Speech and Audio Processing [2964]. Accepted for publication. [ .ps.gz | .pdf ]
[378] Aleksandra Cerekovic, Oya Aran, and Daniel Gatica-Perez. Rapport with virtual agents: What do human social cues and personality explain? IEEE Transactions on Affective Computing, 8(3):382--395, Jul-Sep 2017. [ DOI | .pdf ]
[379] Milos Cernak, Štefan Beňuš, and Alexandros Lazaridis. Speech vocoding for laboratory phonology. In Computer Speech and Language [2965]. [ .pdf ]
[380] Milos Cernak, Juan Rafael Orozco-Arroyave, Frank Rudzicz, Heidi Christensen, Juan Camilo Vasquez-Correa, and Elmar Nöth. Characterisation of voice quality of parkinson's disease using differential phonological posterior features. In Computer Speech and Language [2966]. [ .pdf ]
[381] Milos Cernak, Blaise Potard, and Philip N. Garner. Phonological vocoding using artificial neural networks. In IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP) [2967], pages 4844--4848. [ DOI | .pdf ]
[382] Milos Cernak, Petr Motlicek, and Philip N. Garner. On the (un)importance of the contextual factors in hmm-based speech synthesis. In Proceedings of the IEEE Intl. Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 8140 -- 8143, May 2013. [ .pdf ]
[383] Milos Cernak, Elmar Nöth, Frank Rudzicz, Heidi Christensen, Juan Rafael Orozco-Arroyave, Raman Arora, Tobias Bocklet, Hamidreza Chinaei, Julius Hannink, Phani Sankar Nidadavolu, Juan Camilo Vasquez, Maria Yancheva, Alyssa Vann, and Nikolai Vogler. On the impact of non-modal phonation on phonological features. In Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017) [2968]. [ .pdf ]
[384] Milos Cernak and Sibo Tong. Nasal speech sounds detection using connectionist temporal. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2018. [ .pdf ]
[385] Milos Cernak, David Imseng, and Hervé Bourlard. Robust triphone mapping for acoustic modeling. Idiap-RR Idiap-RR-02-2013, Idiap, 1 2013. [ .pdf ]
[386] Milos Cernak, Petr Motlicek, and Philip N. Garner. On the (un)importance of the contextual factors in hmm-based speech synthesis and coding. Idiap-RR Idiap-RR-06-2013, Idiap, 3 2013. [ .pdf ]
[387] Milos Cernak, Philip N. Garner, and Petr Motlicek. Progress report of a project in very low bit-rate speech coding. Idiap-RR Idiap-RR-08-2012, Idiap, 2 2012. [ .pdf ]
[388] Milos Cernak and Sibo Tong. Nasal speech sounds detection using connectionist temporal classification. Idiap-RR Idiap-RR-28-2017, Idiap, 10 2017. [ .pdf ]
[389] Milos Cernak and Philip N. Garner. Phonvoc: A phonetic and phonological vocoding toolkit. In Interspeech, September 2016. [ .pdf ]
[390] Milos Cernak, Xingyu Na, and Philip N. Garner. Syllable-based pitch encoding for low bit rate speech coding with recognition/synthesis architecture. In Proc. of Interspeech 2013 [2970]. [ .pdf ]
[391] Milos Cernak and Pierre-Edouard Honnet. An empirical model of emphatic word detection. In Proc. of Interspeech [2971], pages 573--577. [ .pdf ]
[392] Milos Cernak, Afsaneh Asaei, Pierre-Edouard Honnet, Philip N. Garner, and Hervé Bourlard. Sound pattern matching for automatic prosodic event detection. In Interspeech [2972]. [ .pdf ]
[393] Milos Cernak, Alain Komaty, Amir Mohammadi, André Anjos, and Sébastien Marcel. Bob speaks kaldi. In Proc. of Interspeech, August 2017. [ .pdf ]
[394] Milos Cernak, Alexandros Lazaridis, Philip N. Garner, and Petr Motlicek. Stress and accent transmission in hmm-based syllable-context very low bit rate speech coding. In Interspeech [2973]. [ .pdf ]
[395] Milos Cernak, Afsaneh Asaei, and Alexandre Hyafil. Cognitive speech coding: Examining the impact of cognitive speech processing on speech compression. In IEEE Signal Processing Magazine [2974], pages 97--109. [ DOI | .pdf ]
[396] Milos Cernak, Philip N. Garner, Alexandros Lazaridis, Petr Motlicek, and Xingyu Na. Incremental syllable-context phonetic vocoding. In IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING [2975]. [ http | .pdf ]
[397] Milos Cernak, Alexandros Lazaridis, Afsaneh Asaei, and Philip N. Garner. Composition of deep and spiking neural networks for very low bit rate speech coding. In IEEE/ACM Trans. on Audio, Speech and Language Processing [2976]. [ .pdf ]
[398] Nikhil Chacko, Kevin G. Chan, and Michael Liebling. Intensity-based point-spread-function-aware registration for multi-view applications in optical microscopy. In Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, pages 306--309. IEEE, April 2015. [ DOI | .pdf ]
[399] Murali Mohan Chakka, André Anjos, Sébastien Marcel, Roberto Tronci, Daniele Muntoni, Gianluca Fadda, Maurizio Pili, Nicola Sirena, Gabriele Murgia, Marco Ristori, Fabio Roli, Junjie Yan, Dong Yi, Zhen Lei, Zhiwei Zhang, Stan Z.Li, William Robson Schwartz, Anderson Rocha, Helio Pedrini, Javier Lorenzo-Navarro, Modesto Castrillón-Santana, Jukka Maatta, Abdenour Hadid, and Matti Pietikainen. Competition on counter measures to 2-d facial spoofing attacks. In Proceedings of IAPR IEEE International Joint Conference on Biometrics (IJCB), Washington DC, USA, October 2011. [ .pdf ]
[400] Kevin G. Chan and Michael Liebling. Direct inversion algorithm for focal plane scanning optical projection tomography. Biomedical Optics Express, 2017. [ .pdf ]
[401] Kevin G. Chan, Sebastian J. Streichan, Le A. Trinh, and Michael Liebling. Simultaneous temporal superresolution and denoising for cardiac fluorescence microscopy. IEEE Transactions on Computational Imaging, 2016. in press. [ DOI | http | .pdf ]
[402] Kevin G. Chan and Michael Liebling. A point-spread-function-aware filtered backprojection algorithm for focal-plane-scanning optical projection tomography. In 2016 IEEE International Symposium on Biomedical Imaging, April 2016.
[403] Kevin G. Chan and Michael Liebling. Estimation of divergence-free 3d cardiac blood flow in a zebrafish larva using multi-view microscopy. In Biomedical Imaging (ISBI), 2015 IEEE 12th International Symposium on, pages 385--388. IEEE, April 2015. [ DOI | .pdf ]
[404] K. Chatzilygeroudis, A. Vassiliades, F. Stulp, Sylvain Calinon, and J. B. Mouret. A survey on policy search algorithms for learning robot controllers in a handful of trials. IEEE Trans. on Robotics, 32(2):328--347, April 2020. [ DOI | http | .pdf ]
[405] Ricardo Chavarriaga, Ferran Galán, and José del R. Millán. Asynchronous detection and classification of oscillatory brain activity. In 16 European Signal Processing Conference [2977]. IDIAP-RR 08-36. [ .ps.gz | .pdf ]
[406] Xavier Perrin, Ricardo Chavarriaga, Céline Ray, Roland Siegwart, and José del R. Millán. A comparative psychophysical and eeg study of different feedback modalities for hri. In 3rd ACM/IEEE Conf on Human-Robot Interaction (HRI08) [2978]. IDIAP-RR 07-78. [ .ps.gz | .pdf ]
[407] Ricardo Chavarriaga, Pierre W. Ferrez, and José del R. Millán. To err is human: Learning from error potentials in brain-computer interfaces. In 1st International Conference on Cognitive Neurodynamics (ICCN 2007) [2979]. IDIAP-RR 07-37. [ .ps.gz | .pdf ]
[408] Laurent Dollé, Mehdi Khamassi, Benoît Girard, Agnès Guillot, and Ricardo Chavarriaga. Analyzing interactions between navigation strategies using a computational model of action selection. In Int Conf Spatial Cognition 2008 [2980]. IDIAP-RR 08-48. [ .ps.gz | .pdf ]
[409] Tatjana Chavdarova, Pierre Baqué, Andrii Maksai, Stéphane Bouquet, Cijo Jose, Louis Lettry, Francois Fleuret, Pascal Fua, and Luc Van Gool. Wildtrack: A multi-camera hd dataset for dense unscripted pedestrian detection. In Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 5030--5039, June 2018. [ DOI ]
[410] Tatjana Chavdarova and Francois Fleuret. Sgan: An alternative training of generative adversarial networks. In Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, pages 9407--9415. IEEE, 2018. [ DOI ]
[411] Tatjana Chavdarova and Francois Fleuret. Deep multi-camera people detection. In Proceedings of the IEEE International Conference on Machine Learning and Applications, 2017.
[412] Tatjana Chavdarova, Sebastian Stich, Martin Jaggi, and Francois Fleuret. Stochastic variance reduced gradient optimization of generative adversarial networks. In International Conference on Machine Learning (ICML) workshop on Theoretical Foundations and Applications of Deep Generative Models, July 2018.
[413] Tatjana Chavdarova, Matteo Pagliardini, Martin Jaggi, and Francois Fleuret. Taming gans with lookahead. Idiap-RR Idiap-RR-20-2020, Idiap, 9 2020. [ http | .pdf ]
[414] Tatjana Chavdarova, Gauthier Gidel, Francois Fleuret, and Simon Lacoste-Julien. Reducing noise in gan training with variance reduced extragradient. In Proceedings of the international conference on Neural Information Processing Systems, 2019.
[415] Tatjana Chavdarova and Francois Fleuret. Deep Generative Models and Applications. PhD thesis, EPFL, July 2020. [ DOI | http | .pdf ]
[416] Gilberto Chávez-Martínez, Salvador Ruiz-Correa, and Daniel Gatica-Perez. International conference on mobile and ubiquitous multimedia. In Happy and Agreeable? Multi-Label Classification of Impressions in Social Video, MUM '15, pages 109--120, New York, NY, USA, December 2015. ACM. [ DOI | http | .pdf ]
[417] Datong Chen and Jean-Marc Odobez. Comparison of Support Vector Machine and neural network for Text Texture Verification. Idiap-RR Idiap-RR-19-2002, IDIAP, Martigny, 4 2002. [ .ps.gz | .pdf ]
[418] Datong Chen and Juergen Luettin. Multiple Hypotheses Video OCR. In Proceedings of the 4th International Workshop on Document Analysis System [2981]. Published in Proceedings of the 4th International Workshop on Document Analysis System,. [ .ps.gz | .pdf ]
[419] Datong Chen, Kim Shearer, and Hervé Bourlard. Text enhancement with asymmetric Filter for Video OCR. In Proceedings of the 11th International Conference on Image Analysis and Processing [2982]. Published in Int. Conf. Image Analysis and Processing, Palermo Italy, Sep. 26-28, 2001, IEEE Computer Society. [ .ps.gz | .pdf ]
[420] Jean-Marc Odobez and Datong Chen. Video Text Recognition based on Markov Random Field and Grayscale Consistency Constraint. In Int. Conf. Image Processing 2002 [2983]. Published in Proceedings of the Int. Conf. Image Processing 2002. [ .ps.gz | .pdf ]
[421] Datong Chen and Jean-Marc Odobez. Sequential Monte Carlo Video Text Segmentation. In ICIP, 2003. [ .ps.gz | .pdf ]
[422] Datong Chen, Jean-Marc Odobez, and Hervé Bourlard. Text Segmentation and Recognition in Complex Background Based on Markov Random Field. In Int. Conf. Pattern Recognition 2002 [2984]. Published in Proceedings of the Int. Conf. Pattern Recognition 2002. [ .ps.gz | .pdf ]
[423] Datong Chen, Kim Shearer, and Hervé Bourlard. Video OCR for Sport Video annotation and Retrieval. In Proceedings of the 8th IEEE International Conference on Mechatronics and Machine Vision in Practice [2985]. Published in International Conference on Mechatronics and Machine Vision in Practice, Hong Kong SAR, China, Aug. 27-29, 2001. [ .ps.gz | .pdf ]
[424] Datong Chen, Jean-Marc Odobez, and Hervé Bourlard. Text Detection and Recognition in images and Videos. Pattern Recognition, 37(3), 3 2004.
[425] Datong Chen and Kim Shearer. Asymmetric filter for text recognition in video. Idiap-RR Idiap-RR-37-2000, IDIAP, 2000. [ .ps.gz | .pdf ]
[426] Datong Chen and Juergen Luettin. A survey of text detection and recognition in images and videos. Idiap-RR Idiap-RR-38-2000, IDIAP, 2000. [ .ps.gz | .pdf ]
[427] Datong Chen, Hervé Bourlard, and Jean-Philippe Thiran. Text identification in Complex Background using SVM. In Proceedings of the Int. Conf. on computer vision and pattern recognition [2986]. Published in Proceeding of the Int. Conf. on computer vision and pattern recognition, 2001.
[428] Datong Chen and Jean-Marc Odobez. A New Method of Contrast normalization for Verification of extracted Video Text having Complex Backgrounds. Idiap-RR Idiap-RR-16-2002, IDIAP, 4 2002. [ .ps.gz | .pdf ]
[429] Datong Chen, Jean-Marc Odobez, and Hervé Bourlard. Text Detection and Recognition in images and Videos. Idiap-RR Idiap-RR-61-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[430] Datong Chen and Jean-Marc Odobez. Monte Carlo Video Text Segmentation. Idiap-RR Idiap-RR-07-2003, IDIAP, 1 2003. [ .ps.gz | .pdf ]
[431] Datong Chen, Jean-Marc Odobez, and Jean-Philippe Thiran. A localization/verification scheme for finding text in images and video frames based on contrast independent features and machine learning methods. In Signal Processing: Image Communication [2987]. Similar to RR-03-42. [ .ps.gz | .pdf ]
[432] Datong Chen. Text detection and recognition in images and video sequences. Idiap-rr, École Polytechnique Fédérale de Lausanne, 2003. Thèse EPFL, n° 2863 (2003). [ .ps.gz | .pdf ]
[433] Octavian Cheng, John Dines, and Mathew Magimai.-Doss. A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition. Idiap-RR Idiap-RR-62-2006, IDIAP, 2006. Submitted for publication. [ .ps.gz | .pdf ]
[434] Cheng Chen, Alexandre Heili, and Jean-Marc Odobez. Combined estimation of location and body pose in surveillance video. In AVSS, 2011. [ .pdf ]
[435] Cheng Chen, Yi Yang, Feiping Nie, and Jean-Marc Odobez. 3d human pose recovery from image by efficient visual feature selection. Computer Vision and Image Understanding, 115(3), 2011. [ .pdf ]
[436] Cheng Chen and Jean-Marc Odobez. We are not contortionists: Coupled adaptive learning for head and body orientation estimation in surveillance video. In IEEE International Conference on Computer Vision and Pattern Recognition, 2012. [ .pdf ]
[437] Cheng Chen, Alexandre Heili, and Jean-Marc Odobez. A joint estimation of head and body orientation cues in surveillance video. In IEEE International Workshop on Socially Intelligent Surveillance and Monitoring, 2011. [ .pdf ]
[438] Yiqiang Chen, Yu Yu, and Jean-Marc Odobez. Head nod detection from a full 3d model. In Proceedings of the ICCV 2015, pages 528--536, December 2015. [ .pdf ]
[439] Cheng Chen. Learning a 3d human pose distance metric from geometric pose descriptor. IEEE Transactions on Visualization and Computer Graphics, 17(11):1676--1689, 2011. [ .pdf ]
[440] Ivana Chingovska, André Anjos, and Sébastien Marcel. Anti-spoofing in action: joint operation with a verification system. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Workshop on Biometrics [2989]. [ .pdf ]
[441] Ivana Chingovska, Jinwei Yang, Zhen Lei, Dong Yi, Stan Z.Li, Olga Kähm, Naser Damer, Christian Glaser, Arjan Kuijper, Alexander Nouak, Jukka Komulainen, Tiago de Freitas Pereira, Shubham Gupta, Shubham Bansal, Shubham Khandelwal, Ayush Rai, Tarun Krishna, Dushyant Goyal, Muhammad-Adeel Waris, Honglei Zhang, Iftikhar Ahmad, Serkan Kiranyaz, Moncef Gabbouj, Roberto Tronci, Maurizio Pili, Nicola Sirena, Fabio Roli, Javier Galbally, Julian Fierrez, Allan Pinto, Helio Pedrini, William Robson Schwartz, Anderson Rocha, André Anjos, and Sébastien Marcel. The 2nd competition on counter measures to 2d face spoofing attacks. In International Conference of Biometrics 2013 [2990]. [ .pdf ]
[442] Ivana Chingovska, André Anjos, and Sébastien Marcel. On the effectiveness of local binary patterns in face anti-spoofing. In Proceedings of the 11th International Conference of the Biometrics Special Interes Group [2991]. [ .pdf ]
[443] Ivana Chingovska and André Anjos. On the use of client identity information for face anti-spoofing. IEEE Transactions on Information Forensics and Security, Special Issue on Biometric Anti-spoofing, 10(4):787--796, 2015. [ .pdf ]
[444] Ivana Chingovska, André Anjos, and Sébastien Marcel. Biometrics evaluation under spoofing attacks. In IEEE Transactions on Information Forensics and Security [2992], pages 2264--2276. IEEE Transactions of Information Forensics and Security: in minor revision. [ DOI | .pdf ]
[445] Ivana Chingovska, André Anjos, and Sébastien Marcel. Anti-spoofing: Evaluation methodologies. In Stan Z.Li and Anil Jain, editors, Encyclopedia of Biometrics. Springer US, 2nd edition edition, 2014. [ DOI ]
[446] Ivana Chingovska, André Anjos, and Sébastien Marcel. Evaluation methodologies. In Sébastien Marcel, Mark Nixon, and Stan Z.Li, editors, Handbook of Biometric Antispoofing. Springer, 2014.
[447] Ivana Chingovska, Nesli Erdogmus, André Anjos, and Sébastien Marcel. Face recognition systems under spoofing attacks. In Face Recognition Systems Under Spoofing Attacks [2993], idiap-rr 8, pages 165--194. Submitted for as a book-chapter for: Face Recognition Across the Electromagnetic Spectrum (Springer). [ DOI | http ]
[448] Ivana Chingovska, Amir Mohammadi, André Anjos, and Sébastien Marcel. Evaluation methodologies for biometric presentation attack detection. In Sébastien Marcel, Mark Nixon, Julian Fierrez, and Nicholas Evans, editors, Handbook of Biometric Anti-Spoofing, chapter 20. Springer International Publishing, 2nd edition, 2019. [ DOI | http | .pdf ]
[449] Ivana Chingovska. Trustworthy Biometric Verification under Spoofing Attacks: Application to the Face Mode. PhD thesis, École Polytechnique Fédérale de Lausanne, November 2015. Thèse EPFL, n° 6879 (2016). [ http | .pdf ]
[450] Gokul Chittaranjan, Oya Aran, and Daniel Gatica-Perez. Exploiting observers' judgements for nonverbal group interaction analysis. In IEEE Conference on Automatic Face and Gesture Recognition, page 6. IEEE, March 2011. [ .pdf ]
[451] Gokul Chittaranjan and Hayley Hung. Are you a werewolf? detecting deceptive roles and outcomes in a conversational role-playing game. In IEEE International Conference on Acoustics, Speech and Signal Processing, 3 2010. [ .pdf ]
[452] Gokul Chittaranjan, Jan Blom, and Daniel Gatica-Perez. Who's who with big-five: Analyzing and classifying personality traits with smartphones. In International Symposium on Wearable Computing, page 8, June 2011. [ .pdf ]
[453] Gokul Chittaranjan, Oya Aran, and Daniel Gatica-Perez. Inferring truth from multiple annotators for social interaction analysis. In Neural Information Processing Systems (NIPS) Workshop on Modeling Human Communication Dynamics (HCD), page 4, 2011. [ .pdf ]
[454] Gokul Chittaranjan, Jan Blom, and Daniel Gatica-Perez. Mining large-scale smartphone data for personality studies. Personal and Ubiquitous Computing, 2012. [ .pdf ]
[455] Seunjin Choi, Youngki Lyu, Frédéric Berthommier, Hervé Glotin, and Andrzej Cichocki. Blind separation of delayed and superimposed acoustic sources : learning algorithms an experimental study. In Proc. IEEE Int. Conference on Speech Processing (ICSP), Seoul, 9 1999. IEEE.
[456] Gérard Chollet, Jean-Luc Cochard, Andrei Constantinescu, Cédric Jaboulet, and Philippe Langlais. Swiss french polyphone and polyvar: telephone speech databases to model inter- and intra-speaker variability. Idiap-RR Idiap-RR-01-1996, IDIAP, 1996. [ .ps.gz | .pdf ]
[457] F. Néel, Gérard Chollet, F. Lamel, W. Minker, and Andrei Constantinescu. Reconnaissance et compréhension de la parole: évaluation et applications. In Henri Méloni, editor, Fondements et perspectives en traitement automatique de la parole. AUPELF -- UREF, 1996.
[458] Christos Dimitrakakis. Ensembles for Sequence Learning. PhD thesis, École Polytechnique Fédérale de Lausanne, 2006. [ .ps.gz | .pdf ]
[459] Jean-Luc Cochard. Un environnement d'analyse linguistique robuste: Cpd, version 1.7. Idiap-RR Idiap-RR-03-1992, IDIAP, 1992. [ .ps.gz | .pdf ]
[460] Jean-Luc Cochard. Une technique efficace de traitement en prolog de la morphologie flexionnelle du français. Idiap-RR Idiap-RR-04-1992, IDIAP, 1992. [ .ps.gz | .pdf ]
[461] Jean-Luc Cochard. Un interface d'indexation documentaire: I d'i, version 1.4. Idiap-RR Idiap-RR-01-1993, IDIAP, 1993. [ .ps.gz | .pdf ]
[462] Jean-Luc Cochard. Un interface d'indexation documentaire: I d'i, version 2.0. Idiap-RR Idiap-RR-03-1993, IDIAP, 1993. [ .ps.gz | .pdf ]
[463] Jean-Luc Cochard. Un interface de recherche documentaire: I de r, version 2.0. Idiap-RR Idiap-RR-04-1993, IDIAP, 1993. [ .ps.gz | .pdf ]
[464] Murielle Vial and Jean-Luc Cochard. Towards a multi-agents approach for understanding speech. Idiap-Com Idiap-Com-05-1996, IDIAP, 1996. [ .pdf ]
[465] Jean-Luc Cochard and Murielle Vial. Etc_vérif : un environnement multi-agents de reconnaissance automatique de la parole en continu. In Proceedings of JEP'96: XXIèmes Journées d'étude sur la Parole, 6 1996. [ .ps.gz | .pdf ]
[466] Philippe Langlais, Henri Méloni, and Jean-Luc Cochard. Un système prédictif de la structuration syntaxico-rythmique d'un énoncé à l'aide d'informations prosodiques. In Proceedings of JEP'96: XXIèmes Journées d'étude sur la Parole, 6 1996. [ .ps.gz | .pdf ]
[467] Andrzej Drygajlo, Jean-Luc Cochard, Gérard Chollet, Olivier Bornet, and Philippe Renevey. Sun workstation and swissnet platform for speech recognition and speaker verification over the telephone. In Proceedings of Workstations und ihre Anwendungen, SIWORK'96, 5 1996. [ .ps.gz | .pdf ]
[468] Laurent Colbois, Tiago de Freitas Pereira, and Sébastien Marcel. On the use of automatically generated synthetic image datasets for benchmarking face recognition. In International Joint Conference on Biometrics (IJCB 2021), 2021. Accepted for Publication in IJCB2021. [ .pdf ]
[469] Thierry Collado. Developement d'un systeme de demande interactif via le telephone (infovox). Idiap-Com Idiap-Com-08-2001, IDIAP, 2001. [ .ps.gz | .pdf ]
[470] Ronan Collobert. Support vector machines, théorie et application. Idiap-Com Idiap-Com-03-2000, IDIAP, 2000. [ .ps.gz | .pdf ]
[471] Ronan Collobert and Samy Bengio. Support vector machines for large-scale regression problems. Idiap-RR Idiap-RR-17-2000, IDIAP, 2000. [ .ps.gz | .pdf ]
[472] Ronan Collobert and Samy Bengio. On the convergence of svmtorch, an algorithm for large-scale regression problems. Idiap-RR Idiap-RR-24-2000, IDIAP, 2000. [ .ps.gz | .pdf ]
[473] Ronan Collobert and Samy Bengio. SVMTorch: Support vector machines for large-scale regression problems. Journal of Machine Learning Research, 1, 2001. [ .ps.gz | .pdf ]
[474] Ronan Collobert, Samy Bengio, and Yoshua Bengio. A parallel mixture of SVMs for very large scale problems. In Neural Computation [2994]. [ .ps.gz | .pdf ]
[475] Ronan Collobert, Samy Bengio, and Yoshua Bengio. A parallel mixture of SVMs for very large scale problems. In Dietterich et al. [2994]. [ .ps.gz | .pdf ]
[476] Ronan Collobert, Yoshua Bengio, and Samy Bengio. Scaling large learning problems with hard parallel mixtures. In International Workshop on Pattern Recognition with Support Vector Machines, SVM'2002, 2002. [ .ps.gz | .pdf ]
[477] Ronan Collobert, Yoshua Bengio, and Samy Bengio. Scaling large learning problems with hard parallel mixtures. International Journal on Pattern Recognition and Artificial Intelligence (IJPRAI), 17(3), 2003. [ .pdf ]
[478] Ronan Collobert and Samy Bengio. A gentle hessian for efficient gradient descent. In IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP [2995]. Published in “ICASSP, 2004”. [ .ps.gz | .pdf ]
[479] Ronan Collobert and Samy Bengio. Links between perceptrons, MLPs and SVMs. In International Conference on Machine Learning, ICML [2996]. [ .ps.gz | .pdf ]
[480] Ronan Collobert. Large Scale Machine Learning. Idiap-rr, Université de Paris VI, 2004. [ .ps.gz | .pdf ]
[481] Ronan Collobert. Deep learning for efficient discriminative parsing. In International Conference on Artificial Intelligence and Statistics, 2011. [ .pdf ]
[482] Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. Natural language processing (almost) from scratch. Journal of Machine Learning Research, 12:2493--2537, 2011. [ .pdf ]
[483] Ronan Collobert, Koray Kavukcuoglu, and Clément Farabet. Torch7: A matlab-like environment for machine learning. In BigLearn, NIPS Workshop, 2011. [ .pdf ]
[484] Ronan Collobert, Koray Kavukcuoglu, and Clément Farabet. Implementing neural networks efficiently. In Grégoire Montavon, Geneviève Orr, and K. R. Müller, editors, Neural Networks: Tricks of the Trade. Springer, second edition, 2012. [ .pdf ]
[485] Christine Marcel. Multimodal identity verification at idiap. Idiap-Com Idiap-Com-04-2003, IDIAP, 2003. [ .ps.gz | .pdf ]
[486] David Grangier, Alessandro Vinciarelli, and Hervé Bourlard. Information retrieval on noisy text. Idiap-Com Idiap-Com-08-2003, IDIAP, 2003. [ .ps.gz | .pdf ]
[487] Mike Flynn and Pierre Wellner. In search of a good bet. Idiap-Com Idiap-Com-11-2003, IDIAP, 2003. [ .ps.gz | .pdf ]
[488] Andrei Constantinescu, Olivier Bornet, Gilles Caloz, and Gérard Chollet. Validating different flexible vocabulary approaches on the swiss french polyphone and polyvar databases. In Proceedings of ICSLP 96, 10 1996.
[489] Megan Cook, Sandra Kuntsche, Florian Labhart, and Emmanuel Kuntsche. Do different drinks make you feel different emotions? examination of young adolescents' beverage-specific alcohol expectancies using the alcohol expectancy task. Addictive Behaviors, 2020. [ DOI | http ]
[490] Artur Costa-Pazo, Sushil Bhattacharjee, Esteban Vazquez-Fernandez, and Sébastien Marcel. The replay-mobile face presentation-attack database. In Proceedings of the International Conference on Biometrics Special Interests Group, September 2016. [ .pdf ]
[491] Alessandro Costa. Using synthetic fingerprint images to test the performance of an AFIS system. PhD thesis, Université de Lausanne, July 2022. [ .pdf ]
[492] Dijana Petrovska-Delacretaz, Jean Hennebert, Dominique Genoud, and Gérard Chollet. Semi-automatic hmm-based annotation of the polycost database. In Application of speaker recognition techniques in telephony. COST250, 1996.
[493] Evann Courdier and Francois Fleuret. Borrowing from yourself: Faster future video segmentation with partial channel update. In International Conference on Pattern Recognition, 2022. [ .pdf ]
[494] Andrew Morris. Data utility modelling for mismatch reduction. In Proc. CRAC (workshop on Consistent & Reliable Acoustic Cues for sound analysis) [2998]. [ .ps.gz | .pdf ]
[495] M. Cristani, G. Paggetti, Alessandro Vinciarelli, L. Bazzani, G. Menegaz, and V. Murino. Towards computational proxemics: Inferring social relations from interpersonal distances. In Proceedings of the IEEE International Conference on Social Computing, pages 290--297, 2011.
[496] M. Cristani, A. Pesarin, Alessandro Vinciarelli, M. Crocco, and V. Murino. Look at who's talking. In Proceedings of International Conference on Ambient Intelligence, pages 68--76, 2011.
[497] S. Cuche and Emile Fiesler. Ontogenic high order cauchy machines. In Nicolas Droux, editor, Proceedings of the SIPAR Workshop '95: Parallel and Distributed Systems, Biel, Switzerland, 1995. Biel School of Engineering.
[498] S. Cuche and Emile Fiesler. Generalized cauchy machines. Neurocomputing, 1996. submitted.
[499] S. Cuche and Emile Fiesler. Extended cauchy machines. In Proceedings of the International Conference on Neural Information Processing, volume 1, 1996.
[500] Sébastien Cuendet. Model adaptation for sentence unit segmentation from speech. Idiap-RR Idiap-RR-64-2006, IDIAP, Martigny, Switzerland, 2006. [ .ps.gz | .pdf ]
[501] Nicholas Cummins, Yilin Pan, Zhao Ren, Julian Fritsch, Venkata Srikanth Nallanthighal, Heidi Christensen, Daniel Blackburn, Björn Schuller, Mathew Magimai.-Doss, Helmer Strik, and Aki Härmä. A comparison of acoustic and linguistics methodologies for alzheimer's dementia recognition. In Proceedings of Interspeech, pages 2182--2186, 2020. [ .pdf ]
[502] Gilles Curtois, Vincent Grimaldi, Hervé Lissek, Ina Kodrasi, and Eleftheria Georganti. Experimental evaluation of speech enhancement methods in remote microphone systems for hearing aids. In Proc. EuroNoise 2018, pages 351--358, May 2018. [ .pdf ]
[503] J. Czyz, Samy Bengio, Christine Marcel, and L. Vandendorpe. Scalability analysis of audio-visual person identity verification. In 4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA [2999]. [ .ps.gz | .pdf ]
[504] Corentin Dancette, Remi Cadene, Damien Teney, and Matthieu Cord. Beyond question-based biases: Assessing multimodal shortcut learning in visual question answering. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021. [ .pdf ]
[505] Ewen Dantec, Rohan Budhiraja, Adria Roig, Teguh Santoso Lembono, Guilhem Saurel, Olivier Stasse, Pierre Fernbach, Steve Tonneau, Sethu Vijayakumar, Sylvain Calinon, Michel Taix, and Nicolas Mansard. Whole body model predictive control with a memory of motion:experiments on a torque-controlled talos. In IEEE International Conference on Robotics and Automation, 2021. [ .pdf ]
[506] Priyanka Das, Joseph McGrath, Zhaoyuan Fang, Aidan Boyd, Ganghee Jang, Amir Mohammadi, Sandip Purnapatra, David Yambay, Sébastien Marcel, Mateusz Trokielewicz, Piotr Maciejewicz, Kevin Bowyer, Adam Czajka, and Stephanie Schuckers. Iris liveness detection competition (livdet-iris) – the 2020 edition. In INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2020), 2020. [ http | .pdf ]
[507] D. Vandromme. Harmonic plus noise model for concatenative speech synthesis. Idiap-RR Idiap-RR-37-2005, IDIAP, 2005. [ .pdf ]
[508] Nauman Dawalatabad, Srikanth Madikeri, Hema A Murthy, and C Chandra Sekhar. Incremental transfer learning in two-pass information bottleneck based speaker diarization system for meetings. In Proceedings of ICASSP 2019, pages 6291--6295, May 2019.
[509] Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar, and Hema A Murthy. Novel architectures for unsupervised information bottleneck based speaker diarization of meetings. In IEEE/ACM Transactions on Audio Speech and Language Processing [3000].
[510] Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar, and Hema A Murthy. Two-pass ib based speaker diarization system using meeting-specific ann based features. In Proceedings of Interspeech 2016 [3001], pages 2199--2203.
[511] Yannick Dayer. Face recognition systems: performance evaluation and bias analysis. Idiap-Com Idiap-Com-04-2020, Idiap, Rue Marconi 19, 1920 Martigny, 8 2020. [ .pdf ]
[512] Tiago de Freitas Pereira and Sébastien Marcel. Fairness in biometrics: a figure of merit to assess biometric verification systems. arXiv, November 2020. [ .pdf ]
[513] Tiago de Freitas Pereira and Sébastien Marcel. Periocular biometrics in mobile environment. In IEEE Seventh International Conference on Biometrics: Theory, Applications and Systems, pages 1--7. IEEE, September 2015. [ DOI | .pdf ]
[514] Tiago de Freitas Pereira, Jukka Komulainen, André Anjos, José Mario De Martino, Abdenour Hadid, Matti Pietikainen, and Sébastien Marcel. Face liveness detection using dynamic texture. EURASIP Journal on Image and Video Processing, 2, January 2014. [ DOI | http | .pdf ]
[515] Tiago de Freitas Pereira, André Anjos, José Mario De Martino, and Sébastien Marcel. Can face anti-spoofing countermeasures work in a real world scenario? In International Conference on Biometrics, June 2013. [ http | .pdf ]
[516] Tiago de Freitas Pereira and Sébastien Marcel. Heterogeneous face recognition using inter-session variability modelling. In IEEE Computer Society Workshop on Biometrics. IEEE, June 2016. [ .pdf ]
[517] Tiago de Freitas Pereira, André Anjos, and Sébastien Marcel. Heterogeneous face recognition using domain specific units. IEEE Transactions on Information Forensics and Security, page 13, February 2019. [ DOI | .pdf ]
[518] Tiago de Freitas Pereira, André Anjos, José Mario De Martino, and Sébastien Marcel. Lbp-top based countermeasure against face spoofing attacks. In International Workshop on Computer Vision With Local Binary Pattern Variants - ACCV, page 12, November 2012. [ .pdf ]
[519] Tiago de Freitas Pereira and Sébastien Marcel. Fairness in biometrics: a figure of merit to assess biometric verification systems. IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021. [ DOI | .pdf ]
[520] Tiago de Freitas Pereira. Learning How To Recognize Faces in Heterogeneous Environments. PhD thesis, Ecole Polytechnique Federale de Lausanne, February 2019. [ DOI | http | .pdf ]
[521] De Greve Zacharie and Joel Praveen Pinto. Keyword spotting on word lattices. Idiap-RR Idiap-RR-22-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[522] María del Río Carral, Lucia Volpato, Chloé Michoud, Thanh-Trung Phan, and Daniel Gatica-Perez. Professional youtubers’ health videos as research material: Formulating a multi-method design in health psychology. Methods in Psychology, Special Issue on Innovations in Qualitative Research, 5, December 2021. [ .pdf ]
[523] F. de Wet, Katrin Weber, Louis Boves, B. Cranen, Samy Bengio, and Hervé Bourlard. Evaluation of formant-like features for automatic speech recognition. Journal of the Acoustical Society of America (JASA), 116(3), 2004. [ .ps.gz | .pdf ]
[524] Subhadeep Dey, Srikanth Madikeri, and Petr Motlicek. Information theoretic clustering for unsupervised domain-adaptation. In Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016) [3002], pages 5580--5584. [ .pdf ]
[525] Subhadeep Dey, Petr Motlicek, Srikanth Madikeri, and Marc Ferras. Exploiting sequence information for text-dependent speaker verification. In Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing [3003], pages 5370--5374. [ .pdf ]
[526] Subhadeep Dey, Takafumi Koshinaka, Petr Motlicek, and Srikanth Madikeri. Dnn based speaker embedding using content information for text-dependent speaker verification. In Proceedings of 2018 IEEE International Conference on Acoustics, Speech, and Signal Processing [3004]. [ .pdf ]
[527] Subhadeep Dey, Srikanth Madikeri, Marc Ferras, and Petr Motlicek. Deep neural network based posteriors for text-dependent speaker verification. In Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016) [3005], pages 5050--5054. [ .pdf ]
[528] Subhadeep Dey, Marc Ferras, Petr Motlicek, and Srikanth Madikeri. Content normalization for text-independent speaker verification. Idiap-RR Idiap-RR-31-2017, Idiap, 12 2017. [ .pdf ]
[529] Subhadeep Dey, Srikanth Madikeri, and Petr Motlicek. End-to-end text-dependent speaker verification using novel distance measures. In Proceedings of Interspeech 2018, volume 1-6, pages 3598--3602, 2018. [ DOI ]
[530] Subhadeep Dey, Petr Motlicek, Trung Bui, and Franck Dernoncourt. Exploiting semi-supervised training through a dropout regularization in end-to-end speech recognition. In Proc. of Interspeech 2019, 2019.
[531] Subhadeep Dey, Srikanth Madikeri, Petr Motlicek, and Marc Ferras. Content normalization for text-dependent speaker verification. In Proc. of Interspeech, 2017. [ .pdf ]
[532] Subhadeep Dey, Petr Motlicek, Srikanth Madikeri, and Marc Ferras. Template-matching for text-dependent speaker verification. In Speech Communication [3006]. [ .pdf ]
[533] Subhadeep Dey. Phonetic aware techniques for Speaker Verification. PhD thesis, EPFL, 2018. [ .pdf ]
[534] V. Demyanov, Nicolas Gilardi, Mikhail Kanevski, Michel Maignan, and V. Polishchuk. Decision-oriented environmental mapping with radial basis function neural networks. In Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999.
[535] Dhiraj Joshi and Daniel Gatica-Perez. Finding groups of people in google news. In ACM Int. Conf. on Human-Centered Multimedia (HCM) [3007]. IDIAP-RR 05-68. [ .ps.gz | .pdf ]
[536] Alfred Dielmann, Giulia Garau, and Hervé Bourlard. Floor holder detection and end of speaker turn prediction in meetings. In International Conference on Speech and Language Processing, Interspeech. ISCA, 9 2010. [ .pdf ]
[537] Pranay Dighe, Afsaneh Asaei, and Hervé Bourlard. Low-rank and sparse soft targets to learn better dnn acoustic models. In Proceedings of 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 2017. [ .pdf ]
[538] Pranay Dighe, Gil Luyet, Afsaneh Asaei, and Hervé Bourlard. Exploiting low-dimensional structures to enhance dnn based acoustic modeling in speech recognition. In Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), pages 5690--5694. IEEE, March 2016. [ .pdf ]
[539] Pranay Dighe, Afsaneh Asaei, and Hervé Bourlard. Sparse hidden markov models for exemplar-based speech recognition using deep neural network posterior features. Idiap-RR Idiap-RR-19-2016, Idiap, 8 2016. [ .pdf ]
[540] Pranay Dighe, Hervé Bourlard, and Afsaneh Asaei. Far-field asr using low-rank and sparse soft targets from parallel data. In IEEE Workshop on Spoken Language Technology, pages 581--587. IEEE, December 2018. [ .pdf ]
[541] Pranay Dighe, Marc Ferras, and Hervé Bourlard. Detecting and labeling speakers on overlapping speech using vector taylor series. In INTERSPEECH, 2014. [ .pdf ]
[542] Pranay Dighe, Afsaneh Asaei, and Hervé Bourlard. Exploiting eigenposteriors for semi-supervised training of dnn acoustic models with sequence discrimination. In Proceedings of Interspeech, 2017. [ .pdf ]
[543] Pranay Dighe, Marc Ferras, and Hervé Bourlard. Modeling overlapping speech using vector taylor series. In Odyssey: The Speaker and Language Recognition Workshop, June 2014. [ .pdf ]
[544] Pranay Dighe, Afsaneh Asaei, and Hervé Bourlard. Dictionary learning for sparse representation of neural network exemplars in speech recognition. In Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, 2015, page 1093, 2015. abstract. [ .pdf ]
[545] Pranay Dighe, Afsaneh Asaei, and Hervé Bourlard. Sparse modeling of neural network posterior probabilities for exemplar-based speech recognition. In Proceedings of SPARS 2015: Workshop on Signal Processing with Adaptive Sparse Structured Representations, June 2015. abstract. [ .pdf ]
[546] Pranay Dighe, Afsaneh Asaei, and Hervé Bourlard. On quantifying the quality of acoustic models in hybrid dnn-hmm asr. Speech Communication, 119:24--35, May 2020. [ DOI ]
[547] Pranay Dighe, Afsaneh Asaei, and Hervé Bourlard. Sparse modeling of neural network posterior probabilities for exemplar-based speech recognition. Speech Communication: Special Issue on Advances in Sparse Modeling and Low-rank Modeling for Speech Processing, 76:230–244, February 2016. [ DOI | .pdf ]
[548] Pranay Dighe. Sparse and Low-rank Modeling for Automatic Speech Recognition. PhD thesis, EPFL, 2019. [ DOI | .pdf ]
[549] Christos Dimitrakakis and Samy Bengio. Estimates of parameter distributions for optimal action selection. Idiap-RR Idiap-RR-72-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[550] Christos Dimitrakakis and Samy Bengio. Boosting word error rates. In IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP [3008]. IDIAP-RR 04-49.
[551] Christos Dimitrakakis. Nearly optimal exploration-exploitation decision thresholds. In Int. Conf. on Artificial Neural Networks (ICANN) [3009]. IDIAP-RR 06-12. [ .ps.gz | .pdf ]
[552] Christos Dimitrakakis and Samy Bengio. Online policy adaptation for ensemble classifiers. In Neurocomputing [3010]. IDIAP-RR 03-69. [ .ps.gz | .pdf ]
[553] Christos Dimitrakakis and Samy Bengio. Gradient estimates of return distributions. In PASCAL Workshop on Principled Methods of Trading Exploration and Exploitation [3011]. IDIAP-RR 05-29. [ .ps.gz | .pdf ]
[554] Christos Dimitrakakis. Online statistical estimation for vehicle control. Idiap-RR Idiap-RR-13-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[555] Christos Dimitrakakis and Samy Bengio. Online policy adaptation for ensemble algorithms. Idiap-RR Idiap-RR-28-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[556] K. Dimitropoulos, P. Daras, S. Manitsaris, F. F. Leymarie, and Sylvain Calinon. Editorial: Artificial intelligence and human movement in industries and creation. Frontiers in Robotics and AI, 8:712521, 2021. [ .pdf ]
[557] John Dines, Jithendra Vepa, and Thomas Hain. The segmentation of multi-channel meeting recordings for automatic speech recognition. In Int. Conf. on Spoken Language Processing (Interspeech ICSLP) [3012]. IDIAP-RR 06-22. [ .ps.gz | .pdf ]
[558] John Dines and Mathew Magimai.-Doss. A study of phoneme and grapheme based context-dependent asr systems. Idiap-RR Idiap-RR-12-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[559] John Dines and Jithendra Vepa. Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics. Idiap-RR Idiap-RR-13-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[560] John Dines, Hui Liang, Lakshmi Saheer, Matthew Gibson, William Byrne, Keiichiro Oura, Keiichi Tokuda, Junichi Yamagishi, Simon King, Mirjam Wester, Teemu Hirsimäki, Reima Karhila, and Mikko Kurimo. Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for hmm-based speech synthesis. Computer Speech and Language, 2011. [ DOI | http | .pdf ]
[561] John Dines, Junichi Yamagishi, and Simon King. Measuring the gap between hmm-based asr and tts. Idiap-RR Idiap-RR-34-2010, Idiap, 10 2010. [ .pdf ]
[562] John Dines, Lakshmi Saheer, and Hui Liang. Speech recognition with speech synthesis models by marginalising over decision tree leaves. In Proceedings of Interspeech [3014]. [ .pdf ]
[563] John Dines, Junichi Yamagishi, and Simon King. Measuring the gap between hmm-based asr and tts. In Proceedings of Interspeech [3015]. [ .pdf ]
[564] V. Demyanov, Mikhail Kanevski, Michel Maignan, E. Savelieva, V. Timonin, S. Chernov, and G. Piller. Indoor radon risk assessment with geostatistics and artificial neural networks. In Geostatistical congress 2000, 2000.
[565] V. Demyanov, Mikhail Kanevski, E. Savelieva, V. Timonin, and S. Chernov. Neural network residual stochastic co-simulation for environmental data analysis. In Neural Computation 2000, 2000.
[566] Trinh-Minh-Tri Do and Thierry Artieres. Neural conditional random fields. In Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, volume 9. JMLR: W&CP, 5 2010. [ .pdf ]
[567] Trinh-Minh-Tri Do, Jan Blom, and Daniel Gatica-Perez. Smartphone usage in the wild: a large-scale analysis of applications and context. In 13th International Conference on Multimodal Interaction, November 2011. [ .pdf ]
[568] Trinh-Minh-Tri Do, Kyriaki Kalimeri, Bruno Lepri, Fabio Pianesi, and Daniel Gatica-Perez. Inferring social activities with mobile sensor networks. In 15th ACM International Conference on Multimodal Interaction, December 2013. [ .pdf ]
[569] Cong-Thanh Do, Mohammad J. Taghizadeh, and Philip N. Garner. Improving microphone array speech recognition with cochlear implant-like spectrally reduced speech. Idiap-RR Idiap-RR-40-2011, Idiap, 12 2011. [ .pdf ]
[570] Trinh-Minh-Tri Do and Daniel Gatica-Perez. Groupus: Smartphone proximity data and human interaction type mining. In 15th annual International Symposium on Wearable Computers, June 2011. [ .pdf ]
[571] Trinh-Minh-Tri Do and Thierry Artieres. Regularized bundle methods for convex and non-convex risks. Journal of Machine Learning Research, 13:3539--3583, December 2012. [ .pdf ]
[572] Trinh-Minh-Tri Do and Daniel Gatica-Perez. Contextual grouping: discovering real-life interaction types from longitudinal bluetooth data. In 12th International Conference on Mobile Data Management, June 2011. [ .pdf ]
[573] Trinh-Minh-Tri Do and Daniel Gatica-Perez. By their apps you shall understand them: mining large-scale patterns of mobile phone usage. In The 9th International Conference on Mobile and Ubiquitous Multimedia, 12 2010. [ .pdf ]
[574] Trinh-Minh-Tri Do and Daniel Gatica-Perez. Where and what: Using smartphones to predict next locations and applications in daily life. Pervasive and Mobile Computing, May 2013. [ .pdf ]
[575] Trinh-Minh-Tri Do, O. Dousse, Markus Miettinen, and Daniel Gatica-Perez. A probabilistic kernel method for human mobility prediction with smartphones. Pervasive and Mobile Computing, 2014. [ .pdf ]
[576] Trinh-Minh-Tri Do and Daniel Gatica-Perez. Human interaction discovery in smartphone proximity networks. Personal and Ubiquitous Computing, 2012. [ .pdf ]
[577] Cong-Thanh Do, Mohammad J. Taghizadeh, and Philip N. Garner. Combining cepstral normalization and cochlear implant-like speech processing for microphone array-based speech recognition. In Proceedings of the IEEE Workshop on Spoken Language Technology, December 2012. [ .pdf ]
[578] Cong-Thanh Do, Dominique Pastor, and André Goalic. A novel framework for noise robust asr using cochlear implant-like spectrally reduced speech. Speech Communication, 2011. [ DOI | .pdf ]
[579] Trinh-Minh-Tri Do and Daniel Gatica-Perez. The places of our lives: Visiting patterns and automatic labeling from longitudinal smartphone data. IEEE Transactions on Mobile Computing, 2013. [ .pdf ]
[580] Trinh-Minh-Tri Do and Daniel Gatica-Perez. Contextual conditional models for smartphone-based human mobility prediction. In Proceedings of the 14th ACM International Conference on Ubiquitous Computing, September 2012. [ .pdf ]
[581] Clément Dromart, Loïc Puthod, Jérôme Kämpf, and Diane von Gunten. District heating network modelling for future integration of solar thermal energy. In Journal of Physics: Conference Series, volume 2042, page 012089. IOP Publishing, November 2021. [ DOI ]
[582] S. Pavankumar Dubagunta, Rob J. J. H. van Son, and Mathew Magimai.-Doss. Adjustable deterministic pseudonymization of speech. In Computer, Speech & Language [3016]. Open Access. [ DOI ]
[583] S. Pavankumar Dubagunta, Bogdan Vlasenko, and Mathew Magimai.-Doss. Learning voice source related information for depression detection. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019. [ .pdf ]
[584] S. Pavankumar Dubagunta, Selen Hande Kabil, and Mathew Magimai.-Doss. Improving children speech recognition through feature learning from raw speech signal. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019. [ .pdf ]
[585] S. Pavankumar Dubagunta and Mathew Magimai.-Doss. Segment-level training of anns based on acoustic confidence measures for hybrid hmm/ann speech recognition. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019. [ .pdf ]
[586] S. Pavankumar Dubagunta, Edoardo Moneta, Eleni Theocharopoulos, and Mathew Magimai.-Doss. Towards automatic prediction of non-expert perceived speech fluency ratings. Idiap-RR Idiap-RR-11-2021, Idiap, 8 2021. [ .pdf ]
[587] S. Pavankumar Dubagunta and Mathew Magimai.-Doss. Using speech production knowledge for raw waveform modelling based styrian dialect identification. In Proceedings of Interspeech, 2019. [ .pdf ]
[588] S. Pavankumar Dubagunta. Novel Methods for Incorporating Prior Knowledge for Automatic Speech Assessment. PhD thesis, École polytechnique fédérale de Lausanne (EPFL), September 2021. [ .pdf ]
[589] Charles Dubout and Francois Fleuret. Deformable part models with individual part scaling. In British Machine Vision Conference, 2013. [ .pdf ]
[590] Charles Dubout and Francois Fleuret. Accelerated training of linear object detectors. In CVPR 2013 Workshop on Structured Prediction, 2013. [ www: | .pdf ]
[591] Charles Dubout and Francois Fleuret. Exact acceleration of linear object detectors. In Proceedings of the European Conference on Computer Vision, 2012. [ .pdf ]
[592] Charles Dubout and Francois Fleuret. Tasting families of features for image classification. In International Conference on Computer Vision, November 2011. [ .pdf ]
[593] Charles Dubout and Francois Fleuret. Adaptive sampling for large scale boosting. Journal of Machine Learning Research, 15:1431--1453, 2014. [ .pdf ]
[594] Charles Dubout and Francois Fleuret. Boosting with maximum adaptive sampling. In Proceedings of the Neural Information Processing Systems Conference, 2011.
[595] Charles Dubout. Object Classification and Detection in High Dimensional Feature Space. PhD thesis, Programme doctoral en Informatique, Communications et Information, December 2013. [ .pdf ]
[596] Benoît Duc, Gilbert Maître, Stefan Fischer, and Josef Bigün. Person authentication by fusing face and speech information. In Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97), Lecture Notes in Computer Science. Springer Verlag, 1997.
[597] Benoît Duc, Elizabeth Saers Bigün, Josef Bigün, Gilbert Maître, and Stefan Fischer. Fusion of audio and video information for multi modal person authentication. Pattern Recognition Letters, 18(9), 1997.
[598] Stefan Duffner, Jean-Marc Odobez, and Elisa Ricci. Dynamic partitioned sampling for tracking with discriminative features. In Proceedings of the British Maschine Vision Conference, 9 2009. [ .pdf ]
[599] Stefan Duffner and Jean-Marc Odobez. Exploiting long-term observations for track creation and deletion in online multi-face tracking. In IEEE Conference on Automatic Face and Gesture Recognition [3017], pages 525--530. [ .pdf ]
[600] Stefan Duffner, Petr Motlicek, and Danil Korchagin. The ta2 database - a multi-modal database from home entertainment. In International Conference on Signal Acquisition and Processing [3018]. [ .pdf ]
[601] Stefan Duffner and Jean-Marc Odobez. A track creation and deletion framework for long-term online multi-face tracking. IEEE Transactions on Image Processing, March 2013. [ .pdf ]
[602] Stefan Duffner, Petr Motlicek, and Danil Korchagin. The ta2 database – a multi-modal database from home entertainment. International Journal of Computer and Electrical Engineering, 4(5):670--673, December 2012. [ http | .pdf ]
[603] Stefan Duffner and Jean-Marc Odobez. Leveraging colour segmentation for upper-body detection. Pattern Recognition, 47(6):2222--2230, 2014. [ .pdf ]
[604] Cédric Dufour. Havc-ii - idiap private cloud (technical inside-out). Idiap-Com Idiap-Com-01-2015, Idiap, 7 2015. [ .pdf ]
[605] Joël Dumoulin, Olivier Canévet, Michael Villamizar, Hugo Nunes, Omar Abou Khaled, Elena Mugellini, Fabrice Moscheni, and Jean-Marc Odobez. Unicity: A depth maps database for people detection in security airlocks. In IEEE International Conference on Advanced Video and Signal-based Surveillance Workshop, 2018. [ .pdf ]
[606] Stéphane Dupont and Juergen Luettin. Using the multi-stream approach for continuous audio-visual speech recognition: Experiments on the M2VTS database. In Proc. 5th Int. Conf. on Spoken Language Processing, volume 4, 1998. [ .ps.gz | .pdf ]
[607] Stéphane Dupont and Juergen Luettin. Audio-visual speech modelling for continuous speech recognition. IEEE Transactions on Multimedia, 2000. to appear.
[608] Stéphane Dupont and Juergen Luettin. Using the multi-stream approach for continuous audio-visual speech recognition. Idiap-RR Idiap-RR-14-1997, IDIAP, 1997. [ .ps.gz | .pdf ]
[609] Abhishek Dutta, Manuel Günther, Laurent El Shafey, Sébastien Marcel, Raymond Veldhuis, and Luuk Spreeuwers. Impact of eye detection error on face recognition performance. IET Biometrics, January 2015. [ http | .pdf ]
[610] Gérard Chollet and Frédéric Bimbot. Assessment of speaker verification systems. In Spoken Language Ressources and Assessment. EAGLES Handbook, 1995.
[611] Sarah Ebling, Necati Cihan Camgoz, Penny Boyes Braem, Katja Tissi, Sandra Sidler-Miserez, Stephanie Stoll, Simon Hadfield, Tobias Haug, Richard Bowden, Sandrine Tornay, Marzieh Razavi, and Mathew Magimai.-Doss. Smile swiss german sign language dataset. In Language Resources and Evaluation Conference, 2018.
[612] B. Nedic, Guillaume Gravier, Jamal Kharroubi, Gérard Chollet, Dijana Petrovska-Delacretaz, G. Durou, Frédéric Bimbot, Raphaël Blouet, M. Seck, Jean-François Bonastre, Corinne Fredouille, Teva Merlin, I. Magrin-Chagnolleau, S. Pigeon, Patrick Verlinde, and Jan Cernocky. The Elisa'99 speaker recognition and tracking systems. In IEEE Workshop on Automatic Advanced Technologies, 1999.
[613] B. Nedic, Frédéric Bimbot, Raphaël Blouet, Jean-François Bonastre, Gilles Caloz, Jan Cernocky, Gérard Chollet, G. Durou, Corinne Fredouille, Dominique Genoud, Guillaume Gravier, Jean Hennebert, Jamal Kharroubi, I. Magrin-Chagnolleau, Teva Merlin, Chafic Mokbel, Dijana Petrovska-Delacretaz, S. Pigeon, M. Seck, Patrick Verlinde, and M. Zouhal. The ELISA systems for the NIST'99 evaluation in speaker detection and tracking. DSP Journal (Special Issue on the Nist Speaker Recognition Workshop), 1999.
[614] D. Elizondo, Emile Fiesler, and Jerzy Korczak. Non-ontogenic sparse neural networks. In Proceedings of the International Conference on Neural Networks, volume 1, Piscataway, NJ, 1995. IEEE, IEEE.
[615] M. Tajine, D. Elizondo, Emile Fiesler, and Jerzy Korczak. Adapting the 2-class recursive deterministic perceptron neural network to m classes. In Proceedings of the International Conference on Neural Networks. IEEE, IEEE, 1997.
[616] Laurent El Shafey, Roy Wallace, and Sébastien Marcel. Face verification using gabor filtering and adapted gaussian mixture models. In Proceedings of the 11th International Conference of the Biometrics Special Interest Group [3019], pages 397--408. [ .pdf ]
[617] Laurent El Shafey and Sébastien Marcel. Scalable probabilistic models: Applied to face identification in the wild. In 8th European Biometrics Research and Industry Awards. European Association for Biometrics, September 2014. Research paper supporting my application to the EAB Awards. [ http | .pdf ]
[618] Laurent El Shafey, Elie Khoury, and Sébastien Marcel. Audio-visual gender recognition in uncontrolled environment using variability modeling techniques. In International Joint Conference on Biometrics, pages 1 -- 8. IEEE, October 2014. [ DOI | http | .pdf ]
[619] Laurent El Shafey. Scalable Probabilistic Models for Face and Speaker Recognition. PhD thesis, École Polytechnique Fédérale de Lausanne (EPFL), April 2014. [ http | .pdf ]
[620] Laurent El Shafey, Chris McCool, Roy Wallace, and Sébastien Marcel. A scalable formulation of probabilistic linear discriminant analysis: Applied to face recognition. In IEEE Transactions on Pattern Analysis and Machine Intelligence [3020], pages 1788--1794. Accepted for publication. [ DOI | http | .pdf ]
[621] Remi Emonet. Environment - application - adaptation: a community architecture for ambient intelligence. In International Conference on Ambient Computing, Applications, Services and Technologies, October 2011.
[622] Remi Emonet, Jagannadan Varadarajan, and Jean-Marc Odobez. Multi-camera open space human activity discovery for anomaly detection. In 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, August 2011. [ .pdf ]
[623] Remi Emonet, Jagannadan Varadarajan, and Jean-Marc Odobez. Extracting and locating temporal motifs in video scenes using a hierarchical non parametric bayesian model. In IEEE Conference on Computer Vision and Pattern Recognition, June 2011. [ .pdf ]
[624] Remi Emonet and Jean-Marc Odobez. Analyse non supervisée d'activités en vidéo surveillance pour l'analyse de scène et la détection d'événements anormaux. Idiap-RR Idiap-RR-20-2013, Idiap, 5 2013. [ http | .pdf ]
[625] Remi Emonet, Jagannadan Varadarajan, and Jean-Marc Odobez. Temporal analysis of motif mixtures using dirichlet processes. IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), 36(1), January 2014. [ .pdf ]
[626] Remi Emonet, E. Oberzaucher, and Jean-Marc Odobez. What to show? automatic stream selection among multiple sensors. In International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, January 2014. [ .pdf ]
[627] Remi Emonet and Jean-Marc Odobez. Unsupervised methods for activity analysis and detection of abnormal events. In Dufour [3021]. [ DOI | .pdf ]
[628] Nesli Erdogmus and Sébastien Marcel. Spoofing attacks to 2d face recognition systems with 3d masks. In International Conference of the Biometrics Special Interes Group [3022]. [ .pdf ]
[629] Nesli Erdogmus and Sébastien Marcel. Spoofing in 2d face recognition with 3d masks and anti-spoofing with kinect. In Biometrics: Theory, Applications and Systems [3023]. [ .pdf ]
[630] Nesli Erdogmus and Sébastien Marcel. Spoofing face recognition with 3d masks. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, pages 1084--1097, July 2014. [ DOI | .pdf ]
[631] Nesli Erdogmus, Matthias Vanoni, and Sébastien Marcel. Within- and cross- database evaluations for gender classification via befit protocols. In International Workshop on Multimedia Signal Processing, pages 1--6, September 2014. [ DOI | http | .pdf ]
[632] Christos Dimitrakakis and Samy Bengio. Online policy adaptation for ensemble classifiers. In 12th European Symposium on Artificial Neural Networks, ESANN 04 [3024]. IDIAP-RR 03-69.
[633] Jaume Escofet and Todd Andrew Stephenson. Automatic speech recognition using dynamic Bayesian networks with the energy as an auxiliary variable. Idiap-RR Idiap-RR-18-2003, IDIAP, 2003. [ .ps.gz | .pdf ]
[634] Paula Estrella, Andrei Popescu-Belis, and Margaret King. The femti guidelines for contextual mt evaluation: principles and tools. Linguistica Antverpiensia New Series, 8, 2009.
[635] Paula Estrella, Andrei Popescu-Belis, and Margaret King. Improving contextual quality models for mt evaluation based on evaluators' feedback. In 6th International Conference on Language Resources and Evaluation, 2008.
[636] Gérard Chollet and M. Homayounpour. Discrimination of the voices of twins and siblings for speaker verification. In 4th European Conference on Speech Communication and Technology, Madrid, Spain, 9 1995.
[637] Philippe Langlais. Microprosodic study of isolated French word corpora. In 4th European Conference on Speech Communication and Technology, Madrid, Spain, 1995.
[638] Jean-Luc Cochard and Olivier Oppizzi. Reliability in a multi-agent spoken language recognition system. In 4th European Conference on Speech Communication and Technology, Madrid, Spain, 9 1995.
[639] Andrew Morris, Astrid Hagen, and Hervé Bourlard. Map combination of multi-stream hmm or hmm/ann experts. In Proc. Eurospeech [3025]. [ .ps.gz | .pdf ]
[640] Andrew Morris, Astrid Hagen, and Hervé Bourlard. The full combination sub-bands approach to noise robust hmm/ann based asr. In 6th European Conference on Speech Communication and Technology --- Eurospeech'99, Budapest, Hungary, 1999. [ .ps.gz | .pdf ]
[641] Marco Ewerton, Oleg Arenz, and Jan Peters. Assisted teleoperation in changing environments with a mixture of virtual guides. Advanced Robotics, 34(18):1157--1170, July 2020. [ DOI | http ]
[642] Marco Ewerton, Oleg Arenz, Guilherme Maeda, Dorothea Koert, Zlatko Kolev, Masaki Takahashi, and Jan Peters. Learning trajectory distributions for assisted teleoperation and path planning. Frontiers in Robotics and AI, 6:89, 2019. [ DOI | http ]
[643] Marco Ewerton, Sylvain Calinon, and Jean-Marc Odobez. An attention mechanism for deep q-networks with applications in robotic pushing. In Proc. of Workshop on Emerging paradigms for robotic manipulation: from the lab to the productive world, ICRA [3026].
[644] Marco Ewerton, Guilherme Maeda, Dorothea Koert, Zlatko Kolev, Masaki Takahashi, and Jan Peters. Reinforcement learning of trajectory distributions: Applications in assisted teleoperation and motion planning. In IEEE International Conference on Intelligent Robots and Systems, 2019.
[645] Marco Ewerton, Angel Martínez-González, and Jean-Marc Odobez. An efficient image-to-image translation hourglass-based architecture for object pushing policy learning. In IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021. [ .pdf ]
[646] Mael Fabien, Esaú VILLATORO-TELLO, Petr Motlicek, and Shantipriya Parida. Bertaa: Bert fine-tuning for authorship attribution. In Proceedings of the 17th International Conference on Natural Language Processing, 2020. [ .pdf ]
[647] Mael Fabien, Shantipriya Parida, Dawei Zhu, Petr Motlicek, Aravind Krishnan, and Hoang H. Nguyen. Roxanne research platform: Automate criminal investigations. In Interspeech Show and Tell 2021, June 2021. [ .pdf ]
[648] Mael Fabien and Petr Motlicek. Open-set speaker identification pipeline in live criminal investigations. In 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021. [ .pdf ]
[649] Katayoun Farrahi and Daniel Gatica-Perez. What did you do today? discovering daily routines from large-scale mobile data. In ACM International Conference on Multimedia (ACMMM) [3027]. IDIAP-RR 08-49. [ .ps.gz | .pdf ]
[650] Katayoun Farrahi and Daniel Gatica-Perez. Discovering human routines from cell phone data with topic models. In IEEE International Symposium on Wearable Computers (ISWC) [3028]. IDIAP-RR 08-32. [ .ps.gz | .pdf ]
[651] Katayoun Farrahi and Daniel Gatica-Perez. Daily routine classification from mobile phone data. In Workshop on Machine Learning and Multimodal Interaction (MLMI08) [3029]. IDIAP-RR 07-62. [ .ps.gz | .pdf ]
[652] Katayoun Farrahi and Daniel Gatica-Perez. Discovering routines from large-scale human locations using probabilistic topic models. ACM Transactions on Intelligent Systems and Technology, 2(1), 2011. [ .pdf ]
[653] Katayoun Farrahi and Daniel Gatica-Perez. Learning and predicting multimodal daily life patterns from cell phones. In ICMI-MLMI, 2009. [ .pdf ]
[654] Katayoun Farrahi and Daniel Gatica-Perez. Mining human location-routines using a multi-level topic model. Idiap-RR Idiap-RR-28-2010, Idiap, 8 2010. [ .pdf ]
[655] Katayoun Farrahi and Daniel Gatica-Perez. Probabilistic mining of socio-geographic routines from mobile phone data. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 4(4), 8 2010. [ .pdf ]
[656] Katayoun Farrahi, Remi Emonet, and Alois Ferscha. Socio-technical network analysis from wearable interactions. In International Symposium on Wearable Computers, June 2012. [ .pdf ]
[657] Katayoun Farrahi and Daniel Gatica-Perez. Extracting mobile behavioral patterns with the distant n-gram topic model. In Proceedings of the IEEE International Symposium on Wearable Computers, June 2012. [ .pdf ]
[658] Katayoun Farrahi and Daniel Gatica-Perez. A probabilistic approach to mining mobile phone data sequences. Personal and Ubiquitous Computing, December 2012. [ .pdf ]
[659] Katayoun Farrahi and Daniel Gatica-Perez. Mining human location-routines using a multi-level approach to topic modeling. In 2010 IEEE Second International Conference on Social Computing, SIN Symposium, 8 2010. [ .pdf ]
[660] Katayoun Farrahi. A Probabilistic Approach to Socio-Geographic Reality Mining. PhD thesis, Ecole Polytechnique Fédérale de Lausanne, 2011. [ .pdf ]
[661] B. Fasel. Fast multi-scale face detection. Idiap-Com Idiap-Com-04-1998, IDIAP, 1998. [ .ps.gz | .pdf ]
[662] B. Fasel and Juergen Luettin. Recognition of Asymmetric Facial Action Unit activities and intensities. In Proceedings of the International Conference on Pattern Recognition (ICPR 2000) [3030]. IDIAP-RR 99-22. [ .ps.gz | .pdf ]
[663] B. Fasel. Robust Face Analysis using Convolutional neural networks. In Proceedings of the International Conference on Pattern Recognition (ICPR 02) [3031]. IDIAP-RR 01-48. [ .ps.gz | .pdf ]
[664] B. Fasel. Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional neural networks. In International IEEE Workshop on Neural Networks for Signal Processing (NNSP 02) [3032]. IDIAP-RR 01-49. [ .ps.gz | .pdf ]
[665] B. Fasel. Head-Pose Invariant Facial Expression Recognition using Convolutional neural networks. In International IEEE Conference on Multimodal Interfaces (ICMI 02) [3033]. IDIAP-RR 02-51. [ .ps.gz | .pdf ]
[666] B. Fasel. Mutliscale Facial Expression Recognition using Convolutional neural networks. In Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP 02) [3034]. IDIAP-RR 02-52. [ .ps.gz | .pdf ]
[667] B. Fasel and Juergen Luettin. Automatic Facial Expression analysis: A Survey. In Pattern Recognition [3035]. IDIAP-RR 99-19. [ .ps.gz | .pdf ]
[668] Sarah Favre. Social network analysis in multimedia indexing: Making sense of people in multiparty recordings. In Proceedings of the Doctoral Consortium of the International Conference on Affective Computing & Intelligent Interaction (ACII), 2009. [ .pdf ]
[669] Sarah Favre, Alfred Dielmann, and Alessandro Vinciarelli. Automatic role recognition in multiparty recordings using social networks and probabilistic sequential models. In ACM International Conference on Multimedia, 2009. [ .pdf ]
[670] Sarah Favre, Hugues Salamin, Alessandro Vinciarelli, Dilek Hakkani Tür, and N. P. Garg. Role recognition for meeting participants: an approach based on lexical information and social network analysis. In ACM International Conference on Multimedia [3036]. To appear in Proceedings of ACM International Conference on Multimedia (2008). [ .pdf ]
[671] Sarah Favre, Hugues Salamin, John Dines, and Alessandro Vinciarelli. Role recognition in multiparty recordings using social affiliation networks and discrete distributions. In International Conference on Multimodal Interfaces [3037]. To appear in Proceedings of ICMI International Conference on Multimodal Interfaces (2008). [ .pdf ]
[672] Sarah Favre. Social Network Analysis for Automatic Role Recognition. PhD thesis, Ecole Polytechnique Fédérale de Lausanne, 12 2010. [ .pdf ]
[673] Marc Ferras and Hervé Bourlard. Mlp-based factor analysis for tandem speech recognition. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 2013. [ .pdf ]
[674] Marc Ferras, Srikanth Madikeri, Petr Motlicek, and Hervé Bourlard. System fusion and speaker linking for longitudinal diarization of tv shows. In Proceedings of 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016), pages 5495--5499. IEEE, March 2016. [ .pdf ]
[675] Marc Ferras, Srikanth Madikeri, Subhadeep Dey, Petr Motlicek, and Hervé Bourlard. A large-scale open-source acoustic simulator for speaker recognition. IEEE Signal Processing Letters, 23(4):527 -- 531, 2016. [ .pdf ]
[676] Marc Ferras, Srikanth Madikeri, and Hervé Bourlard. Speaker diarization and linking of meeting data. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(11):1935--1945, November 2016.
[677] Marc Ferras and Hervé Bourlard. Multi-source posteriors for speech activity detection on public talks. In INTERSPEECH, 2014. [ .pdf ]
[678] Marc Ferras, Stefano Masneri, Oliver Schreer, and Hervé Bourlard. Diarizing large corpora using multi-modal speaker linking. In INTERSPEECH 2014, 2014. [ .pdf ]
[679] Marc Ferras, Srikanth Madikeri, Subhadeep Dey, Petr Motlicek, and Hervé Bourlard. Inter-task system fusion for speaker recognition. In Proceeedings of the INTERSPEECH, 2016. [ .pdf ]
[680] Marc Ferras and Hervé Bourlard. Speaker diarization and linking of large corpora. In Proceedings of the IEEE Workshop on Spoken Language Technology, 2012. [ .pdf ]
[681] Ana Cláudia Barbosa Honório Ferreira, Danton Diego Ferreira, Henrique Ceretta Oliveira, Igor Carvalho de Resende, André Anjos, and Maria Helena Baena de Moraes Lopes. Competitive neural layer-based method to identify people with high risk for diabetic foot. Computers in Biology and Medicine, 120, May 2020. [ DOI | http | .pdf ]
[682] Pierre W. Ferrez and José del R. Millán. You are wrong!---automatic detection of interaction errors from brain waves. In Proceedings of the 19th International Joint Conference on Artificial Intelligence, Edinburgh, UK, 8 2005. [ .pdf ]
[683] Pierre W. Ferrez, Ferran Galán, Anna Buttfield, S. L. González Andino, R. Grave de Peralta, and José del R. Millán. High frequency bands and estimated local field potentials to improve single-trial classification of electroencephalographic signals. In Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, Graz, Austria, 9 2006. [ .pdf ]
[684] Pierre W. Ferrez and José del R. Millán. Error-related eeg potentials generated during simulated brain-computer interaction. IEEE Trans. on Biomedical Engineering, 55(3), 2008. [ .pdf ]
[685] Pierre W. Ferrez and José del R. Millán. Simultaneous real-time detection of motor imagery and error-related potentials for improved bci accuracy. In Proceedings of the 4th International Brain-Computer Interface Workshop and Training Course, 2008. [ .pdf ]
[686] Pierre W. Ferrez and José del R. Millán. Eeg-based brain-computer interaction: Improved accuracy by automatic single-trial error detection. In Advances in Neural Information Processing Systems 21, 2007. [ .pdf ]
[687] Pierre W. Ferrez. Error-related EEG potentials in brain-computer interfaces. PhD thesis, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, 2007. PhD Thesis #3928 at the École Polytechnique Fédérale de Lausanne. [ .pdf ]
[688] Jason Ferris, Cheneal Puljević, Florian Labhart, Adam Winstock, and Emmanuel Kuntsche. The role of sex and age on pre-drinking: An exploratory international comparison of 27 countries. Alcohol and Alcoholism, 54(4):378–385, July 2019. [ DOI ]
[689] F. Ficuciello, P. Falco, and Sylvain Calinon. A brief survey on the role of dimensionality reduction in manipulation learning and control. IEEE Robotics and Automation Letters (RA-L), 3(3):2608--2615, 2018. [ DOI | http | .pdf ]
[690] Emile Fiesler. Neural network formalization. Idiap-RR Idiap-RR-01-1992, IDIAP, 1992. [ .ps.gz | .pdf ]
[691] Emile Fiesler. Neural network classification and formalization. Computer Standards & Interfaces, 16(03), 6 1994.
[692] Emile Fiesler and R. Beale, editors. Handbook of Neural Computation. The Computational Intelligence Library. Institute of Physics and Oxford University Press, New York, New York, 1996. The electronic version is expected in early 1997.
[693] Emile Fiesler. Neural network topologies. In Emile Fiesler and R. Beale, editors, Handbook of Neural Computation, The Computational Intelligence Library, chapter B2. New York, New York, 1996.
[694] Emile Fiesler and K. Cios. Supervised ontogenic networks. In Emile Fiesler and R. Beale, editors, Handbook of Neural Computation, The Computational Intelligence Library, chapter C1.7. New York, New York, 1996.
[695] Emile Fiesler. CRC Comprehensive Dictionary of Electrical Engineering. CRC Press, Boca Raton, Florida, 1997. Contributing Author: E. Fiesler.
[696] Emile Fiesler and Michel Maignan. A connectionist system for two-dimensional representation of multivariate location data. In Proceedings of the Fifth International Workshop on Artificial Intelligence for High Energy Physics, Amsterdam, The Netherlands, 1997. AIHENP, Elsevier Science.
[697] Ailbhe Finnerty, Skanda Muralidhar, Laurent Son Nguyen, Fabio Pianesi, and Daniel Gatica-Perez. Stressful first impressions in job interviews. In Proceedings of the 18th ACM International Conference on Multimodal Interaction, pages 325--332, November 2016. [ .pdf ]
[698] Evann Courdier and Francois Fleuret. Real-time segmentation networks should be latency aware. In Asian Conference on Computer Vision, 2020. [ .pdf ]
[699] Francois Fleuret, Philip Abbet, Charles Dubout, and Leonidas Lefakis. The mash project. In European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2011. [ .pdf ]
[700] Suraj Srinivas and Francois Fleuret. Rethinking the role of gradient-based attribution methods for model interpretability. In International Conference on Learning Representations, 2021. [ .pdf ]
[701] Francois Fleuret and Donald Geman. Stationary features and cat detection. In Journal of Machine Learning Research [3038].
[702] Emtiyaz Khan, Pierre Baqué, Francois Fleuret, and Pascal Fua. Kullback-leibler proximal variational inference. In Proceedings of the international conference on Neural Information Processing Systems, pages 3402--3410, 2015. [ .pdf ]
[703] Francois Fleuret, Ting Li, Charles Dubout, Emma K. Wampler, Steven Yantis, and Donald Geman. Comparing machines and humans on a visual categorization test. Proceedings of the National Academy of Sciences, 2011.
[704] Francois Fleuret. Multi-layer boosting for pattern recognition. In Pattern Recognition Letter [3039].
[705] Francois Fleuret, Horesh Ben Shitrit, and Pascal Fua. Re-identification for improved people tracking. In Person Re-Identification, pages 311--336. Springer, 2014.
[706] Francois Fleuret, Jerome Berclaz, Richard Lengagne, and Pascal Fua. Multi-camera people tracking with a probabilistic occupancy map. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(2), 2 2008.
[707] Pietro Florio, Giuseppe Peronato, A. T. D. Perera, Anthony Di Blasi, Kin Ho Poon, and Jérôme Kämpf. Designing and assessing solar energy neighborhoods from visual impact. Sustainable Cities and Society, 2021. [ DOI | http ]
[708] Michele Focchi, Andrea del Prete, I. Havoutis, Roy Featherstone, D. G. Caldwell, and Claudio Semini. High-slope terrain locomotion for torque-controlled quadruped robots. Autonomous Robots, 2016. [ DOI | http ]
[709] Marco Fornoni, Barbara Caputo, and Francesco Orabona. Multiclass latent locally linear support vector machines. In Cheng Soon Ong and Tu-Bao Ho, editors, JMLR W&CP, Volume 29: Asian Conference on Machine Learning, pages 229--244, 2013. [ http | .pdf ]
[710] Marco Fornoni and Barbara Caputo. Indoor scene recognition using task and saliency-driven feature pooling. In Proceedings of the British Machine Vision Conference, September 2012. [ .pdf ]
[711] Marco Fornoni, Jesus Martinez-Gomez, and Barbara Caputo. A multi cue discriminative approach to semantic place classification. In CLEF 2010 Notebook Papers/LABs/Workshops, 9 2010. [ .pdf ]
[712] Marco Fornoni and Barbara Caputo. Scene recognition with naive bayes non-linear learning. In Proceedings of the 22nd International Conference on Pattern Recognition, pages 3404 -- 3409. IEEE, August 2014. [ DOI | .pdf ]
[713] Marco Fornoni. Saliency-based Representations and Multi-component Classifiers for Visual Scene Recognition. PhD thesis, École Polytechnique Fédérale de Lausanne (EPFL), October 2014. [ .pdf ]
[714] Cecile Fougeron, Nicolas Audibert, Ina Kodrasi, Parvaneh Janbakhshi, Michaela Pernon, Nathalie Leveque, Stephanie Borel, Marina Laganaro, Hervé Bourlard, and Frederic Assal. Comparison of 5 methods for the evaluation of intelligibility in mild to moderate french dysarthric speech. In Annual Conference of the International Speech Communication Association, September 2022.
[715] Petr Fousek and Hynek Hermansky. Towards asr based on hierarchical posterior-based keyword recognition. Idiap-RR Idiap-RR-64-2005, IDIAP, 2005. Submitted to ICASSP 2006. [ .ps.gz | .pdf ]
[716] Petr Fousek, Petr Svojanovsky, Frantisek Grezl, and Hynek Hermansky. New nonsense syllables database -- analyses and preliminary asr experiments. In Proceedings of International Conference on Spoken Language Processing (ICSLP) [3040]. IDIAP-RR 2004-29. [ .ps.gz | .pdf ]
[717] Frank Formaz, Manish Goyal, and Olivier Bornet. Development of a DTW based Speech Recognition System over the telephone line. Idiap-Com Idiap-Com-05-2001, IDIAP, 2001. [ .ps.gz | .pdf ]
[718] Frank Formaz and Norbert Crettol. The idiap multimedia file server. Idiap-Com Idiap-Com-05-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[719] Denise Frauendorfer, Marianne Schmid Mast, Dairazalia Sanchez-Cortes, and Daniel Gatica-Perez. Emergent power hierarchies and group performance. International Journal of Psychology, 50(5):392–396, October 2015. Published online: 7 OCT 2014. [ DOI | http | .pdf ]
[720] Denise Frauendorfer, Marianne Schmid Mast, Laurent Son Nguyen, and Daniel Gatica-Perez. Nonverbal social sensing in action: Unobtrusive recording and extracting of nonverbal behavior in social interactions illustrated with a research example. Journal of Nonverbal Behavior, 38(2):231--245, June 2014. [ DOI | .pdf ]
[721] Anna Freeman, Alastair Watson, Paul O'Reagan, Oskar Wysocki, Hannah Burke, Andre Freitas, and et al. Wave comparisons of clinical characteristics and outcomes of covid-19 admissions - exploring the impact of treatment and strain dynamics. Journal of Clinical Virology, January 2022.
[722] Gerald Friedland, Chuohao Yeo, and Hayley Hung. Visual speaker localization aided by acoustic models. In ACM Multimedia, 2009.
[723] Gerald Friedland, Hayley Hung, and Chuohao Yeo. Multi-modal speaker diarization of real-world meetings using compressed-domain video features. In International Conference on Audio, Speech and Signal Processing, 2009. [ .pdf ]
[724] Gerald Friedland, Adam Janin, David Imseng, Xavier Anguera, Luke Gottlieb, Marijn Huijbregts, Mary Tai Knox, and Oriol Vinyals. The icsi rt-09 speaker diarization system. IEEE Transactions on Audio, Speech, and Language Processing, 20(2):371--381, February 2012. [ DOI ]
[725] Julian Fritsch, Sebastian Wankerl, and Elmar Nöth. Automatic diagnosis of alzheimer's disease using neural network language models. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2019. [ .pdf ]
[726] Julian Fritsch, S. Pavankumar Dubagunta, and Mathew Magimai.-Doss. Estimating the degree of sleepiness by integrating articulatory feature knowledge in raw waveform based cnns. In International Conference on Acoustics, Speech, and Signal Processing (ICASSP) [3041]. [ .pdf ]
[727] Julian Fritsch, Guillem Quer, and Mathew Magimai.-Doss. Probabilistic symbol sequence matching and its application to pathological speech intelligibility assessment. Idiap-RR Idiap-RR-01-2021, Idiap, 1 2021. [ .pdf ]
[728] Julian Fritsch and Mathew Magimai.-Doss. Utterance verification-based dysarthric speech intelligibility assessment using phonetic posterior features. IEEE Signal Processing Letters, 28:224 -- 228, 2021. [ DOI | .pdf ]
[729] Livia Fritz, Ulli Vilsmaier, Garance Clement, Laurie Daffe, Anna Pagani, Melissa Pang, Daniel Gatica-Perez, Vincent Kaufmann, Marie Santiago Delefosse, and Claudia R Binder. Explore, engage, empower: Methodological insights into a transformative mixed methods study tackling the covid-19 lockdown. Humanities and Social Sciences Communications, May 2022. [ .pdf ]
[730] Kenneth Alberto Funes Mora and Jean-Marc Odobez. 3d gaze tracking and automatic gaze coding from rgb-d cameras. In IEEE Conference in Computer Vision and Pattern Recognition, Vision Meets Cognition Workshop, June 2014. [ .pdf ]
[731] Kenneth Alberto Funes Mora and Jean-Marc Odobez. Geometric generative gaze estimation (g3e) for remote rgb-d cameras. In IEEE Computer Vision and Pattern Recognition Conference, pages 1773--1780. IEEE, June 2014. [ DOI | .pdf ]
[732] Kenneth Alberto Funes Mora, Florent Monay, and Jean-Marc Odobez. Eyediap: A database for the development and evaluation of gaze estimation algorithms from rgb and rgb-d cameras. In Proceedings of the ACM Symposium on Eye Tracking Research and Applications. ACM, March 2014. [ DOI | .pdf ]
[733] Kenneth Alberto Funes Mora, Laurent Son Nguyen, Daniel Gatica-Perez, and Jean-Marc Odobez. A semi-automated system for accurate gaze coding in natural dyadic interactions. In 15th ACM International Conference on Multimodal Interaction. ACM, December 2013. [ DOI | .pdf ]
[734] Kenneth Alberto Funes Mora. 3d head pose and gaze tracking and their application to diverse multimodal tasks. In Doctoral consortium of the 15th ACM International Conference on Multimodal Interaction, December 2013. [ DOI ]
[735] Kenneth Alberto Funes Mora, Florent Monay, and Jean-Marc Odobez. Eyediap database: Data description and gaze tracking evaluation benchmarks. Idiap-RR Idiap-RR-08-2014, Idiap, 5 2014. [ .pdf ]
[736] Kenneth Alberto Funes Mora and Jean-Marc Odobez. Gaze estimation in the 3d space using rgb-d sensors. towards head-pose and user invariance. International Journal of Computer Vision, 118(2):194--216, June 2016. First online: 13 November 2015. [ DOI | http | .pdf ]
[737] Kenneth Alberto Funes Mora. 3D Gaze Estimation from Remote RGB-D Sensors. PhD thesis, École Polytechnique Fédérale de Lausanne, October 2015. Thèse EPFL, n° 6680. [ DOI | .pdf ]
[738] Kenneth Alberto Funes Mora and Jean-Marc Odobez. Gaze estimation from multimodal kinect data. In IEEE Conference in Computer Vision and Pattern Recognition, Workshop on Gesture Recognition, June 2012. [ DOI | .pdf ]
[739] Kenneth Alberto Funes Mora and Jean-Marc Odobez. Person independent 3d gaze estimation from remote rgb-d cameras. In International Conference on Image Processing. IEEE, September 2013. [ DOI | .pdf ]
[740] L. Fusco, Kevin C. Smith, F. Benmansour, Riwal Lefort, Francois Fleuret, Pascal Fua, and O. Pertz. Morphodynamic profiling to explore spatio-temporal signaling networks regulating neurite outgrowth. In 1st International SystemsX.ch Conference on Systems Biology, 2011.
[741] Tamas Gabor Csapo, Geza Nemeth, Milos Cernak, and Philip N. Garner. Modeling unvoiced sounds in statistical parametric speech synthesis with a continuous vocoder. In Proc. of EUSIPCO, 2016. [ .pdf ]
[742] Ferran Galán, J. Palix, Ricardo Chavarriaga, Pierre W. Ferrez, Eileen Lew, C. A. Hauert, and José del R. Millán. Visuo-spatial attention frame recognition for brain-computer interfaces. In Proceedings of the 1st International Conference on Cognitive Neurodynamics, Shanghai, China, 11 2007. [ .pdf ]
[743] Ferran Galán, Pierre W. Ferrez, Francesc Oliva, Joan Guàrdia, and José del R. Millán. Feature extraction for multi-class bci using canonical variates analysis. In Proceedings of the IEEE International Symposium on Intelligent Signal Processing [3042]. Submitted for publication. [ .pdf ]
[744] Ferran Galán, Marnix Nuttin, Eileen Lew, Pierre W. Ferrez, G. Vanacker, Johan Philips, and José del R. Millán. A brain-actuated wheelchair: Asynchronous and non-invasive brain-computer interfaces for continuous control of robots. Clinical Neurophysiology, 2008. [ .pdf ]
[745] Ferran Galán, Marnix Nuttin, Dirk Vanhooydonck, Eileen Lew, Pierre W. Ferrez, Johan Philips, and José del R. Millán. Continuous brain-actuated control of an intelligent wheelchair by human eeg. In In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course [3043]. IDIAP-RR 08-53. [ .pdf ]
[746] Ferran Galán, Francesc Oliva, Joan Guàrdia, Pierre W. Ferrez, and José del R. Millán. Detecting intentional mental transitions in an asynchronous bci. Idiap-RR Idiap-RR-43-2006, IDIAP, 2006. Submitted for publication. [ .ps.gz | .pdf ]
[747] Ferran Galán. Methods for Asynchronous and Non-Invasive EEG-Based Brain-Computer Interfaces. Towards Intelligent Brain-Actuated Wheelchairs. PhD thesis, University of Barcelona, 2008. [ .pdf ]
[748] Javier Galbally, Chris McCool, Julian Fierrez, Sébastien Marcel, and Javier Ortega-Garcia. Hill-climbing attack to an eigenface-based face verification system. In Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS), 2009. [ .pdf ]
[749] Javier Galbally, Chris McCool, Julian Fierrez, Sébastien Marcel, and Javier Ortega-Garcia. On the vulnerability of face verification systems to hill-climbing attacks. Pattern Recognition, 2009.
[750] Adrian Galdran, André Anjos, José Dolz, Hadi Chakor, Hervé Lombaert, and Ismail Ben Ayed. The little w-net that could: State-of-the-art retinal vessel segmentation with minimalistic models. Cornell University Pre-print Server, 2020. Also submitted to IEEE TMI. [ http | .pdf ]
[751] Adrian Galdran, André Anjos, José Dolz, Hadi Chakor, Hervé Lombaert, and Ismail Ben Ayed. State-of-the-art retinal vessel segmentation with minimalistic models. Nature Scientific Reports, 12(6174), April 2022. [ DOI | .pdf ]
[752] Sriram Ganapathy, Petr Motlicek, Hynek Hermansky, and Harinath Garudadri. Autoregressive modelling of hilbert envelopes for wide-band audio coding. In AES 124th Convention, Audio Engineering Society [3044]. IDIAP-RR 08-40. [ .ps.gz | .pdf ]
[753] Sriram Ganapathy, Petr Motlicek, Hynek Hermansky, and Harinath Garudadri. Temporal masking for bit-rate reduction in audio codec based on frequency domain linear prediction. In IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP) [3045]. IDIAP-RR 07-48. [ .ps.gz | .pdf ]
[754] Sriram Ganapathy, Petr Motlicek, Hynek Hermansky, and Harinath Garudadri. Spectral noise shaping: Improvements in speech/audio codec based on linear prediction in spectral domain. In INTERSPEECH 2008 [3046]. IDIAP-RR 08-16. [ .ps.gz | .pdf ]
[755] Sriram Ganapathy, Petr Motlicek, and Hynek Hermansky. Mdct for encoding residual signals in frequency domain linear prediction. In Audio Engineering Society (AES,',','), 127th Convention [3047]. [ http | .pdf ]
[756] Sriram Ganapathy, Samuel Thomas, Petr Motlicek, and Hynek Hermansky. Applications of signal analysis using autoregressive models for amplitude modulation. Idiap-RR Idiap-RR-35-2009, Idiap, Rue Marconi 19, 12 2009. [ .pdf ]
[757] Sriram Ganapathy, Petr Motlicek, and Hynek Hermansky. Modified discrete cosine transform for encoding residual signals in frequency domain linear prediction. Idiap-RR Idiap-RR-74-2008, Idiap, 12 2008. [ .pdf ]
[758] Sriram Ganapathy, Petr Motlicek, and Hynek Hermansky. Low-delay error resilient speech coding using sub-band hilbert envelopes. Idiap-RR Idiap-RR-75-2008, Idiap, 12 2008. [ .pdf ]
[759] Sriram Ganapathy, Petr Motlicek, and Hynek Hermansky. Autoregressive models of amplitude modulations in audio compression. In IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING [3048]. [ http | .pdf ]
[760] Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky. Modulation frequency features for phoneme recognition in noisy speech. In Journal of Acoustical Society of America - Express Letters [3049]. [ .pdf ]
[761] Sriram Ganapathy, Petr Motlicek, and Hynek Hermansky. Error resilient speech coding using sub-band hilbert envelopes. In 12th International Conference on Text, Speech and Dialogue, TSD 2009 [3050]. [ .pdf ]
[762] Sriram Ganapathy, Samuel Thomas, Petr Motlicek, and Hynek Hermansky. Applications of signal analysis using autoregressive models for amplitude modulation. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09. IEEE, 10 2009. Digital Object Identifier 10.1109/ASPAA.2009.534649. [ http | .pdf ]
[763] J. Gancet, P. Weiss, G. Antonelli, M. F. Pfingsthorn, Sylvain Calinon, A. Turetta, C. Walen, D. Urbina, S. Govindaraj, P. Letier, X. Martinez, J. Salini, B. Chemisky, G. Indiveri, G. Casalino, P. Di Lillo, E. Simetti, D. De Palma, A. Birk, A. K. Tanwani, I. Havoutis, A. Caffaz, and L. Guilpain. Dexterous undersea interventions with far distance onshore supervision: the dexrov project. In IFAC Conference on Control Applications in Marine Systems (CAMS), pages 414--419, September 2016. [ DOI | http | .pdf ]
[764] J. Gancet, D. Urbina, P. Letier, M. Ilzokvitz, P. Weiss, F. Gauch, G. Antonelli, G. Indiveri, G. Casalino, A. Birk, M. F. Pfingsthorn, Sylvain Calinon, Ajay Kumar Tanwani, A. Turetta, C. Walen, and L. Guilpain. Dexrov: Dexterous undersea inspection and maintenance in presence of communication latencies. In IFAC Workshop on Navigation, Guidance and Control of Underwater Vehicles (NGCUV), volume 48, pages 218--223, 2015.
[765] Gangadhar Garipelli, Ricardo Chavarriaga, and José del R. Millán. Recognition of anticipatory behavior from human eeg. In In proceedings, 4th Intl. Brain-Computer Interface Workshop and Training Course [3051]. IDIAP-RR 08-52. [ .ps.gz | .pdf ]
[766] Gangadhar Garipelli, Ferran Galán, Ricardo Chavarriaga, Pierre W. Ferrez, Eileen Lew, and José del R. Millán. The use of brain-computer interfacing for ambient intelligence. In In the book, Constructing Ambient Intelligence: AmI-07 Workshops Proceedings, Max Muhlhauser, Alois Ferscha, and Erwin Aitenbichler (Eds.,',','), LNCS, Springer Verlag, 2008. [3052]. IDIAP-RR 07-61. [ .ps.gz | .pdf ]
[767] Xiao Gao, J. Silverio, Sylvain Calinon, Miao Li, and Xiaohui Xiao. Bilateral teleoperation with object-adaptive mapping. Complex & Intelligent Systems, 2021. [ .pdf ]
[768] Xiao Gao, J. Silverio, E. Pignat, Sylvain Calinon, Miao Li, and Xiaohui Xiao. Motion mappings for continuous bilateral teleoperation. IEEE Robotics and Automation Letters, 6(3):5048--5055, 2021. [ .pdf ]
[769] Giulia Garau, Silèye O. Ba, Hervé Bourlard, and Jean-Marc Odobez. Investigating the use of visual focus of attention for audio-visual speaker diarisation. In Proceedings of the ACM International Conference on Multimedia, 10 2009. [ .pdf ]
[770] Giulia Garau and Hervé Bourlard. Using audio and visual cues for speaker diarisation initialisation. In International Conference on Acoustics, Speech and Signal Processing, 3 2010. [ .pdf ]
[771] Giulia Garau, Alfred Dielmann, and Hervé Bourlard. Audio–visual synchronisation for speaker diarisation. In International Conference on Speech and Language Processing, Interspeech, 9 2010. [ .pdf ]
[772] Nikhil Garg, Benoit Favre, Korbinian Reidhammer, and Dilek Hakkani Tür. Clusterrank: A graph based method for meeting summarization. Idiap-RR Idiap-RR-09-2009, Idiap, P.O. Box 592, CH-1920 Martigny, Switzerland, 6 2009. [ .pdf ]
[773] Nikhil Garg and Daniel Gatica-Perez. Tagging and retrieving images with co-occurrence models: from corel to flickr. Idiap-RR Idiap-RR-21-2009, Idiap, 8 2009. [ .pdf ]
[774] Nikhil Garg. Co-occurrence models for image annotation and retrieval. Idiap-RR Idiap-RR-22-2009, Idiap, 8 2009. Ecole Polytechnique Fédérale de Lausanne - Master Thesis. [ .pdf ]
[775] Gangadhar Garipelli, Ricardo Chavarriaga, and José del R. Millán. Fast recognition of anticipation related potentials. IEEE Transactions on Biomedical Engineering, 2008. In press. [ .pdf ]
[776] Philip N. Garner. Snr features for automatic speech recognition. In Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding [3053]. [ .pdf ]
[777] Philip N. Garner, Milos Cernak, and Blaise Potard. A simple continuous excitation model for parametric vocoding. Idiap-RR Idiap-RR-03-2015, Idiap, 1 2015. [ .pdf ]
[778] Philip N. Garner. A map approach to noise compensation of speech. Idiap-RR Idiap-RR-08-2009, Idiap, 6 2009. [ .pdf ]
[779] Philip N. Garner and David Imseng. Statistical models for hmm/ann hybrids. Idiap-RR Idiap-RR-11-2013, Idiap, 4 2013. [ .pdf ]
[780] Philip N. Garner. Combining the snr spectrum with a cochlear model. Idiap-RR Idiap-RR-14-2018, Idiap, 9 2018. [ .pdf ]
[781] Philip N. Garner and John Dines. Tracter: A lightweight dataflow framework. In Proceedings of Interspeech [3056]. [ .pdf ]
[782] Philip N. Garner, David Imseng, and Thomas Meyer. Automatic speech recognition and translation of a swiss german dialect: Walliserdeutsch. In Proceedings of Interspeech, Singapore, September 2014. [ .pdf ]
[783] Philip N. Garner. Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition. In Speech Communication [3057], pages 991--1001. [ DOI | .pdf ]
[784] Philip N. Garner, Milos Cernak, and Petr Motlicek. A simple continuous pitch estimation algorithm. IEEE Signal Processing Letters, 20(1):102--105, January 2013. [ http | .pdf ]
[785] Philip N. Garner, Rob Clark, Jean-Philippe Goldman, Pierre-Edouard Honnet, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli, and Junichi Yamagishi. Translation and prosody in swiss languages. In Nouveaux cahiers de linguistique française, 2014. [ .pdf ]
[786] Philip N. Garner. Bayesian Approaches to Uncertainty in Speech Processing. PhD thesis, School of Computing Sciences, University of East Anglia, September 2011. [ .pdf ]
[787] Philip N. Garner and Sibo Tong. A bayesian approach to recurrence in neural networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(8):2527--2537, 2021. [ DOI | .pdf ]
[788] Huseyn Gasimov, Petr Motlicek, and Hervé Bourlard. Who wants to be a millionaire? (ii). Idiap-Com Idiap-Com-02-2013, Idiap, Rue Marocni 19, Martigny, Switzerland, 2 2013. [ .pdf ]
[789] Daniel Gatica-Perez. Modeling interest in face-to-face conversations from multimodal nonverbal behavior. In In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.,',','), Multimodal Signal Processing, Academic Press. Academic Press, 2009. [ .pdf ]
[790] Daniel Gatica-Perez, Oya Aran, and Dinesh Babu Jayagopi. Analysis of small groups. In Social Signal Processing, pages 349--367. Cambridge University Press. Editors J. Burgoon, N. Magnenat-Thalmann, M. Pantic, and A. Vinciarelli, 2017. [ DOI ]
[791] Daniel Gatica-Perez, Carlos Pallan Gayol, Stephane Marchand-Maillet, Jean-Marc Odobez, Edgar Roman-Rangel, Guido Krempel, and Nikolai Grube. The maaya project: Multimedia analysis and access for documentation and decipherment of maya epigraphy. In Proc. Digital Humanities Conference, July 2014. [ .pdf ]
[792] Daniel Gatica-Perez, Salvador Ruiz-Correa, and Darshan Santani. What tripadvisor can't tell: Crowdsourcing urban impressions for whole cities. In Alessia de Biase, Nancy Ottaviano, and Ornella Zaza, editors, Digital Polis. L'Oeil d'Or (translated to French.), 2018. [ .pdf ]
[793] Daniel Gatica-Perez, Darshan Santani, Joan-Isaac Biel, and Thanh-Trung Phan. Social multimedia, diversity, and global south cities: A double blind side. In Proc. ACM Workshop on Fairness, Accountability, and Transparency in Multimedia (FAT/MM), October 2019. [ .pdf ]
[794] Daniel Gatica-Perez and Jean-Marc Odobez. Visual attention, speaking activity, and group conversational analysis in multi-sensor environments. In In H. Nakashima, J. Augusto, H. Aghajan (Eds.,',','), Handbook of Ambient Intelligence and Smart Environments. Springer, 2010.
[795] Daniel Gatica-Perez. Signal processing in the workplace. IEEE Signal Processing Magazine, 32(1):121--125, January 2015. [ .pdf ]
[796] Daniel Gatica-Perez, Gulcan Can, Rui Hu, Stephane Marchand-Maillet, Jean-Marc Odobez, Carlos Pallan Gayol, and Edgar Roman-Rangel. Maaya: Multimedia methods to support maya epigraphic analysis. In Diego Jimenez-Badillo, editor, Arqueologia computacional: Nuevos enfoques para el analisis y la difusion del patrimonio cultural. INAH-RedTDPC, 2017. [ .pdf ]
[797] Daniel Gatica-Perez, Joan-Isaac Biel, David Labbe, and Nathalie Martin. Discovering eating routines in context with a smartphone app. In Ubicomp/Iswc'19 Adjunct: Proceedings Of The 2019 Acm International Joint Conference On Pervasive And Ubiquitous Computing And Proceedings Of The 2019 Acm International Symposium On Wearable Computers, pages 422--429, September 2019. [ DOI | .pdf ]
[798] Daniel Gatica-Perez. Automatic nonverbal analysis of social interaction in small groups: A review. Image and Vision Computing, Special Issue on Human Behavior, 27(12), 12 2009. [ .pdf ]
[799] Daniel Gatica-Perez, Edgar Roman-Rangel, Jean-Marc Odobez, and Carlos Pallan. New world, new worlds: Visual analysis of pre-columbian pictorial collections. In Proceedings of the International Workshop on Multimedia for Cultural Heritage. Springer CCIS series book, April 2011. [ .pdf ]
[800] Daniel Gatica-Perez, Dairazalia Sanchez-Cortes, Trinh-Minh-Tri Do, Dinesh Babu Jayagopi, and Kazuhiro Otsuka. Vlogging over time: Longitudinal impressions and behavior in youtube. In Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, pages 37--46, November 2018. [ DOI | .pdf ]
[801] Daniel Gatica-Perez, Ming-Ting Sun, and Alexander Loui. Probabilistic home video structuring: Feature selection and performance evaluation. In IEEE International Conference on Image Processing [3059]. [ .ps.gz | .pdf ]
[802] Daniel Gatica-Perez and Ming-Ting Sun. Linking objects in videos by importance sampling. In IEEE International Conference on Multimedia and Expo [3060]. [ .ps.gz | .pdf ]
[803] Daniel Gatica-Perez, Alexander Loui, and Ming-Ting Sun. Finding structure in home videos by probabilistic hierarchical clustering. In IEEE Transactions on Circuits and Systems for Video Technology [3061]. IDIAP-RR 02-22. [ .ps.gz | .pdf ]
[804] Daniel Gatica-Perez, Guillaume Lathoud, Iain A. McCowan, Jean-Marc Odobez, and Darren Moore. Audio-visual speaker tracking with importance particle filters. In IEEE International Conference on Image Processing (ICIP) [3062]. [ .ps.gz | .pdf ]
[805] Daniel Gatica-Perez, Iain A. McCowan, Mark Barnard, Samy Bengio, and Hervé Bourlard. On automatic annotation of meeting databases. In IEEE International Conference on Image Processing (ICIP) [3063]. [ .ps.gz | .pdf ]
[806] Daniel Gatica-Perez and Ming-Ting Sun. Object localization in metric spaces for video linking. In IEEE Workshop on Motion and Video Computing [3064]. [ .ps.gz | .pdf ]
[807] Daniel Gatica-Perez, Guillaume Lathoud, Iain A. McCowan, and Jean-Marc Odobez. A mixed-state i-particle filter for multi-camera speaker tracking. In IEEE Int. Conf. on Computer Vision Workshop on Multimedia Technologies for E-Learning and Collaboration (ICCV-WOMTEC) [3065]. [ .ps.gz | .pdf ]
[808] Daniel Gatica-Perez, Napat Triroj, Jean-Marc Odobez, Alexander Loui, and Ming-Ting Sun. Assessing scene structuring in consumer videos. In Int. Conf. on Image and Video Retrieval (CIVR) [3066]. [ .ps.gz | .pdf ]
[809] Daniel Gatica-Perez, Iain A. McCowan, Dong Zhang, and Samy Bengio. Detecting group interest-level in meetings. In IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP) [3067]. Submitted for publication. [ .ps.gz | .pdf ]
[810] Daniel Gatica-Perez, Jean-Marc Odobez, Silèye O. Ba, Kevin C. Smith, and Guillaume Lathoud. Tracking people in meetings with particles. In Proc. Int. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS,',','), invited paper [3068]. in Proc. Int. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS,',','), invited paper, Montreux, Apr. 2005. [ .ps.gz | .pdf ]
[811] Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez, and Iain A. McCowan. Multimodal multispeaker probabilistic tracking in meetings. In Proc. Int. Conf. on Multimodal Interfaces (ICMI) [3069]. in Proc. Int. Conf. on Multimodal Interfaces (ICMI,',','), Trento, Oct. 2005. [ .ps.gz | .pdf ]
[812] Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez, and Iain A. McCowan. Audio-visual probabilistic tracking of multiple speakers in meetings. In IEEE Trans. on Audio, Speech, and Language Processing, accepted for publication. [3070]. IDIAP-RR 05-27. [ .ps.gz | .pdf ]
[813] Daniel Gatica-Perez. Analyzing group interactions in conversations: a review. In IEEE Int. Conf. on Multisensor Fusion and Integration for Intelligent Systems (MFI) [3071]. IDIAP-RR 06-63. [ .ps.gz | .pdf ]
[814] Cédric Gaudard, Guillermo Aradilla, and Hervé Bourlard. Speech recognition based on template matching and phone posterior probabilities. Idiap-Com Idiap-Com-02-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[815] Paul Gay, Gregor Dupuy, Jean-Marc Odobez, Sylvain Meignier, and Paul Deleglise. Comparison of two methods for unsupervised person identification in tv shows. In 12th International Workshop on Content-Based Multimedia Indexing, 2014. [ .pdf ]
[816] Paul Gay, Sylvain Meignier, Paul Deleglise, and Jean-Marc Odobez. Crf-based context modeling for person identification in broadcast videos. Frontiers in ICT: Computer Image Analysis, 3, 2016. [ .pdf ]
[817] Paul Gay, Elie Khoury, Sylvain Meignier, Jean-Marc Odobez, and Paul Deleglise. A conditional random field approach for audio-visual people diarization. In Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 116 -- 120. IEEE, May 2014. [ DOI | .pdf ]
[818] Paul Gay, Elie Khoury, Sylvain Meignier, Jean-Marc Odobez, and Paul Deleglise. Face identification from overlaid texts using local face recurrent patterns and crf models. In IEEE International Conference on Image Processing 2014. IEEE, October 2014. [ .pdf ]
[819] Gérard Chollet. Les domaines d'application des technologies vocales. In Fondements et perspectives en traitement automatique de la parole. GDR-PRC Communication Homme-Machine, 1995.
[820] Dominique Genoud, Miguel Moreira, and Eddy Mayoraz. Text dependent speaker verification using binary classifiers. In Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing --- ICASSP'98 [3072]. IDIAP-RR 97-08. [ .ps.gz | .pdf ]
[821] Dominique Genoud, Guillaume Gravier, Frédéric Bimbot, and Gérard Chollet. Combining methods to improve speaker verification decision. Idiap-RR Idiap-RR-02-1996, IDIAP, 1996. [ .ps.gz | .pdf ]
[822] Dominique Genoud, Guillaume Gravier, Frédéric Bimbot, and Gérard Chollet. Amelioration des performances de verification du locuteur par combinaison de methodes. In JEP, editor, Journees d'etudes sur la parole, Avignon, 6 1996. JEP.
[823] Dominique Genoud, Frédéric Bimbot, Guillaume Gravier, and Gérard Chollet. Combining methods to improve speaker verification decision. In ICSLP, editor, Proceedings of The Fourth International Conference on Spoken Language Processing, Philadelphia, 1996. ICSLP, ICSLP. [ .ps.gz | .pdf ]
[824] Dominique Genoud and Gilles Caloz. 1997 NIST evaluation: Text independent speaker detection (verification). Idiap-Com Idiap-Com-03-1997, IDIAP, 1997. [ .ps.gz | .pdf ]
[825] Frédéric Bimbot and Dominique Genoud. Likelihood ratio adjustment for the compensation of model mismatch in speaker verification. In Eurospeech 97 [3073]. IDIAP-RR 97-05. [ .ps.gz | .pdf ]
[826] Dominique Genoud. Reconnaissance et Transformation de Locuteurs. PhD thesis, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, 1 1999. [ .pdf ]
[827] Dominique Genoud and Gérard Chollet. Deliberate imposture: a challenge for automatic speaker verification systems. In Proceedings of the European Conference on Speech Communication and Technology, 1999.
[828] Dominique Genoud and Gérard Chollet. Speech pre-processing against intentional imposture in speaker recognition. In Proceedings of ICSLP, Sidney, 1998.
[829] Dominique Genoud and Gérard Chollet. Voice transformation, a tool for imposture of speaker verification. In Proceedings of International Phonetic Science conference IPS98, Washington, 1998.
[830] Anjith George and Sébastien Marcel. Cross modal focal loss for rgbd face anti-spoofing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021. [ .pdf ]
[831] Anjith George and Sébastien Marcel. Deep pixel-wise binary supervision for face presentation attack detection. In International Conference on Biometrics, 2019. [ .pdf ]
[832] Anjith George, David Geissenbuhler, and Sébastien Marcel. A comprehensive evaluation on multi-channel biometric face presentation attack detection. Idiap-RR Idiap-RR-02-2022, Idiap, 2 2022. [ .pdf ]
[833] Anjith George and Sébastien Marcel. Robust face presentation attack detection with multi-channel neural networks. Idiap-RR Idiap-RR-03-2022, Idiap, 3 2022. [ .pdf ]
[834] Anjith George and Sébastien Marcel. Can your face detector do anti-spoofing? face presentation attack detection with a multi-channel face detector. Idiap-RR Idiap-RR-12-2020, Idiap, 6 2020. [ .pdf ]
[835] Anjith George and Sébastien Marcel. Learning one class representations for presentation attack detection using multi-channel convolutional neural networks. Idiap-RR Idiap-RR-15-2020, Idiap, 7 2020. [ .pdf ]
[836] Anjith George and Sébastien Marcel. On the effectiveness of vision transformers for zero-shot face anti-spoofing. In International Joint Conference on Biometrics (IJCB 2021) [3074]. [ .pdf ]
[837] Anjith George and Sébastien Marcel. Multi-channel face presentation attack detection using deep learning. In Deep Learning-Based Face Analytics. Springer International Publishing, 2021. [ .pdf ]
[838] Anjith George, Zohreh Mostaani, David Geissenbuhler, Olegs Nikisins, André Anjos, and Sébastien Marcel. Biometric face presentation attack detection with multi-channel convolutional neural network. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019. [ .pdf ]
[839] Anjith George and Sébastien Marcel. Learning one class representations for face presentation attack detection using multi-channel convolutional neural networks. IEEE Transactions on Information Forensics and Security, 2020. [ .pdf ]
[840] Branislav Gerazov, Aleksandar Gjoreski, Aleksandar Melov, Pierre-Edouard Honnet, Zoran Ivanovski, and Philip N. Garner. Unified prosody model based on atom decomposition for emphasis detection. In Proceedings of ETAI, Struga, Macedonia, September 2016. [ .pdf ]
[841] Branislav Gerazov, Pierre-Edouard Honnet, Aleksandar Gjoreski, and Philip N. Garner. Weighted correlation based atom decomposition intonation modelling. In Proceedings of Interspeech, pages 1601--1605, September 2015. [ .pdf ]
[842] Branislav Gerazov, Gérard Bailly, Omar Mohammed, Yi Xu, and Philip N. Garner. Embedding context-dependent variations of prosodic contours using variational encoding for decomposing the structure of speech prosody. In Workshop on Prosody and Meaning: Information Structure and Beyond, November 2018. [ http | .pdf ]
[843] Branislav Gerazov and Philip N. Garner. An agonist-antagonist pitch production model. In Lecture Notes in Artificial Intelligence: 18th International Conference, SPECOM 2016, volume 9811, pages 84--91, August 2016. [ .pdf ]
[844] Branislav Gerazov and Philip N. Garner. An investigation of muscle models for physiologically based intonation modelling. In Proceedings of the 23rd Telecommunications Forum, pages 468--471, Belgrade, Serbia, November 2015. [ DOI | .pdf ]
[845] Sucheta Ghosh, Milos Cernak, Sarbani Palit, and B. B. Chaudhuri. An analysis of rhythmic staccato-vocalization based on frequency demodulation for laughter detection in conversational meetings. Idiap-RR Idiap-RR-02-2016, Idiap, 1 2016. [ .pdf ]
[846] Arjan Gijsberts, Rashida Bohra, David Sierra González, Alexander Werner, Markus Nowak, Barbara Caputo, Maximo A. Roa, and Claudio Castellini. Stable myoelectric control of a hand prosthesis using non-linear incremental learning. Frontiers in Neurorobotics, 8, February 2014. [ DOI ]
[847] Arjan Gijsberts and Barbara Caputo. Exploiting accelerometers to improve movement classification for prosthetics. In International Conference on Rehabilitation Robotics, 2013. [ .pdf ]
[848] Arjan Gijsberts and Giorgio Metta. Real-time model learning using incremental sparse spectrum gaussian process regression. Neural Networks, August 2012.
[849] Arjan Gijsberts, Manfredo Atzori, Claudio Castellini, Henning Müller, and Barbara Caputo. The movement error rate for evaluation of machine learning methods for semg-based hand movement classification. Transactions on Neural Systems and Rehabilitation Engineering, pages 735 -- 744, July 2014. [ DOI ]
[850] Nicolas Gilardi, Tom Melluish, and Michel Maignan. Confidence evaluation for risk prediction. In 2001 Annual Conference of the IAMG [3075]. IDIAP-RR 01-22. [ .ps.gz | .pdf ]
[851] Nicolas Gilardi, Samy Bengio, and Mikhail Kanevski. Estimation of conditional distributions using gaussian mixture models. Idiap-RR Idiap-RR-03-2002, IDIAP, 2002. Submitted to ICANN 2002. [ .ps.gz | .pdf ]
[852] Nicolas Gilardi and Samy Bengio. Local machine learning models for spatial data analysis. In Journal of Geographic Information and Decision Analysis [3076]. IDIAP-RR 00-34. [ .ps | .pdf ]
[853] Nicolas Gilardi, Samy Bengio, and Mikhail Kanevski. Conditional gaussian mixture models for environmental risk mapping. In IEEE International Workshop on Neural Networks for Signal Processing (NNSP) [3077]. [ .ps.gz | .pdf ]
[854] David Ginsbourger, Olivier Roustant, and Nicolas Durrande. On degeneracy and invariances of random fields paths with applications in gaussian process modelling. Journal of Statistical Planning and Inference, 170:117--128, March 2016. [ DOI ]
[855] David Ginsbourger, Olivier Roustant, Dominic Schuhmacher, Nicolas Durrande, and Nicolas Lenz. On anova decompositions of kernels and gaussian random field paths. In Monte Carlo and Quasi-Monte Carlo Methods, volume 163 of Springer Proceedings in Mathematics & Statistics, pages 315--330. Springer International Publishing, 2016. [ DOI ]
[856] David Ginsbourger, Jean Baccou, Clément Chevalier, and Frédéric Perales. Design of computer experiments using competing distances between set-valued inputs. In mODa 11 - Advances in Model-Oriented Design and Analysis, Contributions to Statistics, pages 123--131. Springer International Publishing, 2016. [ DOI ]
[857] David Ginsbourger. Sequential design of computer experiments. In Wiley StatsRef: Statistics Reference Online. Wiley, 2018. Accepted.
[858] Hakan Girgin, Teguh Santoso Lembono, Radu Cirligeanu, and Sylvain Calinon. Optimization of robot configurations for motion planning in industrial riveting. In Proc. IEEE Intl Conf. on Advanced Robotics (ICAR), 2021. [ .pdf ]
[859] Hakan Girgin, E. Pignat, N. Jaquier, and Sylvain Calinon. Active improvement of control policies with bayesian gaussian mixture model. In Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020.
[860] Hakan Girgin, Julius Jankowski, and Sylvain Calinon. Reactive anticipatory robot skills with memory. In The International Symposium on Robotics Research, 2022. [ .pdf ]
[861] A. Giusti, M. Zeestraten, E. Icer, A. Pereira, D. G. Caldwell, Sylvain Calinon, and M. Althoff. Flexible automation driven by demonstration: Leveraging strategies that simplify robotics. IEEE Robotics and Automation Magazine (RAM), 25(2):18--27, June 2018. [ DOI | http | .pdf ]
[862] Nicolas Gilardi, Mikhail Kanevski, Michel Maignan, and Eddy Mayoraz. Environmental and pollution spatial data classification with support vector machines and geostatistics. In Intelligent techniques for Spatio-Temporal Data Analysis in Environmental Applications. Workshop W07, 1999.
[863] Nicolas Gilardi, Mikhail Kanevski, Michel Maignan, and Eddy Mayoraz. Environmental and pollution spatial data classification with support vector machines and geostatistics. In Geostatistical congress 2000, 2000.
[864] Oren Glickman, Ido Dagan, Mikaela Keller, Samy Bengio, and Walter Daelemans. Investigating lexical substitution scoring for subtitle generation. In Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL). [3078]. IDIAP-RR 06-36. [ .ps.gz | .pdf ]
[865] Frédéric Berthommier, Hervé Glotin, Emmanuel Tessier, and Hervé Bourlard. Interfacing of CASA and partial recognition based on a multistream technique. In ICSLP'98, volume 4. Sidney, 1998. [ .ps.gz | .pdf ]
[866] Hervé Glotin, Emmanuel Tessier, Hervé Bourlard, and Frédéric Berthommier. Reconnaissance multi-bandes de la parole bruitée par couplage entre les niveaux primitifs et d'identification. In Journées Etude Parole - Martigny, 1998. [ .ps.gz | .pdf ]
[867] Hervé Glotin, Emmanuel Tessier, Hervé Bourlard, and Frédéric Berthommier. Reconnaissance robuste de la parole par segmentation signal/bruit en sous-bandes. In Neurosciences et Sciences de l'Ingénieur'98 - Munster, CNRS, 1998. [ .ps.gz | .pdf ]
[868] Hervé Glotin, Frédéric Berthommier, Emmanuel Tessier, and Hervé Bourlard. Interfacing of CASA and multistream recognition. In TSD'98-Text, Speech and Dialog International Workshop. BRNO-Czech Republic, 1998. [ .ps.gz | .pdf ]
[869] Hervé Glotin, Frédéric Berthommier, and Emmanuel Tessier. A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition. In Proc. European Conf. on Speech Communication and Technology (EUROSPEECH), volume 5, 9 1999.
[870] Hervé Glotin and Frédéric Berthommier. Test of several external posterior weighting functions for multiband full combination asr. In Int. Conf. on Spoken Language Processing (ICSLP) [3079]. published in ICSLP 2000.
[871] Hervé Glotin. Various adaptive weighting schemes for large vocabulary robust audio-visual asr, with particular reference to the cocktail party effect. Idiap-Com Idiap-Com-04-2000, IDIAP, 2000.
[872] Hervé Glotin. Robust multi-stream speech recognition based on the combined reliabilities of the speech signal and phonemes estimates. Idiap-RR Idiap-RR-36-2000, IDIAP, 2000. accepted to WISP IEEE 2001 Budapest.
[873] Hervé Glotin, D. Vergyri, C. Neti, G. Potamianos, and Juergen Luettin. Weighting schemes for audio-visual fusion in speech recognition. Idiap-RR Idiap-RR-44-2000, IDIAP, 2000. published in IEEE International Conference on Acoustic, Speech, and Signal Processing. [ .ps.gz | .pdf ]
[874] Frédéric Berthommier and Hervé Glotin. Reconnaissance de la parole dans le bruit après renforcement fondé sur l'harmonicité. In Proceedings of JEP'2000, Aussois, 2000. no IDIAP RR, see RESPITE www.
[875] Jean-Philippe Goldman, Pierre-Edouard Honnet, Rob Clark, Philip N. Garner, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Tiago Macedo, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli, and Junichi Yamagishi. The siwis database: a multilingual speech database with acted emphasis. In Proceedings of Interspeech [3080], pages 1532--1535. [ DOI | .pdf ]
[876] Alejandro Gomez-Alanis, Jose A. Gonzalez-Lopez, S. Pavankumar Dubagunta, Antonio M. Peinado, and Mathew Magimai.-Doss. On joint optimization of automatic speaker verification and anti-spoofing in the embedding space. IEEE Transactions on Information Forensics and Security, 16:1579--1593, 2021. [ DOI | .pdf ]
[877] Andreé R. Goncalves, Pavel Korshunov, Ricardo P. V. Violato, Flávio O. Simões, and Sébastien Marcel. On the generalization of fused systems in voice presentation attack detection. In A. Brömme, Christoph Busch, A. Dantcheva, C. Rathgeb, and A. Uhl, editors, 16th International Conference of the Biometrics Special Interest Group, Darmstadt, Germany, September 2017. [ .pdf ]
[878] German Gonzalez, Francois Fleuret, and Pascal Fua. Learning rotational features for filament detection. In Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2009.
[879] German Gonzalez, Engin Turetken, Francois Fleuret, and Pascal Fua. Delineating trees in noisy 2d images and 3d image stacks. In Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, page 2799–2806, 2010.
[880] German Gonzalez, Francois Fleuret, and Pascal Fua. Automated delineation of dendritic networks in noisy image stacks. In proceedings of the European Conference on Computer Vision, 2008.
[881] German Gonzalez, L. Fusco, Riwal Lefort, F. Benmansour, Pascal Fua, and Kevin C. Smith. Automated quantification of morphodynamics for high-throughput live cell imaging datasets. In 1st International SystemsX.ch Conference on Systems Biology, 2011.
[882] German Gonzalez, Francois Aguet, Francois Fleuret, Michael Unser, and Pascal Fua. Steerable features for statistical 3d dendrite detection. In Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention, 2009.
[883] Gábor Gosztolya, Tamás Grósz, László Tóth, and David Imseng. Building context-dependent dnn acoustic models using kullback-leibler divergence-based state tying. In Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2015. [ .pdf ]
[884] Baran Gözcü, Afsaneh Asaei, and Volkan Cevher. Manifold sparse beamforming. In IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, pages 113--116. IEEE, December 2013. [ DOI | .pdf ]
[885] Baran Gözcü, rabeeh karimi mahabadi, Yen-Huan Li, Efe Ilicak, Tolga Çukur, Jonathan Scarlett, and Volkan Cevher. Learning-based compressive mri. IEEE Transactions on Medical Imaging, 2018. [ .pdf ]
[886] Eric Grand. Handwritten digits recognition. Idiap-RR Idiap-RR-07-2000, IDIAP, 2000. Travail de diplome 1999 de l'Ecole d'Ingénieurs du Valais à Sion. [ .ps.gz | .pdf ]
[887] Alain Rakotomamonjy, Francis Bach, Stéphane Canu, and Yves Grandvalet. More efficiency in multiple kernel learning. In International Conference on Machine Learning (ICML) [3081]. IDIAP-RR 07-18. [ .ps.gz | .pdf ]
[888] Romain Hérault and Yves Grandvalet. Sparse probabilistic classifiers. In International Conference on Machine Learning (ICML) [3082]. IDIAP-RR 07-19. [ .ps.gz | .pdf ]
[889] Yves Grandvalet, Johnny Mariéthoz, and Samy Bengio. A probabilistic interpretation of svms with an application to unbalanced classification. In Advances in Neural Information Processing Systems, NIPS 15 [3083]. IDIAP-RR 05-26. [ .ps.gz | .pdf ]
[890] Marie Szafranski, Yves Grandvalet, and Pierre Morizet-Mahoudeaux. Hierarchical penalization. In Advances in Neural Information Processing Systems 21 [3084]. IDIAP-RR 07-76. [ .ps.gz | .pdf ]
[891] Marie Szafranski, Yves Grandvalet, and Alain Rakotomamonjy. Composite kernel learning. In McCallum and Roweis [3085]. IDIAP-RR 08-59. [ .pdf ]
[892] Yves Grandvalet, Alain Rakotomamonjy, Joseph Keshet, and Stéphane Canu. Support vector machines with a reject option. In Proceedings of the 22nd Annual Conference on Neural Information Processing Systems [3086]. [ .pdf ]
[893] Marcel Granero-Moya, Thanh-Trung Phan, and Daniel Gatica-Perez. Zurich like new: Analyzing open urban multimodal data. In Proceedings of the 1st International Workshop on Multimedia Computing for Urban Data, October 2021. [ .pdf ]
[894] David Grangier and Alessandro Vinciarelli. Making retrieval faster through document clustering. Idiap-RR Idiap-RR-02-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[895] David Grangier and Alessandro Vinciarelli. Noisy text clustering. Idiap-RR Idiap-RR-31-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[896] David Grangier and Alessandro Vinciarelli. Effect of recognition errors on text clustering. Idiap-RR Idiap-RR-82-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[897] David Grangier and Samy Bengio. Inferring document similarity from hyperlinks. In ACM Conference on Information and Knowledge Management [3087]. [ .ps.gz | .pdf ]
[898] David Grangier and Alessandro Vinciarelli. Effect of segmentation method on video retrieval performance. In Proceedings of the 2005 IEEE International Conference on Multimedia and Expo (ICME-05) [3088]. [ .ps.gz | .pdf ]
[899] David Grangier and Samy Bengio. A discriminative decoder for the recognition of phoneme sequences. Idiap-RR Idiap-RR-67-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[900] David Grangier and Samy Bengio. Exploiting hyperlinks to learn a retrieval model. In NIPS Workshop on Learning to Rank [3087]. [ .ps.gz | .pdf ]
[901] David Grangier, Florent Monay, and Samy Bengio. Learning to retrieve images from text queries with a discriminative model. In International Workshop on Adaptive Multimedia Retrieval (AMR) [3089]. [ .ps.gz | .pdf ]
[902] David Grangier, Florent Monay, and Samy Bengio. A discriminative approach for the retrieval of images from text queries. In European Conference on Machine Learning (ECML), 2006. [ .ps.gz | .pdf ]
[903] David Grangier and Samy Bengio. A neural network to retrieve images from text queries. In International Conference on Artificial Neural Networks (ICANN) [3090]. [ .ps.gz | .pdf ]
[904] David Grangier, Florent Monay, and Samy Bengio. A discriminative approach for the retrieval of images from text queries. Idiap-RR Idiap-RR-15-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[905] David Grangier and Samy Bengio. Learning the inter-frame distance for discriminative template-based keyword detection. In International Conference on Speech Communication and Technology (INTERSPEECH), 2007. [ .ps.gz | .pdf ]
[906] David Grangier and Samy Bengio. Learning the inter-frame distance for discriminative template-based keyword detection. Idiap-RR Idiap-RR-15-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[907] David Grangier and Samy Bengio. A discriminative kernel-based model to rank images from text queries. In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) [3091]. [ .ps.gz | .pdf ]
[908] David Grangier. Machine Learning for Information Retrieval. Idiap-rr, Ecole Polytechnique Fédérale de Lausanne, 6 2008. Thèse Ecole polytechnique fédérale de Lausanne EPFL, no 4088 (2008,',','), Faculté des sciences et techniques de l'ingénieur STI, Section de génie électrique et électronique, Institut de génie électrique et électronique IEL (Laboratoire de l'IDIAP LIDIAP). Dir.: Hervé Bourlard, Sami Bengio. [ .pdf ]
[909] David Grangier, Joseph Keshet, and Samy Bengio. Discriminative keyword spotting. In Joseph Keshet and Samy Bengio, editors, Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods. John Wiley and Sons, 2009.
[910] Malo Grisard, Petr Motlicek, Wissem Allouchi, Michael Baeriswyl, Alexandros Lazaridis, and Qingran Zhan. Spoken language identification using language bottleneck features. In Proceedings of TSD [3093]. [ .pdf ]
[911] Cristina Grisot and Thomas Meyer. Cross-linguistic annotation of narrativity for english/french verb tense disambiguation. In 9th Edition of the Language Resources and Evaluation Conference, March 2014. [ .pdf ]
[912] Gérard Chollet, Jean-Luc Cochard, Philippe Langlais, and R. van Kommer. Swiss-french polyphone: a telephone speech database to develop interactive voice servers. In Linguistic Databases, Gröningen, 1995.
[913] Etienne Grossmann, José António Gaspar, and Francesco Orabona. Calibration from statistical properties of the visual world. In European Conf. on Computer Vision [3094]. IDIAP-RR 08-63. [ .ps.gz | .pdf ]
[914] Maël Guillemot and Bastien Crettol. From meeting recordings to web distribution: Description of the process. Idiap-Com Idiap-Com-05-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[915] Maël Guillemot, Pierre Wellner, Daniel Gatica-Perez, and Jean-Marc Odobez. A hierarchical keyframe user interface for browsing video over the internet. In Rauterberg et al. [3095]. [ .ps.gz | .pdf ]
[916] Maël Guillemot, Jean-Marc Odobez, and Daniel Gatica-Perez. Algorithms for video structuring. Idiap-Com Idiap-Com-05-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[917] Ait-Hassou Aissa. Hmm inference towards flexible speech recognition. Idiap-Com Idiap-Com-03-2003, IDIAP, 2003.
[918] Maël Guillemot, Jean-Marc Odobez, Alessandro Vinciarelli, and Sandy Ingram. Klewel webcast: from research to growing company. IEEE Multimedia, 22(4):94--99, December 2015. [ .pdf ]
[919] Liane Guillou, Christian Hardmeier, Preslav Nakov, Sara Stymne, Jorg Tiedemann, Yannick Versley, Mauro Cettolo, Bonnie Webber, and Andrei Popescu-Belis. Findings of the 2016 wmt shared task on cross-lingual pronoun prediction. In Proceedings of WMT 2016 (First Conference on Machine Translation), page 525–542. Association for Computational Linguistics, 2016. [ http ]
[920] Manuel Günther, Roy Wallace, and Sébastien Marcel. An open source framework for standardized comparisons of face recognition algorithms. In Fusiello et al. [3096], pages 547--556. The source code to re-generate the results of this paper can be downloaded from the URL below. [ DOI | http | .pdf ]
[921] Manuel Günther, Dennis Haufe, and Rolf P. Würtz. Face recognition with disparity corrected Gabor phase differences. In Alessandro E. P. Villa, Wlodzislaw Duch, Péter Érdi, Francesco Masulli, and Günther Palm, editors, Artificial Neural Networks and Machine Learning, volume 7552 of Lecture Notes in Computer Science, pages 411--418. Springer Berlin, September 2012. [ DOI | .pdf ]
[922] Manuel Günther, Artur Costa-Pazo, Changxing Ding, Elhocine Boutellaa, Giovani Chiachia, Honglei Zhang, Marcus de Assis Angeloni, Vitomir Struc, Elie Khoury, Esteban Vazquez-Fernandez, Dacheng Tao, Messaoud Bengherabi, David Cox, Serkan Kiranyaz, Tiago de Freitas Pereira, Jerneja Zganec-Gros, Enrique Argones-Rúa, Nicolas Pinto, Moncef Gabbouj, Flávio Simões, Simon Dobrisek, Daniel González-Jiménez, Anderson Rocha, Mário Uliani Neto, Nikola Pavesic, Alexandre Falcão, Ricardo Violato, and Sébastien Marcel. The 2013 face recognition evaluation in mobile environment. In The 6th IAPR International Conference on Biometrics [3097]. [ .pdf ]
[923] Manuel Günther, Laurent El Shafey, and Sébastien Marcel. 2d face recognition: An experimental and reproducible research survey. Idiap-RR Idiap-RR-13-2017, Idiap, 4 2017. [ .pdf ]
[924] Manuel Günther, Stefan Böhringer, Dagmar Wieczorek, and Rolf P. Würtz. Reconstruction of images from gabor graphs with applications in facial image processing. Journal of Wavelets, Multiresolution and Information Processing, 13(4):25, July 2015. [ DOI | .pdf ]
[925] Manuel Günther, Laurent El Shafey, and Sébastien Marcel. Face recognition in challenging environments: An experimental and reproducible research survey. In Thirimachos Bourlai, editor, Face Recognition Across the Imaging Spectrum. Springer, 1 edition, February 2016. [ .pdf ]
[926] Anshul Gupta, Samy Tafasca, and Jean-Marc Odobez. A modular multimodal architecture for gaze target prediction: Application to privacy-sensitive settings. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022.
[927] E. Gysels, José del R. Millán, Silvia Chiappa, and P. Celka. Studying phase synchrony for classification of mental tasks in brain machine interfaces. In Proceedings of the Conference of the International Society for Brain Electromagnetic Topography, Santa Fe, USA, 11 2003.
[928] Maryam Habibi and Andrei Popescu-Belis. Diverse keyword extraction from conversations. In Proceedings of the ACL 2013 (51th Annual Meeting of the Association for Computational Linguistics ), Short Papers, pages 651--657. ACL, August 2013. [ .pdf ]
[929] Maryam Habibi and Andrei Popescu-Belis. Using crowdsourcing to compare document recommendation strategies for conversations. In RecSys, Recommendation Utility Evaluation (RUE 2012) [3098], pages 15--20. [ .pdf ]
[930] Maryam Habibi and Andrei Popescu-Belis. Enforcing topic diversity in a document recommender for conversations. In Proceedings of the Coling 2014 (25th International Conference on Computational Linguistics), pages 746--759. IEEE, August 2014. [ .pdf ]
[931] Maryam Habibi, Parvaz Mahdabi, and Andrei Popescu-Belis. Question answering in conversations: Query refinement using contextual and semantic information. In Data & Knowledge Engineering Journal [3099]. accepted version by the journal (before copy-editing).
[932] Maryam Habibi, Nikolaos Pappas, and Andrei Popescu-Belis. Topic and sentiment in phrase-based statistical machine translation. Idiap-RR Idiap-RR-10-2017, Idiap, 3 2017. [ .pdf ]
[933] Maryam Habibi and Andrei Popescu-Belis. Keyword extraction and clustering for document recommendation in conversations. IEEE/ACM Transactions on Audio Speech and Language Processing, 23(4):746 -- 759, February 2015. [ DOI | .pdf ]
[934] Maryam Habibi and Andrei Popescu-Belis. Query refinement using conversational context: a method and an evaluation resource. In Proceedings of NLDB 2015 (20th International Conference on Applications of Natural Language to Information Systems), volume 9103 of Lecture Notes in Computer Science, pages 89--102. Springer-Verlag Berlin, 2015. [ DOI | .pdf ]
[935] Maryam Habibi. Modeling Users’ Information Needs in a Document Recommender for Meetings. PhD thesis, EPFL, November 2015. [ .pdf ]
[936] Abdenour Hadid, Nicholas Evans, Sébastien Marcel, and Julian Fierrez. Biometrics systems under spoofing attack: an evaluation methodology and lessons learned. IEEE Signal Processing Magazine, 2015. [ .pdf ]
[937] Cathleen Hagemann, Giulia E. Tyzack, Doaa M. Taha, Helen Devine, Linda Greensmith, Jia Newcombe, Rickie Patani, Andrea Serio, and Raphaelle Luisier. Automated and unbiased discrimination of als from control tissue at single cell resolution. Brain Pathology, 2021.
[938] Astrid Hagen and Hervé Glotin. Etudes comparatives des robustesses au bruit de l'approche 'full combination' et de son approximation. In Journee d'Etudes sur la Parole, Aussois, Aussois, France, 2000. IDIAP-RR 00-04. [ .ps.gz | .pdf ]
[939] Astrid Hagen. Robust speech recognition based on multi-stream processing. Idiap-rr, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, 12 2001. thesis # (IDIAP-RR 01-41). [ .ps.gz | .pdf ]
[940] Astrid Hagen, Andrew Morris, and Hervé Bourlard. Subband-based speech recognition in noisy conditions: The full combination approach. Idiap-RR Idiap-RR-15-1998, IDIAP, 1998. IDIAP-RR 98-15. [ .ps.gz | .pdf ]
[941] Astrid Hagen, Andrew Morris, and Hervé Bourlard. Different weighting schemes in the full combination subbands approach for noise robust asr. In Robust Methods for Speech Recognition in Adverse Conditions, Tampere, Finland, 5 1999. IDIAP-RR 99-11. [ .ps.gz | .pdf ]
[942] Saeid Haghighatshoar, Mohammad J. Taghizadeh, and Afsaneh Asaei. A new identity for the least-square solution of overdetermined set of linear equations. Idiap-RR Idiap-RR-35-2015, Idiap, 12 2015. [ .pdf ]
[943] Thomas Hain, Lukas Burget, John Dines, Philip N. Garner, Asmaa El Hannani, Marijn Huijbregts, Martin Karafiat, Mike Lincoln, and Vincent Wan. The amida 2009 meeting transcription system. In Proceedings of Interspeech, 9 2010. [ .pdf ]
[944] Thomas Hain, Lukas Burget, John Dines, Philip N. Garner, Frantisek Grezl, Asmaa El Hannani, Marijn Huijbregts, Martin Karafiat, Mike Lincoln, and Vincent Wan. Transcribing meetings with the amida systems. IEEE Transactions on Audio, Speech, and Language Processing, 20(2):486--498, February 2012. [ DOI | http | .pdf ]
[945] Najeh Hajlaoui. Are act's scores increasing with better translation quality? In Are ACT's scores increasing with better translation quality?, page 6, July 2013. [ .pdf ]
[946] Najeh Hajlaoui and Andrei Popescu-Belis. Translating english discourse connectives into arabic: a corpus-based analysis and an evaluation metric. In Fourth Workshop on Computational Approaches to Arabic Script-based Languages at Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), October 2012. [ .pdf ]
[947] Najeh Hajlaoui and Andrei Popescu-Belis. Assessing the accuracy of discourse connective translations: Validation of an automatic metric. In 14th International Conference on Intelligent Text Processing and Computational Linguistics, volume 7817, pages 236--247. University of the Aegean, Springer, March 2013. [ DOI | .pdf ]
[948] Hamed Ketabdar and Hervé Bourlard. Hierarchical integration of phonetic and lexical knowledge in phone posterior estimation. In ICASSP'08, 2008. [ .ps | .pdf ]
[949] Hamed Ketabdar and Hervé Bourlard. In-context phone posteriors as complementary features for tandem asr. In ICSLP'08, 2008. [ .ps | .pdf ]
[950] Hamed Ketabdar and Hervé Bourlard. Enhanced phone posteriors for improving speech recognition systems. Idiap-RR Idiap-RR-39-2008, IDIAP, 2008. [ .ps | .pdf ]
[951] Bence Halpern, Julian Fritsch, Enno Hermann, Rob Van Son, Odette Scharenborg, and Mathew Magimai.-Doss. An objective evaluation framework for pathological speech synthesis. In Proceedings of ITG Conference on Speech Communication, 2021. [ .pdf ]
[952] Hamed Ketabdar, Jithendra Vepa, Samy Bengio, and Hervé Bourlard. Developing and enhancing posterior based speech recognition systems. In Proceedings of Interspeech [3101]. IDIAP-RR 05-23. [ .ps.gz | .pdf ]
[953] Hamed Ketabdar, Hervé Bourlard, and Samy Bengio. Hierarchical multi-stream posterior based speech recognition system. In Proceedings MLMI workshop [3102]. IDIAP-RR 05-25. [ .ps.gz | .pdf ]
[954] Fahad Haneef, Giovanni Pernigotto, Andrea Gasparella, and Jérôme Kämpf. Application of urban scale energy modelling and multi-objective optimization techniques for building energy renovation at district scale. Sustainability, 13(20), 2021. [ DOI | http ]
[955] Hans Jongebloed. Voicephone: An interactive vocal server for telephone numbers. Idiap-Com Idiap-Com-04-1996, IDIAP and CSE, University of Groningen, 12 1996. [ .pdf ]
[956] Iain A. McCowan, Maganti Hari Krishna, Daniel Gatica-Perez, Darren Moore, and Silèye O. Ba. Speech acquisition in meetings with an audio-visual sensor array. In Pro. IEEE ICME [3103]. IDIAP-RR 05-03. [ .ps.gz | .pdf ]
[957] Hari Krishna Maganti, Jithendra Vepa, and Hervé Bourlard. Continuous microphone array speech recognition on wall street journal corpus. Idiap-RR Idiap-RR-47-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[958] Mike Lincoln, Iain A. McCowan, Jithendra Vepa, and Hari Krishna Maganti. The multi-channel wall street journal audio visual corpus (mc-wsj-av): Specification and initial experiments. Idiap-RR Idiap-RR-69-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[959] Hari Krishna Maganti, Daniel Gatica-Perez, and Iain A. McCowan. Speech Enhancement and Recognition in Meetings with an Audio-Visual Sensor Array. Idiap-RR Idiap-RR-24-2006, IDIAP, Martigny, Switzerland, 2006. [ .ps.gz | .pdf ]
[960] Hari Krishna Maganti and Daniel Gatica-Perez. Speaker localization for microphone array-based asr: The effects of accuracy on overlapping speech. In Proc. Int. Conf. on Multimodal Interfaces (ICMI) [3104]. IDIAP-RR 06-29. [ .ps.gz | .pdf ]
[961] Hari Krishna Maganti, Petr Motlicek, and Daniel Gatica-Perez. Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms. Idiap-RR Idiap-RR-57-2006, IDIAP, Martigny, Switzerland, 2006. [ .ps.gz | .pdf ]
[962] I. Havoutis and Sylvain Calinon. Learning from demonstration for semi-autonomous teleoperation. Autonomous Robots, 43(3):713--726, 2019. [ DOI | http | .pdf ]
[963] Carlos Mastalli, I. Havoutis, Michele Focchi, Claudio Semini, and D. G. Caldwell. Hierarchical planning of dynamic movements without scheduled contact sequences. In Proceedings of the IEEE International Conference of Robotics and Automation, 2016.
[964] I. Havoutis and Sylvain Calinon. Supervisory teleoperation with online learning and optimal control. In Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), pages 1534--1540. IEEE, May 2017. [ http | .pdf ]
[965] I. Havoutis and Sylvain Calinon. Learning assistive teleoperation behaviors from demonstration. In Proc. IEEE International Symposium on Safety, Security and Rescue Robotics, pages 258--263, October 2016. [ .pdf ]
[966] Aleksandra Cerekovic, Oya Aran, and Daniel Gatica-Perez. How do you like your virtual agent?: Human-agent interaction experience through nonverbal features and personality traits. In Human Behavior Understanding, volume 8749 of Lecture Notes in Computer Science, pages 1--15. Springer, 2014. [ .pdf ]
[967] Alexandre Heili, Cheng Chen, and Jean-Marc Odobez. Detection-based multi-human tracking using a crf model. In The Eleventh IEEE International Workshop on Visual Surveillance, 2011. [ .pdf ]
[968] Alexandre Heili, Jagannadan Varadarajan, Bernard Ghanem, Narendra Ahuja, and Jean-Marc Odobez. Improving head and body pose estimation through semi-supervised manifold alignment. In International Conference on Image Processing, 2014. [ .pdf ]
[969] Alexandre Heili, Adolfo Lopez-Mendez, and Jean-Marc Odobez. Exploiting long-term connectivity and visual motion in crf-based multi-person tracking. Idiap-RR Idiap-RR-05-2014, Idiap, 4 2014. [ .pdf ]
[970] Alexandre Heili, Adolfo Lopez Mendez, and Jean-Marc Odobez. Exploiting long-term connectivity and visual motion in crf-based multi-person tracking. Idiap-RR Idiap-RR-06-2014, Idiap, 4 2014. [ .pdf ]
[971] Alexandre Heili and Jean-Marc Odobez. Parameter estimation and contextual adaptation for a multi-object tracking crf model. In IEEE Workshop on Performance Evaluation of Tracking and Surveillance, 2013. [ .pdf ]
[972] Alexandre Heili. Human Tracking and Pose Estimation in Open Spaces. PhD thesis, École Polytechnique Fédérale de Lausanne (EPFL), 2014. [ .pdf ]
[973] Alexandre Heili, Adolfo Lopez-Mendez, and Jean-Marc Odobez. Exploiting long-term connectivity and visual motion in crf-based multi-person tracking. Transactions on Image Processing, 2014. [ .pdf ]
[974] Hartmut Helmke, Matthias Kleinert, Shruthi Shetty, Oliver Ohneiser, heiko Ehr, Hörðdur Arilíusson, Teodor S. Simiganoschi, Amrutha Prasad, Petr Motlicek, Karel Vesely, Karel Ondřej, Pavel Smrz, Julia Harfmann, and Christian Windisch. Readback error detection by automatic speech recognition to increase atm safety. In Fourteenth USA/Europe Air Traffic Management Research and Development Seminar (ATM2021), page 10. The United States Federal Aviation Administration (FAA), EUROCONTROL, September 2021. [ http | .pdf ]
[975] Hartmut Helmke, Shruthi Shetty, Matthias Kleinert, Oliver Ohneiser, heiko Ehr, Amrutha Prasad, Petr Motlicek, Cerna Aneta, and Christian Windisch. Measuring speech recognition and understanding performance in air traffic control domain beyond word error rates. In 11th SESAR Innovation Days, 2021. [ .pdf ]
[976] Coralie Hemptinne. Master thesis: Integration of the harmonic plus noise model (hnm) into the hidden markov model-based speech synthesis system (hts). Idiap-RR Idiap-RR-69-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[977] James Henderson. The unstoppable rise of computational linguistics in deep learning. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 6294--6306. Association for Computational Linguistics, Association for Computational Linguistics, July 2020. [ DOI | http ]
[978] Enno Hermann, Herman Kamper, and Sharon Goldwater. Multilingual and unsupervised subword modeling for zero-resource languages. Computer Speech and Language, 65, January 2021. [ DOI | http ]
[979] Enno Hermann and Mathew Magimai.-Doss. Dysarthric speech recognition with lattice-free mmi. In International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6109--6113, 2020. [ DOI | http | .pdf ]
[980] Enno Hermann and Sharon Goldwater. Multilingual bottleneck features for subword modeling in zero-resource languages. In Proc. Interspeech, pages 2668--2672, September 2018. [ DOI | .pdf ]
[981] Enno Hermann and Mathew Magimai.-Doss. Handling acoustic variation in dysarthric speech recognition systems through model combination. In Proceedings of Interspeech, 2021. [ .pdf ]
[982] Guillaume Heusch, Fabien Cardinaux, and Sébastien Marcel. Lighting normalization algorithms for face verification. Idiap-Com Idiap-Com-03-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[983] Guillaume Heusch, Fabien Cardinaux, and Sébastien Marcel. Efficient diffusion-based illumination normalization for face verification. Idiap-RR Idiap-RR-46-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[984] Guillaume Heusch, Yann Rodriguez, and Sébastien Marcel. Local binary patterns as an image preprocessing for face authentication. In IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR) [3105]. IDIAP-RR 05-76. [ .ps.gz | .pdf ]
[985] Guillaume Heusch and Sébastien Marcel. Face authentication with Salient Local Features and Static Bayesian network. In IEEE / IAPR Intl. Conf. On Biometrics (ICB) [3106]. IDIAP-RR 07-04. [ .ps.gz | .pdf ]
[986] Guillaume Heusch, André Anjos, and Sébastien Marcel. A reproducible study on remote heart rate measurement. arXiv, September 2017. [ http | .pdf ]
[987] Guillaume Heusch and Sébastien Marcel. Pulse-based features for face presentation attack detection. In Proceedings of BTAS 2018, special session on Image and Video Forensics in Biometrics, 2018. [ .pdf ]
[988] Guillaume Heusch and Sébastien Marcel. Bayesian networks to combine intensity and color information in face recognition. In International Conference on Biometrics [3107]. [ .pdf ]
[989] Guillaume Heusch, Tiago de Freitas Pereira, and Sébastien Marcel. A comprehensive experimental and reproducible study on selfie biometrics in multistream and heterogeneous settings. In IEEE Transactions on Biometrics, Behavior and Identity Science [3108]. [ DOI | http ]
[990] Guillaume Heusch and Sébastien Marcel. A novel statistical generative model dedicated to face recognition. In Image & Vision Computing [3109]. in press. [ .pdf ]
[991] Guillaume Heusch and Sébastien Marcel. Remote blood pulse analysis for face presentation attack detection. In Sébastien Marcel, Mark Nixon, Julian Fierrez, and Nicholas Evans, editors, Handbook of Biometric Anti-Spoofing, Advances in Computer Vision and Pattern Recognition, chapter 10. Springer, 2nd edition, April 2019. [ http ]
[992] Guillaume Heusch, Anjith George, David Geissenbuhler, Zohreh Mostaani, and Sébastien Marcel. Deep models and shortwave infrared information to detect face presentation attacks. IEEE Transactions on Biometrics, Behavior, and Identity Science, 2020. [ .pdf ]
[993] Guillaume Heusch. Bayesian Networks as Generative Models for Face Recognition. PhD thesis, EPFL, 2009. [ .pdf ]
[994] Weipeng He, Petr Motlicek, and Jean-Marc Odobez. Adaptation of multiple sound source localization neural networks with weak supervision and domain-adversarial training. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) [3110], pages 770--774. [ DOI | .pdf ]
[995] Weipeng He, Lu Lu, Biqiao Zhang, Jay Mahadeokar, Kaustubh Kalgaonkar, and Christian Fuegen. Spatial attention for far-field speech recognition with deep beamforming neural networks. In 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 7499--7503, 2020. [ DOI ]
[996] Weipeng He, Petr Motlicek, and Jean-Marc Odobez. Deep neural networks for multiple speaker detection and localization. In 2018 IEEE International Conference on Robotics and Automation (ICRA) [3111], pages 74--79. [ DOI | .pdf ]
[997] Weipeng He, Petr Motlicek, and Jean-Marc Odobez. Neural network adaptation and data augmentation for multi-speaker direction-of-arrival estimation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29:1303--1317, 2021. [ DOI | http ]
[998] Weipeng He, Petr Motlicek, and Jean-Marc Odobez. Joint localization and classification of multiple sound sources using a multi-task neural network. In Proceedings of Interspeech [3112], pages 312--316. [ DOI | .pdf ]
[999] Weipeng He, Petr Motlicek, and Jean-Marc Odobez. Multi-task neural network for robust multiple speaker embedding extraction. In Proceedings of Interspeech 2021, 2021.
[1000] Weipeng He. Deep Learning Approaches for Auditory Perception in Robotics. PhD thesis, École polytechnique fédérale de Lausanne, March 2021. [ .pdf ]
[1001] Ivan Himawan, Petr Motlicek, Marc Ferras, and Srikanth Madikeri. Towards utterance-based neural network adaptation in acoustic modeling. In IEEE Automatic Speech Recognition and Understanding Workshop, pages 289--295, December 2015. [ .pdf ]
[1002] Ivan Himawan, Petr Motlicek, David Imseng, Blaise Potard, Namhoon Kim, and Jaewon Lee. Learning feature mapping using deep neural network bottleneck features for distant large vocabulary speech recognition. In IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 4540--4544, April 2015. [ DOI | .pdf ]
[1003] Ivan Himawan, Petr Motlicek, Sridha Sridharan, David Dean, and Dian Tjondronegoro. Channel selection in the short-time modulation domain for distant speech recognition. In Proceedings of Interspeech [3113], pages 741--745. [ .pdf ]
[1004] Ivan Himawan, Petr Motlicek, David Imseng, and Sridha Sridharan. Feature mapping using far-field microphones for distant speech recognition. In Speech Communication [3114], pages 1--9. A publication of the European Association for Signal Processing (EURASIP) and of the International Speech Communication Association (ISCA). [ DOI | http | .pdf ]
[1005] Ivan Himawan, Srikanth Madikeri, Petr Motlicek, Milos Cernak, Sridha Sridharan, and Clinton Fookes. Voice presentation attack detection using convolutional neural networks. In Handbook of Biometric Anti-Spoofing, Advances in Computer Vision and Pattern Recognition, chapter 17, pages 391--415. Springer, 2nd edition, April 2019. [ http ]
[1006] Pierre-Edouard Honnet, Branislav Gerazov, and Philip N. Garner. Atom decomposition-based intonation modelling. In IEEE 40th International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 4744--4748. IEEE, April 2015. [ DOI | .pdf ]
[1007] Pierre-Edouard Honnet, Alexandros Lazaridis, Philip N. Garner, and Junichi Yamagishi. The siwis french speech synthesis database – design and recording of a high quality french database for speech synthesis. Idiap-RR Idiap-RR-03-2017, Idiap, 2 2017. [ .pdf ]
[1008] Pierre-Edouard Honnet and Philip N. Garner. Intonation atom based emphasis transfer. Idiap-RR Idiap-RR-14-2016, Idiap, 5 2016. [ .pdf ]
[1009] Pierre-Edouard Honnet, Alexandros Lazaridis, Jean-Philippe Goldman, and Philip N. Garner. Prosody in swiss french accents: Investigation using analysis by synthesis. In Speech Prosody [3115]. [ .pdf ]
[1010] Pierre-Edouard Honnet, Branislav Gerazov, Aleksandar Gjoreski, and Philip N. Garner. Intonation modelling using a muscle model and perceptually weighted matching pursuit. Speech Communication, 2018. [ DOI | http | .pdf ]
[1011] Pierre-Edouard Honnet and Philip N. Garner. Emphasis recreation for tts using intonation atoms. In 9th ISCA Speech Synthesis Workshop, pages 14--20, Sunnyvale, CA, USA, September 2016. [ DOI | .pdf ]
[1012] Pierre-Edouard Honnet and Philip N. Garner. Importance of prosody in swiss french accent for speech synthesis. In Nouveaux cahiers de linguistique française, 2014. [ .pdf ]
[1013] Pierre-Edouard Honnet. Intonation Modelling for Speech Synthesis and Emphasis Preservation. PhD thesis, École Polytechnique Fédérale de Lausanne, January 2017. [ DOI | .pdf ]
[1014] Ya-Ping Hsieh, Yu-Chun Kao, rabeeh karimi mahabadi, Alp Yurtsever, Anastasios Kyrillidis, and Volkan Cevher. A non-euclidean gradient descent framework for non-convex matrix factorization. IEEE Transactions on Signal Processing, 2018. [ .pdf ]
[1015] Alethia Hume, Luca Cernuzzi, José Luis Zarza, Ivano Bison, and Daniel Gatica-Perez. Analysis of the big-five personality traits in the chatbot "uc - paraguay". CLEI electronic journal, 25(2), May 2022. [ .pdf ]
[1016] Hayley Hung, Dinesh Babu Jayagopi, Chuohao Yeo, Gerald Friedland, Silèye O. Ba, Jean-Marc Odobez, Kannan Ramchandran, Nikki Mirghafori, and Daniel Gatica-Perez. Using audio and video features to classify the most dominant person in a group meeting. [3116]. IDIAP-RR 07-29. [ .ps.gz | .pdf ]
[1017] Hayley Hung and Gokul Chittaranjan. The wolf corpus: Exploring group behaviour in a competitive role-playing game. In ACM Multimedia, 10 2010. [ .pdf ]
[1018] Hayley Hung and Gerald Friedland. Towards audio-visual on-line diarization of participants in group meetings. In European Conference on Computer Vision Workshop on Multi-camera and Multi-modal Sensor Fusion, 10 2008. [ .pdf ]
[1019] Hayley Hung and Daniel Gatica-Perez. Identifying dominant people in meetings from audio-visual sensors. In International Conference on Automatic Face and Gesture Recognition [3118]. [ .pdf ]
[1020] Hayley Hung, Yan Huang, Gerald Friedland, and Daniel Gatica-Perez. Estimating the dominant person in multi-party conversations using speaker diarization strategies. In IEEE International Conference on Acoustics, Speech, and Signal Processing [3119].
[1021] Hayley Hung, Dinesh Babu Jayagopi, Silèye O. Ba, Jean-Marc Odobez, and Daniel Gatica-Perez. Investigating automatic dominance estimation in groups from visual attention and speaking activity. In International Conference on Multi-modal Interfaces, 2008. [ .pdf ]
[1022] Hayley Hung and Silèye O. Ba. Speech/non-speech detection in meetings from automatically extracted low resolution visual features. Idiap-RR Idiap-RR-20-2009, Idiap, 7 2009. submitted to icmi-mlmi. [ .pdf ]
[1023] Hayley Hung, Yan Huang, Gerald Friedland, and Daniel Gatica-Perez. Estimating dominance in multi-party meetings using speaker diarization. IEEE Transactions on Audio, Speech, and Language Processing, 19(4):847--860, may 2011. [ .pdf ]
[1024] Hayley Hung and Daniel Gatica-Perez. Estimating cohesion in small groups using audio-visual nonverbal behavior. In IEEE Trans. on Multimedia, Special Issue on Multimodal Affective Interaction [3121], pages 563 -- 575. [ .pdf ]
[1025] Rui Hu, Jean-Marc Odobez, and Daniel Gatica-Perez. Assessing a shape descriptor for analysis of mesoamerican hieroglyphics: A view towards practice in digital humanities. In Digital Humanities Conference (DH), July 2016. [ .pdf ]
[1026] Rui Hu, Carlos Pallan Gayol, Jean-Marc Odobez, and Daniel Gatica-Perez. Analyzing and visualizing ancient maya hieroglyphics using shape: from computer vision to digital humanities. Digital Scholarship in the Humanities, 32:179--194, December 2017. [ .pdf ]
[1027] Rui Hu, Jean-Marc Odobez, and Daniel Gatica-Perez. Extracting maya glyphs from degraded ancient documents via image segmentation. Journal on Computing and Cultural Heritage, 10, April 2017. [ .pdf ]
[1028] Rui Hu, Hieu Pham, Philipp Buluschek, and Daniel Gatica-Perez. Elderly people living alone: Detecting home visits with ambient and wearable sensing. In In Proceedings of MMHealth, October 2017. [ .pdf ]
[1029] Rui Hu, Gulcan Can, Carlos Pallan Gayol, Guido Krempel, Jakub Spotak, Gabrielle Vail, Stephane Marchand-Maillet, Jean-Marc Odobez, and Daniel Gatica-Perez. Multimedia analysis and access of ancient maya epigraphy. Signal processing magazine, 2015. [ .pdf ]
[1030] Alexandre Hyafil and Milos Cernak. Neuromorphic based oscillatory device for incremental syllable boundary detection. In Proc. of Interspeech [3122], pages 1191--1195. [ .pdf ]
[1031] Hynek Hermansky. TRAP-TANDEM: Data-driven extraction of temporal features from speech. In large part published in Proceedings of ASRU-2003 [3123]. IDIAP-RR 03-50. [ .ps.gz | .pdf ]
[1032] Hynek Hermansky and Nelson Morgan. Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research. Idiap-RR Idiap-RR-81-2003, IDIAP, Martigny, Switzerland, 2003. Submitted to Journal of Negative Results in Speech and Audio Sciences. [ .ps.gz | .pdf ]
[1033] Hynek Hermansky and Hervé Bourlard. Some emerging concepts in speech recognition. Idiap-RR Idiap-RR-82-2003, IDIAP, Martigny, Switzerland, 2003. Submitted to SWIM 2004. [ .ps.gz | .pdf ]
[1034] Hynek Hermansky. Stochastic techniques in deriving perceptual knowledge. Idiap-RR Idiap-RR-84-2004, IDIAP, Martigny, Switzerland, 2004. in Proceedings SAPA-2004, Jeju Island, Korea. [ .ps.gz | .pdf ]
[1035] Hynek Hermansky and Petr Fousek. Multi-resolution rasta filtering for tandem-based asr. In Proceedings of Interspeech 2005 [3124]. IDIAP-RR 2005-18. [ .ps.gz | .pdf ]
[1036] Hynek Hermansky, Petr Fousek, and Mikko Lehtonen. The role of speech in multimodal human-computer interaction (towards reliable rejection of non-keyword input). In Proceedings of 8th International Conference on Text, Speech and Dialogue - TSD 2005 [3125]. IDIAP-RR 2005-63. [ .ps.gz | .pdf ]
[1037] Miranti I. Mandasari, Manuel Günther, Roy Wallace, Rahim Saedi, Sébastien Marcel, and David Van Leeuwen. Score calibration in face recognition. In IET Biometrics [3126], pages 1--11. [ DOI | http | .pdf ]
[1038] Jean-Luc Cochard and Philippe Froidevaux. Environnement multi-agents de reconnaissance automatique de la parole en continu. In Actes des 3èmes Journées Francophones sur l'Intelligence Artificielle Distribuée et les Systèmes Multi-agents, 1995.
[1039] Gérard Chollet and M. Homayounpour. Neural nets approaches to speaker verification: comparison with second order statistical measure. In ICASSP, Detroit, 1995.
[1040] Christos Dimitrakakis and Samy Bengio. Boosting hmms with an application to speech recognition. In IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP [3127]. IDIAP-RR 03-41. [ .ps.gz | .pdf ]
[1041] Chidansh A. Bhatt, Nikolaos Pappas, Maryam Habibi, and Andrei Popescu-Belis. Multimodal reranking of content-based recommendations for hyperlinking video snippets. In ACM International Conference on Multimedia Retrieval, 2014. [ .pdf ]
[1042] H. Hong, Seunjin Choi, Hervé Glotin, and Frédéric Berthommier. Blind acoustic source separation for cocktail party speech recognition. In IEEE, editor, ICONIP, 7th IEEE Int. Conf. on Neural Information Processing, no IDIAP RR, see RESPITE www, Korea, 11 2000.
[1043] Gérard Chollet and M. Homayounpour. A study of intra- and inter-speaker variability in the voices of twins for speaker verification. In International Congress of Phonetic Sciences, Stockholm, 8 1995.
[1044] Philippe Langlais and Jean-Luc Cochard. The use of prosodic agents in a cooperative automatic speech recognition system. In International Congress of Phonetic Sciences, Stockholm, Sweden, 1995.
[1045] Frédéric Béchet, Philippe Langlais, and Henri Méloni. Lexical filtrering by means of prosodic information. In International Congress of Phonetic Sciences, Stockholm, Sweden, 1995.
[1046] Frédéric Berthommier, Hervé Glotin, and Emmanuel Tessier. A front-end using the harmonicity cue for speech enhancement in loud noise. In Int. Conf. on Spoken Language Processing (ICSLP), 2000. no IDIAPRR, see RESPITE www.
[1047] Andrew Morris, Ljubomir Josifovski, Hervé Bourlard, Martin Cooke, and Phil Green. A neural network for classification with incomplete data: application to robust asr. In Proc. ICSLP [3128]. [ .ps.gz | .pdf ]
[1048] Andrew Morris, Simon Payne, and Hervé Bourlard. Low cost duration modelling for noise robust speech recognition. In Proc. ICSLP [3129]. [ .ps.gz | .pdf ]
[1049] Seunjin Choi, H. Hong, Hervé Glotin, and Frédéric Berthommier. Multichannel signal separation for cocktail party speech recognition: a dynamic recurrent network. In Int. Conf. on Spoken Language Processing (ICSLP). no IDIAP RR, see RESPITE www, 2000.
[1050] Hervé Bourlard, Jean-Luc Cochard, Emile Fiesler, Gilbert Maître, and Eddy Mayoraz. Activity report 1996. Idiap-Com Idiap-Com-01-1997, IDIAP, 1997. [ .ps.gz | .pdf ]
[1051] Silèye O. Ba and Jean-Marc Odobez. A video database for head pose tracking evaluation. Idiap-Com Idiap-Com-04-2005, IDIAP, Martigny, Switzerland, 2005. [ .ps.gz | .pdf ]
[1052] Johan M. Andersen, Gilles Caloz, and Hervé Bourlard. Swisscom “AVIS” project (no. 392) advanced vocal interfaces services. Idiap-Com Idiap-Com-06-1997, IDIAP, 1997. [ .ps.gz | .pdf ]
[1053] Mikhail Kanevski, Patrick Wong, and Stéphane Canu. Environmental data mapping with support vector regression and geostatistics. Idiap-RR Idiap-RR-10-2000, IDIAP, 2000. [ .pdf ]
[1054] Mikhail Kanevski and Nicolas Gilardi. Numerical experiments with support vector machines. Idiap-RR Idiap-RR-15-1999, IDIAP, 1999. [ .ps | .pdf ]
[1055] Kim Shearer and Svetha Venkatesh. Artifacts of the colour coherence vector and an alternative similarity measure. Idiap-RR Idiap-RR-02-2001, IDIAP, 2001. [ .ps.gz | .pdf ]
[1056] Kim Shearer, Chitra Dorai, and Svetha Venkatesh. Detection of narrative structure for annotation of news broadcasts. Idiap-RR Idiap-RR-03-2001, IDIAP, 2001. [ .ps.gz | .pdf ]
[1057] Murielle Vial. Définition et évaluation d'un protocole de négociation dans un système multi-agents de reconnaissance de la parole. Idiap-RR Idiap-RR-02-1995, IDIAP, Martigny, Switzerland, 1995.
[1058] Gérard Chollet and Chafic Mokbel. Automatic word recognition in cars. IEEE Speech and Audio Processing, 1995.
[1059] Shajith Ikbal. Nonlinear Feature Transformations for Noise Robust Speech Recognition. Idiap-rr, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, 6 2004. thesis # (IDIAP-RR 04-70). [ .ps.gz | .pdf ]
[1060] Shajith Ikbal, Hervé Bourlard, Samy Bengio, and Katrin Weber. IDIAP HMM/HMM2 System: Theoretical Basis and Software Specifications. Idiap-RR Idiap-RR-27-2001, IDIAP, Martigny, Switzerland, 2001. [ .ps.gz | .pdf ]
[1061] Shajith Ikbal, Katrin Weber, and Hervé Bourlard. Speaker Normalization using HMM2. In Proceedings of the 2002 IEEE International Workshop on Neural Networks for Signal Processing (NNSP-02) [3131]. [ .ps.gz | .pdf ]
[1062] Shajith Ikbal, Hemant Misra, and Hervé Bourlard. Phase AutoCorrelation (PAC) derived Robust Speech Features. In Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03) [3132]. [ .ps.gz | .pdf ]
[1063] Shajith Ikbal, Hynek Hermansky, and Hervé Bourlard. Nonlinear Spectral Transformations for Robust Speech Recognition. In Proceedings of the IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop 2003 [3133]. [ .ps.gz | .pdf ]
[1064] Shajith Ikbal, Hemant Misra, Hervé Bourlard, and Hynek Hermansky. Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition. In Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04) [3134]. [ .ps.gz | .pdf ]
[1065] Shajith Ikbal, Hemant Misra, Sunil Sivadas, Hynek Hermansky, and Hervé Bourlard. Entropy Based Combination of Tandem Representations for Noise Robust ASR. In Proceedings of the INTERSPEECH-ICSLP-04 [3135]. To appear. [ .ps.gz | .pdf ]
[1066] Shajith Ikbal, Mathew Magimai.-Doss, Hemant Misra, and Hervé Bourlard. Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR. In Proceedings of the INTERSPEECH-ICSLP-04 [3136]. To appear. [ .ps.gz | .pdf ]
[1067] Shajith Ikbal, Hemant Misra, Hervé Bourlard, and Hynek Hermansky. Phase AutoCorrelation (PAC) Features for Noise Robust ASR. Idiap-RR Idiap-RR-40-2004, IDIAP, Martigny, Switzerland, 2004. Submitted for publication.
[1068] Shajith Ikbal, Hervé Bourlard, and Mathew Magimai.-Doss. HMM/ANN Based Spectral Peak Location Estimation for Noise Robust Speech Recognition. Idiap-RR Idiap-RR-50-2004, IDIAP, Martigny, Switzerland, 2004. Submitted for publication. [ .ps.gz | .pdf ]
[1069] Shajith Ikbal, Hemant Misra, Hynek Hermansky, and Mathew Magimai.-Doss. Phase autocorrelation (pac) features for noise robust speech recognition. Speech Communication, 54(7):867–880, September 2012. [ DOI ]
[1070] David Imseng and Gerald Friedland. Robust speaker diarization for short speech recordings. In Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding [3137], pages 432--437. [ .pdf ]
[1071] David Imseng, Ramya Rasipuram, and Mathew Magimai.-Doss. Fast and flexible kullback-leibler divergence based acoustic modeling for non-native speech recognition. In Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding [3138], pages 348--353. [ .pdf ]
[1072] David Imseng, Petr Motlicek, Philip N. Garner, and Hervé Bourlard. Impact of deep mlp architecture on different acoustic modeling techniques for under-resourced speech recognition. In Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, December 2013. [ .pdf ]
[1073] David Imseng and Gerald Friedland. An adaptive initialization method for speaker diarization based on prosodic features. In Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing [3139], pages 4946--4949. [ .pdf ]
[1074] David Imseng, Hervé Bourlard, Mathew Magimai.-Doss, and John Dines. Language dependent universal phoneme posterior estimation for mixed language speech recognition. In Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing [3140], pages 5012--5015. [ .pdf ]
[1075] David Imseng, Hervé Bourlard, and Philip N. Garner. Using kl-divergence and multilingual information to improve asr for under-resourced languages. In Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4869--4872, March 2012. [ .pdf ]
[1076] David Imseng and Hervé Bourlard. Speaker adaptive kullback-leibler divergence based hidden markov models. In Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, 2013. [ .pdf ]
[1077] David Imseng, Blaise Potard, Petr Motlicek, Alexandre Nanchen, and Hervé Bourlard. Exploiting un-transcribed foreign data for speech recognition in well-resourced languages. In Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 2322 -- 2326. IEEE, 2014. [ DOI | .pdf ]
[1078] David Imseng and John Dines. Decision tree clustering for kl-hmm. Idiap-Com Idiap-Com-01-2012, Idiap, 2 2012. [ .pdf ]
[1079] David Imseng, John Dines, Petr Motlicek, Philip N. Garner, and Hervé Bourlard. Comparing different acoustic modeling techniques for multilingual boosting. Idiap-RR Idiap-RR-01-2013, Idiap, 1 2013. [ .pdf ]
[1080] David Imseng. Novel initialization methods for speaker diarization. Idiap-RR Idiap-RR-07-2009, Idiap, 5 2009. Master's thesis. [ .pdf ]
[1081] David Imseng, Petr Motlicek, Hervé Bourlard, and Philip N. Garner. Using out-of-language data to improve an under-resourced speech recognizer. Idiap-RR Idiap-RR-09-2013, Idiap, 3 2013. [ .pdf ]
[1082] David Imseng, Hervé Bourlard, and Philip N. Garner. Boosting under-resourced speech recognizers by exploiting out of language data - case study on afrikaans. Idiap-RR Idiap-RR-15-2012, Idiap, 6 2012. [ .pdf ]
[1083] David Imseng, Hervé Bourlard, and Mathew Magimai.-Doss. Towards mixed language speech recognition systems. In Proceedings of Interspeech [3145], pages 278--281. [ .pdf ]
[1084] David Imseng, Mathew Magimai.-Doss, and Hervé Bourlard. Hierarchical multilayer perceptron based language identification. In Proceedings of Interspeech [3146], pages 2722--2725. [ .pdf ]
[1085] David Imseng, Hervé Bourlard, John Dines, Philip N. Garner, and Mathew Magimai.-Doss. Improving non-native asr through stochastic multilingual phoneme space transformations. In Proceedings of Interspeech [3147], pages 537--540. [ .pdf ]
[1086] David Imseng and Gerald Friedland. Tuning-robust initialization methods for speaker diarization. In IEEE Transactions on Audio, Speech, and Language Processing [3149], pages 2028--2037. [ DOI | .pdf ]
[1087] David Imseng, Hervé Bourlard, John Dines, Philip N. Garner, and Mathew Magimai.-Doss. Applying multi- and cross-lingual stochastic phone space transformations to non-native speech recognition. IEEE Transactions on Audio, Speech, and Language Processing, 2013. [ DOI | .pdf ]
[1088] David Imseng. Multilingual speech recognition A posterior based approach. PhD thesis, École Polytechnique Fédérale de Lausanne (EPFL), June 2013. [ .pdf ]
[1089] Nigmatulina Iuliia, Zuluaga-Gomez. Juan, Amrutha Prasad, Seyyed Saeed Sarfjoo, and Petr Motlicek. A two-step approach to leverage contextual data: speech recognition in air-traffic communications. In Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022. [ .pdf ]
[1090] Nigmatulina Iuliia, Rudolf Braun, Zuluaga-Gomez. Juan, and Petr Motlicek. Improving callsign recognition with air-surveillance data in air-traffic communication. Idiap-RR Idiap-RR-20-2021, Idiap, 11 2021. [ http ]
[1091] Malinka Ivanova, Sushil Bhattacharjee, Sébastien Marcel, Anna Rozeva, and Mariana Durcheva. Enhancing trust in eassessment - the tesla system solution. In Technology Enhanced Assessment Conference., December 2018. [ .pdf ]
[1092] Gilles Caloz, Cédric Jaboulet, Johnny Mariéthoz, A. Glaeser, and Dominique Genoud. Voice-b system. In IEEE 4th Workshop on Intercative Voice Technology for Telecommunications Applications (IVTTA'98) September 29--30, Torino, Italy, 1998.
[1093] Frédéric Bimbot, H. P. Hutter, Cédric Jaboulet, Johan Koolwaaij, Johan Lindberg, and J. B. Pierrot. Speaker verification in the telephone network : Research activities in the CAVE project. In Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH'97), 1997. [ .ps.gz | .pdf ]
[1094] Alejandro Jaimes, Daniel Gatica-Perez, Nicu Sebe, and Thomas S. Huang. Human-centered Computing: Toward a Human Revolution. IEEE Computer, 40(5), 5 2007. IDIAP-RR 07-57. [ .ps.gz | .pdf ]
[1095] Alejandro Jaimes, Daniel Gatica-Perez, Nicu Sebe, and Thomas S. Huang. Human-centered computing: Toward a human revolution. Idiap-RR Idiap-RR-57-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[1096] Anubhav Jain, Pavel Korshunov, and Sébastien Marcel. Improving generalization of deepfake detection by training for attribution. In International Workshop on Multimedia Signal Processing, October 2021. [ .pdf ]
[1097] Parvaneh Janbakhshi, Ina Kodrasi, and Hervé Bourlard. Pathological speech intelligibility assessment based on the short-time objective intelligibility measure. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 6405--6409, May 2019. [ .pdf ]
[1098] Parvaneh Janbakhshi, Ina Kodrasi, and Hervé Bourlard. Synthetic speech references for automatic pathological speech intelligibility assessment. In International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020. [ .pdf ]
[1099] Parvaneh Janbakhshi and Ina Kodrasi. Experimental investigation on stft phase representations for deep learning-based dysarthric speech detection. In International Conference on Acoustics, Speech, and Signal Processing, 2022. [ .pdf ]
[1100] Parvaneh Janbakhshi, Ina Kodrasi, and Hervé Bourlard. Automatic dysarthric speech detection exploiting pairwise distance-based convolutional neural networks. In 45th International Conference on Acoustics, Speech, and Signal Processing [3150], page 7328–7332. Submitted. [ .pdf ]
[1101] Parvaneh Janbakhshi, Ina Kodrasi, and Hervé Bourlard. Spectral subspace analysis for automatic assessment of pathological speech intelligibility. In Proceedings of Interspeech, pages 3038--3042, September 2019. [ .pdf ]
[1102] Parvaneh Janbakhshi and Ina Kodrasi. Adversarial-free speaker identity-invariant representation learning for automatic dysarthric speech classification. In Annual Conference of the International Speech Communication Association, September 2022.
[1103] Parvaneh Janbakhshi, Ina Kodrasi, and Hervé Bourlard. Automatic pathological speech intelligibility assessment exploiting subspace-based analyses. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28:1717 -- 1728, May 2020. [ DOI | .pdf ]
[1104] Parvaneh Janbakhshi and Ina Kodrasi. Supervised speech representation learning for parkinson's disease classification. In ITG Conference on Speech Communication [3151]. accepted in ITG Conference on Speech Communication. [ .pdf ]
[1105] Parvaneh Janbakhshi, Ina Kodrasi, and Hervé Bourlard. Subspace-based learning for automatic dysarthric speech detection. IEEE Signal Processing Letters, 2020. In press.
[1106] Julius Jankowski, Hakan Girgin, and Sylvain Calinon. Probabilistic adaptive control for robust behavior imitation. IEEE Robotics and Automation Letters, January 2021. [ .pdf ]
[1107] Julius Jankowski, Mattia Racca, and Sylvain Calinon. From key positions to optimal basis functions for probabilistic adaptive control. IEEE Robotics and Automation Letters, January 2022. [ .pdf ]
[1108] Christian Jaques, Linda Bapst-Wicht, Daniel F. Schorderet, and Michael Liebling. Multi-spectral widefield microscopy of the beating heart through post-acquisition synchronization and unmixing. In 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), pages 1382--1385, April 2019. [ DOI | .pdf ]
[1109] Christian Jaques and Michael Liebling. Aliasing mitigation in optical microscopy of dynamic biological samples by use of temporally modulated color illumination and a standard rgb camera. Journal of Biomedical Optics, 25(10):106505, October 2020. [ DOI | http ]
[1110] Christian Jaques, Emmanuel Pignat, Sylvain Calinon, and Michael Liebling. Temporal super-resolution microscopy using a hue-encoded shutter. Optical Society of America Biomedical Optics Express, 10(09):4727--4741, September 2019. [ DOI | http ]
[1111] Christian Jaques, Alexander Ernst, Nadia Mercader, and Michael Liebling. Temporal resolution doubling in fluorescence light-sheet microscopy via a hue-encoded shutter and regularization. OSA Continuum, 3(8), August 2020. [ .pdf ]
[1112] Christian Jaques. Active Illumination and Computational Methods for Temporal and Spectral Super-Resolution Microscopy. PhD thesis, EPFL, 2020. [ DOI | .pdf ]
[1113] Christian Jaques and Michael Liebling. Generalized temporal sampling with active illumination in optical microscopy. In Proceeding of the SPIE Conference Optics and Photonics, Wavelets and Sparsity XVIII, volume 11138. SPIE, SPIE, August 2019. [ .pdf ]
[1114] N. Jaquier, David Ginsbourger, and Sylvain Calinon. Learning from demonstration with model-based gaussian process. In Conference on Robot Learning, October 2019. [ .pdf ]
[1115] N. Jaquier, L. Rozo, Sylvain Calinon, and M. Buerger. Bayesian optimization meets riemannian manifolds in robot learning. In Conference on Robot Learning, October 2019. [ .pdf ]
[1116] N. Jaquier, L. Rozo, D. G. Caldwell, and Sylvain Calinon. Geometry-aware manipulability learning, tracking and transfer. International Journal of Robotic Research, 40(2-3):624--650, 2021. [ .pdf ]
[1117] N. Jaquier and Sylvain Calinon. Gaussian mixture regression on symmetric positive definite matrices manifolds: Application to wrist motion estimation with semg. In Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, pages 59--64, September 2017. [ http | .pdf ]
[1118] N. Jaquier, L. Rozo, and Sylvain Calinon. Analysis and transfer of human movement manipulability in industry-like activities. In Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems, 2020. [ .pdf ]
[1119] N. Jaquier, C. Castellini, and Sylvain Calinon. Improving hand and wrist activity detection using tactile sensors and tensor regression methods on riemannian manifolds. In Proc. of the Myoelectric Control Symposium, August 2017. [ .html | .pdf ]
[1120] N. Jaquier and Sylvain Calinon. Improving the control of prosthetic hands with tactile sensing. Micro & Nano Magazine, Micronarc, pages 42--43, 2018. [ .pdf ]
[1121] N. Jaquier, L. Rozo, and Sylvain Calinon. Geometry-aware robot manipulability transfer. In R:SS Workshop on Learning and Inference in Robotics: Integrating Structure, Priors and Models, June 2018. [ .pdf ]
[1122] N. Jaquier and Sylvain Calinon. Geometry-aware control and learning in robotics. In R:SS Pioneers Workshop, June 2018.
[1123] N. Jaquier, L. Rozo, D. G. Caldwell, and Sylvain Calinon. Geometry-aware tracking of manipulability ellipsoids. In Robotics: Science and Systems, June 2018. [ .pdf ]
[1124] N. Jaquier, R. Haschke, and Sylvain Calinon. Tensor-variate mixture of experts for proportional myographic control of a robotic hand. Robotics and Autonomous Systems, 142:103812, 2021. [ .pdf ]
[1125] N. Jaquier, M. Connan, C. Castellini, and Sylvain Calinon. Combining electromyography and tactile myography to improve hand and wrist activity detection in prostheses. Technologies, 5(4), October 2017. [ .pdf ]
[1126] N. Jaquier. Robot skills learning with Riemannian manifolds : Leveraging geometry-awareness in robot learning, optimization and control. PhD thesis, Ecole Polytechnique Fédérale de Lausanne, July 2020. [ .pdf ]
[1127] Dinesh Babu Jayagopi, Hayley Hung, Chuohao Yeo, and Daniel Gatica-Perez. Predicting the dominant clique in meetings through fusion of nonverbal cues. In ACM MM 2008 [3152]. IDIAP-RR 08-08. [ .ps.gz | .pdf ]
[1128] Dinesh Babu Jayagopi and Jean-Marc Odobez. Given that, should i respond? contextual addressee estimation in multi-party human-robot interactions. In Proceedings of Human Robot Interaction (HRI) Conference, 2013. [ .pdf ]
[1129] Dinesh Babu Jayagopi, Samira Sheikhi, David Klotz, Johannes Wienke, Jean-Marc Odobez, Sebastian Wrede, Vasil Khalidov, Laurent Son Nguyen, Britta Wrede, and Daniel Gatica-Perez. The vernissage corpus: a conversational human-robot-interaction dataset. In Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction, 2013. [ .pdf ]
[1130] Dinesh Babu Jayagopi and Daniel Gatica-Perez. Discovering group nonverbal conversational patterns with topics. In Proceedings ICMI-MLMI, 11 2009. [ .pdf ]
[1131] Dinesh Babu Jayagopi, Dairazalia Sanchez-Cortes, Kazuhiro Otsuka, Junji Yamato, and Daniel Gatica-Perez. Linking speaking and looking behavior patterns with group composition, perception, and performance. In Proceedings of the International Conference on Multimodal Interaction (ICMI), Santa Monica, USA, 2012. [ .pdf ]
[1132] Dinesh Babu Jayagopi, Samira Sheikhi, David Klotz, Johannes Wienke, Jean-Marc Odobez, Sebastian Wrede, Vasil Khalidov, Laurent Son Nguyen, Britta Wrede, and Daniel Gatica-Perez. The vernissage corpus: A multimodal human-robot-interaction dataset. Idiap-RR Idiap-RR-33-2012, Idiap, 12 2012. [ .pdf ]
[1133] Dinesh Babu Jayagopi, Raducanu Bogdan, and Daniel Gatica-Perez. Characterising conversationsal group dynamics using nonverbal behaviour. In Proceedings ICME 2009, 6 2009. [ .pdf ]
[1134] Dinesh Babu Jayagopi and Daniel Gatica-Perez. Mining group nonverbal conversational patterns using probabilistic topic models. IEEE Transactions on Multimedia, 2010. [ .pdf ]
[1135] Dinesh Babu Jayagopi, Hayley Hung, Chuohao Yeo, and Daniel Gatica-Perez. Modeling dominance in group conversations using nonverbal activity cues. IEEE Transactions on Audio, Speech and Language Processing, 2008. [ .pdf ]
[1136] Dinesh Babu Jayagopi, Taemie Kim, Alex Pentland, and Daniel Gatica-Perez. Privacy-sensitive recognition of group conversational context with sociometers. Springer Multimedia Systems Journal, 2011. [ .pdf ]
[1137] Dinesh Babu Jayagopi, Taemie Kim, Alex Pentland, and Daniel Gatica-Perez. Recognizing conversational context in group interaction using privacy-sensitive mobile sensors. In Proceedings of International Conference on Mobile and Ubiquitous Multimedia, Limassol, Cyprus, 12 2010. [ .pdf ]
[1138] Dinesh Babu Jayagopi. Computational modeling of face-to-face social interaction using nonverbal behavioral cues. PhD thesis, Ecole Polytechnique Fédérale de Lausanne, 2011. [ .pdf ]
[1139] Jérôme Kowalczyk. Une application de reconnaissance du locuteur :
le user-customized password speaker verification. Idiap-Com Idiap-Com-04-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[1140] Jie Luo, Barbara Caputo, and Vittorio Ferrari. Who's doing what: Joint modeling of names and verbs for simultaneous face and pose annotation. In Advances in Neural Information Processing Systems 22 (NIPS09). NIPS Foundation, MIT Press, 12 2009. [ .pdf ]
[1141] Joseph Keshet. Theoretical foundations for large-margin kernel-based continuous speech recognition. Idiap-RR Idiap-RR-44-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[1142] Joseph Keshet, David Grangier, and Samy Bengio. Discriminatove keyword spotting. Idiap-RR Idiap-RR-31-2008, IDIAP, 2008. [ .ps.gz | .pdf ]
[1143] Niklas Johansson, Chris McCool, and Sébastien Marcel. On-line unsupervised adaptation for face verification using gaussian mixture models with multiple user models. Idiap-RR Idiap-RR-07-2011, Idiap, 3 2011. [ .pdf ]
[1144] Mohammad Mahdi Johari, Yann Lepoittevin, and Francois Fleuret. Geonerf: Generalizing nerf with geometry priors. In Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2022. [ http | .pdf ]
[1145] Mohammad Mahdi Johari, Camilla Carta, and Francois Fleuret. Depthinspace: Exploitation and fusion of multiple video frames for structured-light depth estimation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 6039--6048, October 2021. [ .html ]
[1146] Cijo Jose and Francois Fleuret. Scalable metric learning via weighted approximate rank component analysis. In ECCV 2016, 2016. [ .pdf ]
[1147] Cijo Jose, Moustapha Cisse, and Francois Fleuret. Kronecker recurrent units. In Proceedings of the International Conference on Machine Learning, 2018.
[1148] Cijo Jose. Learning embeddings: efficient algorithms and applications. PhD thesis, École Polytechnique Fédérale de Lausanne, February 2018. [ DOI | .pdf ]
[1149] Pierre Jourlin, Juergen Luettin, Dominique Genoud, and Hubert Wassner. Acoustic-labial speaker verification. In Proceedings of the First International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'97) [3153]. IDIAP-RR 97-13.
[1150] Pierre Jourlin, Juergen Luettin, Dominique Genoud, and H. Wassner. Integrating acoustic and labial information for speaker identification and verification. In Proceedings of the European Conference on Speech Communication and Technology, 1997.
[1151] Pierre Jourlin, Juergen Luettin, Dominique Genoud, and H. Wassner. Acoustic-labial speaker verification. In Pattern Recognition Letters [3153]. IDIAP-RR 97-13. [ .ps.gz | .pdf ]
[1152] Brendan Jou, Tao Chen, Nikolaos Pappas, Miriam Redi, Mercan Topkara, and Shih-Fu Chang. Visual affect around the world: A large-scale multilingual visual sentiment ontology. In Proceedings of the ACM International Conference on Multimedia, pages 159--168, Brisbane, Australia, 2015. [ .pdf ]
[1153] Jonathan Rey and Frank Formaz. Managing idiap inventory (computers, components, software and licences). Idiap-Com Idiap-Com-04-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[1154] Zuluaga-Gomez. Juan, Karel Vesely, Blatt Alexander, Petr Motlicek, Dietrich Klakow, Allan Tart, Igor Szoke, Amrutha Prasad, Seyyed Saeed Sarfjoo, Pavel Kolcarek, Martin Kocour, Honza Cernocky, Claudia Cevenini, Khalid Choukri, Mickael Rigault, and Fabian Landis. Automatic call sign detection: Matching air surveillance data with air traffic spoken communications. In Proceedings of 8th OpenSky Symposium 2020, volume 59 of 1, pages 1--10. OpenSky Network, MDPI, November 2020. [ DOI | http | .pdf ]
[1155] Zuluaga-Gomez. Juan, Seyyed Saeed Sarfjoo, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek, Oliver Ohneiser, and Hartmut Helmke. Bertraffic: A robust bert-based approach for speaker change detection and role identification of air-traffic communications. Idiap-RR Idiap-RR-15-2021, Idiap, 10 2021. Submitted to ICASSP 2022.
[1156] Zuluaga-Gomez. Juan, Nigmatulina Iuliia, Amrutha Prasad, Petr Motlicek, Karel Vesely, Martin Kocour, and Igor Szoke. Contextual semi-supervised learning: An approach to leverage air-surveillance and untranscribed atc data in asr systems. In Interspeech 2021 [3154]. [ http | .pdf ]
[1157] Agnès Just, Sébastien Marcel, O. Bernier, and J. E. Viallet. Reconnaissance de gestes 3d bi-manuels. Idiap-RR Idiap-RR-79-2003, IDIAP, 2003. [ .ps.gz | .pdf ]
[1158] Agnès Just and Sébastien Marcel. Two-handed gesture recognition. Idiap-RR Idiap-RR-24-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[1159] Agnès Just. Two-Handed Gestures for Human-Computer Interaction. Idiap-rr, École Polytechnique Fédérale de Lausanne, 2006. PhD Thesis #3683 at the École Polytechnique Fédérale de Lausanne. [ .ps.gz | .pdf ]
[1160] Agnès Just, O. Bernier, and Sébastien Marcel. Recognition of isolated complex mono- and bi-manual 3d hand gestures. In Proc. of the sixth International Conference on Automatic Face and Gesture Recognition [3156]. IDIAP-RR 03-63. [ .ps.gz | .pdf ]
[1161] Agnès Just, O. Bernier, and Sébastien Marcel. Hmm and iohmm for the recognition of mono- and bi-manual 3d hand gestures. Idiap-RR Idiap-RR-39-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[1162] Agnès Just, Yann Rodriguez, and Sébastien Marcel. Hand posture classification and recognition using the modified census transform. In IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR) [3157]. IDIAP-RR 06-02. [ .ps.gz | .pdf ]
[1163] Selen Hande Kabil, Hannah Muckenhirn, and Mathew Magimai.-Doss. On learning to identify genders from raw speech signal using cnns. In Proceedings of Interspeech, pages 287--291, September 2018. [ DOI | .pdf ]
[1164] Selen Hande Kabil and Hervé Bourlard. From undercomplete to sparse overcomplete autoencoders to improve lf-mmi speech recognition. In Proceedings of Interspeech Conference, 2022. [ .pdf ]
[1165] Kyriaki Kalimeri, Bruno Lepri, Oya Aran, Dinesh Babu Jayagopi, Daniel Gatica-Perez, and Fabio Pianesi. Modeling dominance effects on nonverbal behaviors using granger causality. In Proceedings of International Conference on Multimodal Interaction, ICMI 2012, Santa Monica, CA, 2012. [ .pdf ]
[1166] Kamand Kamangar. Unsupervised learning for information distillation. Idiap-RR Idiap-RR-47-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[1167] Arthur Kantor, Milos Cernak, Jiri Havelka, Sean Huber, Jan Kleindienst, and Doris B. Gonzalez. Reading companion: The technical and social design of an automated reading tutor. In Workshop on Child, Computer and Interaction, September 2012. [ .pdf ]
[1168] Yunus Emre Kara, Gaye Genc, Oya Aran, and Lale Akarun. Modeling annotator behaviors for crowd labeling. Neurocomputing, 160:141–156, July 2015. [ DOI | .pdf ]
[1169] Angelos Katharopoulos and Francois Fleuret. Not all samples are created equal: Deep learning with importance sampling. In Proceedings of International Conference on Machine Learning [3158]. [ .pdf ]
[1170] Angelos Katharopoulos and Francois Fleuret. Processing megapixel images with deep attention-sampling models. In Proceedings of International Conference on Machine Learning [3159]. [ .html | .pdf ]
[1171] Angelos Katharopoulos, Apoorv Vyas, Nikolaos Pappas, and Francois Fleuret. Transformers are rnns: Fast autoregressive transformers with linear attention. In Proceedings of International Conference on Machine Learning, volume 19, 2020.
[1172] Katrin Keller, Souheil Ben-Yacoub, and Chafic Mokbel. Combining Wavelet-domain Hidden Markov Trees with Hidden Markov Models. Idiap-RR Idiap-RR-14-1999, IDIAP, 1999. [ .ps.gz | .pdf ]
[1173] Mikaela Keller and Samy Bengio. A neural network for Text Representation. In International Conference on Artificial Neural Networks, ICANN [3160]. IDIAP-RR 05-12. [ .ps.gz | .pdf ]
[1174] Mikaela Keller, Samy Bengio, and Siew Yeung Wong. Benchmarking non-parametric statistical tests. In Advances in Neural Information Processing Systems, NIPS 18. MIT Press [3161]. IDIAP-RR 05-38. [ .ps.gz | .pdf ]
[1175] Mikaela Keller and Samy Bengio. Theme Topic Mixture Model: A graphical model for document representation. In Pascal Workshop on Text Mining and Understanding [3162]. IDIAP-RR 04-05. [ .ps.gz | .pdf ]
[1176] Mikaela Keller. Machine Learning Approaches to Text Representation using Unlabeled Data. Idiap-rr, Ecole Polytechnique Fédérale de Lausanne, 2006. IDIAP-RR 06-76. [ .ps.gz | .pdf ]
[1177] Mikaela Keller and Samy Bengio. A multitask learning approach to document representation using unlabeled data. Idiap-RR Idiap-RR-44-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[1178] Mikaela Keller, Johnny Mariéthoz, and Samy Bengio. Significance Tests for bizarre Measures in 2-Class Classification Tasks. Idiap-RR Idiap-RR-34-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[1179] Jean Keomany and Sébastien Marcel. Active shape models using local binary patterns. Idiap-RR Idiap-RR-07-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[1180] Christopher Kermorvant and Chafic Mokbel. Towards introducing long-term statistics in muse for robust speech recognition. Idiap-RR Idiap-RR-18-1999, IDIAP, 1999. [ .ps.gz | .pdf ]
[1181] Christopher Kermorvant and Chafic Mokbel. Towards introducing long-term statistics in muse for robust speech recognition. In Automatic Speech Recognition and Understanding (ASRU) workshop, Keystone, Colorado, USA, 12 1999. [ .ps.gz | .pdf ]
[1182] Christopher Kermorvant and Andrew Morris. A comparison of two strategies for asr in additive noise : Missing data and spectral subtraction. In 6th European Conference on Speech Communication and Technology --- Eurospeech'99, Budapest, Hungary, 1999. [ .ps.gz | .pdf ]
[1183] Christopher Kermorvant and Andrew Morris. A comparison of two strategies for asr in additive noise : Missing data and spectral subtraction. Idiap-RR Idiap-RR-17-1999, IDIAP, 1999. [ .ps | .pdf ]
[1184] Christopher Kermorvant. A comparison of noise reduction techniques for robust speech recognition. Idiap-RR Idiap-RR-10-1999, IDIAP, 1999. IDIAP-RR 99-10. [ .ps.gz | .pdf ]
[1185] Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer, Samy Bengio, and Dan Chazan. Discriminative kernel-based phoneme sequence recognition. In The 9th International Conference on Spoken Language Processing (INTERSPEECH) [3164]. [ .pdf ]
[1186] Joseph Keshet, David Grangier, and Samy Bengio. Discriminative keyword spotting. In Workshop on Non-Linear Speech Processing, 2007. [ .pdf ]
[1187] Joseph Keshet, David Grangier, and Samy Bengio. Discriminative keyword spotting. Speech Communication, 51(4), 4 2009. [ .pdf ]
[1188] Joseph Keshet and Dan Chazan. A kernel wrapper for phoneme sequence recognition. In Joseph Keshet and Samy Bengio, editors, Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods. John Wiley and Sons, 2009.
[1189] Joseph Keshet. A proposal for a kernel-based algorithm for large vocabulary continuous speech recognition. In Joseph Keshet and Samy Bengio, editors, Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods. John Wiley and Sons, 2009.
[1190] Joseph Keshet and Samy Bengio. Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods. John Wiley & Sons, 2008.
[1191] Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer, and Dan Chazan. A large margin algorithm for forced alignment. In Joseph Keshet and Samy Bengio, editors, Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods. John Wiley and Sons, 3 2009.
[1192] Cem Keskin, Oya Aran, and Lale Akarun. Hand gesture analysis. In Albert Ali Salah and Theo Gevers, editors, Computer Analysis of Human Behavior,, pages 125--149. Springer London, 2011.
[1193] Hamed Ketabdar, Jithendra Vepa, Samy Bengio, and Hervé Bourlard. Using more informative posterior probabilities for speech recognition. In IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP) [3165]. IDIAP-RR 05-91. [ .ps.gz | .pdf ]
[1194] Hamed Ketabdar, Jithendra Vepa, Samy Bengio, and Hervé Bourlard. Posterior based keyword spotting with a priori thresholds. In International Conference on Spoken Language Processing (ICSLP) [3166]. IDIAP-RR 06-67. [ .ps.gz | .pdf ]
[1195] Hamed Ketabdar and Hynek Hermansky. Identifying unexpected words using in-context and out-of-context phoneme posteriors. Idiap-RR Idiap-RR-68-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[1196] Hamed Ketabdar. Enhancing posterior based speech recognition systems. PhD thesis, Ecole Polytechnique Fédérale de Lausanne, Lausanne , Switzerland, 2008. Thèse Ecole polytechnique fédérale de Lausanne EPFL, no 4218 (2008,',','), Faculté des sciences et techniques de l'ingénieur STI, Section de génie électrique et électronique, Institut de génie électrique et électronique IEL (Laboratoire de l'IDIAP LIDIAP). Dir.: Hervé Bourlard. [ .pdf ]
[1197] Vasil Khalidov and Jean-Marc Odobez. Real-time multiple head tracking using texture and colour cues. Idiap-RR Idiap-RR-02-2017, Idiap, 2 2017. [ .pdf ]
[1198] Vasil Khalidov, Florence Forbes, and Radu Horaud. Alignment of binocular-binaural data using a moving audio-visual target. In Proc. of IEEE Int. Workshop on Multimedia Signal Processing (MMSP), 2013. [ .pdf ]
[1199] Deepanshu Khanna, Muskaan Singh, and Petr Motlicek. Idiap_tiet@lt-edi-acl2022 : Hope speech detection in social media using contextualized bert with attention mechanism. In ACL, 2022. [ .pdf ]
[1200] Daniel Khashabi, Arman Cohan, Siamak Shakeri, Pedram Hosseini, Pouya Pezeshkpour, Marzieh Bitaab, Faeze Brahman, Sarik Ghazarian, Arman Kabiri, rabeeh karimi mahabadi, Omid Memarrast, Ahmadreza Mosallanezhad, Erfan Noury, Shahab Raji, Mohammad Sadegh Rasooli, Sepideh Sadeghi, Erfan Sadeqi Azer, Niloofar Safi Samghabadi, Mahsa Shafaei, Saber Sheybani, Ali Tazarv, and Yadollah Yaghoobzadeh. Parsinlu: A suite of language understanding challenges for persian. TACL, 2021. [ .pdf ]
[1201] Khaled Khelif, yann Mombrun, Gerhard Backfried, Farhan Sahito, Luca Scarpatto, Petr Motlicek, Damien Kelly, Gideon Hazzani, Emmanouil Chatzigavriil, and Srikanth Madikeri. Towards a breakthrough speaker identification approach for law enforcement agencies: Siip. In European Intelligence and Security Informatics Conference (EISIC) 2017 [3167], pages 32--39. [ DOI | http | .pdf ]
[1202] Khaled Khelif, yann Mombrun, Gideon Hazzani, Petr Motlicek, Srikanth Madikeri, Farhan Sahito, Damien Kelly, Luca Scarpatto, Emmanouil Chatzigavriil, and Gerhard Backfried. Siip: An innovative speaker identification approach for law enforcement agencies. In Big Data and Artificial Intelligence for Military Decision Making, pages PT--1 -- 1: PT--1 -- 14. http://www.sto.nato.int/, STO, May 2018. Meeting Proceedings RDP. [ DOI | .pdf ]
[1203] Banriskhem Khonglah, Srikanth Madikeri, Subhadeep Dey, Hervé Bourlard, Petr Motlicek, and Jayadev Billa. Incremental semi-supervised learning for multi-genre speech recognition. In Proceedings of ICASSP 2020, 2020. [ .pdf ]
[1204] Banriskhem Khonglah, Srikanth Madikeri, Navid Rekabsaz, Nikolaos Pappas, Petr Motlicek, and Hervé Bourlard. Stacked neural networks with parameter sharing for multilingual language modeling. Idiap-RR Idiap-RR-12-2019, Idiap, 10 2019. [ .pdf ]
[1205] Banriskhem Khonglah, Srikanth Madikeri, Petr Motlicek, and Hervé Bourlard. Investigating time delay neural network (tdnn) for language modeling in low resource automatic speech recognition. Idiap-RR Idiap-RR-13-2019, Idiap, 10 2019. [ .pdf ]
[1206] Abbas Khosravani, Philip N. Garner, and Alexandros Lazaridis. An evaluation benchmark for automatic speech recognition of german-english code-switching. In IEEE Automatic Speech Recognition and Understanding Workshop, December 2021. [ .pdf ]
[1207] Abbas Khosravani, Philip N. Garner, and Alexandros Lazaridis. Learning to translate low-resourced swiss german dialectal speech into standard german text. In IEEE Automatic Speech Recognition and Understanding Workshop. IEEE, December 2021. [ .pdf ]
[1208] Abbas Khosravani, Claudiu Musat, Philip N. Garner, and Alexandros Lazaridis. Comparison of subword segmentation methods for open-vocabulary asr using a difficulty metric. Technical report, April 2020. [ .pdf ]
[1209] Abbas Khosravani, Claudiu Musat, Philip N. Garner, and Alexandros Lazaridis. Comparison of subword segmentation methods for open-vocabularyend-to-end speech recognition. Idiap-RR Idiap-RR-34-2020, Idiap, 12 2020. Submitted to SLT 2021 conference, DAHL project. [ .pdf ]
[1210] Abbas Khosravani, Philip N. Garner, and Alexandros Lazaridis. Modeling dialectal variation for swiss german automatic speech recognition. In Proceedings of Interspeech, August 2021. [ DOI | .pdf ]
[1211] Elie Khoury, Manuel Günther, Laurent El Shafey, and Sébastien Marcel. On the improvements of uni-modal and bi-modal fusions of speaker and face recognition for mobile biometrics. In Biometric Technologies in Forensic Science [3168]. [ .pdf ]
[1212] Elie Khoury, Antoine Laurent, Sylvain Meignier, and Simon Petitrenaud. Combining transcription-based and acoustic-based speaker identifications for broadcast news. In IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012. [ .pdf ]
[1213] Elie Khoury, Laurent El Shafey, and Sébastien Marcel. Spear: An open source toolbox for speaker recognition based on bob. In Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 1655 -- 1659, May 2014. [ DOI | http | .pdf ]
[1214] Elie Khoury, Sébastien Marcel, and Manuel Günther. Icb 2013 - competition on speaker recognition in mobile environment using the mobio database: The evaluation plan. Idiap-Com Idiap-Com-04-2012, Idiap, 12 2012. [ .pdf ]
[1215] Elie Khoury, Paul Gay, and Jean-Marc Odobez. Fusing matching and biometric similarity measures for face diarization in video. Idiap-RR Idiap-RR-31-2013, Idiap, 11 2013. [ .pdf ]
[1216] Elie Khoury, Laurent El Shafey, Chris McCool, Manuel Günther, and Sébastien Marcel. Bi-modal biometric authentication on mobile phones in challenging conditions. In Image and Vision Computing [3172], pages 1147--1160. [ DOI | http ]
[1217] Elie Khoury, Tomi Kinnunen, Aleksandr Sizov, Zhizheng Wu, and Sébastien Marcel. Introducing i-vectors for joint anti-spoofing and speaker verification. In The 15th Annual Conference of the International Speech Communication Association, 2014. [ .pdf ]
[1218] Elie Khoury, Christine Sénac, and Philippe Joly. Audiovisual diarization of people in video content. Multimedia Tools and Applications, 2012. [ .pdf ]
[1219] Elie Khoury, Laurent El Shafey, and Sébastien Marcel. The idiap speaker recognition evaluation system at nist sre 2012. In NIST Speaker Recognition Conference. NIST, December 2012. [ .pdf ]
[1220] Elie Khoury, Laurent El Shafey, Marc Ferras, and Sébastien Marcel. Hierarchical speaker clustering methods for the nist i-vector challenge. In Odyssey: The Speaker and Language Recognition Workshop, 2014. [ .pdf ]
[1221] Samuel Kim, Maurizio Filippone, Fabio Valente, and Alessandro Vinciarelli. Predicting the conflict level in television political debates: an approach based on crowdsourcing, nonverbal communication and gaussian processes. In ACM Multimedia, 2012.
[1222] Samuel Kim, Fabio Valente, and Alessandro Vinciarelli. Automatic detection of conflicts in spoken conversations: ratings and analysis of broadcast political debates. In Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, March 2012. [ .pdf ]
[1223] Samuel Kim, Sree Harsha Yella, and Fabio Valente. Automatic detection of conflict escalation in spoken conversations. In INTERSPEECH. ISCA, 2012. [ .pdf ]
[1224] N. Kiukkonen, Blom J., O. Dousse, Daniel Gatica-Perez, and J. K. Laurila. Towards rich mobile phone datasets: Lausanne data collection campaign. In Proc. ACM Int. Conf. on Pervasive Services (ICPS,',','), Berlin., 7 2010. [ .pdf ]
[1225] Matthias Kleinert, Hartmut Helmke, Shruthi Shetty, Oliver Ohneiser, heiko Ehr, Amrutha Prasad, Petr Motlicek, and Julia Harfmann. Automated interpretation of air traffic control communication: The journey from spoken words to a deeper understanding of the meaning. In 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC), pages 1--9. IEEE, October 2021. [ DOI | .pdf ]
[1226] Matthias Kleinert, Hartmut Helmke, Gerald Siol, heiko Ehr, Cerna Aneta, Kern Christian, Dietrich Klakow, Petr Motlicek, Youssef Oualil, Mittul Singh, and Ajay Srinivasamurthy. Semi-supervised adaptation of assistant based speech recognition models for different approach areas. In 37th AIAA/IEEE Digital Avionics Systems Conference. AIAA/IEEE, September 2018. The best paper award in cathegory "ST-B: Human Factors & Performance for Aerospace Applications" (http://2018.dasconline.org/pages/award-winners). [ http | .pdf ]
[1227] Matthias Kleinert, Hartmut Helmke, Gerald Siol, heiko Ehr, Dietrich Klakow, Mittul Singh, Petr Motlicek, Kern Christian, Cerna Aneta, and Hlousek Petr. Adaptation of assistant based speech recognition to new domains and its acceptance by air traffic controllers. In Proceedings of the 2nd International Conference on Intelligent Human Systems Integration (IHSI 2019): Integrating People and Intelligent Systems, pages 820 -- 826, February 2019. [ DOI ]
[1228] Matthias Kleinert, Hartmut Helmke, Gerald Siol, heiko Ehr, Michael Finke, Youssef Oualil, and Ajay Srinivasamurthy. Machine learning of controller command prediction models from recorded radar data and controller speech utterances. In Proceedings of the 7th SESAR Innovation Days (SID). University of Belgrade, November 2017. [ .pdf ]
[1229] Matthias Kleinert, Hartmut Helmke, heiko Ehr, Kern Christian, Dietrich Klakow, Petr Motlicek, Mittul Singh, and Gerald Siol. Building blocks of assistant based speech recognition for air traffic management applications. In Conference: SESAR Innovation Days 2018. European Union, Eurocontrol, SESARJU, December 2018. [ http | .pdf ]
[1230] David Klotz, Johannes Wienke, Britta Wrede, Sebastian Wrede, Samira Sheikhi, Dinesh Babu Jayagopi, Vasil Khalidov, and Jean-Marc Odobez. Robot-to-group interaction in a vernissage: Architecture & dataset for multi-party dialog. In Proceedings of 5th International Conference on Cognitive Systems, 2012. [ .pdf ]
[1231] David Klotz, Johannes Wienke, Julia Peltason, Britta Wrede, Sebastian Wrede, Vasil Khalidov, and Jean-Marc Odobez. Engagement-based multi-party dialog with a humanoid robot. In Proceedings of the SIGDIAL 2011: the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 341--343, 2011. [ .pdf ]
[1232] Martin Kocour, Karel Vesely, Igor Szoke, Santosh Kesiraju, Zuluaga-Gomez. Juan, Blatt Alexander, Amrutha Prasad, Nigmatulina Iuliia, Petr Motlicek, and et al. Automatic processing pipeline for collecting and annotating air-traffic voice communication data. In Proceedings of 9th OpenSky Symposium 2020, pages 1--9. OpenSky Network, MDPI, November 2021. [ .pdf ]
[1233] Martin Kocour, Karel Vesely, Blatt Alexander, Zuluaga-Gomez. Juan, Igor Szoke, Jan Cernocky, Dietrich Klakow, and Petr Motlicek. Boosting of contextual information in asr for air-traffic call-sign recognition. In Interspeech 2021, August 2021. [ .pdf ]
[1234] Ina Kodrasi, Michaela Pernon, Marina Laganaro, and Hervé Bourlard. Automatic and perceptual discrimination between dysarthria, apraxia of speech, and neurotypical speech. In IEEE International Conference on Acoustics, Speech and Signal Processing, June 2021. [ .pdf ]
[1235] Ina Kodrasi and Simon Doclo. Joint late reverberation and noise power spectral density estimation in a spatially homogeneous noise field. In Proc. International Conference on Acoustics, Speech, and Signal Processing, pages 441--445, April 2018. [ .pdf ]
[1236] Ina Kodrasi and Hervé Bourlard. Super-gaussianity of speech spectral coefficients as a potential biomarker for dysarthric speech detection. In IEEE International Conference on Acoustics, Speech and Signal Processing, May 2019.
[1237] Ina Kodrasi. Temporal envelope and fine structure cues for dysarthric speech detection using convolutional neural networks. IEEE Signal Processing Letters, September 2021. [ .pdf ]
[1238] Ina Kodrasi and Hervé Bourlard. Single-channel late reverberation power spectral density estimation using denoising autoencoders. In Proc. Annual Conference of the International Speech Communication Association, September 2018. [ .pdf ]
[1239] Ina Kodrasi, Michaela Pernon, Marina Laganaro, and Hervé Bourlard. Automatic discrimination of apraxia of speech and dysarthria using a minimalistic set of handcrafted features. In Interspeech, October 2020. [ .pdf ]
[1240] Ina Kodrasi and Hervé Bourlard. Spectro-temporal sparsity characterization for dysarthric speech detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28:1210--1222, April 2020. [ .pdf ]
[1241] Ina Kodrasi and Hervé Bourlard. Statistical modeling of speech spectral coefficients in patients with parkinson's disease. In Proc. ITG conference on Speech Communication, October 2018. [ .pdf ]
[1242] Ina Kodrasi and Simon Doclo. Improving the conditioning of the optimization criterion in acoustic multi-channel equalization using shorter reshaping filters. EURASIP Journal on Advances in Signal Processing, (11), January 2018. [ .pdf ]
[1243] Ina Kodrasi and Simon Doclo. Analysis of eigenvalue decomposition-based late reverberation power spectral density estimation. IEEE Transaction on Acoustics, Speech and Language Processing, 26(6):1106--1118, June 2018. [ .pdf ]
[1244] Jukka Komulainen, Abdenour Hadid, Matti Pietikainen, André Anjos, and Sébastien Marcel. Complementary countermeasures for detecting scenic face spoofing attacks. In International Conference on Biometrics, June 2013. [ http | .pdf ]
[1245] Danil Korchagin. Out-of-scene av data detection. In Proceedings IADIS International Conference Applied Computing [3173]. [ .pdf ]
[1246] Danil Korchagin. Impact of excitation frequency on short-term recording synchronisation and confidence estimation. In Proceedings European Signal Processing Conference [3174]. [ .pdf ]
[1247] Danil Korchagin. Audio spatio-temporal fingerprints for cloudless real-time hands-free diarization on mobile devices. In Proceedings of the 3rd Joint Workshop on Hands-Free Speech Communication and Microphone Arrays [3175]. [ .pdf ]
[1248] Danil Korchagin, Philip N. Garner, and John Dines. Automatic temporal alignment of av data with confidence estimation. In Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing [3176]. [ .pdf ]
[1249] Danil Korchagin, Petr Motlicek, Stefan Duffner, and Hervé Bourlard. Just-in-time multimodal association and fusion from home entertainment. In Proceedings IEEE International Conference on Multimedia & Expo [3177]. [ .pdf ]
[1250] Danil Korchagin and Hamid Reza Abutalebi. Social focus of attention as a time function derived from multimodal signals. In Proceedings IEEE International Conference on Multimedia & Expo [3178]. [ .pdf ]
[1251] Danil Korchagin. Multimodal data flow controller. Idiap-Com Idiap-Com-01-2009, Idiap, P.O. Box 592, CH-1920 Martigny, Switzerland, 11 2009. [ .pdf ]
[1252] Danil Korchagin, Philip N. Garner, and John Dines. Automatic temporal alignment of av data. Idiap-RR Idiap-RR-39-2009, Idiap, 12 2009. [ .pdf ]
[1253] Danil Korchagin, Stefan Duffner, Petr Motlicek, and Carl Scheffler. Multimodal cue detection engine for orchestrated entertainment. In Proceedings International Conference on MultiMedia Modeling [3183]. [ .pdf ]
[1254] Danil Korchagin. Memoirs of togetherness from audio logs. In Proceedings International ICST Conference on User Centric Media [3184]. [ .pdf ]
[1255] Pavel Korshunov and Sébastien Marcel. Face anthropometry aware audio-visual age verification. In ACM Multimedia, October 2022. [ .pdf ]
[1256] Pavel Korshunov, Michael Halstead, Diego Castan, Martin Graciarena, Mitchell McLaren, Brian Burns, Aaron Lawson, and Sébastien Marcel. Tampered speaker inconsistency detection with phonetically aware audio-visual features. In International Conference on Machine Learning, Synthetic Realities: Deep Learning for Detecting AudioVisual Fakes, July 2019. Best paper award in ICML workshop "Synthetic Realities: Deep Learning for Detecting AudioVisual Fakes". [ .pdf ]
[1257] Pavel Korshunov and Sébastien Marcel. Joint operation of voice biometrics and presentation attack detection. In IEEE International Conference on Biometrics: Theory, Applications and Systems [3185]. Open source software for the paper: https://pypi.python.org/pypi/bob.paper.btas_j2016. [ http | .pdf ]
[1258] Pavel Korshunov, Sébastien Marcel, Hannah Muckenhirn, A. R. Gonçalves, A. G. Souza Mello, R. P. Velloso Violato, F. O. Simões, M. U. Neto, M. de Assis Angeloni, J. A. Stuchi, H. Dinkel, N. Chen, Y. Qian, D. Paul, G. Saha, and Md Sahidullah. Overview of btas 2016 speaker anti-spoofing competition. In IEEE International Conference on Biometrics: Theory, Applications and Systems [3186]. Open source software for the paper: https://pypi.python.org/pypi/bob.paper.btas_c2016. [ http | .pdf ]
[1259] Pavel Korshunov and Sébastien Marcel. Speaker inconsistency detection in tampered video. In European Signal Processing Conference, September 2018. [ .pdf ]
[1260] Pavel Korshunov and Sébastien Marcel. Subjective and objective evaluation of deepfake videos. In The international Conference on Acoustics, Speech, and Signal Processing, June 2021. [ .pdf ]
[1261] Pavel Korshunov, Anubhav Jain, and Sébastien Marcel. Custom attribution loss for improving generalization and interpretability of deepfake detection. In International Conference on Acoustics, Speech, & Signal Processing, May 2022. [ .pdf ]
[1262] Pavel Korshunov and Sébastien Marcel. Vulnerability of face recognition to deep morphing. In International Conference on Biometrics for Borders, October 2019. [ .pdf ]
[1263] Pavel Korshunov and Sébastien Marcel. Vulnerability assessment and detection of deepfake videos. In IAPR International Conference on Biometrics [3187]. [ .pdf ]
[1264] Pavel Korshunov and Sébastien Marcel. Deepfake detection: humans vs. machines. Idiap-RR Idiap-RR-36-2020, Idiap, 12 2020. [ .pdf ]
[1265] Pavel Korshunov and Sébastien Marcel. Presentation attack detection in voice biometrics. In Claus Vielhauer, editor, User-Centric Privacy and Security in Biometrics, chapter 7. The Institution of Engineering and Technology, Savoy Place, London WC2R 0BL, UK, 2017. [ .pdf ]
[1266] Pavel Korshunov and Sébastien Marcel. Cross-database evaluation of audio-based spoofing detection systems. In Interspeech [3188]. Open source software package for the paper: https://pypi.python.org/pypi/bob.paper.interspeech_2016. [ http | .pdf ]
[1267] Pavel Korshunov, Andreé R. Goncalves, Ricardo P. V. Violato, Flávio O. Simões, and Sébastien Marcel. On the use of convolutional neural networks for speech presentation attack detection. In International Conference on Identity, Security and Behavior Analysis, January 2018. [ .pdf ]
[1268] Pavel Korshunov and Sébastien Marcel. A cross-database study of voice presentation attack detection. In Sébastien Marcel, Mark Nixon, Julian Fierrez, and Nicholas Evans, editors, Handbook of Biometric Anti-Spoofing: Presentation Attack Detection, 2nd Edition, chapter 19. Springer, November 2018.
[1269] Pavel Korshunov and Sébastien Marcel. Impact of score fusion on voice biometrics and presentation attack detection in cross-database evaluations. IEEE Journal of Selected Topics in Signal Processing, 11(4):695 -- 705, June 2017. [ DOI | .pdf ]
[1270] Pavel Korshunov and Sébastien Marcel. Improving generalization of deepfake detection with data farming and few-shot learning. IEEE Transactions on Biometrics, Behavior, and Identity Science, December 2021. [ .pdf ]
[1271] Ketan Kotwal and Sébastien Marcel. Residual feature pyramid network for enhancement of vascular patterns. In IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, June 2022. [ .pdf ]
[1272] Ketan Kotwal and Sébastien Marcel. Cnn patch pooling for detecting 3d mask presentation attacks in nir. In IEEE International Conference on Image Processing [3189]. [ .pdf ]
[1273] Ketan Kotwal, Sushil Bhattacharjee, Philip Abbet, Zohreh Mostaani, Huang Wei, Xu Wenkang, Zhao Yaxi, and Sébastien Marcel. Domain-specific adaptation of cnn for detecting face presentation attacks in nir. IEEE Transactions on Biometrics, Behavior, and Identity Science, 2022. [ .pdf ]
[1274] Ketan Kotwal, Zohreh Mostaani, and Sébastien Marcel. Detection of age-induced makeup attacks on face recognition systems using multi-layer deep features. IEEE Transactions on Biometrics, Behavior, and Identity Science, page 11, 2019. [ .pdf ]
[1275] Ketan Kotwal, Sushil Bhattacharjee, and Sébastien Marcel. Multispectral deep embeddings as a countermeasure to custom silicone mask presentation attacks. IEEE Transactions on Biometrics, Behavior, and Identity Science, 2019. [ .pdf ]
[1276] S. R. Krishnan, Mathew Magimai.-Doss, and C. S. Seelamantula. A savitzky-golay filtering perspective of dynamic feature computation. IEEE Signal Processing Letters, 20(3):281 -- 284, March 2013. [ DOI ]
[1277] Tipaluck Krityakierne and David Ginsbourger. Global optimization with sparse and local gaussian process models. In Panos Pardalos, Mario Pavone, Giovanni Maria Farinella, and Vincenzo Cutello, editors, Machine Learning, Optimization, and Big Data, volume 9432 of Lecture Notes in Computer Science, pages 185--196. Springer International Publishing, 2015. [ DOI ]
[1278] Vedrana Krivokuca and Sébastien Marcel. Towards protecting face embeddings in mobile face verification scenarios. In arXiv [3190]. Version 1 -- Submitted to IEEE T-BIOM. [ http | .pdf ]
[1279] Vedrana Krivokuca and Sébastien Marcel. Towards quantifying the entropy of fingervein patterns across different feature extractors. In 2018 IEEE 4th International Conference on Identity, Security, and Behavior Analysis (ISBA), 2018. [ .pdf ]
[1280] Vedrana Krivokuca, Marta Gomez-Barrero, Sébastien Marcel, Christian Rathgeb, and Christoph Busch. Towards measuring the amount of discriminatory information in finger vein biometric characteristics using a relative entropy estimator. In Andreas Uhl, Christoph Busch, Sébastien Marcel, and Raymond Veldhuis, editors, Handbook of Vascular Biometrics, chapter 17, pages 507--525. Springer Open, 2019. [ .pdf ]
[1281] Vedrana Krivokuca and Sébastien Marcel. On the recognition performance of biohash-protected finger vein templates. In Andreas Uhl, Christoph Busch, Sébastien Marcel, and Raymond Veldhuis, editors, Handbook of Vascular Biometrics, chapter 15, pages 465--480. Springer Open, 2019. [ .pdf ]
[1282] Sacha Krstulović. Relating LPC modeling to a factor-based articulatory model. In Proc. ICSLP 2000, 2000. [ .ps.gz | .pdf ]
[1283] Sacha Krstulović and Frédéric Bimbot. Inverse lattice filtering of speech with adapted non-uniform delays. In Proc. ICSLP 2000, 2000. [ .ps.gz | .pdf ]
[1284] Sacha Krstulović. LPC modeling with speech production constraints. In Proc. 5th Speech Production Seminar, 2000. [ .ps.gz | .pdf ]
[1285] Sacha Krstulović and Frédéric Bimbot. Signal modeling with Non Uniform Topology lattice filters. In Proc. ICASSP 2001, volume ii, 2001. [ .ps.gz | .pdf ]
[1286] Sacha Krstulović. Epfl lab session 1/2: Introduction to Gaussian statistics and pattern recognition. Idiap-Com Idiap-Com-06-2001, IDIAP, 2001. [ .ps.gz | .pdf ]
[1287] Sacha Krstulović. Epfl lab session 2/2: Introduction to hidden markov models. Idiap-Com Idiap-Com-07-2001, IDIAP, 2001. [ .ps.gz | .pdf ]
[1288] Sacha Krstulović. PhD Thesis: Speech Analysis with Production Constraints. Idiap-rr, École Polytechnique Fédérale de Lausanne, 2001. [ .ps.gz | .pdf ]
[1289] Sacha Krstulović. Présentation du modèle DRM. Idiap-Com Idiap-Com-03-1996, IDIAP, 4 1996. [ .ps.gz | .pdf ]
[1290] Sacha Krstulović. Investigation of a possible process identity between DRM and linear filtering. Idiap-RR Idiap-RR-19-1997, IDIAP, 1997. [ .ps.gz | .pdf ]
[1291] Sacha Krstulović. Acoustico-articulatory inversion of unequal-length tube models through lattice inverse filtering. Idiap-RR Idiap-RR-16-1998, IDIAP, 1998. [ .ps.gz | .pdf ]
[1292] Sacha Krstulović. LPC-based inversion of the DRM articulatory model. In Proc. Eurospeech'99, 1999. [ .ps.gz | .pdf ]
[1293] Serife Kucur Ergunay, Elie Khoury, Alexandros Lazaridis, and Sébastien Marcel. On the vulnerability of speaker verification to realistic voice spoofing. In IEEE International Conference on Biometrics: Theory, Applications and Systems, pages 1--8. IEEE, September 2015. [ DOI | http | .pdf ]
[1294] Thibaut Kulak and Sylvain Calinon. Intrinsically-motivated robot learning of bayesian probabilistic movement primitives. In ICRA workshop: "Towards Curious Robots: Modern Approaches for Intrinsically-Motivated Intelligent Behavior", 2021. [ .pdf ]
[1295] Thibaut Kulak, Hakan Girgin, Jean-Marc Odobez, and Sylvain Calinon. Active learning of bayesian probabilistic movement primitives. IEEE Robotic and Automation Letters, 2021. [ .pdf ]
[1296] Thibaut Kulak, J. Silverio, and Sylvain Calinon. Fourier movement primitives: an approach for learning rhythmic robot skills from demonstrations. In Robotics: Science and Systems, 2020. [ .pdf ]
[1297] Thibaut Kulak and Sylvain Calinon. Combining social and intrinsically-motivated learning for multi-task robot skill acquisition. IEEE Transactions on Cognitive and Developmental Systems, 2021. [ .pdf ]
[1298] Thibaut Kulak. Learning strategies and representations for intuitive robot learning from demonstration. PhD thesis, EPFL, December 2021. [ .pdf ]
[1299] D S Pavan Kumar, Bogdan Vlasenko, and Mathew Magimai.-Doss. Modelling glottal source information for depression detection. Idiap-RR Idiap-RR-13-2018, Idiap, 8 2018. [ .pdf ]
[1300] Kenichi Kumatani, Uwe Mayer, Tobias Gehrig, Emilian Stoimenov, John McDonough, and Matthias Wölfel. Minimum mutual information beamforming for simultaneous active speakers. Idiap-RR Idiap-RR-73-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[1301] Kenichi Kumatani, Uwe Mayer, Tobias Gehrig, Emilian Stoimenov, John McDonough, and Matthias Wölfel. Adaptive beamforming with a minimum mutual information criterion. Idiap-RR Idiap-RR-74-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[1302] Kenichi Kumatani, John McDonough, Stefan Schacht, Dietrich Klakow, Philip N. Garner, and Weifeng Li. Filter bank design for subband adaptive beamforming and application to speech recognition. Idiap-RR Idiap-RR-02-2008, IDIAP, 2008. [ .ps.gz | .pdf ]
[1303] Kenichi Kumatani, John McDonough, Dietrich Klakow, Philip N. Garner, and Weifeng Li. Maximum negentropy beamforming. Idiap-RR Idiap-RR-07-2008, IDIAP, 2008. [ .ps.gz | .pdf ]
[1304] Kenichi Kumatani, John McDonough, Barbara Rauch, Dietrich Klakow, Philip N. Garner, and Weifeng Li. Beamforming with a maximum negentropy criterion. In IEEE Transactions on Audio Speech and Language Processing [3196]. [ .pdf ]
[1305] Kenichi Kumatani, John McDonough, Dietrich Klakow, Philip N. Garner, and Weifeng Li. Adaptive beamforming with a maximum negentropy criterion. In Proceedings of the Joint Workshop on Hands-free Speech Communication and Microphone Arrays [3197]. [ .pdf ]
[1306] Kenichi Kumatani, John McDonough, Stefan Schacht, Dietrich Klakow, Philip N. Garner, and Weifeng Li. Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming. In Proceedings of ICASSP 2008 [3198]. [ .pdf ]
[1307] Kenichi Kumatani, John McDonough, Barbara Rauch, Philip N. Garner, Weifeng Li, and John Dines. Maximum kurtosis beamforming with the generalized sidelobe canceller. In Proceedings of INTERSPEECH, September 2008, 9 2008. [ .pdf ]
[1308] Mikko Kurimo and Chafic Mokbel. Latent semantic indexing by self-organizing map. In ESCA ETRW workshop on Accessing Information in Spoken Audio [3199]. IDIAP-RR 99-12. [ .ps.gz | .pdf ]
[1309] Mikko Kurimo. Indexing spoken audio by LSA and SOMs. In Proceedings of the European Signal Processing Conference EUSIPCO'2000 [3200]. IDIAP-RR 00-06.
[1310] Mikko Kurimo. Fast latent semantic indexing of spoken documents by using self-organizing maps. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP'2000 [3201]. IDIAP-RR 99-20. [ .ps.gz | .pdf ]
[1311] Mikko Kurimo. Thematic indexing of spoken documents by using self-organizing maps. Idiap-RR Idiap-RR-05-2000, IDIAP, 2000. [ .ps.gz | .pdf ]
[1312] Mikko Kurimo. Indexing audio documents by using latent semantic analysis and som. In Oja and Kaski [3202]. IDIAP-RR 99-13. [ .ps.gz | .pdf ]
[1313] Mikko Kurimo, William Byrne, John Dines, Philip N. Garner, Matthew Gibson, Yong Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Mirjam Wester, Yi-Jian Wu, and Junichi Yamagishi. Personalising speech-to-speech translation in the emime project. In Proceedings of the ACL 2010 System Demonstrations. Association for Computational Linguistics, 7 2010. [ .pdf ]
[1314] Ilja Kuzborskij, Francesco Orabona, and Barbara Caputo. Scalable greedy algorithms for transfer learning. Computer Vision and Image Understanding, 2016.
[1315] Ilja Kuzborskij, Francesco Orabona, and Barbara Caputo. From n to n+1: Multiclass transfer incremental learning. In Proceedings of the Conference on Computer Vision and Pattern Recognition, June 2013. [ .pdf ]
[1316] Ilja Kuzborskij, Fabio M. Carlucci, and Barbara Caputo. When naïve bayes nearest neighbors meet convolutional neural networks. In Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, June 2016. [ .pdf ]
[1317] Ilja Kuzborskij, Arjan Gijsberts, and Barbara Caputo. On the challenge of classifying 52 hand movements from surface electromyography. In 34th Annual Conference of the IEEE Engineering in Medicine & Biology Society, 2012. [ .pdf ]
[1318] Ilja Kuzborskij, Francesco Orabona, and Barbara Caputo. Transfer learning through greedy subset selection. In Image Analysis and Processing - ICIAP 2015 [3203], pages 3--14. [ DOI | .pdf ]
[1319] Ilja Kuzborskij and Francesco Orabona. Stability and hypothesis transfer learning. In International Conference on Machine Learning, June 2013. [ .pdf ]
[1320] Ilja Kuzborskij and Francesco Orabona. Fast rates by transferring from auxiliary hypotheses. Machine Learning, 2016.
[1321] Ilja Kuzborskij. Theory and Algorithms for Hypothesis Transfer Learning. PhD thesis, EPFL, 2018. [ DOI | .pdf ]
[1322] Florian Labhart, Emmanuel Kuntsche, Michael Livingston, and Rutger Engels. After how many drinks does someone experience acute consequences-determining thresholds for binge drinking based on two event-level studies: Optimal thresholds for binge drinking. Addiction, 113(12):2235--2244, December 2018. [ DOI | http | .pdf ]
[1323] Florian Labhart, Thanh-Trung Phan, Daniel Gatica-Perez, and Emmanuel Kuntsche. Shooting shots: Estimating alcoholic drink sizes in real life using event-level reports and annotations of close-up pictures. Drug and Alcohol Review, 2020. [ DOI | http | .pdf ]
[1324] Florian Labhart, Flavio Tarsetti, Olivier Bornet, Darshan Santani, Jasmine Truong, Sara Landolt, Daniel Gatica-Perez, and Emmanuel Kuntsche. Capturing drinking and nightlife behaviours and their social and physical context with a smartphone application - investigation of users' experience and reactivity. Addiction Research and Theory, 28(1):62--75, January 2020. [ DOI | http ]
[1325] Florian Labhart, Flavio Tarsetti, Olivier Bornet, Darshan Santani, Jasmine Truong, Sara Landolt, Daniel Gatica-Perez, and Emmanuel Kuntsche. Development of the geographical proportional-to-size street-intercept sampling (gpsis) method for recruiting urban nightlife-goers in an entire city. International Journal of Social Research Methodology, 20(6):721--736, 2017. [ DOI ]
[1326] Florian Labhart, Emmanuel Kuntsche, and Rutger Engels. What reminds young people that they drank more than intended on weekend nights: An event-level study. Journal of Studies on Alcohol and Drugs, 79(4):644--648, July 2018. [ DOI | http | .pdf ]
[1327] Florian Labhart, Skanda Muralidhar, Benoit Massé, Lakmal Buddika Meegahapola, Emmanuel Kuntsche, and Daniel Gatica-Perez. Ten seconds of my nights: exploring methods to measure brightness, loudness and attendance and their associations with alcohol use from video clips. PLOS ONE, 2021. [ DOI | .pdf ]
[1328] Florian Labhart. Context is Everything: Using a Smartphone App to Capture Young People's Drinking Behaviours, Cognitions, Environments, and Consequences. PhD thesis, La Trobe University, Melbourne, Australia, October 2020. [ DOI | .pdf ]
[1329] Denis Lalanne and Andrei Popescu-Belis. User requirements for meeting support technology. In Steve Renals, Hervé Bourlard, Jean Carletta, and Andrei Popescu-Belis, editors, Multimodal Signal Processing: Human Interactions in Meetings, pages 210--221. Cambridge University Press, Cambridge, UK, 2012.
[1330] Inga Lang, Lonneke van der Plas, Malvina Nissim, and Albert Gatt. Visually grounded interpretation of noun-noun compounds in english. In Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics. Association for Computational Linguistics, 2022.
[1331] I. Lapidot and H. Guterman. Dichotomy between clustering performance and minimum distortion in piecewise-dependent-data (PDD) clustering. In to be published in IEEE Signal Processing Letters [3204]. IDIAP-RR 02-48. [ .ps.gz | .pdf ]
[1332] I. Lapidot and Andrew Morris. Extended BIC criterion for model selection. Idiap-RR Idiap-RR-42-2002, IDIAP, Martigny, Switzerland, 2002. [ .ps.gz | .pdf ]
[1333] I. Lapidot. What is better: GMM of two gaussians or two clusters with one gaussian? Idiap-RR Idiap-RR-56-2002, IDIAP, Martigny, Switzerland, 2002. [ .ps.gz | .pdf ]
[1334] I. Lapidot. Self-organizing-maps with BIC for speaker clustering. Idiap-RR Idiap-RR-60-2002, IDIAP, Martigny, Switzerland, 2002. [ .ps.gz | .pdf ]
[1335] Ivan Laptev, Barbara Caputo, and Tony Lindberg. Local velocity-adapted motion events for spatio-temporal recognition. Computer Vision and Image Undertanding, 108(3), 2007. [ .ps.gz | .pdf ]
[1336] Guillaume Lathoud, Mathew Magimai.-Doss, and Hervé Bourlard. Unsupervised Spectral Subtraction for noise-Robust ASR on unknown Transmission Channels. Idiap-RR Idiap-RR-09-2006, IDIAP, Martigny, Switzerland, 2006. [ .ps.gz | .pdf ]
[1337] Guillaume Lathoud. Further applications of Sector-Based Detection and Short-Term Clustering. Idiap-RR Idiap-RR-26-2006, IDIAP, Martigny, Switzerland, 2006. [ .ps.gz | .pdf ]
[1338] Guillaume Lathoud. Observations on Multi-Band asynchrony in Distant Speech Recordings. Idiap-RR Idiap-RR-74-2006, IDIAP, Martigny, Switzerland, 2006. [ .ps.gz | .pdf ]
[1339] Guillaume Lathoud. Spatio-Temporal Analysis of Spontaneous Speech with Microphone Arrays. Idiap-rr, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, 12 2006. PhD Thesis #3689 at the Ecole Polytechnique Fédérale de Lausanne. [ .ps.gz | .pdf ]
[1340] Guillaume Lathoud and Iain A. McCowan. Location Based Speaker Segmentation. In Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03) [3206]. IDIAP-RR 02-43. [ .ps.gz | .pdf ]
[1341] Guillaume Lathoud, Iain A. McCowan, and Darren Moore. Segmenting Multiple Concurrent Speakers using Microphone arrays. In Proceedings of Eurospeech 2003 [3207]. IDIAP-RR 03-21. [ .ps.gz | .pdf ]
[1342] Guillaume Lathoud, Iain A. McCowan, and Jean-Marc Odobez. Unsupervised Location-Based Segmentation of Multi-Party Speech. In Proceedings of the 2004 ICASSP-NIST Meeting Recognition Workshop [3208]. IDIAP-RR 04-14. [ .ps.gz | .pdf ]
[1343] Guillaume Lathoud and Iain A. McCowan. A Sector-Based approach for Localization of Multiple Speakers with Microphone arrays. In Proceedings of the 2004 SAPA Workshop [3209]. IDIAP-RR 04-15. [ .ps.gz | .pdf ]
[1344] Guillaume Lathoud, Jean-Marc Odobez, and Daniel Gatica-Perez. AV16.3: an audio-Visual Corpus for Speaker Localization and Tracking. In Proceedings of the 2004 MLMI Workshop, S. Bengio and H. Bourlard Eds, Springer Verlag [3210]. IDIAP-RR 04-28. [ .ps.gz | .pdf ]
[1345] Guillaume Lathoud and Mathew Magimai.-Doss. A Sector-Based, Frequency-Domain approach to Detection and Localization of Multiple Speakers. In Proceedings of ICASSP 2005 [3211]. IDIAP-RR 04-54. [ .ps.gz | .pdf ]
[1346] Guillaume Lathoud, Julien Bourgeois, and Jürgen Freudenberger. Multichannel speech enhancement in cars: Explicit vs. implicit adaptation control. In Proceedings of HSCMA 2005 [3212]. IDIAP-RR 04-67. [ .ps.gz | .pdf ]
[1347] Guillaume Lathoud, Mathew Magimai.-Doss, and Bertrand Mesot. A Spectrogram Model for enhanced Source Localization and noise-Robust ASR. In Proceedings of INTERSPEECH 2005 [3213]. IDIAP-RR 05-13. [ .ps.gz | .pdf ]
[1348] Julien Bourgeois, Jürgen Freudenberger, and Guillaume Lathoud. Implicit Control of noise Canceller for Speech enhancement. In Proceedings of INTERSPEECH 2005, Lisbon, Portugal, 9 2005. [ .ps.gz | .pdf ]
[1349] Guillaume Lathoud, Mathew Magimai.-Doss, Bertrand Mesot, and Hervé Bourlard. Unsupervised Spectral Subtraction for noise-Robust ASR. In Proceedings of the 2005 IEEE ASRU Workshop [3214]. IDIAP RR 05-42. [ .ps.gz | .pdf ]
[1350] Guillaume Lathoud, Julien Bourgeois, and Jürgen Freudenberger. Sector-Based Detection for Hands-Free Speech enhancement in Cars. In EURASIP Journal on Applied Signal Processing, Special Issue on Advances in Multimicrophone Speech Processing [3212]. IDIAP RR 04-67. [ .ps.gz | .pdf ]
[1351] Guillaume Lathoud, Mathew Magimai.-Doss, and Hervé Bourlard. Threshold Selection for Unsupervised Detection, with an application to Microphone arrays. In Proceedings of ICASSP 2006 [3215]. IDIAP RR 05-52. [ .ps.gz | .pdf ]
[1352] Guillaume Lathoud and Jean-Marc Odobez. Short-Term Spatio-Temporal Clustering applied to Multiple Moving Speakers. IEEE Transactions on Audio, Speech and Language Processing, 15(5):15, July 2007. [ .pdf ]
[1353] Elisabetta La Torre, Tatiana Tommasi, and Barbara Caputo. Kernel methods for melanoma recognition. In Medical Informatics in Europe (MIE), Maastricht, The Netherlands, 2006. [ .ps.gz | .pdf ]
[1354] Elisabetta La Torre, Barbara Caputo, and Tatiana Tommasi. Melanoma recognition using kernel classifiers. Idiap-RR Idiap-RR-53-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[1355] J. K. Laurila, Daniel Gatica-Perez, I. Aad, Blom J., Olivier Bornet, Trinh-Minh-Tri Do, O. Dousse, J. Eberle, and M. Miettinen. The mobile data challenge: Big data for mobile computing research. In Pervasive Computing, 2012. [ .pdf ]
[1356] J. K. Laurila, Daniel Gatica-Perez, Jan Blom, Olivier Bornet, Trinh-Minh-Tri Do, O. Dousse, Julien Eberle, and Markus Miettinen. From big smartphone data to worldwide research: The mobile data challenge. Pervasive and Mobile Computing, 9(6):752–771, December 2013. [ .pdf ]
[1357] Alexandros Lazaridis, Milos Cernak, and Philip N. Garner. Probabilistic amplitude demodulation features in speech synthesis for improving prosody. In Proceedings of Interspeech [3216]. [ .pdf ]
[1358] Alexandros Lazaridis, Ivan Himawan, Petr Motlicek, Iosif Mporas, and Philip N. Garner. Investigating cross-lingual multi-level adaptive networks: The importance of the correlation of source and target languages. In Proceedings of the International Workshop on Spoken Language Translation, December 2016. [ .pdf ]
[1359] Alexandros Lazaridis, Elie Khoury, Jean-Philippe Goldman, Mathieu Avanzi, Sébastien Marcel, and Philip N. Garner. Swiss french regional accent identification. In Odyssey: The Speaker and Language Recognition Workshop, 2014. [ .pdf ]
[1360] Alexandros Lazaridis, Pierre-Edouard Honnet, and Philip N. Garner. Svr vs mlp for phone duration modelling in hmm-based speech synthesis. In Speech Prosody [3217]. [ .pdf ]
[1361] Alexandros Lazaridis, Blaise Potard, and Philip N. Garner. Dnn-based speech synthesis: Importance of input features and training data. In A. Ronzhin, R. Potapova, and N. Fakotakis, editors, International Conference on Speech and Computer , SPECOM, volume 9319 of Lecture Notes in Computer Science, pages 193--200. Springer Berlin Heidelberg, 2015. [ DOI | .pdf ]
[1362] Alexandros Lazaridis, Milos Cernak, Pierre-Edouard Honnet, and Philip N. Garner. Investigating spectral amplitude modulation phase hierarchy features in speech synthesis. In 9th ISCA Speech Synthesis Workshop [3218]. [ .pdf ]
[1363] Alexandros Lazaridis, Jean-Philippe Goldman, Mathieu Avanzi, and Philip N. Garner. Syllable-based regional swiss french accent identification using prosodic features. In Nouveaux cahiers de linguistique francaise, 2014. [ .pdf ]
[1364] Le Chen, David Barber, and Jean-Marc Odobez. Dynamical dirichlet mixture model. Idiap-RR Idiap-RR-02-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[1365] Rémi Lebret and Ronan Collobert. Word embeddings through hellinger pca. In 14th Conference of the European Chapter of the Association for Computational Linguistics [3219]. [ .pdf ]
[1366] Rémi Lebret and Ronan Collobert. N-gram-based low-dimensional representation for document classification. In International Conference on Learning Representations, April 2015. [ www: | .pdf ]
[1367] Rémi Lebret, Pedro H. O. Pinheiro, and Ronan Collobert. Phrase-based image captioning. In International Conference on Machine Learning (ICML) [3220], page 2085–2094. Under review by the International Conference on Machine Learning (ICML). [ .html | .pdf ]
[1368] Rémi Lebret, Pedro H. O. Pinheiro, and Ronan Collobert. Twitter sentiment analysis (almost) from scratch. Idiap-RR Idiap-RR-15-2016, Idiap, 5 2016. [ .pdf ]
[1369] Rémi Lebret and Ronan Collobert. "the sum of its parts": Joint learning of word and phrase representations with autoencoders. Idiap-RR Idiap-RR-21-2015, Idiap, 6 2015. In ICML Deep Learning Workshop. [ .pdf ]
[1370] Rémi Lebret, Pedro H. O. Pinheiro, and Ronan Collobert. Simple image description generator via a linear phrase-based model. Idiap-RR Idiap-RR-22-2015, Idiap, 6 2015. In the workshop session of the International Conference on Learning Representations. [ .pdf ]
[1371] Rémi Lebret, Joël Legrand, and Ronan Collobert. Is deep learning really necessary for word embeddings? Idiap-RR Idiap-RR-44-2013, Idiap, 12 2013. Accepted to NIPS Deep Learning Workshop. [ .pdf ]
[1372] Rémi Lebret and Ronan Collobert. Rehabilitation of count-based models for word vector representations. In Computational Linguistics and Intelligent Text Processing, volume 9041 of Lecture Notes in Computer Science, pages 417--429. Springer International Publishing, alexander gelbukh edition, 2015.
[1373] Rémi Lebret. Building Word Embeddings for Solving Natural Language Processing. PhD thesis, École Polytechnique Fédérale de Lausanne, July 2016. Thèse EPFL, n° 7148. [ DOI ]
[1374] Gwénolé Lecorvé, Petr Motlicek, and John Dines. Domain-specific language model adaptation: a case study. Idiap-Com Idiap-Com-01-2013, Idiap, 11 2011. [ .pdf ]
[1375] Gwénolé Lecorvé and Petr Motlicek. Conversion of recurrent neural network language models to weighted finite state transducers for automatic speech recognition. In Proceedings of Interspeech [3221], page to appear. [ .pdf ]
[1376] Gwénolé Lecorvé, John Dines, Thomas Hain, and Petr Motlicek. Supervised and unsupervised web-based language model domain adaptation. In Proceedings of Interspeech [3222], page to appear. [ .pdf ]
[1377] Gwénolé Lecorvé, John Dines, Thomas Hain, and Petr Motlicek. Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du web. In Actes de la conférence conjointe JEP-TALN-RECITAL 2012 [3223], pages 193--200. in French. [ .pdf ]
[1378] Rebecca Lee, Oskar Wysocki, Andre Freitas, and et al. Longitudinal characterisation of haematological and biochemical parameters in cancer patients prior to and during covid-19 reveals features associated with outcome. ESMO Open, February 2021.
[1379] Rebecca Lee, Oskar Wysocki, Andre Freitas, and et al. Establishment of coronet, covid-19 risk in oncology evaluation tool, to identify cancer patients at low versus high risk of severe complications of covid-19 infection upon presentation to hospital. Clinical Cancer Informatics, 2022.
[1380] Kong Aik Lee, Rahim Saedi, Tawfik Hasan, Tomi Kinnunen, Benoit Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo Luis Sordo Martinez, Tharmarajah Thiruvaran, Changhuai You, Padmanabhan Rajan, David Van Leeuwen, Seyed Omid Sadjadi, Driss Matrouf, Laurent El Shafey, John Mason, Eliathamby Ambikairajah, Hanwu Sun, Anthony Larcher, Bin Ma, Ville Hautamäki, Cemal Hanilci, Billy Braithwaite, Gonzalez-Hautamäki Rosa, Gang Liu, Hynek Boril, Navid Shokouhi, John Hansen, Jean-François Bonastre, and Sébastien Marcel. The i4u submission to the 2012 nist speaker recognition evaluation. In NIST Speaker Recognition Conference, December 2012.
[1381] Leonidas Lefakis and Francois Fleuret. Jointly informative feature selection. In International Conference on Artificial Intelligence and Statistics, page 567–575, 2014. [ .pdf ]
[1382] Leonidas Lefakis and Francois Fleuret. Macro-action discovery based on change point detection and boosting. In International Conference on Machine Learning and Applications, 2012. [ .pdf ]
[1383] Leonidas Lefakis and Francois Fleuret. Dynamic programming boosting for discriminative macro-action discovery. In International Conference on Machine Learning, 2014. [ .pdf ]
[1384] Leonidas Lefakis and Francois Fleuret. Jointly informative feature selection. Journal of Machine Learning Research, 2016.
[1385] Leonidas Lefakis and Francois Fleuret. Joint cascade optimization using a product of boosted classifiers. In Proceedings of the Neural Information Processing Systems Conference, page 1315–1323, 2010.
[1386] Leonidas Lefakis and Francois Fleuret. Reservoir boosting : Between online and offline ensemble learning. In Proceedings of the international conference on Neural Information Processing Systems, 2013. [ .pdf ]
[1387] Leonidas Lefakis. Tractable Approaches to Learning and Planning in High Dimensions. PhD thesis, EPFL, 2014. [ DOI ]
[1388] Stéphanie Lefèvre and Jean-Marc Odobez. Structure and appearance features for robust 3d facial actions tracking. In IEEE Proc. Int. Conf. on Multimedia and Expo. IEEE, 2009. [ .pdf ]
[1389] Stéphanie Lefèvre and Jean-Marc Odobez. View-based appearance model online learning for 3d deformable face tracking. In Proc. Int. Conf. on Computer Vision Theory and Applications, May 2010. [ .pdf ]
[1390] Riwal Lefort and Francois Fleuret. A tree-based distance between distributions: application to classification of neurons. In ICASSP 2012 : IEEE International Conference on Acoustics, Speech and Signal Processing, 2012.
[1391] Riwal Lefort, L. Fusco, F. Benmansour, Kevin C. Smith, O. Pertz, and Francois Fleuret. Machine learning techniques to analyse complex, computer vision-extracted, dynamic cellular phenotypes. In 1st International SystemsX.ch Conference on Systems Biology, 2011.
[1392] Riwal Lefort, L. Fusco, O. Pertz, and Francois Fleuret. Machine learning-based tools to model and to remove the off-target effect. Pattern Analysis and Applications, 20(1):87--100, February 2017. first online: 1st April 2015. [ DOI ]
[1393] Riwal Lefort and Francois Fleuret. treekl: A distance between high dimension empirical distributions. Pattern Recognition Letters, 34(2):140--145, 2013. [ .pdf ]
[1394] Joël Legrand and Ronan Collobert. Deep neural networks for syntactic parsing of morphologically rich languages. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016. [ .pdf ]
[1395] Joël Legrand and Ronan Collobert. Recurrent greedy parsing with neural networks. In springer, editor, Proceedings of ECML 2014, volume 8725, pages 130--144. Springer Berlin Heidelberg, 2014. [ DOI | .pdf ]
[1396] Joël Legrand and Ronan Collobert. Joint rnn-based greedy parsing and word composition. In Proceedings of ICLR 2015, 2015. [ .pdf ]
[1397] Joël Legrand and Ronan Collobert. Syntactic parsing of morphologically rich languages using deep neural networks. Idiap-RR Idiap-RR-25-2015, Idiap, 6 2015. Accepted in SPMRL 2015. [ .pdf ]
[1398] Joël Legrand and Ronan Collobert. Phrase representations for multiword expressions. In Proceedings of the 12th Workshop on Multiword Expressions, 2016. [ .pdf ]
[1399] Joël Legrand. Word Sequence Modeling using Deep Learning: and End-to-end Approach and its Applications. PhD thesis, EPFL, August 2016. [ DOI ]
[1400] Joël Legrand, Michael Auli, and Ronan Collobert. Neural network-based word alignment through score aggregation. In Proceedings of the ACL 1st Conference on Machine Translation, 2016. [ .pdf ]
[1401] Mikko Lehtonen. Hierarchical approach for spotting keywords. Idiap-RR Idiap-RR-41-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[1402] V. Lemaire and F. Clérot. Som-based clustering for on-line fraud behavior classification: a case study. Idiap-RR Idiap-RR-30-2002, France Telecom Research and Development, 2002. [ .ps.gz | .pdf ]
[1403] V. Lemaire. Bagging using the vmse cost function. Idiap-RR Idiap-RR-27-2002, France Telecom Research and Development, 2002. [ .ps.gz | .pdf ]
[1404] Teguh Santoso Lembono, Carlos Mastalli, Pierre Fernbach, Nicolas Mansard, and Sylvain Calinon. Learning how to walk: Warm-starting optimal control solver with memory of motion. In International Conference on Robotics and Automation, 2020. [ .pdf ]
[1405] Teguh Santoso Lembono and Sylvain Calinon. Probabilistic iterative lqr for short time horizon mpc. In International Conference on Intelligent Robots and Systems, pages 579--585, 2021. [ DOI ]
[1406] Teguh Santoso Lembono, Antonio Paolillo, Emmanuel Pignat, and Sylvain Calinon. Memory of motion for warm-starting trajectory optimization. IEEE Robotics and Automation Letters, 5(2):2594--2601, 2020. [ DOI | .pdf ]
[1407] Teguh Santoso Lembono, Francisco Suarez-Ruiz, and Quang-Cuong Pham. Scalar - simultaneous calibration of 2d laser and robot's kinematic parameters using three planar constraints. In International Conference on Intelligent Robots, 2018. [ .pdf ]
[1408] Teguh Santoso Lembono, Emmanuel Pignat, Julius Jankowski, and Sylvain Calinon. Learning constrained distributions of robot configurations with generative adversarial network. IEEE Robotics and Automation Letters, 2021. [ .pdf ]
[1409] Teguh Santoso Lembono, Francisco Suarez-Ruiz, and Quang-Cuong Pham. Scalar: Simultaneous calibration of 2-d laser and robot kinematic parameters using planarity and distance constraints. IEEE Transactions on Automation Science and Engineering, 16(4):1971--1979, October 2019. [ DOI ]
[1410] Teguh Santoso Lembono. Memory of Motion for Initializing Optimization in Robotics. PhD thesis, École Polytechnique Fédérale de Lausanne, July 2022. [ .pdf ]
[1411] Eileen Lew, Marnix Nuttin, Pierre W. Ferrez, A. Degeest, Anna Buttfield, G. Vanacker, and José del R. Millán. Non-invasive brain computer interface for mental control of a simulated wheelchair. In Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, Graz, Austria, 9 2006. [ .pdf ]
[1412] Nam Le and Jean-Marc Odobez. Learning multimodal temporal representation for dubbing detection in broadcast media. In ACM Multimedia. ACM, October 2016. [ .pdf ]
[1413] Nam Le, Alexandre Heili, and Jean-Marc Odobez. Long-term time-sensitive costs for crf-based tracking by detection. In 2nd Workshop on Benchmarking Multi-target Tracking: MOTChallenge 2016, October 2016. [ .pdf ]
[1414] Nam Le, Jean-Marc Odobez, and et al. Towards large scale multimedia indexing: A case study on person discovery in broadcast news. In 15th International Workshop on Content-Based Multimedia Indexing, June 2017. [ .pdf ]
[1415] Nam Le and Jean-Marc Odobez. Improving speaker turn embedding by crossmodal transfer learning from face embedding. In ICCV Workshop on Computer Vision for Audio-Visual Media, October 2017. [ .pdf ]
[1416] Nam Le and Jean-Marc Odobez. A domain adaptation approach to improve speaker turn embedding using face representation. In ACM International Conference on Multimodal Interaction. ACM, November 2017. [ .pdf ]
[1417] Nam Le, Alexandre Heili, Di Wu, and Jean-Marc Odobez. Temporally subsampled detection for accurate and efficient face tracking and diarization. In International Conference on Pattern Recognition. IEEE, December 2016. [ .pdf ]
[1418] Quoc Anh Le and Andrei Popescu-Belis. Automatic vs. human question answering over multimedia meeting recordings. In 10th Annual Conference of the International Speech Communication Association [3224]. [ .pdf ]
[1419] Nam Le and Jean-Marc Odobez. Robust and discriminative speaker embedding via intra-class distance variance regularization. In Proceedings of Interspeech, pages 2257--2261, 2018. [ DOI | .pdf ]
[1420] Nam Le, Di Wu, Sylvain Meignier, and Jean-Marc Odobez. Eumssi team at the mediaeval person discovery challenge. In Working Notes Proceedings of the MediaEval 2015 Workshop, September 2015. [ .pdf | .pdf ]
[1421] Nam Le, Sylvain Meignier, and Jean-Marc Odobez. Eumssi team at the mediaeval person discovery challenge 2016. In MediaEval Benchmarking Initiative for Multimedia Evaluation, October 2016. [ .pdf ]
[1422] Nam Le and Jean-Marc Odobez. Improving speech embedding using crossmodal transfer learning with audio-visual data. Multimedia Tools and Applications, 78(11):15681--15704, January 2019. [ DOI ]
[1423] Nam Le. Multimodal Person Recognition in Audio-Visual Streams. PhD thesis, EPFL, 2019. [ DOI | .pdf ]
[1424] Weifeng Li and Hervé Bourlard. Non-linear spectral contrast stretching for in-car speech recognition. In Interspeech-Eurospeech # to appear in html [3225]. IDIAP-RR 07-53. [ .ps.gz | .pdf ]
[1425] Weifeng Li, Mathew Magimai.-Doss, John Dines, and Hervé Bourlard. Mlp-based log spectral energy mapping for robust overlapping speech recognition. Idiap-RR Idiap-RR-54-2007, IDIAP, 2007. Submitted for publication. [ .ps.gz | .pdf ]
[1426] Weifeng Li, John Dines, and Mathew Magimai.-Doss. Robust overlapping speech recognition based on neural networks. Idiap-RR Idiap-RR-55-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[1427] Weifeng Li. Effective post-processing for single-channel frequency-domain speech enhancement. Idiap-RR Idiap-RR-71-2007, IDIAP, 2007. Submitted for publication. [ .ps.gz | .pdf ]
[1428] Weifeng Li, John Dines, Mathew Magimai.-Doss, and Hervé Bourlard. Neural network based regression for robust overlapping speech recognition using microphone arrays. Idiap-RR Idiap-RR-09-2008, IDIAP, 2008. Submitted for publication. [ .ps.gz | .pdf ]
[1429] Weifeng Li, Kenichi Kumatani, John Dines, Mathew Magimai.-Doss, and Hervé Bourlard. A neural network based regression approach for recognizing simultaneous speech. Idiap-RR Idiap-RR-10-2008, IDIAP, 2008. Submitted for publication. [ .ps.gz | .pdf ]
[1430] Hui Liang and John Dines. Enhancing state mapping-based cross-lingual speaker adaptation using phonological knowledge in a data-driven manner. Idiap-RR Idiap-RR-08-2013, Idiap, 3 2013. [ .pdf ]
[1431] Hui Liang and John Dines. An analysis of language mismatch in hmm state mapping-based cross-lingual speaker adaptation. In Proceedings of Interspeech [3230]. [ .pdf ]
[1432] Renars Liepins and et al. The summa platform prototype. In Proceedings of the EACL 2017 Software Demonstrations, pages 116--119, April 2017. [ http | .pdf ]
[1433] Niklas Linde, David Ginsbourger, James Iriving, Fabio Nobile, and Arnaud Doucet. On uncertainty quantification in hydrogeology and hydrogeophysics. Advances in Water Resources, 110:166–181, December 2017. [ DOI | http ]
[1434] David Lindner, Kyle Matoba, and Alexander Meulemans. Challenges for using impact regularizers to avoid negative side effects. In SafeAI 2021 - AAAI's Workshop on Artificial Intelligence Safety, 2021. [ .pdf ]
[1435] Julian Linke, Philip N. Garner, Gernot Kubin, and Barbara Schuppler. Conversational speech recognition needs data? experiments with austrian german. In Proceedings of the 13th Language Resources and Evaluation Conference, pages 4684--4691, Marseille, France, June 2022. European Language Resources Association. [ .pdf ]
[1436] Gang Liu, Yu Yu, Kenneth Alberto Funes Mora, and Jean-Marc Odobez. A differential approach for gaze estimation with calibration. In 29TH BRITISH MACHINE VISION CONFERENCE, 2018. [ .pdf ]
[1437] Zheyuan Liu, Cristian Rodriguez-Opazo, Damien Teney, and Stephen Gould. Image retrieval on real-life images with pre-trained vision-and-language models. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021. [ .pdf ]
[1438] Gang Liu, Yu Yu, and Jean-Marc Odobez. A differential approach for gaze estimation. IEEE Transaction on Pattern Analysis and Machine Intelligence, 43(3):1092--1098, 2021. [ DOI | http | .pdf ]
[1439] Marcus Liwicki, Andreas Schlapbach, Horst Bunke, Samy Bengio, Johnny Mariéthoz, and Jonas Richiardi. Writer identification for smart meeting room systems. In Seventh IAPR Workshop on Document Analysis Systems, DAS [3233]. IDIAP-RR 05-70. [ .ps.gz | .pdf ]
[1440] Jeevanthi Liyanapathirana and Andrei Popescu-Belis. Using the ted talks to evaluate spoken post-editing of machine translation. In Proceedings of the 10th Language Resources and Evaluation Conference (LREC), 2016. [ .pdf ]
[1441] Weifeng Li, John Dines, Mathew Magimai.-Doss, and Hervé Bourlard. Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition. In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009. [ .pdf ]
[1442] Weifeng Li, Longbiao Wang, Yicong Zhou, John Dines, Mathew Magimai.-Doss, Hervé Bourlard, and Qingmin Liao. Feature mapping of multiple beamformed sources for robust overlapping speech recognition using a microphone array. Idiap-RR Idiap-RR-17-2014, Idiap, 10 2014. IEEE/ACM Trans. on Audio, Speech and Language Processing. [ .pdf ]
[1443] Weifeng Li and Hervé Bourlard. Sub-band based log-energy and its dynamic range stretching for robust in-car speech recognition. In Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech) [3234]. [ .pdf ]
[1444] Sharid Loaiciga, Thomas Meyer, and Andrei Popescu-Belis. English-french verb phrase alignment in europarl for tense translation modeling. In The Ninth Language Resources and Evaluation Conference, 2014. [ .pdf ]
[1445] Michele Loi, Eleonora Viganò, and Lonneke van der Plas. The societal and ethical relevance of computational creativity. In Proceedings of the International Conference on Computational Creativity, 2020.
[1446] Adolfo Lopez-Mendez, C. E. I Westling, Remi Emonet, M. Easteal, L. Lavia, H. J. Witchel, and Jean-Marc Odobez. Automated bobbing and phase analysis to measure walking entrainment. In IEEE International Conference on Image Processing (ICIP), Paris, October 2014. [ .pdf ]
[1447] Adolfo Lopez-Mendez, Florent Monay, and Jean-Marc Odobez. Exploiting scene cues for dropped object detection. In 9th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications., 2014. [ .pdf ]
[1448] Andrew Lovitt, Joel Praveen Pinto, and Hynek Hermansky. On confusions in a phoneme recognizer. [3235]. IDIAP-RR 07-10. [ .ps.gz | .pdf ]
[1449] Andrew Lovitt. Correcting confusion matrices for phone recognizers. Idiap-Com Idiap-Com-03-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[1450] Tobias Löw, Jérémy Maceiras, and Sylvain Calinon. drozbot: Using ergodic control to draw portraits. IEEE Robotics and Automation Letters, page 7, 2022. [ DOI | http | .pdf ]
[1451] Tobias Löw, Tirthankar Bandyopadhyay, Jason Williams, and Paulo Borges. Prompt: Probabilistic motion primitives based trajectory planning. In Proceedings of Robotics: Science and Systems, Virtual, July 2021. [ DOI | .html | .pdf ]
[1452] Perruchoud Loise. The anterior cingulate cortex. Idiap-Com Idiap-Com-02-2008, IDIAP, 2008. [ .pdf ]
[1453] Juergen Luettin and Gilbert Maître. Evaluation protocol for the extended M2VTS database (XM2VTSDB). Idiap-Com Idiap-Com-05-1998, IDIAP, 1998. [ .ps.gz | .pdf ]
[1454] Juergen Luettin and Neil A. Thacker. Speechreading using probabilistic models. In Computer Vision and Image Understanding [3236]. IDIAP-RR 97-12.
[1455] Juergen Luettin and Stéphane Dupont. Continuous audio-visual speech recognition. In Proc. 5th European Conference on Computer Vision [3237]. IDIAP-RR 98-02. [ .ps.gz | .pdf ]
[1456] Juergen Luettin. Towards speaker independent continuous speechreading. In Proceedings of the European Conference on Speech Communication and Technology, 1997. [ .ps.gz | .pdf ]
[1457] Juergen Luettin and Souheil Ben-Yacoub. Robust person verification based on speech and facial images. In Proceedings of the European Conference on Speech Communication and Technology, 1999. [ .ps.gz | .pdf ]
[1458] Juergen Luettin, Neil A. Thacker, and Steve W. Beet. Statistical lip modelling for visual speech recognition. In Proceedings of the 8th European Signal Processing Conference (Eusipco'96), volume I, 1996. [ .ps.gz | .pdf ]
[1459] Juergen Luettin, Neil A. Thacker, and Steve W. Beet. Visual speech recognition using active shape models and hidden markov models. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'96), volume 2, 1996. [ .ps.gz | .pdf ]
[1460] Juergen Luettin, Neil A. Thacker, and Steve W. Beet. Learning to recognise talking faces. In Proceedings of the International Conference on Pattern Recognition (ICPR'96), volume IV. IAPR, 1996. [ .ps.gz | .pdf ]
[1461] Juergen Luettin, Neil A. Thacker, and Steve W. Beet. Locating and tracking facial speech features. In Proceedings of the International Conference on Pattern Recognition (ICPR'96), volume I. IAPR, 1996. [ .ps.gz | .pdf ]
[1462] Juergen Luettin, Neil A. Thacker, and Steve W. Beet. Speaker identification by lipreading. In Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96), volume 1, 1996. [ .ps.gz | .pdf ]
[1463] Juergen Luettin, Neil A. Thacker, and Steve W. Beet. Speachreading using shape and intensity information. In Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96), volume 1, 1996. [ .ps.gz | .pdf ]
[1464] Juergen Luettin, Neil A. Thacker, and Steve W. Beet. Active shape models for visual speech feature extraction. In D. G. Storck and Hennecke Hennecke, editors, Speechreading by Humans and Machines, volume 150 of NATO ASI Series, Series F: Computer and Systems Sciences. Springer Verlag, Berlin, 1996. [ .ps.gz | .pdf ]
[1465] Juergen Luettin, Michael Vogt, and Christoph Bregler. Machine recognition and applications. In D. G. Storck and Hennecke Hennecke, editors, Speechreading by Humans and Machines, volume 150 of NATO ASI Series, Series F: Computer and Systems Sciences. Springer Verlag, Berlin, 1996.
[1466] Juergen Luettin. Speaker verification experiments on the XM2VTS database. Idiap-RR Idiap-RR-02-1999, IDIAP, 1999. [ .ps.gz | .pdf ]
[1467] Juergen Luettin. Speech reading. In J. Noyes and Martin Cooke, editors, Modern Interface Technology: The Leading Edge. Research Studies Press Ltd., 1999.
[1468] Juergen Luettin. Visual Speech and Speaker Recognition. PhD thesis, University of Sheffield, 1997. [ .ps.gz | .pdf ]
[1469] Raphaelle Luisier, Mehmet Girgin, Matthias P. Lutolf, and Adrian Ranga. Mammary epithelial morphogenesis in 3d combinatorial microenvironments. Scientific Reports, 10(1), 2020. [ http | .pdf ]
[1470] Tomas Lundin, Emile Fiesler, and Perry Moerland. Connectionist quantization functions. In Proceedings of the '96 SIPAR-Workshop on Parallel and Distributed Computing. Scientific and Parallel Computing Group, University of Geneva, 1996. [ .ps.gz | .pdf ]
[1471] Tomas Lundin and Perry Moerland. Quantization and pruning of multilayer perceptrons: Towards compact neural networks. Idiap-Com Idiap-Com-02-1997, IDIAP, 3 1997. [ .ps.gz | .pdf ]
[1472] Jie Luo, Andrzej Pronobis, and Barbara Caputo. Svm-based transfer of visual knowledge across robotic platforms. In International Conference on Computer Vision Systems (ICVS07), Bielefeld, Germany, 3 2007. [ .ps.gz | .pdf ]
[1473] Jie Luo, Barbara Caputo, Alon Zweig, Joerg-Henrik Back, and Joern Anemueller. Object category detection using audio-visual cues. In International Conference on Computer Vision Systems (ICVS08), Santorini, Greece, 5 2008. [ .ps.gz | .pdf ]
[1474] Jie Luo, Andrzej Pronobis, Barbara Caputo, and Patric Jensfelt. Incremental learning for place recognition in dynamic environments. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS07), San Diego, California, 10 2007. [ .ps.gz | .pdf ]
[1475] Jie Luo, Andrzej Pronobis, Barbara Caputo, and Patric Jensfelt. Incremental learning for place recognition in dynamic environments. Idiap-RR Idiap-RR-52-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[1476] Jie Luo, Andrzej Pronobis, and Barbara Caputo. Svm-based transfer of visual knowledge across robotic platforms. Idiap-RR Idiap-RR-65-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[1477] Jie Luo, Barbara Caputo, Alon Zweig, Joerg-Henrik Back, and Joern Anemueller. Object category detection using audio-visual cues. Idiap-RR Idiap-RR-58-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[1478] Ngoc-Quang Luong, Lesly Miculicich, and Andrei Popescu-Belis. Pronoun translation and prediction with or without coreference links. In Proceedings of the Second Workshop on Discourse in Machine Translation (DiscoMT), page 94–100, September 2015. [ .pdf ]
[1479] Ngoc-Quang Luong and Andrei Popescu-Belis. Machine translation of spanish personal and possessive pronouns using anaphora probabilities. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL) [3238]. [ .pdf ]
[1480] Ngoc-Quang Luong and Andrei Popescu-Belis. A contextual language model to improve machine translation of pronouns by re-ranking translation hypotheses. In European Association for Machine Translation, May 2016.
[1481] Ngoc-Quang Luong and Andrei Popescu-Belis. Improving pronoun translation by modeling coreference uncertainty. In Proceedings of the First Conference on Machine Translation (WMT16), August 2016. [ .pdf ]
[1482] Ngoc-Quang Luong and Andrei Popescu-Belis. Pronoun language model and grammatical heuristics for aiding pronoun prediction. In Proceedings of the First Conference on Machine Translation (WMT16). ACL, August 2016. [ .pdf ]
[1483] Jie Luo, Francesco Orabona, and Barbara Caputo. An online framework for learning novel concepts over multiple cues. In Proceeding of The 9th Asian Conference on Computer Vision, 9 2009. [ .pdf ]
[1484] Jie Luo, Francesco Orabona, Marco Fornoni, Barbara Caputo, and Nicolo Cesa-Bianchi. Om-2: An online multi-class multi-kernel learning algorithm. In In Proceeding of CVPR 2010, Online Learning for Computer Vision Workshop [3239]. [ .pdf ]
[1485] Jie Luo, Tatiana Tommasi, and Barbara Caputo. Multiclass transfer learning from unconstrained priors. In Proceedings of the 13th International Conference on Computer Vision [3240]. [ .pdf ]
[1486] Jie Luo, Francesco Orabona, Barbara Caputo, and Vittorio Ferrari. Learning from images with captions using the maximum margin set algorithm. Idiap-RR Idiap-RR-30-2011, Idiap, 8 2011. [ .pdf ]
[1487] Jie Luo and Francesco Orabona. Learning from candidate labeling sets. In Advances in Neural Information Processing Systems 23 (NIPS10) [3241]. [ .pdf ]
[1488] Jie Luo. Open-ended Learning of Visual and Multi-modal Patterns. PhD thesis, Ecole polytechnique fédérale de Lausanne, December 2011. Thèse Ecole polytechnique fédérale de Lausanne EPFL, no 5233 (2011,',','), Programme doctoral en Informatique, Communications et Information, Faculté des sciences et techniques de l'ingénieur STI, Institut de génie électrique et électronique IEL (Laboratoire de l'IDIAP LIDIAP). Dir.: Hervé Bourlard and Barbara Caputo. [ .pdf ]
[1489] Gil Luyet. Low-rank representation for enhanced deep neural network acoustic models. Idiap-RR Idiap-RR-05-2016, Idiap, 3 2016. [ .pdf ]
[1490] Gil Luyet, Pranay Dighe, Afsaneh Asaei, and Hervé Bourlard. Low-rank representation of nearest neighbor phone posterior probabilities to enhance dnn acoustic modeling. In Interspeech [3242]. [ .pdf ]
[1491] Hong Lu, Mashfiqui Rabbi, Gokul Chittaranjan, Denise Frauendorfer, Marianne Schmid Mast, Andrew T. Campbell, Daniel Gatica-Perez, and Tanzeem Choudhury. Stresssense: Detecting stress in unconstrained acoustic environments using smartphones. In Ubicomp'12, September 2012. [ .pdf ]
[1492] Jérémy Maceiras. Planning and control of robot manipulation tasks. Idiap-Com Idiap-Com-01-2022, Idiap, 7 2022. [ .pdf ]
[1493] Anmol Madan, Manuel Cebrian, Sai Moturu, Katayoun Farrahi, and Alex Pentland. Sensing the `health state` of our society. IEEE Pervasive Computing, Special Issue on Large-Scale Opportunistic Sensing, 2011. [ .pdf ]
[1494] Anmol Madan, Katayoun Farrahi, Daniel Gatica-Perez, and Alex Pentland. Pervasive sensing to model political opinions in face-to-face networks. In Pervasive, June 2011. [ .pdf ]
[1495] Srikanth Madikeri and Hervé Bourlard. Kl-hmm based speaker diarization system for meetings. In Proceedings of ICASSP 2015 [3243], pages 4435--4439. [ .pdf ]
[1496] Srikanth Madikeri, Petr Motlicek, and Hervé Bourlard. Combining sgmm speaker vectors and kl-hmm approach for speaker diarization. In Proceedings of ICASSP 2015 [3244], pages 4834--4837. [ .pdf ]
[1497] Srikanth Madikeri, Marc Ferras, Petr Motlicek, and Subhadeep Dey. Intra-class covariance adaptation in plda back-ends for speaker verification. In Proceedings of International Conference on Acoustics, Speech and Signal Processing [3245], pages 5365--5369. [ DOI ]
[1498] Srikanth Madikeri, Subhadeep Dey, and Petr Motlicek. A bayesian approach to inter-task fusion for speaker recognition. In In Proceedings of ICASSP 2019 [3246], pages 5786--5790. [ .pdf ]
[1499] Srikanth Madikeri, Petr Motlicek, Marc Ferras, and Subhadeep Dey. Analysis of posterior estimation approaches to i-vector extraction for speaker recognition. Idiap-RR Idiap-RR-15-2018, Idiap, 10 2018. [ .pdf ]
[1500] Srikanth Madikeri, Seyyed Saeed Sarfjoo, Petr Motlicek, and Sébastien Marcel. Idiap submission to the nist sre 2018 speaker recognition evaluation. Idiap-RR Idiap-RR-17-2019, Idiap, 11 2019. [ .pdf ]
[1501] Srikanth Madikeri, David Imseng, and Hervé Bourlard. Improving real time factor of information bottleneck-based speaker diarization system. Idiap-RR Idiap-RR-18-2015, Idiap, 6 2015. [ .pdf ]
[1502] Srikanth Madikeri, Subhadeep Dey, Petr Motlicek, and Marc Ferras. Implementation of the standard i-vector system for the kaldi speech recognition toolkit. Idiap-RR Idiap-RR-26-2016, Idiap, 10 2016. [ .pdf ]
[1503] Srikanth Madikeri, Subhadeep Dey, Marc Ferras, Petr Motlicek, and Ivan Himawan. Idiap submission to the nist sre 2016 speaker recognition evaluation. Idiap-RR Idiap-RR-32-2016, Idiap, 12 2016. [ .pdf ]
[1504] Srikanth Madikeri, Asha T, and Hema A Murthy. Modified group delay feature based total variability space modelling for speaker recognition. International Journal of Speech Techonology, 18(1):17--23, July 2014. [ DOI ]
[1505] Srikanth Madikeri, Ivan Himawan, Petr Motlicek, and Marc Ferras. Integrating online i-vector extractor with information bottleneck based speaker diarization system. In Proceedings of Interspeech 2015 [3247], pages 3105--3109. [ .pdf ]
[1506] Srikanth Madikeri, Subhadeep Dey, and Petr Motlicek. Analysis of language dependent front-end for speaker recognition. In Proceedings of Interspeech 2018, volume 1-6, pages 1101--1105, 2018. [ DOI ]
[1507] Srikanth Madikeri, Banriskhem Khonglah, Sibo Tong, Petr Motlicek, Hervé Bourlard, and Daniel Povey. Lattice-free maximum mutual information training of multilingual speech recognition system. In In Proceedings of Interspeech 2020 [3248], pages 4746--4750. [ .pdf ]
[1508] Srikanth Madikeri, Petr Motlicek, and Hervé Bourlard. Multitask adaptation with lattice-free mmi for multi-genre speech recognition of low resource languages. In Proceedings of Interspeech 2021, 2021. [ .pdf ]
[1509] Erica Madonna, David Ginsbourger, and Olivia Martius. A poisson regression approach to model monthly hail occurrence in northern switzerland using large-scale environmental variables. Atmospheric Research, 203:261--274, May 2018. [ DOI ]
[1510] Mathew Magimai.-Doss. Speech processing. In Hervé Bourlard and Andrei Popescu-Belis, editors, Interactive Multimodal Information Management, chapter 15, pages 221--245. EPFL Press, 2013.
[1511] Mathew Magimai.-Doss, Guillermo Aradilla, and Hervé Bourlard. On joint modelling of grapheme and phoneme information using kl-hmm for asr. Idiap-RR Idiap-RR-24-2009, Idiap, 9 2009. [ .pdf ]
[1512] Mathew Magimai.-Doss, Ramya Rasipuram, Guillermo Aradilla, and Hervé Bourlard. Grapheme-based automatic speech recognition using kl-hmm. In Proceedings of Interspeech, August 2011. [ .pdf ]
[1513] Mathew Magimai.-Doss and Ramya Rasipuram. On learning grapheme-to-phoneme relationships through the acoustic speech signal. The Phonetician, 109–110:6--23, 2014. [ .pdf ]
[1514] Mathew Magimai.-Doss and Hervé Bourlard. Pronunciation models and their evaluation using confidence measures. Idiap-RR Idiap-RR-29-2001, IDIAP, 2001. [ .ps.gz | .pdf ]
[1515] Mathew Magimai.-Doss, Todd Andrew Stephenson, and Hervé Bourlard. Modelling auxiliary information (pitch frequency) in hybrid HMM/ANN based ASR systems. Idiap-RR Idiap-RR-62-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[1516] Mathew Magimai.-Doss, Todd Andrew Stephenson, Hervé Bourlard, and Samy Bengio. Phoneme-grapheme based speech recognition system. In Proceedings of IEEE ASRU [3249]. IDIAP-RR 03-37. [ .ps.gz | .pdf ]
[1517] Mathew Magimai.-Doss, Todd Andrew Stephenson, and Hervé Bourlard. Using pitch frequency information in speech recognition. In Proceedings of Eurospeech [3250]. IDIAP-RR 03-23. [ .ps.gz | .pdf ]
[1518] Mathew Magimai.-Doss and Hervé Bourlard. On the adequacy of baseform pronunciations and pronunciation variants. Idiap-RR Idiap-RR-27-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[1519] Mathew Magimai.-Doss, John Dines, Hervé Bourlard, and Hynek Hermansky. Phoneme vs grapheme based automatic speech recognition. Idiap-RR Idiap-RR-48-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[1520] Mathew Magimai.-Doss, Samy Bengio, and Hervé Bourlard. Joint decoding for phoneme-grapheme continuous speech recognition. In Proceedings of ICASSP [3251]. IDIAP-RR 03-52. [ .ps.gz | .pdf ]
[1521] Mathew Magimai.-Doss, Todd Andrew Stephenson, Shajith Ikbal, and Hervé Bourlard. Modelling auxiliary features in tandem systems. In Proceedings of ICSLP [3252]. IDIAP-RR 04-21. [ .ps.gz | .pdf ]
[1522] Mathew Magimai.-Doss, John Dines, Hervé Bourlard, and Hynek Hermansky. Improving continuous speech recognition system performance with grapheme modelling. Idiap-RR Idiap-RR-16-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[1523] Parvaz Mahdabi and Andrei Popescu-Belis. Explicit suggestion of query terms for news search using topic models and word embeddings. Idiap-RR Idiap-RR-21-2016, Idiap, 8 2016. [ .pdf ]
[1524] Parvaz Mahdabi and Andrei Popescu-Belis. Comparing two strategies for query expansion in a news monitoring system. In Natural Language Processing and Information Systems: 21st International Conference on Applications of Natural Language to Information Systems, NLDB 2016, volume 9612 of Lecture Notes in Computer Science, pages 267--275. Springer-Verlag, 2016. [ DOI ]
[1525] Azar Mahmoodzadeh, Hamid Reza Abutalebi, Hamid Soltanianzadeh, and Hamid Sheikhzadeh. Single channel speech separation with a frame-based pitch range estimation method in modulation frequency. In Proceedings of 5th International Symposium on Telecommunications, 12 2010. [ .pdf ]
[1526] Azar Mahmoodzadeh, Hamid Reza Abutalebi, Hamid Soltanianzadeh, and Hamid Sheikhzadeh. Determination of pitch range based on onset and offset analysis in modulation frequency domain. In Proceedings of 5th International Symposium on Telecommunications, 12 2010. [ .pdf ]
[1527] Gilbert Maître. Experiments with robust similarity measures for OCR. Idiap-RR Idiap-RR-03-1995, IDIAP, 6 1995.
[1528] Gilbert Maître, Stéphane Brunet, and Gianni Pante. Traitement préliminaire de l'image d'un texte manuscrit en vue de sa reconnaissance: une méthode de sur-segmentation. In 4ème Colloque National sur l'Écrit et le Document (CNED'96), 1996.
[1529] Florian Mai, Lukas Galke, and Ansgar Scherp. Cbow is not all you need: Combining cbow with the compositional matrix space model. In International Conference on Learning Representations [3254]. To appear at ICLR 2019. [ http ]
[1530] Florian Mai and James Henderson. Bag-of-vectors autoencoders for unsupervised conditional text generation. Idiap-RR Idiap-RR-21-2021, Idiap, 12 2021. under review at ICLR 2022.
[1531] Maja Popović. Using posterior probabilities for speech/music discrimination. Idiap-RR Idiap-RR-08-2001, IDIAP, Martigny, Switzerland, 2001. [ .ps.gz | .pdf ]
[1532] Andrii Maksai, Xinchao Wang, Francois Fleuret, and Pascal Fua. Non-markovian globally consistent multi-object tracking. In Proceedings of the IEEE International Conference on Computer Vision, 2017.
[1533] M. S. Malekzadeh, Sylvain Calinon, D. Bruno, and D. G. Caldwell. A skill transfer approach for continuum robots - imitation of octopus reaching motion with the stiff-flop robot. In In Proc. of the AAAI Symp. on Knowledge, Skill, and Behavior Transfer in Autonomous Robots, pages 49--52, November 2014. [ http | .pdf ]
[1534] M. S. Malekzadeh, Sylvain Calinon, D. Bruno, and D. G. Caldwell. Learning by imitation with the stiff-flop surgical robot: A biomimetic approach inspired by octopus movements. Robotics and Biomimetics, 1(13):1--15, October 2014. Special Issue on Medical Robotics. [ http | .pdf ]
[1535] Eric Malmi, Trinh-Minh-Tri Do, and Daniel Gatica-Perez. From foursquare to my square: Learning check-in behavior from multiple sources. In The 7th International AAAI Conference on Weblogs and Social Media, July 2013. [ .pdf ]
[1536] Eric Malmi, Trinh-Minh-Tri Do, and Daniel Gatica-Perez. Checking in or checked in: Comparing large-scale manual and automatic location disclosure patterns. In Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia, December 2012.
[1537] Edoardo Manino, Julia Rozanova, Danilo Carvalho, Andre Freitas, and Lucas Cordeiro. Systematicity, compositionality and transitivity of deep nlp models: a metamorphic testing perspective. In Findings of the ACL, 2022.
[1538] Sébastien Marcel. Approches génératives pour le traitement de séquences d'images: application à la reconnaissance dynamique des gestes de la main. Idiap-RR Idiap-RR-45-2000, IDIAP, 2000. Submitted: VALGO 2001, France, 2001. [ .ps.gz | .pdf ]
[1539] Sébastien Marcel. Gestures for multi-modal interfaces: A review. Idiap-RR Idiap-RR-34-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[1540] Sébastien Marcel. Robust face verification using skin color and neural networks. Idiap-RR Idiap-RR-49-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[1541] Sébastien Marcel. Evaluation protocols and comparative results for the Triesch hand posture database. Idiap-RR Idiap-RR-50-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[1542] Sébastien Marcel, Christine Marcel, and Samy Bengio. A state-of-the-art neural network for robust face verification. In Proceedings of the COST275 Workshop on The Advent of Biometrics on the Internet [3256]. Published in the Proceedings of the COST275 Workshop on The Advent of Biometrics on the Internet, Rome, Italy, 7-8 November, 2002. [ .ps.gz | .pdf ]
[1543] Sébastien Marcel and Samy Bengio. Improving face verification using skin color information. In Proceedings of the 16th International Conference on Pattern Recognition [3257]. Published in the Proceedings of the International Conference on Pattern Recognition, Quebec City, Canada, 2002. [ .ps.gz | .pdf ]
[1544] Sébastien Marcel. A symmetric transformation for lda-based face verification. In Proceedings of the 6th International Conference on Automatic Face and Gesture Recognition [3258]. [ .ps.gz | .pdf ]
[1545] Julien Tiphaigne and Sébastien Marcel. A video package for torch. Idiap-Com Idiap-Com-02-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[1546] Sébastien Marcel, P. Jost, P. Vandergheynst, and Jean-Philippe Thiran. Face authentication using client-specific matching pursuit. Idiap-RR Idiap-RR-78-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[1547] Sébastien Marcel. Face verification using lda and mlp on the banca database. Idiap-RR Idiap-RR-66-2003, IDIAP, 2003. [ .ps.gz | .pdf ]
[1548] Sébastien Marcel, Yann Rodriguez, Maël Guillemot, and Andrei Popescu-Belis. Annotation of face detection: description of xml format and files. Idiap-Com Idiap-Com-06-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[1549] Sébastien Marcel and José del R. Millán. Person authentication using brainwaves (eeg) and maximum a posteriori model adaptation. In IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Special Issue on Biometrics [3259]. IDIAP-RR 05-81. [ .ps.gz | .pdf ]
[1550] Sébastien Marcel, Yann Rodriguez, and Guillaume Heusch. On the recent use of local binary patterns for face authentication. In International Journal on Image and Video Processing Special Issue on Facial Image Processing [3260]. IDIAP-RR 06-34, accepted for publication but withdrawn because of author charges. [ .ps.gz | .pdf ]
[1551] Sébastien Marcel. Improving face verification using symmetric transformation. Idiap-RR Idiap-RR-68-2003, IDIAP, 2003. [ .ps.gz | .pdf ]
[1552] Sébastien Marcel, Johnny Mariéthoz, Yann Rodriguez, and Fabien Cardinaux. Bi-modal face and speech authentication: a biologin demonstration system. In Workshop on Multimodal User Authentication (MMUA) [3261]. IDIAP-RR 06-18. [ .ps.gz | .pdf ]
[1553] Sébastien Marcel, Jean Keomany, and Yann Rodriguez. Robust-to-illumination face localisation using active shape models and local binary patterns. Idiap-RR Idiap-RR-47-2006, IDIAP, 2006. Submitted for publication. [ .ps.gz | .pdf ]
[1554] Sébastien Marcel, Philip Abbet, and Maël Guillemot. Google portrait. Idiap-Com Idiap-Com-07-2007, IDIAP, 2007. [ .pdf ]
[1555] Sébastien Marcel. Joint bi-modal face and speaker authentication using explicit polynomial expansion. Idiap-RR Idiap-RR-14-2007, IDIAP, 2007. Submitted for publication. [ .ps.gz | .pdf ]
[1556] Sébastien Marcel, Chris McCool, Pavel Matejka, Timo Ahonen, and Jan Cernocky. Mobile biometry (mobio) face and speaker verification evaluation. Idiap-RR Idiap-RR-09-2010, Idiap, rue Marconi 19, 5 2010. [ .pdf ]
[1557] Sébastien Marcel, Chris McCool, Pavel Matejka, Timo Ahonen, Jan Cernocky, and al. On the results of the first mobile biometry (mobio) face and speaker verification evaluation. Idiap-RR Idiap-RR-30-2010, Idiap, 8 2010. [ .pdf ]
[1558] Sébastien Marcel, Chris McCool, Cosmin Atanasoaei, Flavio Tarsetti, Jan Pesan, Pavel Matejka, Jan Cernocky, Mika Helistekangas, and Markus Turtinen. Mobio: Mobile biometric face and speaker authentication. Idiap-RR Idiap-RR-31-2010, Idiap, rue Marconi 19, 8 2010. [ .pdf ]
[1559] Alvaro Marcos-Ramiro, Daniel Pizarro-Perez, Marta Marron-Romera, Laurent Son Nguyen, and Daniel Gatica-Perez. Body communicative cue extraction for conversational analysis. In Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition, April 2013. [ .pdf ]
[1560] Alvaro Marcos-Ramiro, Daniel Pizarro-Perez, Marta Marron-Romera, and Daniel Gatica-Perez. Automatic blinking detection towards stress discovery. In Proc. ACM Int. Conf. on Multimodal Interaction, pages 307--310. ACM New York, November 2014. [ DOI | .pdf ]
[1561] Alvaro Marcos-Ramiro, Daniel Pizarro-Perez, Marta Marron-Romera, and Daniel Gatica-Perez. Capturing upper body motion in conversation: an appearance quasi-invariant approach. In Proc. ACM Int. Conf. on Multimodal Interaction, pages 327--334. ACM New York, November 2014. [ DOI | .pdf ]
[1562] François Marelli, Bastian Schnell, Hervé Bourlard, T. Dutoit, and Philip N. Garner. An end-to-end network to synthesize intonation using a generalized command response model. In ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) [3262], pages 7040--7044. [ DOI | http | .pdf ]
[1563] François Marelli. Implémentation d'un algorithme de réduction de taille des réseaux de neurones. Idiap-RR Idiap-RR-03-2018, Idiap, 3 2018. [ .pdf ]
[1564] François Marelli. Designing second order recurrent neural networks for prosody modelling. Idiap-RR Idiap-RR-16-2018, Idiap, 11 2018. [ .pdf ]
[1565] François Marelli and Michael Liebling. Optics versus computation: Influence of illumination and reconstruction model accuracy in focal-plane-scanning optical projection tomography. In 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), pages 567--570. IEEE, April 2021. [ DOI | .pdf ]
[1566] Andreas Marfurt and James Henderson. Sentence-level planning for especially abstractive summarization. In Proceedings of the Third Workshop on New Frontiers in Summarization, pages 1--14, Online and in Dominican Republic, November 2021. Association for Computational Linguistics. [ http ]
[1567] Johnny Mariéthoz and Frédéric Bimbot. Adaptation robuste de modèles hmm pour la vérification du locuteur dépendante du texte. In Journee d'Etudes sur la Parole, Aussois [3263]. IDIAP-RR 00-08. [ .ps.gz | .pdf ]
[1568] Olivia Mariani, Alexander Ernst, Nadia Mercader, and Michael Liebling. Reconstruction of image sequences from ungated and scanning-aberrated laser scanning microscopy images of the beating heart. In IEEE Transactions on Computational Imaging [3264], pages 385--395. [ DOI | http | .pdf ]
[1569] Olivia Mariani, François Marelli, Christian Jaques, Alexander Ernst, and Michael Liebling. Unequivocal cardiac phase sorting from alternating ramp- and pulse-illuminated microscopy image sequences. In 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), pages 868--872, April 2021. [ DOI | http | .pdf ]
[1570] Olivia Mariani, Kevin G. Chan, Alexander Ernst, Nadia Mercader, and Michael Liebling. Virtual high-framerate microscopy of the beating heart via sorting of still images. In 2019 IEEE 16th International Symposium on Biomedical Imaging [3265], pages 312--315.
[1571] Olivia Mariani. Computational methods for live heart imaging with speed-constrained microscopes. PhD thesis, EPFL, 2021. [ .pdf ]
[1572] Johnny Mariéthoz, Johan Lindberg, and Frédéric Bimbot. A map approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification. In ICSLP [3266]. IDIAP-RR 00-48. [ .ps.gz | .pdf ]
[1573] F. Porée, Johnny Mariéthoz, Samy Bengio, and Frédéric Bimbot. The BANCA database and experimental protocol for speaker verification. Idiap-RR Idiap-RR-13-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[1574] Johnny Mariéthoz and Samy Bengio. A comparative study of adaptation methods for speaker verification. In International Conference on Spoken Language Processing ICSLP [3267]. IDIAP-RR 01-34. [ .ps.gz | .pdf ]
[1575] Johnny Mariéthoz and Samy Bengio. An alternative to silence removal for text-independent speaker verification. Idiap-RR Idiap-RR-51-2003, IDIAP, 2003. submitted ICASSP 2004. [ .ps.gz | .pdf ]
[1576] Johnny Mariéthoz and Samy Bengio. A new speech recognition baseline system for numbers 95 version 1.3 based on torch. Idiap-RR Idiap-RR-16-2004, IDIAP, 2004. [ .ps.gz | .pdf ]
[1577] Johnny Mariéthoz and Samy Bengio. A unified framework for score normalization techniques applied to text independent speaker verification. In IEEE Signal Processing Letters, Volume 12 [3268]. IDIAP-RR 04-62. [ .ps.gz | .pdf ]
[1578] Johnny Mariéthoz and Samy Bengio. A max kernel for text-independent speaker verification systems. In Second Workshop on Multimodal User Authentication, MMUA [3269]. IDIAP-RR 05-77. [ .ps.gz | .pdf ]
[1579] Johnny Mariéthoz and Samy Bengio. A kernel trick for sequences applied to text-independent speaker verification systems. In Pattern Recognition [3269]. IDIAP-RR 05-77. [ .ps.gz | .pdf ]
[1580] Johnny Mariéthoz and Samy Bengio. Can a professional imitator fool a gmm-based speaker verification system? Idiap-RR Idiap-RR-61-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[1581] Johnny Mariéthoz. Discrmininant models for text-independent speaker verification. Idiap-RR Idiap-RR-70-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[1582] Johnny Mariéthoz, Samy Bengio, and Yves Grandvalet. Kernel based text-independnent speaker verification. Idiap-RR Idiap-RR-68-2008, Idiap, 9 2008. [ .pdf ]
[1583] Marios Athineos, Hynek Hermansky, and Daniel P. W. Ellis. Lp-trap: Linear predictive temporal patterns. [3270]. IDIAP RR 04-59. [ .ps.gz | .pdf ]
[1584] Marios Athineos, Hynek Hermansky, and Daniel P. W. Ellis. Plp2: Autoregressive modeling of auditory-like 2-d spectro-temporal patterns. [3271]. IDIAP RR 04-60. [ .ps.gz | .pdf ]
[1585] Patrick Marmaroli, Jean-Marc Odobez, Xavier Falourd, and Hervé Lissek. A bimodal sound source model for vehicle tracking in traffic monitoring. In European Signal Processing Conference, August 2011. [ .pdf ]
[1586] Patrick Marmaroli, M. Carmona, Xavier Falourd, Hervé Lissek, and Jean-Marc Odobez. Observation of vehicle axles through pass-by noise: A strategy of microphone array design. IEEE Trans. on Intelligent Transportation Systems, March 2013. [ .pdf ]
[1587] Sébastien Marmin, Jean Baccou, Frédéric Perales, David Ginsbourger, and Jacques Liandrat. Planification adaptative d'expériences numériques par paquets en contexte non stationnaire pour une étude de fissuration mécanique. In 23ème Congrès Français de Mécanique, 28 août - 1er septembre 2017, Lille, France (FR), AFM, 2017. [ http ]
[1588] Sébastien Marmin, David Ginsbourger, Jean Baccou, and Jacques Liandrat. Warped gaussian processes and derivative-based sequential design for functions with heterogeneous variations. SIAM/ASA Journal on Uncertainty Quantification, 6(3):991--1018, 2018.
[1589] Sébastien Marmin, Clément Chevalier, and David Ginsbourger. Differentiating the multipoint expected improvement for optimal batch design. In Panos Pardalos, Mario Pavone, Giovanni Maria Farinella, and Vincenzo Cutello, editors, Machine Learning, Optimization, and Big Data, volume 9432 of Lecture Notes in Computer Science, pages 37--48. Springer International Publishing, 2015. [ DOI ]
[1590] Sébastien Marmin, Jean Baccou, Jacques Liandrat, and David Ginsbourger. Non-parametric warping via local scale estimation for non-stationary gaussian process modelling. In Wavelets and Sparsity XVII, volume 10394 of Proc. SPIE, page 1039421. International Society for Optics and Photonics, 2017. [ DOI | http ]
[1591] Guy Marshall, Caroline Jay, and Andre Freitas. Structuralist analysis for neural network system diagrams. In Diagrams, 2021.
[1592] Guy Marshall, Caroline Jay, and Andre Freitas. Number and quality of diagrams in scholarly publications is associated with number of citations. Diagrams, 2021.
[1593] Guy Marshall, Caroline Jay, and Andre Freitas. Scholarly ai system diagrams as an access point to mental models. In Diagrams, 2021.
[1594] Guy Marshall, Mokanarangan Thayaparan, Philip Osborne, and Andre Freitas. Switching contexts: Transportability measures for nlp. In 14th International Conference on Computational Semantics, 2021. [ http ]
[1595] Jesus Martinez-Gomez, Ismael Garcia-Varea, Miguel Cazorla, and Barbara Caputo. Overview of the imageclef 2013 robot vision task. In Working Notes, CLEF 2013, 2013. [ .pdf ]
[1596] Jesus Martinez-Gomez and Barbara Caputo. Towards semi-supervised learning of semantic spatial concepts. In IEEE International Conference on Robotics and Automation, 2011. [ .pdf ]
[1597] Jesus Martinez-Gomez and Barbara Caputo. Towards semi-supervised learning of semantic spatial concepts. Idiap-RR Idiap-RR-03-2011, Idiap, 2 2011. [ .pdf ]
[1598] Jesus Martinez-Gomez, Ismael Garcia-Varea, and Barbara Caputo. Overview of the imageclef 2012 robot vision task. In Working Notes of the ImageCLEF 2012 Laboratory, 2012. [ .pdf ]
[1599] Jesus Martinez-Gomez, Ismael Garcia-Varea, and Barbara Caputo. Baseline multimodal place classifier for the 2012 robot vision task. In Working Notes of the ImageCLEF 2012 Laboratory, 2012. [ .pdf ]
[1600] Jesus Martinez-Gomez and Barbara Caputo. Towards semi-supervised learning of semantic spatial concepts for mobile robots. Journal of Physical Agents, 2011. [ .pdf ]
[1601] Angel Martínez-González, Michael Villamizar, Olivier Canévet, and Jean-Marc Odobez. Investigating depth domain adaptation for efficient human pose estimation. In European Conference on Computer Vision - Workshops, 2018. [ .pdf ]
[1602] Angel Martínez-González, Michael Villamizar, and Jean-Marc Odobez. Pose transformers (potr): Human motion prediction with non-autoregressive transformers. In International Conference in Computer Vision - Workshops, 2021. [ .pdf ]
[1603] Angel Martínez-González, Michael Villamizar, Olivier Canévet, and Jean-Marc Odobez. Residual pose: A decoupled approach for depth-based 3d human pose estimation. In IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020. [ .pdf ]
[1604] Angel Martínez-González, Michael Villamizar, Olivier Canévet, and Jean-Marc Odobez. Real-time convolutional networks for depth-based human pose estimation. In IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018. [ .pdf ]
[1605] Angel Martínez-González, Michael Villamizar, Olivier Canévet, and Jean-Marc Odobez. Efficient convolutional neural networks for depth-based multi-person pose estimation. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 30(11):4207--4221, November 2020. [ DOI | http | .pdf ]
[1606] Angel Martínez-González. Efficient Depth-based Deep Learning Methods for Multi-Party Pose Estimation. PhD thesis, École polytechnique fédérale de Lausanne, 2021. [ DOI | .pdf ]
[1607] Martin Thebault, Benjamin Govehovitch, Karine Bouty, Cyril Caliot, Raphaël Compagnon, Gilles Desthieux, Matteo Formolli, Stéphanie Giroux-Julien, Victor Guillot, Ellis Herman, Jérôme Kämpf, Jouri Kanters, Gabriele Lobaccaro, Christophe Ménézo, Giuseppe Peronato, and Arnkell Jonas Petersen. A comparative study of simulation tools to model the solar irradiation on building façades. In Proceedings of SWC 2021: ISES Solar World Congress. ISES, 2021. [ DOI | http | .pdf ]
[1608] C. Mastalli, M. Focchi, I. Havoutis, A. Radulescu, Sylvain Calinon, J. Buchli, D. G. Caldwell, and C. Semini. Trajectory and foothold optimization using low-dimensional models for rough terrain locomotion. In Proc. IEEE Intl Conf. on Robotics and Automation (ICRA), pages 1096--1103. IEEE, May 2017. [ http | .pdf ]
[1609] Lukas Matena, Alejandro Jaimes, and Andrei Popescu-Belis. Graphical representation of meetings on mobile devices. In MobileHCI 2008 (10th International Conference on Human-Computer Interaction with Mobile Devices and Services, Demonstrations Session), 2008. [ .pdf ]
[1610] Mathew Magimai.-Doss. Using Auxiliary Sources of Knowledge for Automatic Speech Recognition. Idiap-rr, École Polytechnique Fédérale de Lausanne, Computer Science Department, Lausanne, Switzerland, 2005. thesis #3263. [ .ps.gz | .pdf ]
[1611] Kyle Matoba and Francois Fleuret. Exact preimages of neural network aircraft collision avoidance systems. In Machine Learning for Engineering Modeling, Simulation, and Design Workshop at Neural Information Processing Systems 2020, November 2020. [ .pdf ]
[1612] Eddy Mayoraz and Ethem Alpaydin. Support vector machine for multiclass classification. Idiap-RR Idiap-RR-06-1998, IDIAP, 1998. Submitted for publication. [ .ps.gz | .pdf ]
[1613] Eddy Mayoraz and Frédéric Aviolat. Constructive training methods for feedforward neural networks with binary weights. International Journal of Neural Systems, 7(2), 5 1996. [ .ps.gz | .pdf ]
[1614] Eddy Mayoraz and Miguel Moreira. On the decomposition of polychotomies into dichotomies. In Proceedings of The Fourteenth International Conference on Machine Learning [3273]. IDIAP-RR 96-08. [ .ps.gz | .pdf ]
[1615] Eddy Mayoraz and Miguel Moreira. Combinatorial approach for data binarization. In Zytkow and Rauch [3274]. IDIAP-RR 99-08. [ .ps.gz | .pdf ]
[1616] Eddy Mayoraz. On the power of democratic networks. SIAM Journal of Discr. Math, 9(02), 5 1996. [ .ps.gz | .pdf ]
[1617] Eddy Mayoraz. Bounds on the degree of high order binary perceptrons. In François Blayo and Michel Verleysen, editors, Proceedings of ESANN'96. D facto, 1996. [ .ps.gz | .pdf ]
[1618] Eddy Mayoraz. On the complexity of the class of regions computable by a two-layered perceptron. Idiap-RR Idiap-RR-03-1996, IDIAP, 1996. [ .ps.gz | .pdf ]
[1619] Endre Boros, Peter L. Hammer, Toshihide Ibaraki, Alexander Kogan, Eddy Mayoraz, and Ilya Muchnik. An implementation of logical analysis of data. Idiap-RR Idiap-RR-05-1996, IDIAP, 1996. [ .ps.gz | .pdf ]
[1620] Eddy Mayoraz. On variations of the convex hull operator. Idiap-RR Idiap-RR-06-1996, IDIAP, 1996. [ .ps.gz | .pdf ]
[1621] Eddy Mayoraz. On the complexity of recognizing iterated differences of polyhedra. In Gerstner et al. [3275]. IDIAP-RR 97-10. [ .ps.gz | .pdf ]
[1622] Eddy Mayoraz. On the complexity of recognizing regions computable by two-layered perceptrons. In Annals Mathematics and Artificial Intelligence [3276]. [ .ps.gz | .pdf ]
[1623] Chris McCool and Sébastien Marcel. Parts-based face verification using local frequency bands. In in Proceedings of IEEE/IAPR International Conference on Biometrics [3277]. Submitted to ICB 2009. [ .pdf ]
[1624] Chris McCool and Sébastien Marcel. Mobio database for the icpr 2010 face and speech competition. Idiap-Com Idiap-Com-02-2009, Idiap, 11 2009. [ .pdf ]
[1625] Chris McCool and Laurent El Shafey. Notes on probabilistic linear discriminant analysis. Idiap-Com Idiap-Com-03-2013, Idiap, 6 2013. [ .pdf ]
[1626] Chris McCool and Sébastien Marcel. Parts-based face verification using local frequency bands. Idiap-RR Idiap-RR-06-2011, Idiap, 3 2011. [ .pdf ]
[1627] Chris McCool, Sébastien Marcel, Abdenour Hadid, Matti Pietikainen, Pavel Matejka, Jan Cernocky, Norman Poh, J. Kittler, Anthony Larcher, Christophe Levy, Driss Matrouf, Jean-François Bonastre, Phil Tresadern, and Timothy Cootes. Bi-modal person recognition on a mobile phone: using mobile phone data. In IEEE ICME Workshop on Hot Topics in Mobile Multimedia [3278]. [ .pdf ]
[1628] Chris McCool, Roy Wallace, Mitchell McLaren, Laurent El Shafey, and Sébastien Marcel. Session variability modelling for face authentication. In IET Biometrics [3279], pages 117--129. [ DOI | .pdf ]
[1629] Chris McCool, Jordi Sanchez-Riera, and Sébastien Marcel. Feature distribution modelling techniques for 3d face recognition. Pattern Recognition Letters, 31:1324--1330, 2010. [ .pdf ]
[1630] Iain A. McCowan and Darren Moore. Small microphone array: Algorithms and hardware. Idiap-Com Idiap-Com-07-2003, IDIAP, 2003. [ .ps.gz | .pdf ]
[1631] Iain A. McCowan, Daniel Gatica-Perez, and Samy Bengio. Meeting data collection specifications. Idiap-Com Idiap-Com-10-2003, IDIAP, 2003. [ .ps.gz | .pdf ]
[1632] Iain A. McCowan and Hervé Bourlard. Microphone array post-filter for diffuse noise field. In Proceedings of International Conference on Acoustics, Speech and Signal Processing [3280]. IDIAP-RR 01-39. [ .ps.gz | .pdf ]
[1633] Iain A. McCowan and Hervé Bourlard. Microphone array post-filter based on noise field coherence. In IEEE Transactions on Speech and Audio Processing [3281]. IDIAP-RR 01-40. [ .ps.gz | .pdf ]
[1634] Iain A. McCowan, Andrew Morris, and Hervé Bourlard. Robust speech recognition with small microphone arrays using the missing data approach. In Proceedings of International Conference on Speech and Language Processing (ICSLP) [3282]. IDIAP-RR 02-09. [ .ps.gz | .pdf ]
[1635] Iain A. McCowan, Samy Bengio, Daniel Gatica-Perez, Guillaume Lathoud, Florent Monay, Darren Moore, Pierre Wellner, and Hervé Bourlard. Modeling human interaction in meetings. In Proceedings of International Conference on Acoustics, Speech and Signal Processing [3283]. IDIAP-RR 02-59. [ .ps.gz | .pdf ]
[1636] Iain A. McCowan, Daniel Gatica-Perez, Samy Bengio, Guillaume Lathoud, Mark Barnard, and Dong Zhang. Automatic analysis of multimodal group actions in meetings. In IEEE Transactions on Pattern Analysis and Machine Intelligence (to appear) [3284]. To appear. [ .ps.gz | .pdf ]
[1637] Iain A. McCowan, Daniel Gatica-Perez, Samy Bengio, and Hervé Bourlard. Towards computer understanding of human interactions. Idiap-RR Idiap-RR-45-2003, IDIAP, Martigny, Switzerland, 2003. [ .ps.gz | .pdf ]
[1638] Iain A. McCowan, Darren Moore, John Dines, Daniel Gatica-Perez, Mike Flynn, Pierre Wellner, and Hervé Bourlard. On the use of information retrieval measures for speech recognition evaluation. Idiap-RR Idiap-RR-73-2004, IDIAP, Martigny, Switzerland, 2004. [ .ps.gz | .pdf ]
[1639] Michael McCoy, Volkan Cevher, Quoc Tran Dinh, Afsaneh Asaei, and Luca Baldassarre. Convexity in source separation: Models, geometry, and algorithms. IEEE Signal Processing Magazine, Special Issue on Source Separation and Applications, 2013. [ .pdf ]
[1640] Michael McGreevy. Pseudo-syntactic language modeling for disfluent speech recognition. In Proceedings of SST 2004 (10th Australian International Conference on Speech Science & Technology,',','), Sydney, Australia, 2004 [3285]. IDIAP-RR 04-55. [ .ps.gz | .pdf ]
[1641] Jordan Meadows and Andre Freitas. Similarity-based equational inference in physics. Physics Review Research, 2021.
[1642] Luis Emmanuel Medina Rios, Salvador Ruiz-Correa, Darshan Santani, and Daniel Gatica-Perez. Who sees what? examining urban impressions in global south cities. In Human Perception of Visual Information: Psychological and Computational Perspectives. Springer, 2022. [ .pdf ]
[1643] Lakmal Buddika Meegahapola and Daniel Gatica-Perez. Smartphone sensing for the well-being of young adults: A review. IEEE Access, December 2021. [ DOI | http | .pdf ]
[1644] Lakmal Buddika Meegahapola, Wageesha Bangamuarachchi, Anju Chamantha, Salvador Ruiz-Correa, Indika Perera, and Daniel Gatica-Perez. Sensing eating events in context: A smartphone-only approach. IEEE Access, 10, May 2022. [ DOI | http | .pdf ]
[1645] Lakmal Buddika Meegahapola, Florian Labhart, Thanh-Trung Phan, and Daniel Gatica-Perez. Examining the social context of alcohol drinking in young adults with smartphone sensing. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 5(3):26, September 2021. [ DOI | .pdf ]
[1646] Lakmal Buddika Meegahapola, Salvador Ruiz-Correa, Viridiana del Carmen Robledo-Valero, Emilio Ernesto Hernandez-Huerfano, Leonardo Alvarez-Rivera, Ronald Chenu-Abente, and Daniel Gatica-Perez. One more bite? inferring food consumption level of college students using smartphone sensing and self-reports. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 5(1), March 2021. [ .pdf ]
[1647] Lakmal Buddika Meegahapola, Salvador Ruiz-Correa, and Daniel Gatica-Perez. Alone or with others? understanding eating episodes of college students with mobile sensing. In 19th International Conference on Mobile and Ubiquitous Multimedia, MUM 2020, page 162–166, New York, NY, USA, November 2020. ACM, Association for Computing Machinery. [ DOI | http | .pdf ]
[1648] Lakmal Buddika Meegahapola, Salvador Ruiz-Correa, and Daniel Gatica-Perez. Protecting mobile food diaries from getting too personal. In 19th International Conference on Mobile and Ubiquitous Multimedia, MUM 2020, page 212–222, New York, NY, USA, November 2020. Association for Computing Machinery. [ DOI | http | .pdf ]
[1649] Deborah Mendes, Julia Rozanova, Mokanarangan Thayaparan, Marco Valentino, and Andre Freitas. Does my representation capture x? probe-ably. In 59th Annual Meeting of the Association for Computational Linguistics (Demonstration track), Demonstration paper, 2021. [ http ]
[1650] Deborah Mendes, Mokanarangan Thayaparan, Marco Valentino, Julia Rozanova, and Andre Freitas. To be or not to be an integer? encoding variables for mathematical text. In Findings of the ACL, 2022.
[1651] Deborah Mendes and Andre Freitas. Star: Cross-modal statement representation for selecting relevant mathematical premises. In 16th conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021.
[1652] Giangiacomo Mercatali and Andre Freitas. Disentangling generative factors in natural language with discrete variational autoencoders. In The 2021 Conference on Empirical Methods in Natural Language Processing, 2021.
[1653] Bertrand Mesot and David Barber. Switching linear dynamical systems for noise robust speech recognition. Idiap-RR Idiap-RR-08-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[1654] Bertrand Mesot and David Barber. A bayesian alternative to gain adaptation in autoregressive hidden markov models. Idiap-RR Idiap-RR-55-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[1655] Bertrand Mesot and David Barber. A bayesian switching linear dynamical system for scale-invariant robust speech extraction. Idiap-RR Idiap-RR-52-2007, IDIAP, 2007. [ .pdf ]
[1656] Bertrand Mesot. Inference in switching linear dynamical systems applied to noise robust speech recognition of isolated digits. Idiap-rr, Ecole Polytechnique Fédérale de Lausanne, 2008. Thèse Ecole polytechnique fédérale de Lausanne EPFL, no 4059 (2008,',','), Faculté des sciences et techniques de l'ingénieur STI, Section de génie électrique et électronique, Institut de génie électrique et électronique IEL (Laboratoire de l'IDIAP LIDIAP). Dir.: Hervé Bourlard. [ .pdf ]
[1657] K. Messer, J. Matas, J. Kittler, Juergen Luettin, and Gilbert Maître. XM2VTSDB: The extended M2VTS database. In Proc. Second International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA'99), 1999.
[1658] Rakesh Metha, Manuel Günther, and Sébastien Marcel. Gender classification by lut based boosting of overlapping block patterns. In Scandinavian Conference on Image Analysis, volume 9127, pages 530--542. Springer International Publishing, 2015. [ DOI | http | .pdf ]
[1659] Thomas Meyer. Disambiguating temporal-contrastive discourse connectives for machine translation. In Proceedings of ACL-HLT 2011 Student Session, pages 46--51. Association for Computational Linguistics, June 2011. [ .pdf ]
[1660] Thomas Meyer, Andrei Popescu-Belis, Najeh Hajlaoui, and Andrea Gesmundo. Machine translation of labeled discourse connectives. In Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), page 10, October 2012. [ .pdf ]
[1661] Thomas Meyer, Charlotte Roze, Bruno Cartoni, Laurence Danlos, Sandrine Zufferey, and Andrei Popescu-Belis. Disambiguating discourse connectives using parallel corpora: senses vs. translations. In Proceedings of Corpus Linguistics Conference, pages 104--105, July 2011. [ .pdf ]
[1662] Thomas Meyer and Lucie Polakova. Machine translation with many manually labeled discourse connectives. In Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), pages 43--50, June 2013. [ .pdf ]
[1663] Thomas Meyer, Cristina Grisot, and Andrei Popescu-Belis. Detecting narrativity to improve english to french translation of simple past verbs. In Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), pages 33--42, June 2013. [ .pdf ]
[1664] Thomas Meyer and Bonnie Webber. Implicitation of discourse connectives in (machine) translation. In Proceedings of the 1st DiscoMT Workshop at ACL 2013 (51st Annual Meeting of the Association for Computational Linguistics), pages 19--26, June 2013. [ .pdf ]
[1665] Thomas Meyer and Andrei Popescu-Belis. Using sense-labeled discourse connectives for statistical machine translation. In Proceedings of the EACL2012 Workshop on Hybrid Approaches to Machine Translation (HyTra), pages 129--138, April 2012. [ .pdf ]
[1666] Thomas Meyer, Andrei Popescu-Belis, Jeevanthi Liyanapathirana, and Bruno Cartoni. A corpus-based contrastive analysis for defining minimal semantics of inter-sentential dependencies for machine translation. In Proceedings of the GSCL2011 Workshop on "Contrastive Analysis - Translation Studies - Machine Translation: What can we learn from each other?", page 5, September 2011. [ .pdf ]
[1667] Thomas Meyer. Translation error spotting from a user's point of view. Idiap-RR Idiap-RR-31-2012, Idiap, 11 2012. EPFL course project paper. [ .pdf ]
[1668] Thomas Meyer, Najeh Hajlaoui, and Andrei Popescu-Belis. Disambiguating discourse connectives for statistical machine translation. IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(7):1184--1197, July 2015. [ DOI | .pdf ]
[1669] Thomas Meyer, Andrei Popescu-Belis, Sandrine Zufferey, and Bruno Cartoni. Multilingual annotation and disambiguation of discourse connectives for machine translation. In Proceedings of 12th SIGdial Meeting on Discourse and Dialogue, pages 194--203. Association for Computational Linguistics, June 2011. [ .pdf ]
[1670] Thomas Meyer. Discourse-level Features for Statistical Machine Translation. PhD thesis, École Polytechnique Fédérale de Lausanne (EPFL), December 2014. [ .pdf ]
[1671] Lesly Miculicich and Andrei Popescu-Belis. Validation of an automatic metric for the accuracy of pronoun translation (apt). In Proceedings of the Third Workshop on Discourse in Machine Translation (DiscoMT) [3287]. [ .pdf ]
[1672] Lesly Miculicich. Towards document-level neural machine translation. Idiap-RR Idiap-RR-25-2017, Idiap, 9 2017. [ .pdf ]
[1673] Lesly Miculicich, Nikolaos Pappas, Dhananjay Ram, and Andrei Popescu-Belis. Self-attentive residual decoder for neural machine translation. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018. [ .pdf ]
[1674] Lesly Miculicich, Dhananjay Ram, Nikolaos Pappas, and James Henderson. Document-level neural machine translation with hierarchical attention networks. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018. [ .pdf ]
[1675] Lesly Miculicich and James Henderson. Partially-supervised mention detection. In Proceedings of the Third Workshop on Computational Models of Reference, Anaphora and Coreference, 2020. [ .pdf ]
[1676] Lesly Miculicich. Discourse Phenomena in Machine Translation. PhD thesis, École polytechnique fédérale de Lausanne, 2020. [ .pdf ]
[1677] Lesly Miculicich, Marc Marone, and Hany Hassan. Selecting, planning, and rewriting: A modular approach for data-to-document generation and translation. In WNGT EMNLP, 2019. [ .pdf ]
[1678] J. Mouriño, Silvia Chiappa, R. Jané, and José del R. Millán. Evolution of the mental states operating a brain-computer interface. In Proceedings of the International Federation for Medical and Biological Engineering, Vienna, Austria, 12 2002. [ .pdf ]
[1679] José del R. Millán. Brain-computer interfaces. In Michael A. Arbib, editor, The Handbook of Brain Theory and Neural Networks: The Second Edition. The MIT Press, 2002. [ .pdf ]
[1680] José del R. Millán. Robot navigation. In Michael A. Arbib, editor, The Handbook of Brain Theory and Neural Networks: The Second Edition. The MIT Press, 2002. [ .pdf ]
[1681] José del R. Millán. Adaptive brain interfaces. Communications of the ACM, 46(3), 2003.
[1682] F. Cincotti, A. Scipione, A. Tiniperi, D. Mattia, M. G. Marciani, José del R. Millán, S. Salinari, L. Bianchi, and F. Babiloni. Comparison of different feature classifiers for brain computer interfaces. In Proceedings of the 1st International IEEE EMBS Conference on Neural Engineering, Capri, Italy, 3 2003.
[1683] R. Grave de Peralta Menendez, S. L. González Andino, José del R. Millán, T. Pun, and C. M. Michel. Direct non-invasive brain computer interfaces. In Proceedings of the 9th International Conference on Functional Mapping of the Human Brain, New York, USA, 6 2003.
[1684] José del R. Millán. Adaptive brain interfaces for communication and control. In Proceedings of the 10th International Conference on Human-Computer Interaction, Crete, Greece, 6 2003. Invited paper. [ .pdf ]
[1685] José del R. Millán and J. Mouriño. Asynchronous BCI and local neural classifiers: An overview of the adaptive Brain interface project. IEEE Trans. on Neural Systems and Rehabilitation Engineering, Special Issue on Brain-Computer Interface Technology, 11(2), 2003. [ .pdf ]
[1686] José del R. Millán, F. Renkens, J. Mouriño, and W. Gerstner. Non-invasive brain-actuated control of a mobile robot. In Proceedings of the 18th International Joint Conference on Artificial Intelligence, Acapulco, Mexico, 8 2003. [ .pdf ]
[1687] José del R. Millán, F. Renkens, J. Mouriño, and W. Gerstner. Brain-actuated interaction. Artificial Intelligence, 159(1-2), 2004. [ .pdf ]
[1688] José del R. Millán. Restoring locomotion with a thought controlled mobile robot. In Proceedings of the 4th Forum of European Neuroscience, Lisbon, Portugal, 6 2004. Invited paper.
[1689] José del R. Millán, F. Renkens, J. Mouriño, and W. Gerstner. Non-invasive brain-actuated control of a mobile robot by human EEG. IEEE Trans. on Biomedical Engineering, Special Issue on Brain-Machine Interfaces, 51(6), 2004. [ .pdf ]
[1690] José del R. Millán. On the need for on-line learning in brain-computer interfaces. In Proceedings of the International Joint Conference on Neural Networks [3288]. IDIAP-RR 03-30. [ .pdf ]
[1691] R. Grave de Peralta Menendez, S. L. González Andino, L. Perez, Pierre W. Ferrez, and José del R. Millán. Non-invasive estimation of local field potentials for neuroprosthesis control. Cognitive Processing, Special Issue on Motor Planning in Humans and Neuroprosthesis Control, 6(1), 2005. [ .pdf ]
[1692] José del R. Millán. Interfaces cerebrales. Mente y Cerebro, 13(July), 2005. [ .pdf ]
[1693] L. Kauhanen, T. Palomäki, P. Jylänki, F. Aloise, Marnix Nuttin, and José del R. Millán. Haptic feedback compared with visual feedback for bci. In Proceedings of the 3rd International Brain-Computer Interface Workshop & Training Course 2006, Graz, Austria, 9 2006. [ .pdf ]
[1694] C. Menon, Christina de Negueruela, José del R. Millán, O. Tonet, F. Carpi, M. Broschart, Pierre W. Ferrez, Anna Buttfield, P. Dario, L. Citi, C. Laschi, M. Tombini, F. Sepulveda, R. Poli, R. Palaniappan, F. Tecchio, P. M. Rossini, and D. de Rossi. Prospects on brain-machine interfaces for space system control. In Proceedings of the 57th International Astronautical Conference, Valencia, Spain, 7 2006. [ .pdf ]
[1695] B. Blankertz, K. R. Müller, D. Krusienski, G. Schalk, J. R. Wolpaw, A. Schlögl, Gert Pfurtscheller, José del R. Millán, M. Schroeder, and N. Birbaumer. The BCI competition III: Validating alternative approaches to actual BCI problems. IEEE Trans. on Neural Systems and Rehabilitation Engineering, 14(2), 2006.
[1696] R. Grave de Peralta Menendez, S. L. González Andino, Pierre W. Ferrez, and José del R. Millán. Towards Brain-Computer Interfacing. The MIT Press, 2007.
[1697] Pierre W. Ferrez and José del R. Millán. Error-related eeg potentials in brain-computer interfaces. In G. Dornhege, José del R. Millán, T. Hinterberger, D. McFarland, and K. R. Müller, editors, Towards Brain-Computer Interfacing. The MIT Press, 2007.
[1698] José del R. Millán, Pierre W. Ferrez, and Anna Buttfield. The idiap brain-computer interface: An asynchronous multi-class approach. In G. Dornhege, José del R. Millán, T. Hinterberger, D. McFarland, and K. R. Müller, editors, Towards Brain-Computer Interfacing. The MIT Press, 2007.
[1699] R. Grave de Peralta Menendez, S. L. González Andino, Pierre W. Ferrez, and José del R. Millán. Non-invasive estimates of local field potentials for brain-computer interfaces. In G. Dornhege, José del R. Millán, T. Hinterberger, D. McFarland, and K. R. Müller, editors, Towards Brain-Computer Interfacing. The MIT Press, 2007.
[1700] José del R. Millán, Anna Buttfield, C. Vidaurre, M. Krauledat, A. Schlögl, P. Shenoy, B. Blankertz, R.P.N. Rao, R. Cabeza, Gert Pfurtscheller, and K. R. Müller. Adaptation in brain-computer interfaces. In G. Dornhege, José del R. Millán, T. Hinterberger, D. McFarland, and K. R. Müller, editors, Towards Brain-Computer Interfacing. The MIT Press, 2007.
[1701] S. L. González Andino, R. Grave de Peralta Menendez, G. Thut, José del R. Millán, P. Morier, and T. Landis. Very high frequency oscillations (VHFO) as a predictor of movement intentions. NeuroImage, 32(1), 2006. [ .pdf ]
[1702] José del R. Millán, F. Renkens, J. Mouriño, and W. Gerstner. Non-invasive brain-actuated control of a mobile robot by human EEG. In 2006 IMIA Yearbook of Medical Informatics. Schattauer Verlag, 2006.
[1703] José del R. Millán, Pierre W. Ferrez, Ferran Galán, Eileen Lew, and Ricardo Chavarriaga. Non-invasive brain-actuated interaction. In Proceedings of the 2nd International Symposium on Brain, Vision and Artificial Intelligence, Naples, Italy, 10 2007. [ .pdf ]
[1704] F. Cincotti, L. Kauhanen, F. Aloise, T. Palomäki, C. Caporusso, P. Jylänki, D. Mattia, F. Babiloni, G. Vanacker, Marnix Nuttin, M. G. Marciani, and José del R. Millán. Vibrotactile feedback for brain-computer interface operation. Computational Intelligence and Neuroscience, 2007, 2007. [ .pdf ]
[1705] G. Vanacker, José del R. Millán, Eileen Lew, Pierre W. Ferrez, Ferran Galán, Johan Philips, H. Van Brussel, and Marnix Nuttin. Context-based filtering for assisted brain-actuated wheelchair driving. Computational Intelligence and Neuroscience, 2007, 2007. [ .pdf ]
[1706] F. Cincotti, L. Kauhanen, F. Aloise, T. Palomäki, N. Caporusso, P. Jylänki, D. Mattia, F. Babiloni, G. Vanacker, Marnix Nuttin, M. G. Marciani, and José del R. Millán. Vibrotactile feedback in the context of mu-rhythm based bci. In Proceedings of the 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Lyon, France, 8 2007. [ .pdf ]
[1707] F. Aloise, N. Caporusso, D. Mattia, F. Babiloni, L. Kauhanen, José del R. Millán, Marnix Nuttin, M. G. Marciani, and F. Cincotti. Brain-machine interfaces through control of electroencephalographic signals and vibrotactile feedback. In Proceedings of the 12th International Conference on Human-Computer Interaction, Beijing, China, 8 2007. [ .pdf ]
[1708] Johan Philips, José del R. Millán, G. Vanacker, Eileen Lew, Ferran Galán, Pierre W. Ferrez, H. Van Brussel, and Marnix Nuttin. Adaptive shared control of a brain-actuated simulated wheelchair. In Proceedings of the 10th IEEE International Conference on Rehabilitation Robotics, Noordwijk, The Netherlands, 6 2007. [ .pdf ]
[1709] M. Broschart, Christina de Negueruela, José del R. Millán, and C. Menon. Augmenting astronaut's capabilities through brain-machine interfaces. In Proceedings of the 20th International Joint Conference on Artificial Intelligence, Workshop on Artificial Intelligence for Space Applications, Hyderabad, India, 1 2007. [ .pdf ]
[1710] Ferran Galán, Marnix Nuttin, Eileen Lew, Pierre W. Ferrez, G. Vanacker, Johan Philips, H. Van Brussel, and José del R. Millán. An asynchronous and non-invasive brain-actuated wheelchair. In Proceedings of the 13th International Symposium on Robotics Research, Hiroshima, Japan, 11 2007. [ .pdf ]
[1711] F. Cincotti, D. Mattia, F. Aloise, S. Bufalari, L. Astolfi, F. De Vico Fallani, A. Tocci, L. Bianchi, M. G. Marciani, S. Gao, José del R. Millán, and F. Babiloni. High-resolution eeg techniques for brain-computer interface applications. Journal of Neuroscience Methods, 2007. [ .pdf ]
[1712] José del R. Millán. Tapping the mind or resonating minds? In Paul T. Kidd, editor, European Visions for the Knowledge Age. Cheshire Henbury, 2007.
[1713] A. Nijholt, D. Tan, B. Allison, José del R. Millán, M. Moore, and B. Graimann. Brain-computer interfaces for hci and games. In Proceedings of the 26th Annual CHI Conference on Human Factors in Computing Systems, Extended Abstracts, Florence, Italy, 4 2008. [ .pdf ]
[1714] José del R. Millán. Brain-controlled robots. IEEE Intelligent Systems, 2008. [ .pdf ]
[1715] José del R. Millán, Pierre W. Ferrez, Ferran Galán, Eileen Lew, and Ricardo Chavarriaga. Non-invasive brain-machine interaction. International Journal of Pattern Recognition and Artificial Intelligence, 2008. [ .pdf ]
[1716] Hemant Misra, Hervé Bourlard, and Vivek Tyagi. New entropy based combination rules in HMM/ANN multi-stream ASR. In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) [3289]. IDIAP-RR 2002 31. [ .ps.gz | .pdf ]
[1717] Hemant Misra and Andrew Morris. Confusion matrix based entropy correction in multi-stream combination. In Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech) [3290]. IDIAP-RR 2002 53. [ .ps.gz | .pdf ]
[1718] Hemant Misra, Shajith Ikbal, Hervé Bourlard, and Hynek Hermansky. Spectral entropy based feature for robust ASR. In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) [3291]. IDIAP-RR 2003 56. [ .ps.gz | .pdf ]
[1719] Hemant Misra, Shajith Ikbal, Sunil Sivadas, and Hervé Bourlard. Multi-resolution spectral entropy based feature for robust ASR. In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) [3292]. IDIAP-RR 2004 37. [ .ps.gz | .pdf ]
[1720] Hemant Misra and Hervé Bourlard. Spectral entropy feature in full-combination multi-stream for robust ASR. In Proceedings of ISCA European Conference on Speech Communication and Technology (Eurospeech) [3293]. IDIAP-RR 2005 10. [ .ps.gz | .pdf ]
[1721] Hemant Misra, Jithendra Vepa, and Hervé Bourlard. Multi-stream ASR: An oracle perspective. In Proceedings of ISCA International Conference on Spoken Language Processing (ICSLP) [3294]. IDIAP-RR 2005 62. [ .ps.gz | .pdf ]
[1722] Hemant Misra and Hervé Bourlard. Spectral entropy feature in multi-stream for robust ASR. Idiap-RR Idiap-RR-45-2005, IDIAP, Martigny, Switzerland, 2005. [ .ps.gz | .pdf ]
[1723] Hemant Misra. Multi-stream Processing for Noise Robust Speech Recognition. Idiap-rr, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland, 3 2006. IDIAP-RR 2006 28. [ .ps.gz | .pdf ]
[1724] Mikaela Keller and Samy Bengio. Textual data representation. Idiap-RR Idiap-RR-74-2003, IDIAP, 2003. [ .ps.gz | .pdf ]
[1725] S. Moeller and Hervé Bourlard. Analytic assessment of telephone transmission impact on asr performance using a simulation model. In Speech Communication [3296]. IDIAP-RR 01-17. [ .ps.gz | .pdf ]
[1726] Perry Moerland, Georg Thimm, and Emile Fiesler. Results on the steepness in backpropagation neural networks. In Marc Aguilar, editor, Proceedings of the '94 SIPAR-Workshop on Parallel and Distributed Computing, Institute of Informatics, University Pérolles, Fribourg, Switzerland, 10 1994. SI Group for Parallel Systems.
[1727] Perry Moerland, Emile Fiesler, and Indu Saxena. The effects of optical thresholding in backpropagation neural networks. In F. Fogelman-Soulié and P. Gallinari, editors, Proceedings of the International Conference on Artificial Neural Networks (ICANN'95 and NeuroNîmes'95), volume 2, Paris La Défense, France, 1995. ENNS, EC2 & Cie.
[1728] Perry Moerland and Emile Fiesler. Hardware-friendly learning algorithms for neural networks: An overview. In Proceedings of the Fifth International Conference on Microelectronics for Neural Networks and Fuzzy Systems: MicroNeuro'96, Los Alamitos, CA, 1996. EPFL and CSEM, IEEE Computer Society Press. [ .pdf ]
[1729] Perry Moerland. A review of MicroNeuro'96, February 12-14, 1996, Lausanne, Switzerland. Neurocomputing, 12(04), 8 1996.
[1730] Perry Moerland, Emile Fiesler, and Indu Saxena. Overcoming inaccuracies in optical multilayer perceptrons. In Proceedings of the First International Symposium on Neuro-Fuzzy Systems (AT'96). AATI, 1996.
[1731] Perry Moerland, Emile Fiesler, and Indu Saxena. Incorporation of liquid-crystal light valve non-linearities in optical multilayer neural networks. Applied Optics, 35(26), 1996. [ .pdf ]
[1732] Perry Moerland and Emile Fiesler. Neural network adaptations to hardware implementations. In Emile Fiesler and R. Beale, editors, Handbook of Neural Computation. Institute of Physics Publishing and Oxford University Publishing, New York, 1997. IDIAP-RR 97-17. [ .ps.gz | .pdf ]
[1733] Perry Moerland and Emile Fiesler. Neural network adaptations to hardware implementations. Idiap-RR Idiap-RR-17-1997, IDIAP, 1997. Published in “Handbook of Neural Computation, E1.2:1--13”. [ .ps.gz | .pdf ]
[1734] Perry Moerland, Emile Fiesler, and Indu Saxena. Discrete all-positive multilayer perceptrons for optical implementation. Idiap-RR Idiap-RR-02-1997, IDIAP, 1997. Accepted for publication in Optical Engineering. [ .ps.gz | .pdf ]
[1735] Perry Moerland. Mixtures of experts estimate a posteriori probabilities. In Gerstner et al. [3297]. (IDIAP-RR 97-07). [ .pdf ]
[1736] Perry Moerland. Some methods for training mixtures of experts. Idiap-Com Idiap-Com-05-1997, IDIAP, 11 1997. [ .ps.gz | .pdf ]
[1737] Perry Moerland, Emile Fiesler, and Indu Saxena. Discrete all-positive multilayer perceptrons for optical implementation. Optical Engineering, 37(4), 4 1998. (IDIAP-RR 97-02). [ .ps.gz | .pdf ]
[1738] Perry Moerland. A comparison of mixture models for density estimation. In Proceedings of the International Conference on Artificial Neural Networks (ICANN'99) [3298]. (IDIAP-RR 98-14). [ .ps.gz | .pdf ]
[1739] Perry Moerland. Classification using localized mixtures of experts. In Proceedings of the International Conference on Artificial Neural Networks (ICANN'99) [3298]. (IDIAP-RR 98-14). [ .ps.gz | .pdf ]
[1740] Perry Moerland and Eddy Mayoraz. Dynaboost: Combining boosted hypotheses in a dynamic way. Idiap-RR Idiap-RR-09-1999, IDIAP, 1999. [ .ps.gz | .pdf ]
[1741] Perry Moerland. Mixtures of latent variable models for density estimation and classification. Idiap-RR Idiap-RR-25-2000, IDIAP, 2000. Submitted for publication. [ .ps.gz | .pdf ]
[1742] Perry Moerland. Mixture Models for Unsupervised and Supervised Learning. Idiap-rr, École Polytechnique Fédérale de Lausanne, Computer Science Department, Lausanne, Switzerland, 6 2000. thesis #2189. [ .ps.gz | .pdf ]
[1743] N. Mohajeri, A. Gudmundsson, G. Knuckler, D. Assouline, Jérôme Kämpf, and J. L. Scartezzini. A solar-based sustainable urban design: The effects of city-scale street-canyon geometry on solar access in geneva, switzerland. Applied Energy, 240:173--190, April 2019. [ DOI ]
[1744] Gelareh Mohammadi, antonio origlia, Maurizio Pili, and Alessandro Vinciarelli. From speech to personality: Mapping voice quality and intonation into personality differences. In in Proceedings of ACM Multimedia 2012, 2012. [ .pdf ]
[1745] Gelareh Mohammadi, Sunghyun Park, Kenji Sagae, Alessandro Vinciarelli, and Louis-Philippe Morency. Who is persuasive? the role of perceived personality and communication modality in social multimedia. In International Conference on Multimodal Interaction, 2013.
[1746] Gelareh Mohammadi and Alessandro Vinciarelli. Humans as feature extractors: Combining prosody and personality perception for better speaking style recognition. In Proceeding of IEEE Int Conference on Systems, Man, and Cybernetics - Special Sessions, 2011. [ .pdf ]
[1747] Gelareh Mohammadi and Alessandro Vinciarelli. Automatic attribution of personality traits based on prosodic features. IEEE Transactions on Affective Computing, 2012. [ .pdf ]
[1748] Amir Mohammadi, Sushil Bhattacharjee, and Sébastien Marcel. Deeply vulnerable -- a study of the robustness of face recognition to presentation attacks. IET (The Institution of Engineering and Technology) -- Biometrics, pages 1--13, 2017. Accepted on 29-Sept-2017. [ DOI | .pdf ]
[1749] Amir Mohammadi, Sushil Bhattacharjee, and Sébastien Marcel. Improving cross-dataset performance of face presentation attack detection systems using face recognition datasets. In 45th International Conference on Acoustics, Speech, and Signal Processing. IEEE, 2020. [ http | .pdf ]
[1750] Amir Mohammadi, Sushil Bhattacharjee, and Sébastien Marcel. Domain adaptation for generalization of face presentation attack detection in mobile settings with minimal information. In 45th International Conference on Acoustics, Speech, and Signal Processing. IEEE, 2020. [ http | .pdf ]
[1751] Gelareh Mohammadi. Automatic Personality Perception: Inferring Personality Traits from Nonverbal Vocal Behavior. PhD thesis, Electrical Engineering Department, EPFL, 2013. [ .pdf ]
[1752] Amir Mohammadi. Trustworthy Face Recognition: Improving Generalization of Deep Face Presentation Attack Detection. PhD thesis, École polytechnique fédérale de Lausanne, 2020. [ .pdf ]
[1753] Alireza Mohammadshahi and James Henderson. Syntax-aware graph-to-graph transformer for semantic role labelling. In Arxiv, April 2021. [ .pdf ]
[1754] Alireza Mohammadshahi, Karl Aberer, and Rémi Lebret. Aligning multilingual word embeddings for cross-modal retrieval task. In Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER), pages 27--33, Hong Kong, China, November 2019. Association for Computational Linguistics. [ DOI | http ]
[1755] Alireza Mohammadshahi and James Henderson. Graph-to-graph transformer for transition-based dependency parsing. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings [3300], page 3278–3289. [ http | .pdf ]
[1756] Alireza Mohammadshahi and James Henderson. Recursive non-autoregressive graph-to-graph transformer for dependency parsing with iterative refinement. In Transactions of the Association for Computational Linguistics [3301]. [ http | .pdf ]
[1757] Alireza Mohammadshahi and James Henderson. Recursive non-autoregressive graph-to-graph transformer for dependency parsing with iterative refinement. In Transactions of the Association for Computational Linguistics (2021) [3301], page 18. [ DOI | http | .pdf ]
[1758] Amlan Mohanty, Debasish Kumar Mallick, Shantipriya Parida, and Satya Ranjan Dash. Semantic behavior analysis of covid-19 patients: A collaborative framework. In Machine Learning for Healthcare Applications. John Wiley & Sons, Inc. USA and Scrivener Publishing LLC, USA, 2021. [ http ]
[1759] Chafic Mokbel and Olivier Collin. Incremental enrollment of speech recognizers. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'99,',','), Phoenix, Arizona, USA, 1999.
[1760] Florent Monay, Pedro Quelhas, Jean-Marc Odobez, and Daniel Gatica-Perez. Integrating co-occurrence and spatial contexts on patch-based scene segmentation. In Beyond Patches Workshop, in conjunction with CVPR [3302]. IDIAP-RR 05-30. [ .ps.gz | .pdf ]
[1761] Florent Monay and Daniel Gatica-Perez. Modeling semantic aspects for cross-media image indexing. In IEEE Transactions on Pattern Analysis and Machine Intelligence [3303]. IDIAP-RR 05-56. [ .ps.gz | .pdf ]
[1762] Florent Monay, Pedro Quelhas, Daniel Gatica-Perez, and Jean-Marc Odobez. Constructing visual models with a latent space approach. In the Springer series of Lecture Notes in Computer Science [3304]. IDIAP-RR 05-14. [ .ps.gz | .pdf ]
[1763] Florent Monay. Learning the structure of image collections with latent aspect models. Idiap-rr, École Polytechnique Fédérale de Lausanne, 2007. PhD Thesis #3729 at the École Polytechnique Fédérale de Lausanne. [ .pdf ]
[1764] Florent Monay and Daniel Gatica-Perez. On image auto-annotation with latent space models. In Proc. ACM Int. Conf. on Multimedia (ACM MM) [3306]. IDIAP-RR 03-31. [ .ps.gz | .pdf ]
[1765] Florent Monay and Daniel Gatica-Perez. Plsa-based image auto-annotation: Constraining the latent space. In Proc. ACM Int. Conf. on Multimedia (ACM MM) [3307]. IDIAP-RR 04-30. [ .ps.gz | .pdf ]
[1766] Florent Monay, Pedro Quelhas, Jean-Marc Odobez, and Daniel Gatica-Perez. Contextual classification of image patches with latent aspect models. EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision, 2009. to appear. [ .pdf ]
[1767] R. Montoliu, Jan Blom, and Daniel Gatica-Perez. Discovering places of interest in everyday life from smartphone data. Multimedia Tools and Applications, 2012. [ .pdf ]
[1768] Raul. Montoliu and Daniel Gatica-Perez. Discovering human places of interest from multimodal mobile phone data. In Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia (MUM,',','), Limassol, Cyprus, 12 2010. [ .pdf ]
[1769] Darren Moore. The idiap smart meeting room. Idiap-Com Idiap-Com-07-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[1770] Darren Moore. Tode: A decoder for continuous speech recognition. Idiap-Com Idiap-Com-09-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[1771] Darren Moore and Iain A. McCowan. Microphone array speech recognition : Experiments on overlapping speech in meetings. In Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03) [3308]. To appear. [ .ps.gz | .pdf ]
[1772] Darren Moore, John Dines, Mathew Magimai.-Doss, Jithendra Vepa, Octavian Cheng, and Thomas Hain. Juicer: A weighted finite-state transducer speech decoder. In 3rd Joint Workshop on Multimodal Interaction and Related Machine LEarning Algorithms MLMI'06 [3309]. IDIAP-RR 06-21. [ .ps.gz | .pdf ]
[1773] Darren Moore. The juicer lvcsr decoder - user manual for juicer version 0.5.0. Idiap-Com Idiap-Com-03-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[1774] Aythami Morales, Julian Fierrez, Ruben Tolosana, Javier Ortega-Garcia, Javier Galbally, Marta Gomez-Barrero, André Anjos, and Sébastien Marcel. Keystroke biometrics ongoing competition. IEEE Access, 4:7736--7746, November 2016. [ DOI | http | .pdf ]
[1775] Miguel Moreira, Alain Hertz, and Eddy Mayoraz. Data binarization by discriminant elimination. In Bruha and Bohanec [3310]. IDIAP-RR 99-04. [ .ps.gz | .pdf ]
[1776] Miguel Moreira and Eddy Mayoraz. Improved pairwise coupling classification with correcting classifiers. In Nédellec and Rouveirol [3311]. IDIAP-RR 97-09. [ .ps.gz | .pdf ]
[1777] Miguel Moreira and Emile Fiesler. Neural networks with adaptive learning rate and momentum terms. Idiap-RR Idiap-RR-04-1995, IDIAP, Martigny, Switzerland, 10 1995. [ .ps.gz | .pdf ]
[1778] Miguel Moreira, Emile Fiesler, and Gianni Pante. Image classification by neural networks for the quality control of watches. In Soto et al. [3312].
[1779] Miguel Moreira. The use of Boolean concepts in general classification contexts. PhD thesis, Ecole Polytechnique Federale de Lausanne, Lausanne, Switzerland, 12 2000. thesis #2316 (IDIAP-RR 00-46). [ .ps.gz | .pdf ]
[1780] Miguel Moreira. The use of boolean concepts in general classification contexts. Idiap-RR Idiap-RR-46-2000, IDIAP, Martigny, Switzerland, 12 2000. [ .ps.gz | .pdf ]
[1781] Nelson Morgan, Hervé Bourlard, and Hynek Hermansky. Automatic speech recognition: an auditory perspective. In Greenberg et al. [3313]. IDIAP-RR 98-17.
[1782] Andrew Morris. An information theoretic measure of sequence recognition performance. Idiap-Com Idiap-Com-03-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[1783] Andrew Morris. Noise pdf transformation in secondary feature processing. Idiap-RR Idiap-RR-29-2002, IDIAP, 2002. [ .ps.gz | .pdf ]
[1784] Astrid Hagen and Andrew Morris. Recent advances in the multi-stream hmm/ann hybrid approach to noise robust asr. Idiap-RR Idiap-RR-57-2002, IDIAP, 2002. To be published in: Computer, Speech and Language (to appear). [ .ps.gz | .pdf ]
[1785] Andrew Morris, Astrid Hagen, Hervé Glotin, and Hervé Bourlard. Multi-stream adaptive evidence combination for noise robust asr. In Speech Communication [3314].
[1786] Andrew Morris. Latent variable decomposition for posteriors or likelihood based subband asr. Idiap-Com Idiap-Com-04-1999, IDIAP, 1999. [ .ps.gz | .pdf ]
[1787] Zohreh Mostaani, RaviShankar Prasad, Bogdan Vlasenko, and Mathew Magimai.-Doss. Modeling of pre-trained neural network embeddings learned from raw waveform for covid-19 infection detection. In Proceedings of ICASSP, 2022. [ .pdf ]
[1788] Zohreh Mostaani, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik, and Mathew Magimai.-Doss. On the relationship between speech-based breathing signal prediction evaluation measures and breathing parameters estimation. In Proc. of ICASSP, 2021. [ .pdf ]
[1789] Zohreh Mostaani, Anjith George, Guillaume Heusch, David Geissenbuhler, and Sébastien Marcel. The high-quality wide multi-channel attack (hq-wmca) database. Idiap-RR Idiap-RR-22-2020, Idiap, 9 2020. [ .pdf ]
[1790] Zohreh Mostaani and Mathew Magimai.-Doss. On breathing pattern information in synthetic speech. In Proceedings of Interspeech, 2022. [ .pdf ]
[1791] Petr Motlicek, Vijay Ullal, and Hynek Hermansky. Wide-band perceptual audio coding based on frequency-domain linear prediction. In IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP) [3315]. IDIAP-RR 06-58. [ .ps.gz | .pdf ]
[1792] Hari Krishna Maganti, Petr Motlicek, and Daniel Gatica-Perez. Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms. In IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007. IDIAP-RR 06-57. [ .ps.gz | .pdf ]
[1793] Petr Motlicek, Hynek Hermansky, Sriram Ganapathy, and Harinath Garudadri. Frequency domain linear prediction for qmf sub-bands and applications to audio coding. In 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI) [3316]. IDIAP-RR 07-16. [ .ps.gz | .pdf ]
[1794] Petr Motlicek, Sriram Ganapathy, Hynek Hermansky, and Harinath Garudadri. Non-uniform qmf decomposition for wide-band audio coding based on frequency domain linear prediction. Idiap-RR Idiap-RR-43-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[1795] Petr Motlicek. Lp-traps in all senses. Idiap-RR Idiap-RR-66-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[1796] Petr Motlicek, Hynek Hermansky, Harinath Garudadri, and Naveen Srinivasamurthy. Speech coding based on spectral dynamics. In Ninth International Conference on TEXT, SPEECH and DIALOGUE (TSD) [3317]. IDIAP-RR 06-05. [ .ps.gz | .pdf ]
[1797] Petr Motlicek, Hynek Hermansky, Sriram Ganapathy, and Harinath Garudadri. Non-uniform speech/audio coding exploiting predictability of temporal evolution of spectral envelopes. In Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD) [3318]. IDIAP-RR 06-30. [ .ps.gz | .pdf ]
[1798] Petr Motlicek, Stefan Duffner, Danil Korchagin, Hervé Bourlard, Carl Scheffler, Jean-Marc Odobez, Giovanni Del Galdo, Markus Kallinger, and Oliver Thiergart. Real-time audio-visual analysis for multiperson videoconferencing. Advances in Multimedia, 2013:21, August 2013. Hindawi Publishing Corporation, Article ID 175745. [ DOI | http | .pdf ]
[1799] Dick C. A. Bulterman, Petr Motlicek, Stefan Duffner, and Danil Korchagin. Together Anywhere, Together Anytime, Technologies for Intimate Interactions. Centrum Wiskunde & Informatica, Amsterdam, Holland, dick c.a. bulterman, editor edition, May 2012.
[1800] Petr Motlicek, Sriram Ganapathy, Hynek Hermansky, and Harinath Garudadri. Wide-band audio coding based on frequency domain linear prediction. In EURASIP Journal on Audio Speech and Music Processing [3319]. Special Issue: Scalable Audio-Content Analysis. [ DOI | .html | .pdf ]
[1801] Petr Motlicek, David Imseng, Blaise Potard, Philip N. Garner, and Ivan Himawan. Exploiting foreign resources for dnn-based asr. In EURASIP Journal on Audio, Speech, and Music Processing [3320]. [ DOI | .pdf ]
[1802] Petr Motlicek and Fabio Valente. Application of out-of-language detection to spoken-term detection. In 2010 IEEE International Conference on Acoustics, Speech and Signal Processing [3321]. [ .pdf ]
[1803] Petr Motlicek, Philip N. Garner, Namhoon Kim, and Jeongmi Cho. Accent adaptation using subspace gaussian mixture models. In The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP) [3322], pages 7170--7174. [ DOI | .pdf ]
[1804] Petr Motlicek, Daniel Povey, and Martin Karafiat. Feature and score level combination of subspace gaussians in lvcsr task. In The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP) [3323], pages 7604--7608. [ DOI | .pdf ]
[1805] Petr Motlicek, Fabio Valente, and Igor Szoke. Improving acoustic based keyword spotting using lvcsr lattices. In Proceedings on IEEE International Conference on Acoustics, Speech and Signal Processing [3324], pages 4413--4416.
[1806] Petr Motlicek, Subhadeep Dey, Srikanth Madikeri, and Lukas Burget. Employment of subspace gaussian mixture models in speaker recognition. In 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing [3325], pages 4445--4449. [ http | .pdf ]
[1807] Petr Motlicek, Laurent El Shafey, Roy Wallace, Chris McCool, and Sébastien Marcel. Bi-modal authentication in mobile environments using session variability modelling. In Proceedings of the 21st International Conference on Pattern Recognition [3326]. [ .pdf ]
[1808] Petr Motlicek, Philip N. Garner, Maël Guillemot, and Vincent Bozzo. Amida/klewel mini-project. Idiap-RR Idiap-RR-03-2010, Idiap, Rue Marconi 19, Martigny, 1 2010. [ .pdf ]
[1809] Petr Motlicek. Automatic out-of-language detection based on confidence measures derived from lvcsr word and phone lattices. Idiap-RR Idiap-RR-06-2009, Idiap, Rue Marconi 19, martigny, Switzerland, 5 2009. [ .pdf ]
[1810] Petr Motlicek, Philip N. Garner, David Imseng, and Fabio Valente. Application of subspace gaussian mixture models in contrastive acoustic scenarios. Idiap-RR Idiap-RR-20-2012, Idiap, Rue Marconi 19, Martigny, Switzerland, 7 2012. [ .pdf ]
[1811] Petr Motlicek, Sriram Ganapathy, and Hynek Hermansky. Entropy coding of quantized spectral components in fdlp audio codec. Idiap-RR Idiap-RR-71-2008, Idiap, 11 2008. [ .pdf ]
[1812] Petr Motlicek. Automatic out-of-language detection based on confidence measures derived fromlvcsr word and phone lattices. In 10thAnnual Conference of the International Speech Communication Association, 2009 ISCA. ISCA, 9 2009. [ .pdf ]
[1813] Petr Motlicek, Sriram Ganapathy, and Hynek Hermansky. Arithmetic coding of sub-band residuals in fdlp speech/audio codec. In 10th Annual Conference of the International Speech Communication Association. ISCA, ISCA 2009, 9 2009. [ .pdf ]
[1814] Petr Motlicek, David Imseng, and Philip N. Garner. Crosslingual tandem-sgmm: Exploiting out-of-language data for acoustic model and feature level adaptation. In Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013) [3329], pages 510--514. [ .pdf ]
[1815] Petr Motlicek, David Imseng, Milos Cernak, and Namhoon Kim. Development of bilingual asr system for mediaparl corpus. In Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech 2014) [3330]. [ .pdf ]
[1816] Zuluaga-Gomez. Juan, Petr Motlicek, Qingran Zhan, Rudolf Braun, and Karel Vesely. Automatic speech recognition benchmark for air-traffic communications. In Proc. Interspeech 2020, pages 2297--2301, October 2020. [ DOI | .pdf ]
[1817] Petr Motlicek, Hynek Hermansky, Srikanth Madikeri, Amrutha Prasad, and Sriram Ganapathy. Am-fm decomposition of speech signal: Applications for speech privacy and diagnosis. In 11th International workshop on Models and Analysis of Vocal Emissions for Biomedical Applications [3331]. [ http | .pdf ]
[1818] Petr Motlicek, Sriram Ganapathy, Hynek Hermansky, Harinath Garudadri, and Marios Athineos. Perceptually motivated sub-band decomposition for fdlp audio coding. In Text, Speech and Dialogue, volume 5246 of Series of Lecture Notes in Artificial Intelligence (LNAI). Springer-Verlag Berlin, Heidelberg, 9 2008. [ .pdf ]
[1819] Kurena Motokura, Masaki Takahashi, Marco Ewerton, and Jan Peters. Plucking motions for tea harvesting robots using probabilistic movement primitives. In IEEE International Conference on Robotics and Automation, 2020.
[1820] K. Moustakas, D. Tzovaras, L. Dybkjaer, N. Bernsen, and Oya Aran. Using modality replacement to facilitate communication between visually and hearing-impaired people. IEEE Multimedia, 18(2):26--37, February 2011. [ DOI ]
[1821] Rémi Moyen and Théophile Gentilhomme. Adaptive ensemble-based optimisation for petrophysical inversion. Mathematical Geosciences, 2020. [ DOI | http ]
[1822] Khalil Mrini, Nikolaos Pappas, and Andrei Popescu-Belis. Cross-lingual transfer for news article labeling: Benchmarking statistical and neural models. Idiap-RR Idiap-RR-26-2017, Idiap, Rue Marconi 19, CH-1920 Martigny, 9 2017. Report of EPFL semester project done by Khalil Mrini (1st year I&C MSc student), supervised by N. Pappas and A. Popescu-Belis. [ .pdf ]
[1823] Hannah Muckenhirn, Mathew Magimai.-Doss, and Sébastien Marcel. Presentation attack detection using long-term spectral statistics for trustworthy speaker verification. In International Conference of the Biometrics Special Interest Group (BIOSIG), September 2016. [ .pdf ]
[1824] Hannah Muckenhirn, Mathew Magimai.-Doss, and Sébastien Marcel. Towards directly modeling raw speech signal for speaker verification using cnns. In IEEE International Conference on Acoustics, Speech and Signal Processing [3332], pages 4884--4888. [ .pdf ]
[1825] Hannah Muckenhirn, Mathew Magimai.-Doss, and Sébastien Marcel. End-to-end convolutional neural network-based voice presentation attack detection. In International Joint Conference on Biometrics, 2017. [ .pdf ]
[1826] Hannah Muckenhirn, Mathew Magimai.-Doss, and Sébastien Marcel. On learning vocal tract system related speaker discriminative information from raw signal using cnns. In Proceedings of Interspeech, pages 1116--1120, September 2018. [ .pdf ]
[1827] Hannah Muckenhirn, Vinayak Abrol, Mathew Magimai.-Doss, and Sébastien Marcel. Understanding and visualizing raw waveform-based cnns. In Proceedings of Interspeech [3333]. [ .pdf ]
[1828] Hannah Muckenhirn, Pavel Korshunov, Mathew Magimai.-Doss, and Sébastien Marcel. Long-term spectral statistics for voice presentation attack detection. In IEEE/ACM Transactions on Audio, Speech and Language Processing [3334], pages 2098--2111. [ .pdf ]
[1829] Hannah Muckenhirn. Trustworthy speaker recognition with minimal prior knowledge using neural networks. PhD thesis, Ecole polytechnique fédérale de Lausanne (EPFL), Switzerland, 2019. [ DOI | .pdf | .pdf ]
[1830] Skanda Muralidhar. On job training: Automated interpersonal behavior assessment & real-time feedback. In Proceedings of the 25th ACM International Conference on Multimedia, 2017, 2017. [ .pdf ]
[1831] Skanda Muralidhar, Laurent Son Nguyen, Denise Frauendorfer, Jean-Marc Odobez, Marianne Schmid Mast, and Daniel Gatica-Perez. Training on the job: Behavioral analysis of job interviews in hospitality. In Proceedings of the 18th ACM International Conference on Multimodal Interaction, pages 84--91, November 2016. [ .pdf ]
[1832] Skanda Muralidhar, Marianne Schmid Mast, and Daniel Gatica-Perez. How may i help you? behavior and impressions in hospitality service encounters. In Proceddings of 19th ACM International Conference on Multimodal Interaction, 2017. [ .pdf ]
[1833] Skanda Muralidhar, Emmanuelle Patricia Kleinlogel, Eric Mayor, Adrian Bangerter, Marianne Schmid Mast, and Daniel Gatica-Perez. Understanding applicants' reactions to asynchronous video interviews through self-reports and nonverbal cues. In Proc. ACM Int. Conf. on Multimodal Interaction (ICMI), October 2020. [ .pdf ]
[1834] Skanda Muralidhar, Marianne Schmid Mast, and Daniel Gatica-Perez. A tale of two interactions: Inferring performance in hospitality encounters from cross-situation social sensing. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2(129), 2018. [ .pdf ]
[1835] Skanda Muralidhar and Daniel Gatica-Perez. Examining linguistic content and skill impression structure for job interview analytics in hospitality. In Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia, 2017. [ .pdf ]
[1836] Skanda Muralidhar, Jean M R Costa, Laurent Son Nguyen, and Daniel Gatica-Perez. Dites-moi: Wearable feedback on conversational behavior. In Proceedings of the 15th International Conference on Mobile and Ubiquitous Multimedia, December 2016. [ .pdf ]
[1837] Skanda Muralidhar, Remy Siegfried, Jean-Marc Odobez, and Daniel Gatica-Perez. Facing employers and customers: What do gaze and expressions tell about soft skills? In Proceedings of the 17th International Conference on Mobile and Ubiquitous Multimedia, pages 121--126, New York, November 2018. ASSOC COMPUTING MACHINERY. [ DOI | .pdf ]
[1838] Skanda Muralidhar. SOCIAL SENSING METHODS FOR ANALYSIS OF DYADIC HOSPITALITY ENCOUNTERS. PhD thesis, EPFL, January 2019. [ .pdf ]
[1839] Skanda Muralidhar, Laurent Son Nguyen, and Daniel Gatica-Perez. Words worth: Verbal content and hirability impressions in youtube video resumes. In Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, 2018. [ .pdf ]
[1840] V. Murino, M. Cristani, and Alessandro Vinciarelli. Socially intelligent surveillance and monitoring: Analysing social dimensions of physical space. In Proceedings of International Workshop on Socially Intelligent Surveillance and Monitoring, pages 51--58, 2010. [ .pdf ]
[1841] Nora A Murphy, Judith A Hall, Marianne Schmid Mast, Mollie A. Ruben, Denise Frauendorfer, Danielle Blanch-Hartigan, Debra L. Roter, and Laurent Son Nguyen. Reliability and validity of nonverbal thin slices in social interactions. Personality and Social Psychology Bulletin, 41(2):199--213, 2014. [ DOI | .pdf ]
[1842] Emanuele Naboni, Marco Meloni, Chris Makey, and Jérôme Kämpf. The simulation of mean radiant temperature in outdoor conditions: A review of architectural tools calculation assumptions. In Proceedings of Building Simulation 2019: 16th Conference of IBPSA, September 2019.
[1843] A. Naceri, T. Schumacher, Q. Li, Sylvain Calinon, and H. Ritter. Learning optimal impedance control during complex 3d arm movements. IEEE Robotics and Automation Letters (RA-L), 6(2):1248--1255, 2021. [ DOI | http | .pdf ]
[1844] Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondrej Bojar, and Sadao Kurohashi. Overview of the 7th workshop on asian translation. In Proceedings of the 7th Workshop on Asian Translation. Association for Computational Linguistics, 2020. [ .pdf ]
[1845] Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondrej Bojar, Chenhui Chu, Akiko Eriguchi, Kaori Abe, Yusuke Oda, and Sadao Kurohashi. Overview of the 8th workshop on asian translation. In Proceedings of the 8th Workshop on Asian Translation (WAT2021), pages 1--45. Association for Computational Linguistics, August 2021. [ http ]
[1846] Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik, and Mathew Magimai.-Doss. Phoneme based respiratory analysis of read speech. In Proceedings of European Signal Processing Conference (EUSIPCO), 2021. [ .pdf ]
[1847] Venkata Srikanth Nallanthighal, Zohreh Mostaani, Aki Härmä, Helmer Strik, and Mathew Magimai.-Doss. Deep learning architectures for estimating breathing signal and respiratory parameters from speech recordings. Neural Networks, 141:211--224, 2021. [ DOI ]
[1848] Alexandre Nanchen and Philip N. Garner. Empirical evaluation and combination of punctuation prediction models applied to broadcast news. In Proceedings of 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing [3335]. [ .pdf ]
[1849] Fabian Nater, Tatiana Tommasi, Luc Van Gool, and Barbara Caputo. Learning to learn new models of human activities in indoor settings1. In Bourlard and Popescu-Belis [3336]. [ .pdf ]
[1850] Xavier Naturel and Jean-Marc Odobez. Detecting queues at vending machines: a statistical layered approach. In Proc. Int. Conf. on Pattern Recognition (ICPR) [3337]. [ .pdf ]
[1851] Xingyu Na and Philip N. Garner. Convolutional pitch target approximation model for speech synthesis. Idiap-RR Idiap-RR-05-2013, Idiap, 3 2013. [ .pdf ]
[1852] B. Nedic and Hervé Bourlard. Recent developments in speaker verification at idiap. Idiap-RR Idiap-RR-26-2000, IDIAP, 2000. [ .ps.gz | .pdf ]
[1853] Radu-Andrei Negoescu and Daniel Gatica-Perez. Topickr: Flickr groups and users reloaded. In MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia [3338].
[1854] Radu-Andrei Negoescu, Brett Adams, Dinh Phung, Svetha Venkatesh, and Daniel Gatica-Perez. Flickr hypergroups. In Proceedings of the 17th ACM International Conference on Multimedia, 10 2009. [ .pdf ]
[1855] Radu-Andrei Negoescu, Alexander Loui, and Daniel Gatica-Perez. Kodak moments and flickr diamonds: How users shape large-scale media. In Proc. of the 18th Intl. Conf. on Multimedia [3339].
[1856] Radu-Andrei Negoescu and Daniel Gatica-Perez. Flickr groups: Multimedia communities for multimedia analysis. In Hua et al. [3340].
[1857] Radu-Andrei Negoescu and Daniel Gatica-Perez. Analyzing flickr groups. In Proc. of the Intl. Conf. on Image and Video Retrieval [3341]. To appear in Proceedings of CIVR'08.
[1858] Radu-Andrei Negoescu and Daniel Gatica-Perez. Modeling and understanding flickr communities through topic-based analysis. Idiap-RR Idiap-RR-19-2010, Idiap, 7 2010. [ .pdf ]
[1859] Radu-Andrei Negoescu and Daniel Gatica-Perez. Modeling and understanding flickr communities through topic-based analysis. IEEE Transactions on Multimedia, 12(5), 8 2010. [ DOI ]
[1860] Radu-Andrei Negoescu. Modeling and understanding communities in online social media using probabilistic methods. PhD thesis, Ecole polytechnique fédérale de Lausanne, June 2011. [ DOI | http | .pdf ]
[1861] Julien Nembrini, Jérôme Kämpf, Michael Pappinutto, and Denis Lalanne. A smart luminaire in an office environment: impact on light distribution, user interactions and comfort. In Journal of Physics: Conference Series, volume 1343. IOP Publishing Ltd, November 2019. [ DOI ]
[1862] James Newling and Francois Fleuret. A sub-quadratic exact medoid algorithm. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017.
[1863] James Newling and Francois Fleuret. Fast k-means with accurate bounds. In Proceedings of the International Conference on Machine Learning (ICML) [3342].
[1864] James Newling and Francois Fleuret. A sub-quadratic exact medoid algorithm. Idiap-RR Idiap-RR-19-2017, Idiap, 7 2017. [ .pdf ]
[1865] James Newling and Francois Fleuret. Nested mini-batch k-means. In Proceedings of NIPS, 2016.
[1866] James Newling and Francois Fleuret. K-medoids for k-means seeding. In Proceedings of the international conference on Neural Information Processing Systems, 2017.
[1867] James Newling. Novel Algorithms for Clustering. PhD thesis, École polytechnique fédérale de Lausanne, January 2018. [ DOI | .pdf ]
[1868] Laurent Son Nguyen, Alvaro Marcos-Ramiro, Marta Marron-Romera, and Daniel Gatica-Perez. Multimodal analysis of body communication cues in employment interviews. In 15th ACM International Conference on Multimodal Interaction Proceedings, 2013. [ .pdf ]
[1869] Laurent Son Nguyen, Jean-Marc Odobez, and Daniel Gatica-Perez. Using self-context for multimodal detection of head nods in face-to-face interactions. In Proceedings of the 14th ACM International Conference on Multimodal Interaction [3343]. [ .pdf ]
[1870] Laurent Son Nguyen and Daniel Gatica-Perez. I would hire you in a minute: Thin slices of nonverbal behavior in job interviews. In Proceedings of the ACM International Conference on Multimodal Interaction (ICMI), pages 51--58, 2015. [ .pdf ]
[1871] Huy H. Nguyen, Junichi Yamagishi, Isao Echizen, and Sébastien Marcel. Generating master faces for use in performingwolf attacks on face recognition systems. In International Join Conference on Biometrics, September 2020. [ .pdf ]
[1872] Hoang H. Nguyen, Mael Fabien, Petr Motlicek, Shantipriya Parida, and Kvetoslav Maly. Roxsd: a simulated dataset of communication in organized crime. In 1st ISCA Symposium on Security and Privacy in Speech Communication, 2021. [ .pdf ]
[1873] Laurent Son Nguyen, Salvador Ruiz-Correa, Marianne Schmid Mast, and Daniel Gatica-Perez. Check out this place: Inferring ambiance from airbnb photos. IEEE transactions on Multimedia, 20(6):1499--1511, June 2018. [ DOI | http | .pdf ]
[1874] Laurent Son Nguyen. Computational Analysis Of Behavior In Employment Interviews And Video Resumes. PhD thesis, École Polytechnique Fédérale de Lausanne, May 2015. [ .pdf ]
[1875] Laurent Son Nguyen, Denise Frauendorfer, Marianne Schmid Mast, and Daniel Gatica-Perez. Hire me: Computational inference of hirability in employment interviews based on nonverbal behavior. IEEE Transactions on Multimedia, 16(4):1018 -- 1031, June 2014. [ DOI | .pdf ]
[1876] Laurent Son Nguyen and Daniel Gatica-Perez. Hirability in the wild: Analysis of online conversational video resumes. IEEE Trans. on Multimedia, 18(7):1422--1437, July 2016. [ .pdf ]
[1877] Olegs Nikisins, Amir Mohammadi, André Anjos, and Sébastien Marcel. On effectiveness of anomaly detection approaches against unseen presentation attacks in face anti-spoofing. In The 11th IAPR International Conference on Biometrics (ICB 2018), February 2018. [ .pdf ]
[1878] Olegs Nikisins, Anjith George, and Sébastien Marcel. Domain adaptation in multi-channel autoencoder based features for robust face anti-spoofing. In International Conference on Biometrics 2019, IEEE, 2019. [ .pdf ]
[1879] Olegs Nikisins, Teodors Eglitis, André Anjos, and Sébastien Marcel. Fast cross-correlation based wrist vein recognition algorithm with rotation and translation compensation. In Sixth International Workshop on Biometrics and Forensics, June 2018. [ .pdf ]
[1880] Maria Elena Nilsback and Barbara Caputo. Cue integration through discriminative accumulation. In International Conference on Computer Vision and Pattern Recognition, 2004. [ .pdf ]
[1881] Jagannadan Varadarajan, Remi Emonet, and Jean-Marc Odobez. A sparsity constraint for topic models - application to temporal activity mining. In NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions [3344]. [ .pdf ]
[1882] Nicoletta Noceti, Barbara Caputo, Claudio Castellini, Luca Baldassarre, Annalisa Barla, Lorenzo Rosasco, Francesca Odone, and Giulio Sandini. Towards a theoretical framework for learning multi-modal patterns for embodied agents. In International Conference on Image Analysis and Processing, 2009. [ .pdf ]
[1883] Norman Poh. Multi-system Biometric Authentication: Optimal Fusion and User-Specific Information. PhD thesis, École Polytechnique Fédérale de Lausanne, 2006. [ .ps.gz | .pdf ]
[1884] Norman Poh and Samy Bengio. Using chimeric users to construct fusion classifiers in biometric authentication tasks: An investigation. In IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP) [3345]. IDIAP-RR 05-59. [ .ps.gz | .pdf ]
[1885] Norman Poh, Samy Bengio, and Arun Ross. Revisiting doddington's zoo: A systematic method to assess user-dependent variabilities. In Multimodal User Authentication (MMUA) [3346]. IDIAP-RR 06-04. [ .ps.gz | .pdf ]
[1886] Norman Poh and Samy Bengio. Using chimeric users to construct fusion classifiers in biometric authentication tasks: An investigation. Idiap-RR Idiap-RR-59-2005, IDIAP, 2005. Published in 2006 IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 1077--1080, 2006, Toulouse. [ .ps.gz | .pdf ]
[1887] Norman Poh, Alvin Martin, and Samy Bengio. Performance generalization in biometric authentication using joint user-specific and sample bootstraps. Idiap-RR Idiap-RR-60-2005, IDIAP, 2005. To appear in IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI). [ .ps.gz | .pdf ]
[1888] Norman Poh and Samy Bengio. Estimating the confidence interval of expected performance curve in biometric authentication using joint bootstrap. Idiap-RR Idiap-RR-25-2006, IDIAP, 2006. Submitted for publication. [ .ps.gz | .pdf ]
[1889] Norman Poh, Alvin Martin, and Samy Bengio. Performance generalization in biometric authentication using joint user-specific and sample bootstraps. In IEEE Pattern Analysis and Machine intelligence [3347]. IDIAP-RR 05-60. [ .ps.gz | .pdf ]
[1890] Nicolas Scaringella. Timbre and rhythmic trap-tandem features for music information retrieval. In "Int. Conf. on Music Information Retrieval (ISMIR)" [3348]. To appear in ISMIR 2008. [ .pdf ]
[1891] Jean-Marc Odobez, Silèye O. Ba, and Daniel Gatica-Perez. An implicit Motion Likelihood for Tracking with Particle Filters. In British Machine Vision Conference (BMVC) [3349]. Similar to RR-03-15. [ .ps.gz | .pdf ]
[1892] Jean-Marc Odobez, Daniel Gatica-Perez, and Maël Guillemot. Video Shot Clustering using Spectral Methods. In 3rd Workshop on Content-Based Multimedia Indexing (CBMI), Rennes, France, 2003. [ .ps.gz | .pdf ]
[1893] Jean-Marc Odobez, Daniel Gatica-Perez, and Maël Guillemot. Spectral Structuring of Home Videos. In International Conference on Image and Video Retrieval (CIVR'03) [3350]. Similar to IDIAP-RR 02-55. [ .ps.gz | .pdf ]
[1894] Jean-Marc Odobez and Silèye O. Ba. Modélisation implicite du mouvement en suivi par filtrage de monte carlo séquentiel. In GRETSI conference, Signal and Image Processing, [3349]. Published in British Machine Vision Conference (BMVC,',','), Norwich, 2003. [ .ps.gz | .pdf ]
[1895] Jean-Marc Odobez and Daniel Gatica-Perez. Embedding motion in model-based stochastic tracking. In 17th Int. Conf. Pattern Recognition (ICPR) [3351]. Similar to RR-03-72. [ .ps.gz | .pdf ]
[1896] Datong Chen, Jean-Marc Odobez, and Jean-Philippe Thiran. Monte Carlo Video Text Segmentation. In International Journal of Pattern Recognition and Artificial Intelligence (IJPRAI) [3352]. IDIAP-RR 03-43. [ .ps.gz | .pdf ]
[1897] Datong Chen and Jean-Marc Odobez. Video Text Recognition using Sequential Monte Carlo and error Voting Methods. In Pattern Recognition Letters [3352]. A shorter version of the paper appeared in the techreport. [ .ps.gz | .pdf ]
[1898] F. Kottelat and Jean-Marc Odobez. Audio-Video Person Clustering in Video Databases. Idiap-RR Idiap-RR-46-2003, IDIAP, Martigny, Switzerland, 2003. [ .ps.gz | .pdf ]
[1899] Nabil Daddaoua, Jean-Marc Odobez, and Alessandro Vinciarelli. Ocr based slide retrieval. Idiap-RR Idiap-RR-11-2005, IDIAP, Martigny, Switzerland, 2005. Submitted for publication. [ .ps.gz | .pdf ]
[1900] Jean-Marc Odobez and Silèye O. Ba. A cognitive and unsupervised map adaptation approach to the recognition of the focus of attention from head pose. In International Conference on Multi-Media & Expo (ICME07) [3353]. IDIAP-RR 07-20. [ .ps.gz | .pdf ]
[1901] Jean-Marc Odobez, Daniel Gatica-Perez, and Silèye O. Ba. Embedding motion in model-based stochastic tracking. In IEEE Transaction on Image Processing [3354]. IDIAP-RR 04-61.
[1902] Jean-Marc Odobez and Oswald Lanz. Sampling techniques for audio-visual tracking and head pose estimation. In Multimodal Signal Processing: Human Interactions in Meetings, chapter 6, pages 84--102. Cambridge University Press, June 2012. [ .pdf ]
[1903] Jean-Marc Odobez, C. Carincotte, Remi Emonet, E. Jouneau, Sofia Zaidenberg, Bertrand Raverra, Francois Bremond, and Andrea Grifoni. Unsupervised activity analysis and monitoring algorithms for effective surveillance systems. In European Conference on Computer Vision, LNCS, October 2012. [ .pdf ]
[1904] Catharine Oertel, Patrik Jonell, Dimosthenis Kontogiorgos, Kenneth Alberto Funes Mora, Jean-Marc Odobez, and Joakim Gustafson. Towards an engagement-aware attentive artificial listener for multi-party interactions. Frontiers in Robotics and AI, 8:189, 2021. [ DOI | http | .pdf ]
[1905] Catharine Oertel, Kenneth Alberto Funes Mora, Samira Sheikhi, Jean-Marc Odobez, and Joakim Gustafson. Who will get the grant ? a multimodal corpus for the analysis of conversational behaviours in group interviews. In International Conference on Multimodal Interaction, Understanding and Modeling Multiparty, Multimodal Interactions Workshop. ACM, November 2014. [ DOI | .pdf ]
[1906] Catharine Oertel, Kenneth Alberto Funes Mora, Joakim Gustafson, and Jean-Marc Odobez. Deciphering the silent participant. on the use of audio-visual cues for the classification of listener categories in group discussions. In Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, ICMI '15, pages 107--114, New York, NY, USA, November 2015. ACM, ACM. [ DOI | .pdf ]
[1907] Catharine Oertel, José David Lopes, Yu Yu, Kenneth Alberto Funes Mora, Joakim Gustafson, Alan Black, and Jean-Marc Odobez. Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens. In Proceedings of the 18th ACM International Conference on Multimodal Interaction, pages 21--28. ACM, November 2016. [ DOI ]
[1908] Oliver Ohneiser, Seyyed Saeed Sarfjoo, Hartmut Helmke, Shruthi Shetty, Petr Motlicek, Matthias Kleinert, heiko Ehr, and Šarunas Murauskas. Robust command recognition for lithuanian air traffic control tower utterances. In Interspeech, 2021. [ .pdf ]
[1909] Shogo Okada, Laurent Son Nguyen, Oya Aran, and Daniel Gatica-Perez. Modeling dyadic and group impressions with inter-modal and inter-person features. ACM Transactions on Multimedia Computing, Communications, and Applications, 15(1), January 2019. [ .pdf ]
[1910] Shogo Okada, Oya Aran, and Daniel Gatica-Perez. Personality trait classification via co-occurrent multiparty multimodal event discovery. In Proceedings of the ACM International Conference on Multimodal Interaction, ICMI '15, pages 15--22. ACM, November 2015. [ DOI | .pdf ]
[1911] Rui Oliveira, Jérôme Kämpf, Romeu Vicente, Ricardo Almeida, and António Figueiredo. Co2 experimental measurements towards the development of a predictive framework using user actions in smart buildings. In Journal of Physics: Conference Series, volume 1343. IOP Publishing Ltd, November 2019. [ DOI ]
[1912] Francesco Orabona, Claudio Castellini, Barbara Caputo, Jie Luo, and Giulio Sandini. Indoor place recognition using online independent support vector machines. In 18th British Machine Vision Conference (BMVC07), Warwick, UK, 9 2007. [ .ps.gz | .pdf ]
[1913] Francesco Orabona, Joseph Keshet, and Barbara Caputo. The projectron: a bounded kernel-based perceptron. In Int. Conf. on Machine Learning [3355]. IDIAP-RR 08-30. [ .ps.gz | .pdf ]
[1914] Francesco Orabona, Claudio Castellini, Barbara Caputo, Jie Luo, and Giulio Sandini. On-line independent support vector machines for cognitive systems. Idiap-RR Idiap-RR-63-2007, IDIAP, 2007. [ .ps.gz | .pdf ]
[1915] Francesco Orabona, Barbara Caputo, Antje Fillbrandt, and Frank Ohl. A theoretical framework for transfer of knowledge across modalities in artificial and cognitive systems. In International Conference on Developmental Learning, 2009. [ .pdf ]
[1916] Francesco Orabona and Jie Luo. Ultra-fast optimization algorithm for sparse multi kernel learning. In Proceedings of the 28th International Conference on Machine Learning [3356]. [ .pdf ]
[1917] Francesco Orabona, Claudio Castellini, Barbara Caputo, Angelo Emanuele Fiorilla, and Giulio Sandini. Model adaptation with least-square svm for adaptive hand prosthetics. In IEEE International conference on Robotics and Automation, 2009. [ .pdf ]
[1918] Francesco Orabona, Claudio Castellini, Barbara Caputo, Angelo Emanuele Fiorilla, and Giulio Sandini. Model adaptation with least-squares svm for adaptive hand prosthetics. Idiap-RR Idiap-RR-05-2009, Idiap, 3 2009. Accepted in ICRA09. [ .pdf ]
[1919] Francesco Orabona, Jie Luo, and Barbara Caputo. Online-batch strongly convex multi kernel learning. Idiap-RR Idiap-RR-07-2010, Idiap, 4 2010. [ .pdf ]
[1920] Francesco Orabona, Joseph Keshet, and Barbara Caputo. Bounded kernel-based perceptrons. Journal of Machine Learning Research, Accepted for pub, 2009.
[1921] Francesco Orabona, Claudio Castellini, Barbara Caputo, Jie Luo, and Giulio Sandini. Towards life-long learning for cognitive systems: Online independent support vector machine. Pattern Recognition, Accepted for Pub, 2009.
[1922] Juan Rafael Orozco-Arroyave, Juan Camilo Vasquez-Correa, Jesús Francisco Vargas-Bonilla, Raman Arora, Najim Dehak, Phani Sankar Nidadavolu, Heidi Christensen, Frank Rudzicz, Maria Yancheva, Alyssa Vann, Nikolai Vogler, Tobias Bocklet, Milos Cernak, Julius Hannink, and Elmar Nöth. Neurospeech: An open-source software for parkinson's speech analysis. Digital Signal Processing, 2017. [ DOI ]
[1923] Hatef Otroshi Shahreza and Sébastien Marcel. Deep auto-encoding and biohashing for secure finger vein recognition. In Proceedings of the 2021 IEEE International Conference on Acoustics, Speech, and Signal Processing. IEEE, 2021. [ DOI | http | .pdf ]
[1924] Hatef Otroshi Shahreza and Sébastien Marcel. Towards protecting and enhancing vascular biometric recognition methods via biohashing and deep neural networks. IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021. [ DOI | http | .pdf ]
[1925] Hatef Otroshi Shahreza, Vedrana Krivokuca, and Sébastien Marcel. On the recognition performance of biohashing on state-of-the-art face recognition models. In Proceedings of the 13th IEEE International Workshop on Information Forensics and Security (WIFS). IEEE, December 2021. [ DOI | http | .pdf ]
[1926] Youssef Oualil, Dietrich Klakow, Gyorgy Szaszak, Ajay Srinivasamurthy, Hartmut Helmke, and Petr Motlicek. A context-aware speech recognition and understanding system for air traffic control domain. In Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, December 2017. [ .pdf ]
[1927] Youssef Oualil, Friedrich Faubel, Mathew Magimai.-Doss, and Dietrich Klakow. A tdoa gaussian mixture model for improving acoustic source tracking. In Oualil [3358]. [ .pdf ]
[1928] Youssef Oualil, Mathew Magimai.-Doss, Friedrich Faubel, and Dietrich Klakow. A probabilistic framework for multiple speaker localization. In Oualil and Magimai.-Doss [3359]. Submitted to ICASSP'13. [ .pdf ]
[1929] Youssef Oualil, Friedrich Faubel, and Dietrich Klakow. A multiple hypothesis gaussian mixture filter for acoustic source localization and tracking. In Oualil [3360], pages 233--236. Submitted to IEEE SSP Workshop 2012. [ .pdf ]
[1930] Youssef Oualil, Mathew Magimai.-Doss, Friedrich Faubel, and Dietrich Klakow. Joint detection and localization of multiple speakers using a probabilistic interpretation of the steered response power. In Youssef Oualil, editor, Statistical and Perceptual Audition Workshop, September 2012. [ .pdf ]
[1931] Mert Ozcan, Jie Luo, Vittorio Ferrari, and Barbara Caputo. A large-scale database of images and captions for automatic face naming. In Proceedings of the 22nd British Machine Vision Conference [3361]. [ .pdf ]
[1932] Jean-François Paiement, Douglas Eck, and Samy Bengio. Chord representations for probabilistic models. Idiap-RR Idiap-RR-58-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[1933] Jean-François Paiement, Douglas Eck, and Samy Bengio. A probabilistic model for chord progressions. In Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR) [3362]. IDIAP-RR 05-57. [ .ps.gz | .pdf ]
[1934] Jean-François Paiement, Douglas Eck, Samy Bengio, and David Barber. A graphical model for chord progressions embedded in a psychoacoustic space. In Proceedings of the 22nd International Conference on Machine Learning [3363]. IDIAP-RR 05-33. [ .pdf ]
[1935] Jean-François Paiement, Yves Grandvalet, Samy Bengio, and Douglas Eck. A distance model for rhythms. In 25th International Conference on Machine Learning (ICML) [3364]. IDIAP-RR 08-33. [ .ps.gz | .pdf ]
[1936] Jean-François Paiement, Yves Grandvalet, Samy Bengio, and Douglas Eck. A generative model for rhythms. In NIPS Workshop on Brain, Music and Cognition [3365]. IDIAP-RR 07-70. [ .ps.gz | .pdf ]
[1937] Jean-François Paiement, Samy Bengio, and Douglas Eck. Probabilistic models for melodic prediction. Idiap-RR Idiap-RR-50-2008, IDIAP, 2008. Submitted for publication. [ .ps.gz | .pdf ]
[1938] Jean-François Paiement, Yves Grandvalet, and Samy Bengio. Predictive models for music. Idiap-RR Idiap-RR-51-2008, IDIAP, 2008. Submitted for publication. [ .ps.gz | .pdf ]
[1939] Jean-François Paiement. Probabilistic models for music. PhD thesis, Ecole Polytechnique Fédérale de Lausanne, 2008. Thèse Ecole polytechnique fédérale de Lausanne EPFL, no 4148 (2008,',','), Faculté des sciences et techniques de l'ingénieur STI, Institut de génie électrique et électronique IEL (Laboratoire de l'IDIAP LIDIAP). Dir.: Hervé Bourlard, Samy Bengio. [ http | .pdf ]
[1940] Bruno Pais, Philipp Buluschek, Guillaume DuPasquier, Tobias Nef, Narayan Schütz, Hugo Saner, Daniel Gatica-Perez, and Valérie Santschi. Evaluation of 1-year in-home monitoring technology by home-dwelling older adults, family caregivers, and nurses. Frontiers in Public Health, 8:9, October 2020. [ DOI | http ]
[1941] Dimitri Palaz, Mathew Magimai.-Doss, and Ronan Collobert. Joint phoneme segmentation inference and classification using crfs. In Global Conference on Signal and Information Processing, pages 587 -- 591. IEEE, December 2014. [ DOI | .pdf ]
[1942] Dimitri Palaz, Mathew Magimai.-Doss, and Ronan Collobert. Convolutional neural networks-based continuous speech recognition using raw speech signal. In International Conference on Acoustics, Speech and Signal Procecssing [3366], pages 4295 -- 4299. [ .pdf ]
[1943] Dimitri Palaz, Mathew Magimai.-Doss, and Ronan Collobert. Raw speech signal-based continuous speech recognition using convolutional neural networks. Idiap-RR Idiap-RR-15-2014, Idiap, 10 2014. Submitted to NIPS 2014. [ .pdf ]
[1944] Dimitri Palaz, Mathew Magimai.-Doss, and Ronan Collobert. Learning linearly separable features for speech recognition using convolutional neural networks. Idiap-RR Idiap-RR-24-2015, Idiap, 6 2015. Accepted as a workshop contribution at ICLR 2015. [ http | .pdf ]
[1945] Dimitri Palaz, Ronan Collobert, and Mathew Magimai.-Doss. End-to-end phoneme sequence recognition using convolutional neural networks. Idiap-RR Idiap-RR-40-2013, Idiap, 12 2013. Accepted at NIPS Deep learning Workshop. [ .pdf ]
[1946] Dimitri Palaz, Ronan Collobert, and Mathew Magimai.-Doss. Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks. In Proceedings of Interspeech [3367]. [ .pdf ]
[1947] Dimitri Palaz, Mathew Magimai.-Doss, and Ronan Collobert. Analysis of cnn-based speech recognition system using raw speech as input. In Proceedings of Interspeech [3368], pages 11--15. [ .pdf ]
[1948] Dimitri Palaz, Mathew Magimai.-Doss, and Ronan Collobert. End-to-end acoustic modeling using convolutional neural networks for hmm-based automatic speech recognition. In Speech Communication [3369], pages 15--32. [ DOI | .pdf ]
[1949] Dimitri Palaz. Towards End-to-End Speech Recognition. PhD thesis, Ecole polytechnique Fédérale de Lausanne, 2016. Thèse EPFL n° 7054. [ DOI | .pdf ]
[1950] Danick Panchard, François Marelli, Edouard De Moura Presa, Peter Wellig, and Michael Liebling. Perspectives and limitations of visible-thermal image pair synthesis via generative adversarial networks. In Security + Defence, Target and Background Signatures VII, Proc. of SPIE, volume 11865, pages 1186509--1--1186509--8. SPIE, September 2021. [ DOI | http | .pdf ]
[1951] Debjani Panda, Satya Ranjan Dash, Ratula Ray, and Shantipriya Parida. Predicting the causal effect relationship between copd and cardio vascular diseases. Informatica, 44(4), December 2020. [ DOI | http ]
[1952] Debjani Panda, Divyajyoti Panda, Satya Ranjan Dash, and Shantipriya Parida. Extreme learning machines with feature selection using ga for effective prediction of fetal heart disease: A novel approach. Informatica, 45(3), October 2021. [ DOI | http ]
[1953] Arnaud Pannatier, Ricardo Picatoste, and Francois Fleuret. Efficient wind speed nowcasting with gpu-accelerated nearest neighbors algorithm. In Proceedings of SIAM Data Mining [3370], page 9.
[1954] M. Panteris, S. Manschitz, and Sylvain Calinon. Learning, generating and adapting wave gestures for expressive human-robot interaction. In Proc. ACM/IEEE Intl Conf. on Human-Robot Interaction (HRI), pages 386--388, 2020. [ DOI | http | .pdf ]
[1955] Maja Pantic and Alessandro Vinciarelli. Implicit human centered tagging. IEEE Signal Processing Magazine, 26, 11 2009. [ .pdf ]
[1956] Maja Pantic, R. Cowie, F. D'Errico, Dirk Heylen, M. Mehu, C. Pelachaud, I. Poggi, M. Schroeder, and Alessandro Vinciarelli. Social signal processing: The research agenda. In "Visual Analysis of Humans" by T.B.Moeslund, A.Hilton, V.Krueger and L.Sigal (eds.), pages 511--538. Springer Verlag, 2011.
[1957] Antonio Paolillo, Teguh Santoso Lembono, and Sylvain Calinon. A memory of motion for visual predictive control tasks. In International Conference on Robotics and Automation, 2020. [ .pdf ]
[1958] Nikolaos Pappas and Andrei Popescu-Belis. Combining content with user preferences for ted lecture recommendation. In Proceedings of the 11th International Workshop on Content Based Multimedia Indexing. IEEE, 2013. [ .pdf ]
[1959] Nikolaos Pappas, Georgios Katsimpras, and Efstathios Stamatatos. Distinguishing the popularity between topics: A system for up-to-date opinion retrieval and mining in the web. In Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics. LNCS, ACM, 2013. [ http | .pdf ]
[1960] Nikolaos Pappas and Andrei Popescu-Belis. Explaining the stars: Weighted multiple-instance learning for aspect-based sentiment analysis. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), October 2014. [ .pdf ]
[1961] Nikolaos Pappas and Andrei Popescu-Belis. Adaptive sentiment-aware one-class collaborative filtering. Expert Systems with Applications, 43:23--41, January 2016. [ DOI | http | .pdf ]
[1962] Nikolaos Pappas, Georgios Katsimpras, and Efstathios Stamatatos. Extracting informative textual parts from web pages containing user-generated content. In 12th International Conference on Knowledge Management and Knowledge Technologies, number 8 in i-KNOW '12, pages 4:1--4:8, New York, NY, USA, June 2012. ACM ICPS, ACM. [ http | .pdf ]
[1963] Nikolaos Pappas and James Henderson. Deep residual output layers for neural language generation. In Proceedings of the 36th International Conference on Machine Learning (ICML), 2019. [ .pdf ]
[1964] Nikolaos Pappas, Mercan Topkara, Miriam Redi, Brendan Jou, Tao Chen, Hongyi Liu, and Shih-Fu Chang. Multilingual visual sentiment concept matching. In Proceedings of the International Conference on Multimedia Retrieval (ICMR), 2016. [ .pdf ]
[1965] Nikolaos Pappas, Georgios Katsimpras, and Efstathios Stamatatos. An agent-based focused crawling framework for topic- and genre-related web document discovery. In 24th IEEE International Conference on Tools with Artificial Intelligence. IEEE, August 2012. [ http | .pdf ]
[1966] Nikolaos Pappas and Thomas Meyer. A survey on language modeling using neural networks. Idiap-RR Idiap-RR-32-2012, Idiap, 11 2012. [ .pdf ]
[1967] Nikolaos Pappas and Andrei Popescu-Belis. Multilingual hierarchical attention networks for document classification. In Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP) [3371], pages 1015--1025. [ .pdf ]
[1968] Nikolaos Pappas, Miriam Redi, Mercan Topkara, Hongyi Liu, Brendan Jou, Tao Chen, and Shih-Fu Chang. Multilingual visual sentiment concept clustering and analysis. International Journal of Multimedia Information Retrieval, 2017. [ .pdf ]
[1969] Nikolaos Pappas and Andrei Popescu-Belis. Explicit document modeling through weighted multiple-instance learning. Journal of Artificial Intelligence Research (JAIR), 58:591--626, 2017. [ .pdf ]
[1970] Nikolaos Pappas and Andrei Popescu-Belis. Combining content with user preferences for non-fiction multimedia recommendation: A study on ted lectures. Multimedia Tools and Applications, Special Issue on Content Based Multimedia Indexing, 74(4):1175--1197, February 2015. [ DOI | .pdf ]
[1971] Nikolaos Pappas and Andrei Popescu-Belis. Sentiment analysis of user comments for one-class collaborative filtering over ted talks. In 36th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 2013. [ .pdf ]
[1972] Nikolaos Pappas and Andrei Popescu-Belis. Human versus machine attention in document classification: A dataset with crowdsourced annotations. In Proceedings of the EMNLP 2016 Workshop on Natural Language Processing for Social Media, 2016. [ .pdf ]
[1973] Nikolaos Pappas and James Henderson. Gile: A generalized input-label embedding for text classification. Transactions of the Association for Computational Linguistics (TACL), 2019. [ .pdf ]
[1974] Nikolaos Pappas. Learning Explainable User Sentiment and Preferences for Information Filtering. PhD thesis, École Polytechnique Fédérale de Lausanne, March 2016. Thèse EPFL, n° 6920. [ DOI | .pdf ]
[1975] Nikolaos Pappas, Lesly Miculicich, and James Henderson. Beyond weight tying: Learning joint input-output embeddings for neural machine translation. In Proceedings of the Third Conference on Machine Translation (WMT), 2018. [ .pdf ]
[1976] Michael Pappinutto, Roberto Boghetti, Moreno Colombo, Chantal Basurto, Kornelius Reutter, Denis Lalanne, Jérôme Kämpf, and Julien Nembrini. Saving energy by maximising daylight and minimising the impact on occupants: an automatic lighting system approach. Energy and Buildings, 2022. [ DOI ]
[1977] Shantipriya Parida, Subhadarshi Panda, Amulya Ratna Dash, Esaú VILLATORO-TELLO, A. Seza Dogruöz, Rosa M. Ortega-Mendoza, Amadeo Hernández, Yashvardhan Sharma, and Petr Motlicek. Open machine translation for low resource south american languages (americasnlp 2021 shared task contribution). In Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas [3372], page 218–223. [ DOI | http ]
[1978] Shantipriya Parida, Satya Ranjan Dash, Ondrej Bojar, Petr Motlicek, Priyanka Pattnaik, and Debasish Kumar Mallick. OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation. In [3373], May 2020. In Proceedings of the LREC 2020 WILDRE5– 5thWorkshop on Indian Language Data:Resources and Evaluation. [ .pdf | .pdf ]
[1979] Shantipriya Parida and Petr Motlicek. Idiap nmt system for wat 2019 multimodal translation task. In Proceedings of the 6th Workshop on Asian Translation, page 175–180. Association for Computational Linguistics, November 2019. [ DOI | .pdf ]
[1980] Shantipriya Parida and Petr Motlicek. Abstract text summarization: A low resource challenge. In In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2019), page 5. Association for Computational Linguistics (ACL), November 2019. [ .pdf ]
[1981] Shantipriya Parida, Esaú VILLATORO-TELLO, Sajit Kumar, Mael Fabien, and Petr Motlicek. Detection of similar languages and dialects using deep supervised autoencoders. In Proceedings of the 17th International Conference on Natural Language Processing, 2020. [ .pdf ]
[1982] Shantipriya Parida. Extractive odia text summarization system: An ocr based approach. Idiap-RR Idiap-RR-02-2020, Idiap, 1 2020. [ .pdf ]
[1983] Shantipriya Parida, Esaú VILLATORO-TELLO, and Petr Motlicek. Challenges in broadcast media content categorization. Idiap-RR Idiap-RR-02-2021, Idiap, 4 2021. [ .pdf ]
[1984] Shantipriya Parida and Petr Motlicek. Idiap abstract text summarization system for german text summarization task. Idiap-RR Idiap-RR-03-2020, Idiap, 1 2020. [ .pdf ]
[1985] Shantipriya Parida and Petr Motlicek. Idiap nmt system for wat 2019 multimodal translation task. Idiap-RR Idiap-RR-04-2020, Idiap, 1 2020. [ .pdf ]
[1986] Shantipriya Parida, Petr Motlicek, and Satya Ranjan Dash. German news article classification : A multichannel cnn approach. Idiap-RR Idiap-RR-09-2020, Idiap, 5 2020. In Proceeding 2nd International Conference on Emerging Trends and Advances in Electrical Engineering and Renewable Energy (ETAEERE-2020). [ .pdf ]
[1987] Shantipriya Parida, Satya Prakash Biswal, Biranchi Narayan Nayak, Mael Fabien, Esaú VILLATORO-TELLO, and Petr Motlicek. Bertodia: Bert pre-training for low resource odia language. Idiap-RR Idiap-RR-16-2021, Idiap, 10 2021. Accepted at 2nd International Conference on Biologically Inspired Techniques in Many-Criteria Decision Making (BITMDM-2021). [ .pdf ]
[1988] Rekha Sahu, Satya Ranjan Dash, Lleuvelyn A Cacha, Roman R Poznanski, and Shantipriya Parida. Epileptic seizure detection: a comparative study between deep and traditional machine learning techniques. Journal of Integrative Neuroscience, 19(1):1--9, 2020. [ http ]
[1989] Shantipriya Parida, Subhadarshi Panda, Satya Prakash Biswal, Ketan Kotwal, Arghyadeep Sen, Satya Ranjan Dash, and Petr Motlicek. Multimodal neural machine translation system for english to bengali. In Proceedings of the First Workshop on Multimodal Machine Translation for Low Resource Languages (MMTLRL 2021) [3375], pages 31--39. [ http ]
[1990] Shantipriya Parida, Esaú VILLATORO-TELLO, Sajit Kumar, Petr Motlicek, and Qingran Zhan. Idiap submission to swiss-german language detection shared task. In Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS) [3376]. [ .pdf ]
[1991] Shantipriya Parida and Petr Motlicek. Idiap abstract text summarization system for german text summarization task. In Proceedings of the 4th edition of the Swiss Text Analytics Conference, October 2019. [ .pdf ]
[1992] Shantipriya Parida, Petr Motlicek, Amulya Ratna Dash, Satya Ranjan Dash, Debasish Kumar Mallick, Satya Prakash Biswal, Priyanka Pattnaik, Biranchi Narayan Nayak, and Ondrej Bojar. Odianlp's participation in wat2020. In Proceedings of the 7th Workshop on Asian Translation. ACL Anthology, 2020. [ .pdf ]
[1993] Shantipriya Parida. Overview of the 6th workshop on asian translation. In Proceedings of the 6th Workshop on Asian Translation, page 1–35. Association for Computational Linguistics, November 2019. [ DOI | .pdf ]
[1994] Sunghyun Park, Gelareh Mohammadi, Ron Artstein, and Louis-Philippe Morency. Crowdsourcing micro-level multimedia annotations: The challenges of evaluation and interface. In Proceedings of International ACM Workshop on Crowdsourcing for Multimedia, 2012.
[1995] Sree Hari Krishnan Parthasarathi, Petr Motlicek, and Hynek Hermansky. Exploiting temporal context for speech/non-speech detection. Idiap-RR Idiap-RR-21-2008, IDIAP, 2008. [ .ps.gz | .pdf ]
[1996] Sree Hari Krishnan Parthasarathi and Hynek Hermansky. A data-driven approach to speech/non-speech detection. Idiap-RR Idiap-RR-23-2008, IDIAP, 2008. [ .ps.gz | .pdf ]
[1997] Sree Hari Krishnan Parthasarathi, Mathew Magimai.-Doss, Daniel Gatica-Perez, and Hervé Bourlard. Speaker change detection with privacy-preserving audio cues. In Proceedings of ICMI-MLMI 2009 [3378]. [ .pdf ]
[1998] Sree Hari Krishnan Parthasarathi, Mathew Magimai.-Doss, Hervé Bourlard, and Daniel Gatica-Perez. Evaluating the robustness of privacy-sensitive audio features for speech detection in personal audio log scenarios. Idiap-RR Idiap-RR-01-2010, Idiap, 1 2010. [ .pdf ]
[1999] Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez, Hervé Bourlard, and Mathew Magimai.-Doss. Privacy-sensitive audio features for speech/nonspeech detection. Idiap-RR Idiap-RR-12-2011, Idiap, 5 2011. [ .pdf ]
[2000] Sree Hari Krishnan Parthasarathi, Hervé Bourlard, and Daniel Gatica-Perez. Lp residual features for robust, privacy-sensitive speaker diarization. Idiap-RR Idiap-RR-14-2011, Idiap, 5 2011. [ .pdf ]
[2001] Sree Hari Krishnan Parthasarathi, Padmanabhan Rajan, and Hema A Murthy. Robustness of group delay representations for noisy speech signals. Idiap-RR Idiap-RR-36-2011, Idiap, 12 2011. [ .pdf ]
[2002] Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez, Hervé Bourlard, and Mathew Magimai.-Doss. Privacy-sensitive audio features for speech/nonspeech detection. IEEE Transactions on Audio, Speech, and Language Processing, 19(8), November 2011. [ .pdf ]
[2003] Sree Hari Krishnan Parthasarathi, Padmanabhan Rajan, and Hema A Murthy. Robustness of group delay representations for noisy speech signals. IJST (Springer), 14(4), 2011. [ .pdf ]
[2004] Sree Hari Krishnan Parthasarathi, Mathew Magimai.-Doss, Hervé Bourlard, and Daniel Gatica-Perez. Investigating privacy-sensitive features for speech detection in multiparty conversations. In Proceedings of Interspeech 2009 [3379]. [ .pdf ]
[2005] Sree Hari Krishnan Parthasarathi, Hervé Bourlard, and Daniel Gatica-Perez. Lp residual features for robust, privacy-sensitive speaker diarization. In Interspeech, 2011. [ .pdf ]
[2006] Sree Hari Krishnan Parthasarathi, Mathew Magimai.-Doss, Hervé Bourlard, and Daniel Gatica-Perez. Evaluating the robustness of privacy-sensitive audio features for speech detection in personal audio log scenarios. In ICASSP 2010, 2010. [ .pdf ]
[2007] Sree Hari Krishnan Parthasarathi, Hervé Bourlard, and Daniel Gatica-Perez. Wordless sounds: Robust speaker diarization using privacy-preserving audio representations. In IEEE Transactions on Audio, Speech, and Language Processing [3380]. [ .pdf ]
[2008] Sree Hari Krishnan Parthasarathi. Privacy-Sensitive Audio Features for Conversational Speech Processing. PhD thesis, Ecole Polytechnique Fédérale de Lausanne, November 2011. [ .pdf ]
[2009] Sree Hari Krishnan Parthasarathi, Petr Motlicek, and Hynek Hermansky. Exploiting contextual information for speech/non-speech detection. In Text, Speech and Dialogue [3381]. [ .pdf ]
[2010] Amin Parvaneh, Ehsan Abbasnejad, Damien Teney, Reza Haffari, Anton van den Hengel, and Javen Qinfeng Shi. Active learning by feature mixing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
[2011] Jose Patino, Ruiqing Yin, Hector Delgado, Herve Bredin, Alain Komaty, Guillaume Wisniewski, Claude Barras, Nicholas Evans, and Sébastien Marcel. Low-latency speaker spotting with online diarization and detection. In The Speaker and Language Recognition Workshop (Odyssey), June 2018. [ .pdf ]
[2012] Novi Patricia and Barbara Caputo. Learning to learn, from transfer learning to domain adaptation: A unifying perspective. In Proceedings of the Computer Vision and Pattern Recognition, pages 1442--1449. IEEE, June 2014. [ DOI | .pdf ]
[2013] Novi Patricia, Tatiana Tommasi, and Barbara Caputo. Multi-source adaptive learning for fast control of prosthetics hand. In Proceedings of the International Conference on Pattern Recognition, pages 2769 -- 2774, August 2014. [ DOI | .pdf ]
[2014] Hélène Paugam-Moisy, R. Martinez, and Samy Bengio. A supervised learning approach based on STDP and polychronization in spiking neuron networks. In European Symposium on Artificial Neural Networks, ESANN [3382]. IDIAP-RR 06-54. [ .ps.gz | .pdf ]
[2015] Hélène Paugam-Moisy. Spiking neuron networks a survey. Idiap-RR Idiap-RR-11-2006, IDIAP, 2006. [ .ps.gz | .pdf ]
[2016] Katharina Pelzelmayer, Sara Landolt, Jasmine Truong, Florian Labhart, Darshan Santani, Emmanuel Kuntsche, and Daniel Gatica-Perez. Youth nightlife at home: towards a feminist conceptualisation of home. Children's Geographies, 2020. [ DOI | http ]
[2017] Adrian Penate-Sanchez, Francesc Moreno-Noguer, Juan Andrade-Cetto, and Francois Fleuret. Letha: Learning from high quality inputs for 3d pose estimation in low quality images. In Proceedings of the International Conference on 3D vision [3383], page 517–524.
[2018] Hugo Penedones, Ronan Collobert, Francois Fleuret, and David Grangier. Improving object classification using pose information. Idiap-RR Idiap-RR-30-2012, Idiap, 11 2012. [ .pdf ]
[2019] Artem Peregoudov, Alessandro Vinciarelli, and Hervé Bourlard. Towards using slide information to enhance speech transcription of meetings. Idiap-RR Idiap-RR-01-2006, IDIAP, 2006. Submitted for publication. [ .ps.gz | .pdf ]
[2020] Michaela Pernon, Frederic Assal, Ina Kodrasi, and Marina Laganaro. Perceptual classification of motor speech disorders: the role of severity, speech task, and listener's expertise. Journal of Speech, Language, and Hearing Research, 2022.
[2021] Giuseppe Peronato, Roberto Boghetti, and Jérôme Kämpf. A machine-learning model for the prediction of aggregated building heating demand from pan-european land-use maps. In Journal of Physics: Conference Series, volume 2042, 2021. [ DOI | .pdf ]
[2022] Xavier Perrin, Ricardo Chavarriaga, Roland Siegwart, and José del R. Millán. Bayesian controller for a novel semi-autonomous navigation concept. In 3rd European Conference on Mobile Robots (ECMR 2007) [3384]. IDIAP-RR 07-26. [ .ps.gz | .pdf ]
[2023] Mike Perrow and David Barber. Probabilistic tagging of unstructured genealogical records. Idiap-RR Idiap-RR-86-2005, IDIAP, 2005. [ .ps.gz | .pdf ]
[2024] L. Fusco, Riwal Lefort, Kevin C. Smith, F. Benmansour, German Gonzalez, Caterina Barilari, Bernd Rinn, Francois Fleuret, Pascal Fua, and O. Pertz. Computer vision profiling of neurite outgrowth dynamics reveals spatio-temporal modularity of rho gtpase signaling. Journal of Cell Biology, 212(1):91--111, January 2016. [ DOI ]
[2025] A. Pesarin, M. Cristani, V. Murino, and Alessandro Vinciarelli. Conversation analysis at work: Detection of conflict in competitive discussions through automatic turn-organization analysis. Cognitive Processing, 2012.
[2026] Michela Pettinato. Detection of disguised speech in forensic science by humans and automatic systems. PhD thesis, Université de Lausanne Ecole des Sciences Criminelles, July 2020. Master Thesis. [ .pdf ]
[2027] Volha Petukhova, Martin Gropp, Dietrich Klakow, Anna Schmidt, Gregor Eigner, Mario Topf, Stefan Srb, Petr Motlicek, Blaise Potard, John Dines, O. Deroo, Ronny Egeler, Uwe Meinz, and Steffen Liersch. The dbox corpus collection of spoken human-human and human-machine dialogues. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), The LREC 2014 Proceedings, Reykjavik, Iceland, May 2014. European Language Resources Association (ELRA). [ http | .pdf ]
[2028] J-P. Pfister, T. Toyoizumi, David Barber, and W. Gerstner. Optimal Spike-Timing Dependent Plasticity for Precise Action Potential Firing in Supervised Learing. [3385]. Accepted in Neural Computation. [ .ps.gz | .pdf ]
[2029] Philip N. Garner. Silence models in weighted finite-state transducers. In Interspeech [3386]. IDIAP-RR 08-19. [ .ps.gz | .pdf ]
[2030] Philip N. Garner. A weighted finite state transducer tutorial. Idiap-Com Idiap-Com-03-2008, IDIAP, 2008. [ .pdf ]
[2031] Thanh-Trung Phan, Skanda Muralidhar, and Daniel Gatica-Perez. Drinks & crowds: Characterizing alcohol consumption through crowdsensing and social media. Journal and Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT '19), June 2019. [ .pdf ]
[2032] Thanh-Trung Phan, Florian Labhart, and Daniel Gatica-Perez. My own private nightlife: Understanding youth personal spaces from crowdsourced video. Proc. ACM Hum.-Comput. Interact, 3(189), November 2019. [ .pdf ]
[2033] Thanh-Trung Phan, Skanda Muralidhar, and Daniel Gatica-Perez. #drink or #drunk: Multimodal signals and drinking practices on instagram. In Proceedings of the 13th EAI International Conference on Pervasive Computing Technologies for Healthcare, May 2019. [ .pdf ]
[2034] Thanh-Trung Phan, Florian Labhart, Skanda Muralidhar, and Daniel Gatica-Perez. Understanding heavy drinking at night through smartphone sensing and active human engagement. In Proceedings of the 14th EAI International Conference on Pervasive Computing Technologies for Healthcare, October 2020. [ .pdf ]
[2035] Thanh-Trung Phan. Understanding Eating and Drinking in Context from Crowdsourced Data. PhD thesis, EPFL, May 2020. [ .pdf ]
[2036] Andre