Publication list - Idiap Publications

Update cookies preferences

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 87 |

InnerView: Learning Place Ambiance from Social Media Images, Darshan Santani, Rui Hu and Daniel Gatica-Perez, in: Proceedings of the 24th ACM International Conference on Multimedia, ACM, 2016

attachment

[DOI]

The Night is Young: Urban Crowdsourcing of Nightlife Patterns, Darshan Santani, Joan-Isaac Biel, Florian Labhart, Jasmine Truong, Sara Landolt, Emmanuel Kuntsche and Daniel Gatica-Perez, in: Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, ACM, 2016

attachment

[DOI]

Sparse Hidden Markov Models for Exemplar-based Speech Recognition Using Deep Neural Network Posterior Features, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, Idiap-RR-19-2016

attachment

Explicit Suggestion of Query Terms for News Search using Topic Models and Word Embeddings, Parvaz Mahdabi and Andrei Popescu-Belis, Idiap-RR-21-2016

attachment

Feature mapping using far-field microphones for distant speech recognition, Ivan Himawan, Petr Motlicek, David Imseng and Sridha Sridharan, in: Speech Communication, 83:1-9, 2016

attachment

[DOI]
[URL]

Efficient Posterior Exemplar Search Space Hashing Exploiting Class-Specific Sparsity Structures, Afsaneh Asaei, Gil Luyet, Milos Cernak and Hervé Bourlard, in: Interspeech, San Francisco, CA, 2016

attachment

On Structured Sparsity of Phonological Posteriors for Linguistic Parsing, Milos Cernak, Afsaneh Asaei and Hervé Bourlard, in: Speech Communication, 84:36-45, 2016

attachment

[DOI]
[URL]

PAoS Markers: Trajectory Analysis of Selective Phonological Posteriors for Assessment of Progressive Apraxia of Speech, Afsaneh Asaei, Milos Cernak and Marina Laganaro, in: Proceeding on the 7th Workshop on Speech and Language Processing for Assistive Technologies (SLPAT), 2016

attachment

Word Sequence Modeling using Deep Learning: and End-to-end Approach and its Applications, Joël Legrand, EPFL, 2016

[DOI]

Temporally Subsampled Detection for Accurate and Efficient Face Tracking and Diarization, Nam Le, Alexandre Heili, Di Wu and Jean-Marc Odobez, in: International Conference on Pattern Recognition, Cancun, Mexico, IEEE, 2016

attachment

Learning Multimodal Temporal Representation for Dubbing Detection in Broadcast Media, Nam Le and Jean-Marc Odobez, in: ACM Multimedia, Amsterdam, ACM, 2016

attachment

Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification, Hannah Muckenhirn, Mathew Magimai-Doss and Sébastien Marcel, in: International Conference of the Biometrics Special Interest Group (BIOSIG), 2016

attachment

Transferring Neural Representations for Low-dimensional Indexing of Maya Hieroglyphic Art, Edgar Roman-Rangel, Gulcan Can, Stephane Marchand-Maillet, Rui Hu, Carlos Pallan Gayol, Guido Krempel, Jakub Spotak, Jean-Marc Odobez and Daniel Gatica-Perez, in: Proc. ECCV Workshop on Computer Vision for Art Analysis, Amsterdam, pages 842-855, Springer, 2016

attachment

[DOI]
[URL]

Emphasis Recreation for TTS using Intonation Atoms, Pierre-Edouard Honnet and Philip N. Garner, in: 9th ISCA Speech Synthesis Workshop, pages 14--20, 2016

attachment

[DOI]

Learning Controllers for Reactive and Proactive Behaviors in Human-Robot Collaboration, L. Rozo, J. Silverio, Sylvain Calinon and D. G. Caldwell, in: Frontiers in Robotics and AI, 3(30):1-11, 2016

attachment

[DOI]

Learning Physical Collaborative Robot Behaviors from Human Demonstrations, L. Rozo, Sylvain Calinon, D. G. Caldwell, P. Jimenez and C. Torras, in: IEEE Trans. on Robotics, 32(3):513-527, 2016

attachment

[DOI]
[URL]

Variable Duration Movement Encoding with Minimal Intervention Control, M. Zeestraten, Sylvain Calinon and D. G. Caldwell, in: Proc. of the IEEE Intl Conf. on Robotics and Automation (ICRA), pages 497-503, 2016

attachment

Joint Operation of Voice Biometrics and Presentation Attack Detection, Pavel Korshunov and Sébastien Marcel, in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016

attachment

[URL]

Overview of BTAS 2016 Speaker Anti-spoofing Competition, Pavel Korshunov, Sébastien Marcel, Hannah Muckenhirn, A. R. Gonçalves, A. G. Souza Mello, R. P. Velloso Violato, F. O. Simões, M. U. Neto, M. de Assis Angeloni, J. A. Stuchi, H. Dinkel, N. Chen, Y. Qian, D. Paul, G. Saha and Md Sahidullah, in: IEEE International Conference on Biometrics: Theory, Applications and Systems, 2016

attachment

[URL]

Cross-database evaluation of audio-based spoofing detection systems, Pavel Korshunov and Sébastien Marcel, in: Interspeech, San Francisco, USA, 2016

attachment

[URL]

Inter-task System Fusion for Speaker Recognition, Marc Ferras, Srikanth Madikeri, Subhadeep Dey, Petr Motlicek and Hervé Bourlard, in: Proceeedings of the INTERSPEECH, 2016

attachment

Scalable Metric Learning via Weighted Approximate Rank Component Analysis, Cijo Jose and Francois Fleuret, in: ECCV 2016, 2016

attachment

Fast K-Means with Accurate Bounds, James Newling and Francois Fleuret, in: Proceedings of the International Conference on Machine Learning (ICML), New York, 2016

Phrase Representations for Multiword Expressions, Joël Legrand and Ronan Collobert, in: Proceedings of the 12th Workshop on Multiword Expressions, 2016

attachment

Neural Network-based Word Alignment through Score Aggregation, Joël Legrand, Michael Auli and Ronan Collobert, in: Proceedings of the ACL 1st Conference on Machine Translation, 2016

attachment

Deep Neural Networks for Syntactic Parsing of Morphologically Rich Languages, Joël Legrand and Ronan Collobert, in: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

attachment

A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition, Marc Ferras, Srikanth Madikeri, Subhadeep Dey, Petr Motlicek and Hervé Bourlard, in: IEEE Signal Processing Letters, 23(4):527 - 531, 2016

attachment

Building Word Embeddings for Solving Natural Language Processing, Rémi Lebret, École Polytechnique Fédérale de Lausanne, 2016

[DOI]

On ANOVA Decompositions of Kernels and Gaussian Random Field Paths, David Ginsbourger, Olivier Roustant, Dominic Schuhmacher, Nicolas Durrande and Nicolas Lenz, in: Monte Carlo and Quasi-Monte Carlo Methods, pages 315-330, Springer International Publishing, 2016

[DOI]

Design of Computer Experiments Using Competing Distances Between Set-Valued Inputs, David Ginsbourger, Jean Baccou, Clément Chevalier and Frédéric Perales, in: mODa 11 - Advances in Model-Oriented Design and Analysis, pages 123-131, Springer International Publishing, 2016

[DOI]

Low-Rank Representation of Nearest Neighbor Phone Posterior Probabilities to Enhance DNN Acoustic Modeling, Gil Luyet, Pranay Dighe, Afsaneh Asaei and Hervé Bourlard, in: Interspeech, 2016

attachment

End-to-End Acoustic Modeling using Convolutional Neural Networks for Automatic Speech Recognition, Dimitri Palaz, Mathew Magimai-Doss and Ronan Collobert, Idiap-RR-18-2016

attachment

Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection, Dhananjay Ram, Afsaneh Asaei and Hervé Bourlard, in: Interspeech, 2016

attachment

HMM-based Non-native Accent Assessment using Posterior Features, Ramya Rasipuram, Milos Cernak and Mathew Magimai-Doss, in: Proceedings of Interspeech, San Francisco, USA, 2016

attachment

When Naïve Bayes Nearest Neighbors Meet Convolutional Neural Networks, Ilja Kuzborskij, Fabio M. Carlucci and Barbara Caputo, in: Computer Vision and Pattern Recognition (CVPR), IEEE Conference on, 2016

attachment

Pronoun Language Model and Grammatical Heuristics for Aiding Pronoun Prediction, Ngoc-Quang Luong and Andrei Popescu-Belis, in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, ACL, 2016

attachment

Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, Alexandros Lazaridis, Milos Cernak, Pierre-Edouard Honnet and Philip N. Garner, Idiap-RR-22-2016

attachment

Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery, Marzieh Razavi and Mathew Magimai-Doss, in: Proceedings of Interspeech, 2016

attachment

Modeling Unvoiced Sounds In Statistical Parametric Speech Synthesis with a Continuous Vocoder, Tamas Gabor Csapo, Geza Nemeth, Milos Cernak and Philip N. Garner, in: Proc. of EUSIPCO, Budapest, Hungary, 2016

attachment

PhonVoc: A Phonetic and Phonological Vocoding Toolkit, Milos Cernak and Philip N. Garner, in: Interspeech, San Francisco, USA, 2016

attachment

Sound Pattern Matching for Automatic Prosodic Event Detection, Milos Cernak, Afsaneh Asaei, Pierre-Edouard Honnet, Philip N. Garner and Hervé Bourlard, in: Interspeech, San Francisco, USA, 2016

attachment

Improving Pronoun Translation by Modeling Coreference Uncertainty, Ngoc-Quang Luong and Andrei Popescu-Belis, in: Proceedings of the First Conference on Machine Translation (WMT16), Berlin, Germany, 2016

attachment

The SIWIS database: a multilingual speech database with acted emphasis, Jean-Philippe Goldman, Pierre-Edouard Honnet, Rob Clark, Philip N. Garner, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Tiago Macedo, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli and Junichi Yamagishi, in: Proceedings of Interspeech, San Francisco, USA, pages 1532--1535, 2016

attachment

[DOI]

Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody, Alexandros Lazaridis, Milos Cernak and Philip N. Garner, in: Proceedings of Interspeech, San Francisco, USA, 2016

attachment

Simultaneous temporal superresolution and denoising for cardiac fluorescence microscopy, Kevin G. Chan, Sebastian J. Streichan, Le A. Trinh and Michael Liebling, in: IEEE Transactions on Computational Imaging, 2016

attachment

[DOI]
[URL]

Predicting the Performance in Decision-Making Tasks: From Individual Cues to Group Interaction, Umut Avci and Oya Aran, in: IEEE Transactions on Multimedia, 18(4):643--658, 2016

attachment

[DOI]
[URL]

High-slope terrain locomotion for torque-controlled quadruped robots, Michele Focchi, Andrea del Prete, I. Havoutis, Roy Featherstone, D. G. Caldwell and Claudio Semini, in: Autonomous Robots, 2016

[DOI]
[URL]

Hierarchical Planning of Dynamic Movements without Scheduled Contact Sequences, Carlos Mastalli, I. Havoutis, Michele Focchi, Claudio Semini and D. G. Caldwell, in: Proceedings of the IEEE International Conference of Robotics and Automation, 2016

Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information, Maryam Habibi, Parvaz Mahdabi and Andrei Popescu-Belis, in: Data & Knowledge Engineering Journal, 2016

Question Answering in Conversations: Query Refinement Using Contextual and Semantic Information, Maryam Habibi, Parvaz Mahdabi and Andrei Popescu-Belis, Idiap-RR-16-2016

attachment

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 87 |

processing time: 0.0004 seconds.