Publications - yannickesteve.com

Articles published in international peer-reviewed journals

[Bouramtane et al., 2025] Tarik Bouramtane, Ismail Mohsine, Nourelhouda Karmouda, Marc Leblanc, Yannick Estève, Ilias Kacimi, Mohamed Hilali, Salima Mdhaffar, Sarah Tweed et Mounia Tahiri (2025). Dimensionality reduction for groundwater forecasting under drought and intensive irrigation with neural networks. Journal of Hydrology: Regional Studies, Vol. 60.0, pp. 102477.
[Parcollet et al., 2024] Titouan Parcollet, Ha Nguyen, Solène Evain, Marcely Zanon Boito, Adrien Pupier, Salima Mdhaffar, Hang Le, Sina Alisamir, Natalia Tomashenko, Marco Dinarelli, Shucong Zhang, Alexandre Allauzen, Maximin Coavoux, Yannick Estève, Mickael Rouvier, Jerôme Goulian, Benjamin Lecouteux, François Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier (2024). Lebenchmark 2.0: A standardized, replicable and enhanced framework for self-supervised representations of french speech. Computer Speech & Language, Vol. 86.0, pp. 101622. Academic Press
[Ravanelli et al., 2024] Mirco Ravanelli, Titouan Parcollet, Adel Moumen, Sylvain De Langen, Cem Subakan, Peter Plantinga, Yingzhi Wang, Pooneh Mousavi, Luca Della Libera, Artem Ploujnikov, Francesco Paissan, Davide Borra, Salah Zaiem, Zeyu Zhao, Shucong Zhang, Georgios Karakasidis, Sung-Lin Yeh, Pierre Champion, Aku Rouhe, Rudolf Braun, Florian Mai, Juan Zuluaga-Gomez, Seyed Mahed Mousavi, Andreas Nautsch, Ha Nguyen, Xuechen Liu, Sangeet Sagar, Jarod Duret, Salima Mdhaffar, Gaëlle Laperrière, Mickael Rouvier, Renato De Mori, Yannick Estève (2024). Open-source conversational ai with speechbrain 1.0. Journal of Machine Learning Research, Vol. 25.0(333.0), pp. 1-11
[Anidjar et al., 2023] Or Haim Anidjar, Yannick Estève, Chen Hajaj, Amit Dvir et Itshak Lapidot (2023). Speech and multilingual natural language framework for speaker change detection and diarization. Expert Systems with Applications, Vol. 213.0, pp. 119238. Pergamon
[Ghannay et al., 2020] Sahar Ghannay, Yannick Estève et Nathalie Camelin (2020). A study of continuous space word and sentence representations applied to ASR error detection. Speech Communication, Vol. 120.0, pp. 31-41. North-Holland
[Masmoudi et al., 2018] Abir Masmoudi, Fethi Bougares, Mariem Ellouze, Yannick Estève et Lamia Belguith (2018). Automatic speech recognition system for Tunisian dialect. Language Resources and Evaluation, Vol. 52.0(1.0), pp. 249-267. Springer Netherlands Dordrecht
[Ouvry-Vial et al., 2016] Brigitte Ouvry-Vial et Yannick Estève (2016). Archimorphosis. Reconstruction (cultural studies journal), special edition: Archives on Fire, Vol. 16.0(1.0)
[Dufour et al., 2014] Richard Dufour, Yannick Estève et Paul Deléglise (2014). Characterizing and detecting spontaneous speech: Application to speaker role recognition. Speech communication, Vol. 56.0, pp. 1-18. North-Holland
[Lecouteux et al., 2013] Benjamin Lecouteux, Georges Linares, Yannick Estève et Guillaume Gravier (2013). Dynamic combination of automatic speech recognition systems by driven decoding. IEEE transactions on audio, speech, and language processing, Vol. 21.0(6.0), pp. 1251-1260.
[Jousse et al., 2009] Vincent Jousse, Sylvain Meignier, Christine Jacquin, Simon Petitrenaud, Yannick Estève et Béatrice Daille (2009). Analyse conjointe du signal sonore et de sa transcription pour l’identification nommée de locuteurs. Revue TAL : Traitement automatique des langues, Vol. 50.0(1.0), pp. 201-225
[Bazillon et al., 2008] Thierry Bazillon, Vincent Jousse, Frédéric Béchet, Yannick Estève, Georges Linarès et Daniel Luzzati (2008). La parole spontanée: transcription et traitement [Processing and transcribing spontaneous speech]. Traitement Automatique des Langues, Volume 49, Numéro 3: Recherches actuelles en phonologie et en phonétique: interfaces avec le traitement automatique des langues [Current Research in Phonology and Phonetics: Interfaces with Natural-Language Processing], pp. 47-76
[Bazillon et al., 2008] Thierry Bazillon, Yannick Estève et Daniel Luzzati (2008). Transcription manuelle vs assistée de la parole préparé et spontanée. Revue TAL
[Estève et al., 2004] Yannick Estève, Paul Deléglise et Bruno Jacob (2004). Système de transcription automatique de la parole et logiciels libres. TAL. Traitement automatique des langues, Vol. 45.0(2.0), pp. 15-39
[Estève et al., 2003] Yannick Estève, Christian Raymond, Frédéric Béchet et Renato de Mori (2003). On the use of linguistic consistency in automatic speech recognition. IEEE Transactions on Speech and Audio Processing, Vol. 11.0(6.0), pp. 746–756

Peer-reviewed articles from international conferences

[Istaiteh et al., 2025] Othman Istaiteh, Salima Mdhaffar et Yannick Estève (2025). Beyond Similarity Scoring: Detecting Entailment and Contradiction in Multilingual and Multimodal Contexts. Proc. Interspeech 2025
[Whetten et al., 2025] Ryan Whetten, Lucas Maison, Titouan Parcollet, Marco Dinarelli et Yannick Estève(2025). Towards Early Prediction of Self-Supervised Speech Model Performance. In Interspeech 2025.
[Kponou et al., 2025] D Fortuné Kponou, Salima Mdhaffar, Fréjus AA Laleye, Eugène C Ezin et Yannick Estève(2025). Extending the Fongbe to French Speech Translation Corpus: resources, models and benchmark. Proc. Interspeech 2025
[Elleuch et al., 2025] Haroun Elleuch, Salima Mdhaffar, Yannick Estève et Fethi Bougares (2025). ADI-20: Arabic Dialect Identification dataset and models. Proceedings of Interspeech 2025
[Duret et al., 2025] Jarod Duret, Salima Mdhaffar, Gaëlle Laperrière, Ryan Whetten, Audrey Galametz, Catherine Kobus, Marion-Cécile Martin, Jo Oleiwan et Yannick Estève (2025). In-Domain SSL Pre-training and Streaming ASR: Application to Air Traffic Control Communications. International Conference on Speech and Computer (SPECOM)
[Agostinelli et al., 2025] Idris Abdulmumin, Victor Agostinelli, Tanel Alumäe, Antonios Anastasopoulos, Luisa Bentivogli, Ondřej Bojar, Claudia Borg, Fethi Bougares, Roldano Cattoni, Mauro Cettolo, Lizhong Chen, William Chen, Raj Dabre, Yannick Estève, Marcello Federico, Mark Fishel, Marco Gaido, Dávid Javorský, Marek Kasztelnik, Fortuné Kponou, Mateusz Krubiński, Tsz Kin Lam, Danni Liu, Evgeny Matusov, Chandresh Kumar Maurya, John P. McCrae, Salima Mdhaffar, Yasmin Moslem, Kenton Murray, Satoshi Nakamura, Matteo Negri, Jan Niehues, Atul Kr. Ojha, John E. Ortega, Sara Papi, Pavel Pecina, Peter Polák, Piotr Połeć, Ashwin Sankar, Beatrice Savoldi, Nivedita Sethiya, Claytone Sikasote, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Brian Thompson, Marco Turchi, Alex Waibel, Patrick Wilken, Rodolfo Zevallos, Vilém Zouhar, Maike Züfle (2025). Findings of the IWSLT 2025 evaluation campaign. Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025)
[Chaimae Chellaf et al., 2025] Chaimae Chellaf, Haroun Elleuch, Othman Istaiteh, D Fortuné Kponou, Fethi Bougares, Yannick Estève et Salima Mdhaffar (2025). LIA and ELYADATA systems for the IWSLT 2025 low-resource speech translation shared task. Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025)
[Kponou et al., 2025] D Fortuné Kponou, Salima Mdhaffar, Fréjus AA Laleye, Eugène Cokou Ezin et Yannick Estève (2025). FFSTC 2: Extending the Fongbe to French Speech Translation Corpus. Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025)
[Laperrière et al., 2024 Gaëlle Laperrière, Sahar Ghannay, Bassam Jabaian et Yannick Estève (2024). A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding. In Interspeech 2024
[Mdhaffar et al., 2024] Salima Mdhaffar, Fethi Bougares, Renato De Mori, Salah Zaiem, Mirco Ravanelli et Yannick Estève (2024). TARIC-SLU: A Tunisian benchmark dataset for spoken language understanding. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
[Sekkat et al., 2024] Chloé Sekkat, Fanny Leroy, Salima Mdhaffar, Blake Perry Smith, Yannick Estève, Joseph Dureau et Alice Coucke (2024). Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
[Druart et al., 2024] Lucas Druart, Valentin Vielzeuf et Yannick Estève (2024). Investigating Low-Cost LLM Annotation for Spoken Dialogue Understanding Datasets. International Conference on Text, Speech, and Dialogue
[Mdhaffar et al., 2024] Salima Mdhaffar, Haroun Elleuch, Fethi Bougares et Yannick Estève (2024). Performance analysis of speech encoders for low-resource slu and asr in tunisian dialect. In Proceedings of The Second Arabic Natural Language Processing Conference (ArabicNLP)
[Nguyen et al., 2023] Tuan Nguyen, Salima Mdhaffar, Natalia Tomashenko, Jean-François Bonastre et Yannick Estève (2023). Federated learning for asr based on wav2vec 2.0. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
[Maison et al., 2023] Lucas Maison et Yannick Esteve (2023). Improving accented speech recognition with multi-domain training. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
[Laurent et al., 2023] Antoine Laurent, Souhir Gahbiche, Ha Nguyen, Haroun Elleuch, Fethi Bougares, Antoine Thiol, Hugo Riguidel, Salima Mdhaffar, Gaëlle Laperrière, Lucas Maison, Sameer Khurana, Yannick Estève(2023). ON-TRAC consortium systems for the IWSLT 2023 dialectal and low-resource speech translation tasks. International Conference on Spoken Language Translation (IWSLT)
[Agarwal et al., 2023] Milind Agarwal, Sweta Agarwal, Antonios Anastasopoulos, Luisa Bentivogli, Ondřej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, Mingda Chen, William Chen, Khalid Choukri, Alexandra Chronopoulou, Anna Currey, Thierry Declerck, Qianqian Dong, Kevin Duh, Yannick Estève, Marcello Federico (2023). Findings of the IWSLT 2023 evaluation campaign. IWSLT
[Laperrière et al., 2023] Gaëlle Laperrière, Ha Nguyen, Sahar Ghannay, Bassam Jabaian et Yannick Estève(2023). Semantic enrichment towards efficient speech representations. Interspeech
[Mdhaffar et al., 2022] Salima Mdhaffar, Valentin Pelloin, Antoine Caubrière, Gaëlle Laperrière, Sahar Ghannay, Bassam Jabaian, Nathalie Camelin et Yannick Estève (2022). Impact analysis of the use of speech and language models pretrained by self-supersivion for spoken language understanding. LREC
[Mdhaffar et al., 2022] Salima Mdhaffar, Jarod Duret, Titouan Parcollet et Yannick Estève (2022). End-to-end model for named entity recognition from speech without paired training data. Interspeech
[Zanon Boito et al., 2022] Marcely Zanon Boito, Laurent Besacier, Natalia Tomashenko et Yannick Estève(2022). A study of gender impact in self-supervised models for speech-to-text systems. Interspeech
[Zanon Boito et al., 2022] Marcely Zanon Boito, Fethi Bougares, Florentin Barbier, Souhir Gahbiche, Loïc Barrault, Mickael Rouvier et Yannick Estéve (2022). Speech resources in the Tamasheq language. LREC
[Laperrière et al., 2022] Gaëlle Laperrière, Valentin Pelloin, Antoine Caubrière, Salima Mdhaffar, Nathalie Camelin, Sahar Ghannay, Bassam Jabaian et Yannick Estève (2022). The Spoken Language Understanding MEDIA Benchmark Dataset in the Era of Deep Learning: data updates, training and evaluation tools. Proceedings of the Thirteenth Language Resources and Evaluation Conference. LREC
[Mdhaffar et al., 2022] Salima Mdhaffar, Jean-François Bonastre, Marc Tommasi, Natalia Tomashenko et Yannick Estève (2022). Retrieving speaker information from personalized acoustic models for speech recognition. 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
[Tomashenko et al., 2022] Natalia Tomashenko, Salima Mdhaffar, Marc Tommasi, Yannick Estève et Jean-François Bonastre (2022). Privacy attacks for automatic speech recognition acoustic models in a federated learning framework. 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
[Anastasopoulos et al., 2022] Antonios Anastasopoulos, Loïc Barrault, Luisa Bentivogli, Marcely Zanon Boito, Ondřej Bojar, Roldano Cattoni, Anna Currey, Georgiana Dinu, Kevin Duh, Maha Elbayad, Clara Emmanuel, Yannick Estève, Marcello Federico, Christian Federmann, Souhir Gahbiche, Hongyu Gong, Roman Grundkiewicz, Barry Haddow, Benjamin Hsu, Dávid Javorský, Vĕra Kloudová, Surafel Lakew, Xutai Ma, Prashant Mathur, Paul McNamee, Kenton Murray, Maria Nǎdejde, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, John Ortega, Juan Pino, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Yogesh Virkar, Alexander Waibel, Changhan Wang, Shinji Watanabe (2022). Findings of the IWSLT 2022 Evaluation Campaign. Proceedings of the 19th international conference on spoken language translation (IWSLT)
[Zanon Boito et al., 2022] Marcely Zanon Boito, John Ortega, Hugo Riguidel, Antoine Laurent, Loïc Barrault, Fethi Bougares, Firas Chaabani, Ha Nguyen, Florentin Barbier et Souhir Gahbiche (2022). ON-TRAC consortium systems for the IWSLT 2022 dialect and low-resource speech translation tasks. IWSLT
[Pelloin et al., 2021] Valentin Pelloin, Nathalie Camelin, Antoine Laurent, Renato De Mori, Antoine Caubrière, Yannick Estève et Sylvain Meignier (2021). End2end acoustic to semantic transduction. 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
[Nguyen et al., 2021] Ha Nguyen, Yannick Estève et Laurent Besacier (2021). An empirical study of end-to-end simultaneous speech translation decoding strategies. 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
[Evain et al., 2021] Solène Evain, Ha Nguyen, Hang Le, Marcely Zanon Boito, Salima Mdhaffar, Sina Alisamir, Ziyi Tong, Natalia Tomashenko, Marco Dinarelli, Titouan Parcollet, Alexandre Allauzen, Yannick Estève, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier (2021). Lebenchmark: A reproducible framework for assessing self-supervised representation learning from speech. Interspeech
[Evain et al., 2021] Solène Evain, Ha Nguyen, Hang Le, Marcely Zanon Boito, Salima Mdhaffar, Sina Alisamir, Ziyi Tong, Natalia Tomashenko, Marco Dinarelli et Titouan Parcollet (2021). Task agnostic and task specific self-supervised learning from speech with lebenchmark. Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2) NEURIPS
[Mdhaffar et al., 2021] Salima Mdhaffar, Marc Tommasi et Yannick Estève (2021). Study on acoustic model personalization in a context of collaborative learning constrained by privacy preservation. International Conference on Speech and Computer, SPECOM
[Ha Nguyen et al., 2021] Ha Nguyen, Yannick Estève et Laurent Besacier (2021). Impact of encoding and segmentation strategies on end-to-end simultaneous speech translation. Interspeech
[Ghannay et al., 2021] Sahar Ghannay, Antoine Caubrière, Salima Mdhaffar, Gaëlle Laperrière, Bassam Jabaian et Yannick Estève (2021). Where are we in semantic concept extraction for Spoken Language Understanding?. International Conference on Speech and Computer. SPECOM
[Caubrière et al., 2020] Antoine Caubrière, Sahar Ghannay, Natalia Tomashenko, Renato De Mori, Antoine Laurent, Emmanuel Morin et Yannick Estève (2020). Error analysis applied to end-to-end spoken language understanding. 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
[Tomashenko et al., 2020] Natalia Tomashenko, Christian Raymond, Antoine Caubrière, Renato De Mori et Yannick Estève (2020). Dialogue history integration into end-to-end signal-to-concept spoken language understanding systems. 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
[Mdhaffar et al., 2020] Salima Mdhaffar, Yannick Estève, Antoine Laurent, Nicolas Hernandez, Richard Dufour, Delphine Charlet, Géraldine Damnati, Solen Quiniou et Nathalie Camelin (2020). A multimodal educational corpus of oral courses: Annotation, analysis and case study. LREC
[Tardy et al., 2020] Paul Tardy, David Janiszek, Yannick Estève et Vincent Nguyen (2020). Align then summarize: Automatic alignment methods for summarization corpus creation. LREC
[Caubrière et al., 2020] Antoine Caubrière, Sophie Rosset, Yannick Estève, Antoine Laurent et Emmanuel Morin (2020). Where are we in named entity recognition from speech? Proceedings of the Twelfth Language Resources and Evaluation Conference, LREC
[Elbayad et al., 2020] Maha Elbayad, Ha Nguyen, Fethi Bougares, Natalia Tomashenko, Antoine Caubrière, Benjamin Lecouteux, Yannick Estève et Laurent Besacier (2020). ON-TRAC consortium for end-to-end and simultaneous speech translation challenge tasks at IWSLT 2020. IWSLT
[Paul Tardy et al., 2020] Paul Tardy, Louis de Seynes, François Hernandez, Vincent Nguyen, David Janiszek et Yannick Estève (2020). Leverage unlabeled data for abstractive speech summarization with self-supervised learning and back-summarization. International Conference on Speech and Computer. SPECOM
[Barhoumi et al., 2020] Amira Barhoumi, Nathalie Camelin, Chafik Aloulou, Yannick Estève et Lamia Hadrich Belguith (2020). Toward qualitative evaluation of embeddings for Arabic sentiment analysis. International Conference on Language Resources and Evaluation (LREC)
[Macary et al., 2020] Manon Macary, Marie Tahon, Yannick Estève et Anthony Rousseau (2020). AlloSat: A new call center french corpus for satisfaction and frustration analysis. Language Resources and Evaluation Conference (LREC)
[Caubrière et al., 2020] Antoine Caubrière, Yannick Estève, Antoine Laurent et Emmanuel Morin (2020). Confidence measure for speech-to-concept end-to-end spoken language understanding. Interspeech 2020
[Tomashenko et al., 2019] Natalia Tomashenko, Antoine Caubriere et Yannick Estève (2019). Investigating adaptation and transfer learning for end-to-end spoken language understanding from speech. Interspeech 2019
[Mdhaffar et al., 2019] Salima Mdhaffar, Yannick Estève, Nicolas Hernandez, Antoine Laurent, Richard Dufour et Solen Quiniou (2019). Qualitative Evaluation of ASR Adaptation in a Lecture Context: Application to the PASTEL Corpus. Interspeech
[Tomashenko et al., 2019] Natalia Tomashenko, Antoine Caubrière, Yannick Estève, Antoine Laurent et Emmanuel Morin (2019). Recent advances in end-to-end spoken language understanding. International Conference on Statistical Language and Speech Processing, ICSLP
[Barhoumi et al., 2019] Amira Barhoumi, Nathalie Camelin, Chafik Aloulou, Yannick Estève et Lamia Hadrich Belguith (2019). An empirical evaluation of arabic-specific embeddings for sentiment analysis. International Conference on Arabic Language Processing
[Ha Nguyen et al., 2019] Ha Nguyen, Natalia Tomashenko, Marcely Zanon Boito, Antoine Caubriere, Fethi Bougares, Mickael Rouvier, Laurent Besacier et Yannick Estève (2019). ON-TRAC consortium end-to-end speech translation systems for the IWSLT 2019 shared task. IWSLT
[Camelin et al., 2018] Nathalie Camelin, Géraldine Damnati, Abdessalam Bouchekif, Anaïs Landeau, Delphine Charlet et Yannick Estève (2018). Frnewslink: a corpus linking tv broadcast news segments and press articles. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC)
[Tomashenko et al., 2018] Natalia Tomashenko, Yuri Khokhlov et Yannick Estève (2018). Speaker Adaptive Training and Mixup Regularization for Neural Network Acoustic Models in Automatic Speech Recognition. Interspeech
[Sahar Ghannay et al., 2018] Sahar Ghannay, Antoine Caubriere, Yannick Estève, Antoine Laurent et Emmanuel Morin (2018). End-to-end named entity extraction from speech. Interspeech
[Kévin Vythelingum et al., 2018] Kévin Vythelingum, Yannick Estève et Olivier Rosec (2018). Acoustic-dependent Phonemic Transcription for Text-to-speech Synthesis. Interspeech
[Ghannay et al., 2018] Sahar Ghannay, Yannick Estève et Nathalie Camelin (2018). Task Specific Sentence Embeddings for ASR Error Detection.. Interspeech
[Simonnet et al., 2018] Edwin Simonnet, Sahar Ghannay, Nathalie Camelin et Yannick Estève (2018). Simulating ASR errors for training SLU systems. LREC 2018
[Abir Masmoudi et al., 2018] Abir Masmoudi, Fethi Bougares, Mariem Ellouze, Yannick Estève et Lamia Belguith (2018). Automatic speech recognition system for Tunisian dialect. Language Resources and Evaluation, Vol. 52.0(1.0), pp. 249-267. LREC
[Tomashenko et al., 2018] Natalia Tomashenko et Yannick Estève (2018). Evaluation of feature-space speaker adaptation for end-to-end acoustic models. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
[Hernandez et al., 2018] François Hernandez, Vincent Nguyen, Sahar Ghannay, Natalia Tomashenko et Yannick Estève (2018). TED-LIUM 3: Twice as much data and corpus repartition for experiments on speaker adaptation. International conference on speech and computer, SPECOM
[Devillers et al., 2018] Laurence Devillers, Sophie Rosset, Guillaume Dubuisson Duplessis, Lucile Bechade, Yucel Yemez, Bekir B Turker, Metin Sezgin, Engin Erzin, Kevin El Haddad et Stephane Dupont (2018). Multifaceted engagement in social interaction with a machine: The JOKER project. 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition FG2018
[Barhoumi et al., 2018] Amira Barhoumi, Chafik Aloulou, Nathalie Camelin, Yannick Estève et Lamia Belguith (2018). Arabic Sentiment analysis: an empirical study of machine translation’s impact. Language Processing and Knowledge Management International Conference (LPKM)
[Bouchekif et al., 2017] Abdessalam Bouchekif, Delphine Charlet, Géraldine Damnati, Nathalie Camelin et Yannick Estève (2017). Evaluating automatic topic segmentation as a segment retrieval task. Interspeech
[Sahar Ghannay et al., 2017] Sahar Ghannay, Yannick Estève et Nathalie Camelin (2017). Enriching confusion networks for post-processing. International Conference on Statistical Language and Speech Processing. ICSLP
[Edwin Simonnet et al., 2017] Edwin Simonnet, Sahar Ghannay, Nathalie Camelin, Yannick Estève et Renato De Mori (2017). ASR error management for improving spoken language understanding. Interspeech
[Amira Barhoumi et al., 2017] Amira Barhoumi, Yannick Estève, Chafik Aloulou et Lamia Belguith (2017). Document embeddings for Arabic sentiment analysis. Conference on Language Processing and Knowledge Management, LPKM
[Ghannay et al., 2016] Sahar Ghannay, Benoit Favre, Yannick Estève et Nathalie Camelin (2016). Word embedding evaluation and combination. Proceedings of the tenth international conference on language resources and evaluation (LREC)
[Bouchekif et al., 2016] Abdessalam Bouchekif, Géraldine Damnati, Delphine Charlet, Nathalie Camelin et Yannick Estève (2016). assignment for automatic topic segments in TV broadcast news. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
[Ghannay et al., 2016] Sahar Ghannay, Yannick Estève, Nathalie Camelin et Paul Deléglise (2016). Acoustic Word Embeddings for ASR Error Detection. Interspeech
[Tomashenko et al., 2016] Natalia Tomashenko, Yuri Khokhlov et Yannick Estève (2016). On the Use of Gaussian Mixture Model Framework to Improve Speaker Adaptation of Deep Neural Network Acoustic Models. Interspeech
[Lailler et al., 2016] Carole Lailler, Anaïs Landeau, Frédéric Béchet, Yannick Estève et Paul Deléglise (2016). Enhancing the RATP-DECODA corpus with linguistic annotations for performing a large range of NLP tasks. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC)
[Masmoudi et al., 2016] Abir Masmoudi, Mariem Ellouze, Fethi Bougares, Yannick Estève et Lamia Hadrich Belguith (2016). Conditional Random Fields for the Tunisian Dialect Grapheme-to-Phoneme Conversion. Interspeech
[Tomashenko et al., 2016] Natalia Tomashenko, Yuri Khokhlov, Anthony Larcher et Yannick Estève (2016). Exploring GMM-derived features for unsupervised adaptation of deep neural network acoustic models. International Conference on Speech and Computer. SPECOM
[Bouchekif et al., 2015] Abdessalam Bouchekif, Géraldine Damnati, Yannick Estève, Delphine Charlet et Nathalie Camelin (2015). Diachronic semantic cohesion for topic segmentation of TV broadcast news. Interspeech
[Dubuisson-Duplessis et al., 2015] Guillaume Dubuisson-Duplessis, Lucile Béchade, Mohamed Sehili, Agnès Delaborde, Vincent Letard, Anne-Laure Ligozat, Paul Deléglise, Yannick Estève, Sophie Rosset et Laurence Devillers (2015). Nao is doing humour in the CHIST-ERA JOKER project. 16th Interspeech
[Devillers et al., 2015] Laurence Devillers, Sophie Rosset, Guillaume Dubuisson Duplessis, Mohamed A Sehili, Lucile Béchade, Agnes Delaborde, Clément Gossart, Vincent Letard, Fan Yang et Yücel Yemez (2015). Multimodal data collection of human-robot humorous interactions in the joker project. 2015 international conference on affective computing and intelligent interaction (ACII)
[Ghannay et al., 2015] Sahar Ghannay, Yannick Estève et Nathalie Camelin (2015). Word embeddings combination and neural networks for robustness in ASR error detection. 23rd European Signal Processing Conference (EUSIPCO)
[Masmoudi et al., 2015] Abir Masmoudi, Nizar Habash, Mariem Ellouze, Yannick Estève et Lamia Hadrich Belguith (2015). Arabic transliteration of romanized tunisian dialect text: A preliminary investigation. International conference on intelligent text processing and computational linguistics
[Ghannay et al., 2015] Sahar Ghannay, Yannick Estève, Nathalie Camelin, Camille Dutrey, Fabian Santiago et Martine Adda-Decker (2015). Combining continuous word representation and prosodic features for asr error prediction. International Conference on Statistical Language and Speech Processing, ICSLP
[Masmoudi et al., 2014] Abir Masmoudi, Mariem Ellouze Khmekhem, Yannick Estève, Lamia Hadrich Belguith et Nizar Habash (2014). A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition. LREC
[Rousseau et al., 2014] Anthony Rousseau, Paul Deléglise et Yannick Estève (2014). Enhancing the TED-LIUM corpus with selected data for language modeling and more TED talk. LREC
[Dupuy et al., 2014] Grégor Dupuy, Sylvain Meignier et Yannick Estève (2014). Is incremental cross-show speaker diarization efficient for processing large volumes of data? Interspeech
[Bougares et al., 2013] Fethi Bougares, Mickael Rouvier, Nathalie Camelin, Paul Deléglise et Yannick Estève(2013). An investigation of single-pass ASR system combination for Spoken Language Understanding. International Conference on Statistical Language and Speech Processing, ICSLP
[Schmiedeke et al., 2013] Sebastian Schmiedeke, Peng Xu, Isabelle Ferrané, Maria Eskevich, Christoph Kofler, Martha A Larson, Yannick Estève, Lori Lamel, Gareth JF Jones et Thomas Sikora (2013). Blip10000: A social video dataset containing spug content for tagging and retrieval. Proceedings of the 4th ACM Multimedia Systems Conference, pp. 96-101
[Bougares et al., 2013] Fethi Bougares, Paul Deléglise, Yannick Estève et Mickael Rouvier (2013). LIUM ASR system for Etape French evaluation campaign: experiments on system combination using open-source recognizers. International Conference on Text, Speech and Dialogue TSD
[Bougares et al., 2012] Fethi Bougares, Mickael Rouvier, Yannick Estève et Georges Linarès (2012). Low latency combination of parallelized single-pass LVCSR system.. Interspeech, pp. 1031-1034
[Dupuy et al., 2012] Grégor Dupuy, Mickael Rouvier, Sylvain Meignier et Yannick Estève (2012). I-vectors and ILP clustering adapted to cross-show speaker diarization. Interspeech
[Rousseau et al., 2012] Anthony Rousseau, Paul Deléglise et Yannick Estève (2012). TED-LIUM: an automatic speech recognition dedicated corpus. LREC
[Fabrice Lefevre et al., 2012] Fabrice Lefevre, Djamel Mostefa, Laurent Besacier, Yannick Estève, Matthieu Quignard, Nathalie Camelin, Benoit Favre, Bassam Jabaian et Lina Maria Rojas Barahona (2012). Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora. The International Conference on Language Resources and Evaluation. LREC
[Dufour et al., 2011] Richard Dufour, Yannick Estève et Paul Deléglise (2011). Investigation of spontaneous speech characterization applied to speaker role recognition. Interspeech 2011
[Rouvier et al., 2010] Mickael Rouvier, Richard Dufour, Georges Linares et Yannick Estève (2010). A language-identification inspired method for spontaneous speech detection. Interspeech
[Dufour et al., 2010] Richard Dufour, Fethi Bougares, Yannick Estève et Paul Deléglise (2010). Unsupervised model adaptation on targeted speech segments for LVCSR system combination. Interspeech, pp. 885-888
[Estève et al., 2010] Yannick Estève, Thierry Bazillon, Jean-Yves Antoine, Frédéric Béchet et Jérôme Farinas (2010). The EPAC corpus: Manual and automatic annotations of conversational speech in french broadcast news. LREC
[Simon Petitrenaud et al., 2010] Simon Petitrenaud, Vincent Jousse, Sylvain Meignier et Yannick Estève (2010). Automatic named identification of speakers using belief functions. Information Processing and Management of Uncertainty (IPMU’10)
[Deléglise et al., 2009] Paul Deléglise, Yannick Estève, Sylvain Meignier et Teva Merlin (2009). Improvements to the LIUM French ASR system based on CMU Sphinx: what helps to significantly reduce the word error rate? Interspeech, pp. 2123-2126
[Laurent et al., 2009] Antoine Laurent, Teva Merlin, Sylvain Meignier, Yannick Estève et Paul Deléglise (2009). Iterative filtering of phonetic transcriptions of proper nouns. 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4265-4268. ICASSP
[Jousse et al., 2009] Vincent Jousse, Simon Petit-Renaud, Sylvain Meignier, Yannick Estève et Christine Jacquin (2009). Automatic named identification of speakers using diarization and ASR systems. 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4557-4560. ICASSP
[Ruchard Dufour et al., 2009] Ruchard Dufour, Vincent Jousse, Yannick Estève, Fréderic Béchet et Georges Linarès (2009). Spontaneous speech characterization and detection in large audio database. SPECOM
[Bazillon et al., 2008] Thierry Bazillon, Yannick Estève et Daniel Luzzati (2008). Manual vs Assisted Transcription of Prepared and Spontaneous Speech. LREC
[Schwenk et al., 2008] Holger Schwenk et Yannick Estève (2008). Data selection and smoothing in an open-source system for the 2008 NIST machine translation evaluation. Interspeech
[Laurent et al., 2008] Antoine Laurent, Teva Merlin, Sylvain Meignier, Yannick Estève et Paul Deléglise (2008). Combined systems for automatic phonetic transcription of proper nouns. 6th Language Evaluation and Resources Conference. LREC
[Lecouteux et al., 2008] Benjamin Lecouteux, Georges Linares, Yannick Estève et Guillaume Gravier (2008). Generalized driven decoding for speech recognition system combination. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1549-1552. ICASSP
[Lecouteux et al., 2007] Benjamin Lecouteux, Georges Linares, Yannick Estève et Julie Mauclair (2007). System combination by driven decoding. 2007 IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP
[Estève et al., 2007] Yannick Estève, Sylvain Meignier, Paul Deléglise et Julie Mauclair (2007). Extracting true speaker identities from transcriptions. Interspeech 2007
[Mauclair et al., 2006] Julie Mauclair, Yannick Estève, Simon Petit-Renaud et Paul Deléglise (2006). Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions. LREC
[Deléglise et al., 2005] Paul Deléglise, Yannick Estève, Sylvain Meignier et Teva Merlin (2005). The LIUM speech transcription system: a CMU Sphinx III-based system for french broadcast news. Interspeech
[Christian Raymond et al., 2004] Christian Raymond, Frédéric Béchet, Renato De Mori, Géraldine Damnati et Yannick Estève (2004). Automatic learning of interpretation strategies for spoken dialogue systems. 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 1.0, pp. I-425. ICASSP
[Yannick Estève et al., 2003] Yannick Estève, Christian Raymond, Frédéric Béchet et Renato De Mori (2003). Conceptual decoding for spoken dialog systems. Interspeech
[De Mori et al., 2002] Renato De Mori, Yannick Estève et Christian Raymond (2002). On the use of structures in language models for dialogue.. Interspeech, pp. 929-932
[Estève et al., 2001] Yannick Estève, Frederic Bechet, Alexis Nasr et Renato De Mori (2001). Stochastic finite state automata language model triggered by dialogue states. Interspeech
[Estève et al., 2000] Yannick Estève, Frederic Bechet et Renato de Mori (2000). Dynamic selection of language models in a dialogue system. Interspeech
[Alexis Nasr et al., 1999] Alexis Nasr, Yannick Estève, Frédéric Béchet, Thierry Spriet et Renato De Mori (1999). A language model combining n-grams and stochastic finite state automata. Eurospeech

Peer-reviewed articles from international workshops

[Mdhaffar et al., 2025] Salima Mdhaffar, Haroun Elleuch, Chaimae Chellaf, Ha Nguyen, & Yannick Estève(2025). SENSE models: an open source solution for multilingual and multimodal semantic-based tasks. In IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
[Whetten et al., 2024] Ryan Whetten, Titouan Parcollet, Adel Moumen, Marco Dinarelli et Yannick Estève(2024). An Analysis of Linear Complexity Attention Substitutes With BEST-RQ. 2024 IEEE Spoken Language Technology Workshop (SLT)
[Whetten et al., 2024] Ryan Whetten, Titouan Parcollet, Marco Dinarelli et Yannick Estève (2024). Open implementation and study of BEST-RQ for speech processing. 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)
[Thibault Gaudier et al., 2024] Thibault Gaudier, Marie Tahon, Anthony Larcher et Yannick Estève (2024). Automatic Voice Identification after Speech Resynthesis using PPG. In Speaker and Language Recognition Workshop-Odyssey
[Duret et al., 2024] Jarod Duret, Mickael Rouvier et Yannick Estève (2024). MSP-Podcast SER Challenge 2024: L’antenne du Ventoux Multimodal Self-Supervised Learning for Speech Emotion Recognition. In Speaker and Language Recognition Workshop-Odyssey
[Lucas Maison et al., 2023] Lucas Maison et Yannick Estève (2023). Some voices are too common: Building fair speech recognition systems using the Common Voice dataset. Interspeech
[Laperrière et al., 2023] Gaëlle Laperrière, Valentin Pelloin, Mickaël Rouvier, Themos Stafylakis et Yannick Estève (2023). On the use of semantically-aligned speech representations for spoken language understanding. 2022 IEEE Spoken Language Technology Workshop (SLT)
[Laperrière et al., 2023] Gaëlle Laperrière, Ha Nguyen, Sahar Ghannay, Bassam Jabaian et Yannick Estève(2023). Specialized Semantic Enrichment of Speech Representations. 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)
[Duret et al., 2023] Jarod Duret, Benjamin O’Brien, Yannick Estève et Titouan Parcollet (2023). Enhancing expressivity transfer in textless speech-to-speech translation. 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
[Léo Jacqmin et al., 2023] Léo Jacqmin, Lucas Druart, Yannick Estève, Benoît Favre, Lina Maria Rojas-Barahona et Valentin Vielzeuf (2023). OLISIA: a cascade system for spoken dialogue state tracking. Proceedings of The Eleventh Dialog System Technology Challenge (DSTC)
[Macary et al., 2021] Manon Macary, Marie Tahon, Yannick Estève et Anthony Rousseau (2021). On the use of self-supervised pre-trained acoustic and linguistic features for continuous speech emotion recognition. 2021 IEEE Spoken Language Technology Workshop (SLT), pp. 373-380. IEEE
[Hang Le et al., 2021] Hang Le, Florentin Barbier, Ha Nguyen, Natalia Tomashenko, Salima Mdhaffar, Souhir Gahbiche, Bougares Fethi, Benjamin Lecouteux, Didier Schwab et Yannick Estève (2021). ON-TRAC’systems for the IWSLT 2021 low-resource speech translation and multilingual speech translation shared tasks. International Conference on Spoken Language Translation (IWSLT)
[Nguyen et al., 2020] Ha Nguyen, Fethi Bougares, Natalia Tomashenko et Yannick Estève (2020). Investigating self-supervised pre-training for end-to-end speech translation. ICML 2020 Workshop on Self-supervision in Audio and Speech
[Ghannay et al., 2018] Sahar Ghannay, Antoine Caubrière, Yannick Estève, Nathalie Camelin, Edwin Simonnet, Antoine Laurent et Emmanuel Morin (2018). End-to-end named entity and semantic concept extraction from speech. 2018 IEEE Spoken Language Technology Workshop (SLT), pp. 692-699. IEEE
[Mdhaffar et al., 2017] Salima Mdhaffar, Fethi Bougares, Yannick Esteve et Lamia Hadrich-Belguith (2017). Sentiment analysis of tunisian dialects: Linguistic ressources and experiments. Third Arabic natural language processing workshop (WANLP), pp. 55-61
[Vythelingum et al., 2017] Kévin Vythelingum, Yannick Estève et Olivier Rosee (2017). Error detection of grapheme-to-phoneme conversion in text-to-speech synthesis using speech signal and lexical context. 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp. 692-697.
[Tomashenko et al., 2016] Natalia Tomashenko, Kévin Vythelingum, Anthony Rousseau et Yannick Estève(2016). LIUM ASR systems for the 2016 Multi-Genre Broadcast Arabic challenge. 2016 IEEE Spoken Language Technology Workshop (SLT), pp. 285-291.
[Ghannay et al., 2016] Sahar Ghannay, Yannick Estève, Nathalie Camelin et Paul Deléglise (2016). Evaluation of acoustic word embeddings. Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP, pp. 62-66
[Simonnet et al., 2015] Edwin Simonnet, Nathalie Camelin, Paul Deléglise et Yannick Estève (2015). Exploring the use of attention-based recurrent neural networks for spoken language understanding. Machine Learning for Spoken Language Understanding and Interaction NIPS 2015 workshop (SLUNIPS 2015)
[Ghannay et al., 2015] Sahar Ghannay, Nathalie Camelin et Yannick Estève (2015). Which ASR errors are hard to detect. Errors by Humans and Machines in Multimedia, Multimodal and Multilingual Data Processing (ERRARE 2015) Workshop, Sinaia, Romania, pp. 11-13
[Garcia-Martinez et al., 2015] Mercedes Garcia-Martinez, Loïc Barrault, Anthony Rousseau, Paul Deléglise et Yannick Estève (2015). The LIUM ASR and SLT systems for IWSLT 2015. 12th International Workshop on Spoken Language Translation (IWSLT 2015)
[Gupta et al., 2015] Vishwa Gupta, Paul Deléglise, Gilles Boulianne, Yannick Estève, Sylvain Meignier et Anthony Rousseau (2015). CRIM and LIUM approaches for multi-genre broadcast media transcription. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp. 681-686.
[Masmoudi et al., 2014] Abir Masmoudi, Yannick Estève, Mariem Ellouze Khmekhem, Fethi Bougares et Lamia Hadrich Belguith (2014). Phonetic tool for the Tunisian Arabic.. SLTU, pp. 253-256
[Dupuy et al., 2014] Grégor Dupuy, Sylvain Meignier, Paul Deléglise et Yannick Estève (2014). Recent improvements on ILP-based clustering for broadcast news speaker diarization. Odyssey 2014: The Speaker and Language Recognition Workshop
[Grivolla et al., 2014] Jens Grivolla, Maite Melero, Toni Badia, Cosmin Cabulea, Yannick Estève, Eelco Herder, Jean-Marc Odobez, Susanne Preuß et Raúl Marín (2014). EUMSSI: a Platform for Multimodal Analysis and Recommendation using UIMA. Proceedings of the Workshop on Open Infrastructures and Analysis Frameworks for HLT, pp. 101-109
[Anthony Rousseau et al., 2014] Anthony Rousseau, Loïc Barrault, Paul, Yannick Estève, Holger Schwenk, Samir Bennacef, Armando Muscariello et Stephan Vanni (2014). LIUM English-to-French spoken language translation system and the Vecsys/LIUM automatic speech recognition system for Italian language for IWSLT 2014. Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign. IWSLT
[Rousseau et al., 2011] Anthony Rousseau, Fethi Bougares, Paul Deléglise, Holger Schwenk et Yannick Estève(2011). LIUM’s systems for the IWSLT 2011 Speech Translation Tasks. Proceedings of the 8th International Workshop on Spoken Language Translation: Evaluation Campaign, IWSLT
[Bougares et al., 2011] Fethi Bougares, Yannick Estève, Paul Deléglise et Georges Linarès (2011). Bag of n-gram driven decoding for LVCSR system harnessing. 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, pp. 278-282. ASRU
[Estève et al., 2010] Yannick Estève, Paul Deléglise, Sylvain Meignier, Simon Petitrenaud, Holger Schwenk, Loic Barrault, Fethi Bougares, Richard Dufour, Vincent Jousse et Antoine Laurent (2010). Some recent research work at LIUM based on the use of CMU SPHINX. CMU SPUD Workshop, Dallas (Texas)
[Rousseau et al., 2010] Anthony Rousseau, Loïc Barrault, Paul Deléglise et Yannick Estève (2010). LIUM’s statistical machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation: Evaluation Campaign. IWSLT
[Richard Dufour et al., 2009] Richard Dufour, Yannick Estève, Paul Deléglise et Frédéric Béchet (2009). Local and global models for spontaneous speech segment detection and characterization. 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, pp. 558-561. ASRU
[Holger Schwenk et al., 2009] Holger Schwenk, Loïc Barrault, Yannick Estève et Patrik Lambert (2009). LIUM’s statistical machine translation system for IWSLT 2009. Proceedings of the 6th International Workshop on Spoken Language Translation: Evaluation Campaign. IWSLT
[Richard Dufour et al., 2008] Richard Dufour et Yannick Estève (2008). Correcting ASR outputs: specific solutions to specific errors in French. 2008 IEEE Spoken Language Technology Workshop, pp. 213-216. SLT
[Holger Schwenk et al., 2008] Holger Schwenk, Yannick Estève et Sadaf Abdul-Rauf (2008). The LIUM Arabic/English statistical machine translation system for IWSLT 2008.. Proceedings of the 5th International Workshop on Spoken Language Translation: Evaluation Campaign, pp. 63-68. IWSLT
[Julie Mauclair et al., 2006] Julie Mauclair, Sylvain Meignier et Yannick Estève (2006). Speaker diarization: about whom the speaker is talking? 2006 IEEE Odyssey-The Speaker and Language Recognition Workshop. Odyssey
[Christian Raymond et al., 2003] Christian Raymond, Yannick Estève, Frédéric Béchet, Renato De Mori et Géraldine Damnati (2003). Belief confirmation in spoken dialog systems using confidence measures. 2003 IEEE Workshop on Automatic Speech Recognition and Understanding. ASRU
[Estève et al., 2002] Yannick Estève, Christian Raymond et Renato De Mori (2002). On the use of structures in language models for dialogue-specific solutions For specific problems. Proc. IDS 2002, pp. paper 38
[Béchet et al., 2001] Frederic Bechet, Yannick Estève et Renato De Mori (2001). Tree-based language model dedicated to natural spoken dialog systems. ISCA Tutorial and Research Workshop on Adaptation Methods for Speech Recognition

Articles from peer-reviewed national conferences

[Whetten et al., 2024] Ryan Whetten, Titouan Parcollet, Marco Dinarelli et Yannick Estève (2024). Implémentation ouverte et étude de BEST-RQ pour le traitement de la parole. Actes des 35èmes Journées d’Études sur la Parole, pp. 412-420
[Gaudier et al., 2024] Thibault Gaudier, Marie Tahon, Anthony Larcher et Yannick Estève (2024). Vérification automatique de la voix de locuteurs après resynthèse à l’aide de PPG. 35èmes Journées d’Études sur la Parole (JEP 2024) 31ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN 2024) 26ème Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RECITAL 2024), pp. 579-588. ATALA & AFPC
[Duret et al., 2023] Jarod Duret, Titouan Parcollet et Yannick Estève (2023). Learning multilingual expressive speech representation for prosody prediction without parallel data. In Speech Synthesis Workshop (SSW)
[Tomashenko et al., 2022] Natalia Tomashenko, Salima Mdhaffar, Marc Tommasi, Yannick Estève et Jean-François Bonastre (2022). On speaker verification from the neural network footprint of personalized acoustic models. Journées d’Études sur la Parole-JEP2022
[Tomashenko et al., 2022] Natalia Tomashenko, Salima Mdhaffar, Marc Tommasi, Yannick Estève et Jean-François Bonastre (2022). Sur la vérification du locuteur à partir de traces d’exécution de modèles acoustiques personnalisés. Proc. JEP 2022, pp. 861-870
[Mdhaffar et al., 2022] Salima Mdhaffar, Jean-François A Bonastre, Marc Tommasi, Natalia Tomashenko et Yannick Estève (2022). Extraction d’informations liées au locuteur depuis un modèle acoustique personnalisé. JEP 2022
[Laperrière et al., 2022] Gaëlle Laperrière, Valentin Pelloin, Antoine Caubrière, Salima Mdhaffar, Nathalie Camelin, Sahar Ghannay, Bassam Jabaian et Yannick Estève (2022). Le benchmark MEDIA revisité: données, outils et évaluation dans un contexte d’apprentissage profond. XXXIVe Journées d’Études sur la Parole–JEP 2022
[Le et al., 2022] Hang Le, Sina Alisamir, Marco Dinarelli, Fabien Ringeval, Solène Evain, Ha Nguyen, Marcely Zanon Boito, Salima Mdhaffar, Ziyi Tong et Natalia Tomashenko (2022). LeBenchmark, un référentiel d’évaluation pour le français oral. 34e Journées d’étude sur la parole JEP 2022
[Evain et al., 2022] Solène Evain, Ha Nguyen, Hang Le, Marcely Zanon Boito, Salima Mdhaffar, Sina Alisamir, Ziyi Tong, Natalia Tomashenko, Marco Dinarelli et Titouan Parcollet (2022). Modèles neuronaux pré-appris par auto-supervision sur des enregistrements de parole en français. JEP 2022
[Tahon et al., 2022] Marie Tahon, Manon Macary et Yannick Estève (2022). Continuous emotion prediction from audio signal with acoustic and linguistic representations. 16ème Congrès Français d’Acoustique, CFA2022
[Macary et al., 2020] Manon Macary, Marie Tahon, Yannick Estève et Anthony Rousseau (2020). Prédiction continue de la satisfaction et de la frustration dans des conversations de centre d’appels. 6e conférence conjointe Journées d’Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1: Journées d’Études sur la Parole, pp. 379-387. ATALA; AFCP
[Caubrière et al., 2020] Antoine Caubrière, Sophie Rosset, Yannick Estève, Antoine Laurent et Emmanuel Morin (2020). Où en sommes-nous dans la reconnaissance des entités nommées structurées à partir de la parole?. 6e conférence conjointe Journées d’Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1: Journées d’Études sur la Parole, pp. 64-72. ATALA; AFCP
[Caubrière et al., 2019] Antoine Caubrière, Natalia Tomashenko, Yannick Estève, Antoine Laurent et Emmanuel Morin (2019). Curriculum d’apprentissage: reconnaissance d’entités nommées pour l’extraction de concepts sémantiques. 26e conférence sur le Traitement Automatique des Langues Naturelles (TALN)
[Barhoumi et al., 2019] Amira Barhoumi, Nathalie Camelin, Chafik Aloulou, Yannick Estève et Lamia Hadrich Belguith (2019). Plongements lexicaux spécifiques à la langue arabe: application à l’analyse d’opinions (Arabic-specific embedddings: application in Sentiment Analysis). Actes de la Conférence sur le Traitement Automatique des Langues Naturelles (TALN) PFIA 2019. Volume II: Articles courts, pp. 381-390
[Barhoumi et al., 2019] Amira Barhoumi, Nathalie Camelin, Chafik Aloulou, Yannick Estève et Lamia Hadrich Belguith (2019). Arabic-specific embedddings: application in Sentiment Analysis. 26e conférence sur le Traitement Automatique des Langues Naturelles (TALN 2019), pp. 381-390. ATALA
[Mdhaffar et al., 2019] Salima Mdhaffar, Yannick Estève, Nicolas Hernandez, Antoine Laurent et Solen Quiniou (2019). Apport de l’adaptation automatique des modèles de langage pour la reconnaissance de la parole: évaluation qualitative extrinsèque dans un contexte de traitement de cours magistraux. 26e Conférence sur le Traitement Automatique des Langues Naturelles, pp. 167-174. ATALA
[Mdhaffar et al., 2018] Salima Mdhaffar, Antoine Laurent et Yannick Estève (2018). Etude de performance des réseaux neuronaux récurrents dans le cadre de la campagne d’évaluation Multi-Genre Broadcast challenge 3 (MGB3). Proc. JEP 2018, pp. 169-177
[Mdhaffar et al., 2018] Salima Mdhaffar, Antoine Laurent et Yannick Estève (2018). Le corpus PASTEL pour le traitement automatique de cours magistraux. 25e conférence sur le Traitement Automatique des Langues Naturelles (TALN 2018)
[Ghannay et al., 2018] Sahar Ghannay, Nathalie Camelin et Yannick Estève (2018). Représentations de phrases dans un espace continu spécifiques à la tâche de détection d’erreurs. Proc. JEP 2018, pp. 630-638
[Tomashenko et al., 2018] Natalia Tomashenko et Yannick Estève (2018). Impact des techniques d’adaptation au locuteur dans l’espace des paramètres pour des modèles acoustiques purement neuronaux. Proc. JEP 2018, pp. 559-567
[Barhoumi, 2018] Amira Barhoumi (2018). Des représentations continues de mots pour l’analyse d’opinions en arabe: une étude qualitative. 25e conférence sur le Traitement Automatique des Langues Naturelles (TALN 2018)
[Vythelingum et al., 2018] Kévin Vythelingum et Yannick Estève Olivier Rosec (2018). Transcription phonétique automatique pour la synthèse de la parole. XXXIIe Journées d’Etudes sur la Parole (JEP 2018)
[Barhoumi et al., 2018] Amira Barhoumi, Nathalie Camelin et Yannick Estève (2018). Des représentations continues de mots pour l’analyse d’opinions en arabe: une étude qualitative (Word embeddings for Arabic sentiment analysis: a qualitative study). Actes de la Conférence TALN. Volume 1-Articles longs, articles courts de TALN, pp. 215-224
[Mdhaffar et al., 2018] Salima Mdhaffar, Antoine Laurent et Yannick Estève (2018). Le corpus PASTEL pour le traitement automatique de cours magistraux (PASTEL corpus for automatic processing of lectures). Actes de la Conférence TALN. Volume 1-Articles longs, articles courts de TALN, pp. 419-426
[Simonnet et al., 2018] Edwin Simonnet, Sahar Ghannay, Nathalie Camelin et Yannick Estève (2018). Simulation d’erreurs de reconnaissance automatique dans un cadre de compréhension de la parole. XXXIIe Journées d’Etudes sur la Parole (JEP 2018)
[Simonnet et al., 2016] Edwin Simonnet, Paul Deléglise, Nathalie Camelin et Yannick Estève (2016). Des Réseaux de Neurones avec Mécanisme d’Attention pour la Compréhension de la Parole. 31ème Journées d’Études sur la Parole
[Ghannay et al., 2016] Sahar Ghannay, Yannick Estève, Nathalie Camelin, Camille Dutrey, Fabián Santiago et Martine Adda-Decker (2016). Utilisation des représentations continues des mots et des paramètres prosodiques pour la détection d’erreurs dans les transcriptions automatiques de la parole (Combining continuous word representation and prosodic features for ASR error detection). Actes de la conférence conjointe JEP-TALN-RECITAL 2016. volume 1: JEP, pp. 723-731
[Tomashenko et al., 2016] Natalia Tomashenko, Yuri Khokhlov, Anthony Larcher et Yannick Estève (2016). Exploration de paramètres acoustiques dérivés de GMMs pour l’adaptation non supervisée de modèles acoustiques à base de réseaux de neurones profonds. Journées d’Études sur la Parole (JEP’16). AFCP
[Simonnet et al., 2016] Edwin Simonnet, Paul Deléglise, Nathalie Camelin et Yannick Estève (2016). Des réseaux de neurones avec mécanisme d’attention pour la compréhension de la parole (exploring the use of attention-based recurrent neural networks for spoken language understanding). Actes de la conférence conjointe JEP-TALN-RECITAL 2016. volume 1: JEP, pp. 642-650
[Lailler et al., 2015] Carole Lailler, Yannick Estève, Renato De Mori, Mohamed Bouallègue et Mohamed Morchid (2015). Utilisation d’annotations sémantiques pour la validation automatique d’hypothèses dans des conversations téléphoniques. Actes de la 22e conférence sur le Traitement Automatique des Langues Naturelles. Articles courts, pp. 185-191
[Abdessalam Bouchekif et al., 2015] Abdessalam Bouchekif, Géraldine Damnati, Nathalie Camelin, Yannick Estève et Delphine Charlet (2015). Segmentation et titrage automatique de journaux télévisés. Actes de la 22e conférence sur le Traitement Automatique des Langues Naturelles. Articles courts, pp. 221-227
[Bougares et al., 2014] Fethi Bougares, Anthony Rousseau, Paul Deléglise, Yannick Estève, Loïc Barrault, Holger Schwenk, Sylvie Brunessaux, Khaled Khelif et Mathieu Manta (2014). Développement et Évaluation d’un Système de Traduction Automatique de la Parole en Pashto vers le Français. JEP 2014
[Masmoudi et al., 2014] Abir Masmoudi, Mariem Ellouze Khemakhem, Yannick Estève, Fethi Bougares, Sawssan Dabbar et L Hadrich Belguith (2014). Phonétisation automatique du dialecte tunisien. Proceedings of JEP 2014
[Dupuy et al., 2014] Grégor Dupuy, Sylvain Meignier et Yannick Estève (2014). Segmentation et Regroupement en Locuteur pour le traitement incrémental des collections volumineuses. 30e Journées d’Études sur la Parole (JEP’14), Vol. 1.0, pp. 433-440
[Bouaziz et al., 2014] Mohamed Bouaziz, Antoine Laurent et Yannick Estève (2014). Décodage hybride dans les SRAP pour l’indexation automatique des documents multimédia. JEP 2014
[Morin et al., 2013] Emmanuel Morin et Yannick Estève (2013). Proceedings of TALN 2013 (Volume 3: System Demonstrations). Proceedings of TALN 2013 (Volume 3: System Demonstrations)
[Morin et al., 2013] Emmanuel Morin et Yannick Estève (2013). Proceedings of TALN 2013 (Volume 2: Short Papers). Proceedings of TALN 2013 (Volume 2: Short Papers)
[Morin et al., 2013] Emmanuel Morin et Yannick Estève (2013). Proceedings of TALN 2013 (Volume 4: Invited Conferences). Proceedings of TALN 2013 (Volume 4: Invited Conferences)
[Bougares et al., 2012] Fethi Bougares, Yannick Estève, Paul Deléglise, Mickael Rouvier et George Linarès (2012). Avancées dans le domaine de la transcription automatique par décodage guidé. JEP
[Dufour et al., 2012] Richard Dufour, Antoine Laurent et Yannick Estève (2012). Combinaison d’approches pour la reconnaissance du rôle des locuteurs. JEP, Grenoble, France
[Lefèvre et al., 2012] Fabrice Lefèvre, Djamel Mostefa, Laurent Besacier, Yannick Estève, Matthieu Quignard, Nathalie Camelin, Benoit Favre, Bassam Jabaian et Lina M Rojas Barahona (2012). Robustesse et portabilités multilingue et multi-domaines des systèmes de compréhension de la parole: les corpus du projet PortMedia (Robustness and portability of spoken language understanding systems among languages and domains: the PORTMEDIA project)[in French]. Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, volume 1: JEP, pp. 779-786
[Dufour et al., 2012] Richard Dufour, Antoine Laurent et Yannick Estève (2012). Combinaison d’approches pour la reconnaissance du rôle des locuteurs (Combination of approaches for speaker role recognition)[in French]. Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, volume 1: JEP, pp. 827-834
[Dupuy et al., 2012] Grégor Dupuy, Mickael Rouvier, Sylvain Meignier et Yannick Estève (2012). Segmentation et Regroupement en Locuteurs d’une collection de documents audio. 29e Journées d’Études sur la Parole (JEP’12), Vol. 1.0, pp. 433-440
[Bougares et al., 2012] Fethi Bougares, Yannick Estève, Paul Deléglise, Mickael Rouvier et Georges Linarès (2012). Avancées dans le domaine de la transcription automatique par décodage guidé (Improvements on driven decoding system combination)[in French]. Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, volume 1: JEP, pp. 795-802
[Jousse et al., 2008] Vincent Jousse, Yannick Estève, Frédéric Béchet, Thierry Bazillon et Georges Linares (2008). Caractérisation et détection de parole spontanée dans de larges collections de documents audio. JEP, Vol. 2008.0, pp. 9-13
[Dufour et al., 2010] Richard Dufour, Yannick Estève, Paul Deléglise et Frédéric Béchet (2010). Utilisation conjointe de modeles locaux et globaux pour la caractérisation et la détection de segments de parole spontanée. Mons, Belgique, pp. 113. JEP
[Petitrenaud et al., 2010] Simon Petitrenaud, Vincent Jousse, Sylvain Meignier et Yannick Estève (2010). Reconnaissance Automatique de Locuteurs à l’aide de Fonctions de Croyance. 17e congrès francophone Reconnaissance des Formes et Intelligence Artificielle (RFIA’10)
[Lecouteux et al., 2008] Benjamin Lecouteux, Georges Linares, Yannick Estève et Guillaume Gravier (2008). Combinaison de systèmes par décodage guidé. JEP/TALN/RECITAL 2008
[Laurent et al., 2008] Antoine Laurent, Sylvain Meignier, Yannick Estève et Paul Deléglise (2008). Combinaison de systèmes pour la phonétisation automatique de noms propres. XXVIIe Journées d’étude sur la parole (JEP 2008), pp. 4
[Vincent Jousse et al., 2008] Vincent Jousse, Christine Jacquin, Sylvain Meignier, Yannick Estève et Béatrice Daille (2008). Etude pour l’amélioration d’un système d’identification nommée du locuteur. Journées d’Etude de la Parole, pp. 10
[Mauclair et al., 2006] Julie Mauclair, Sylvain Meignier et Yannick Estève (2006). Indexation en locuteur: utilisation d’informations lexicales. Les Journées d’Étude sur la Parole (JEP) 2006, pp. 5
[Mauclair et al., 2006] Julie Mauclair, Yannick Estève et Paul Deléglise (2006). Probabilité a posteriori: amélioration d’une mesure de confiance en reconnaissance de la parole. JEP, Dinard, France
[Béchet et al., 2001] Frédéric Béchet, Yannick Estève et Renato De Mori (2001). Modeles de langage hiérarchiques pour les applications de dialogue en parole spontanée. Actes de la 8ème conférence sur le Traitement Automatique des Langues Naturelles. Posters, pp. 325-330
[Estève et al., 2000] Yannick Estève, Frédéric Béchet et Renato de Mori (2000). Sélection dynamique de modèles de langage dans une application de dialogue. JEP 2000, pp. 185–188

TALN (Traitement Automatique des Langues Naturelles) et JEP (Journées d’Études sur la Parole) sont des conférences nationales de premier plan, structurantes pour les communautés francophones du TAL et de la parole.

Book chapters

[Estève et al., 2015] Yannick Estève, Mohamed Bouallegue, Carole Lailler, Mohamed Morchid, Richard Dufour, Georges Linares, Driss Matrouf et Renato De Mori (2015). Integration of word and semantic features for theme identification in telephone conversations. Natural Language Dialog Systems and Intelligent Assistants, pp. 223-231. Springer International Publishing Cham
[Estève et al., 2012] Yannick Estève et Paul Deléglise (2012). Adaptation and Discriminative Training of Acoustic Models. Techniques for Noise Robustness in Automatic Speech Recognition, pp. 283-310. John Wiley & Sons, Ltd Chichester, UK

Other publications and communications

[Pinquier et al., 2025] Julien Pinquier, Corinne Fredouille, Solène Evain, Jean-François Bonastre, Julie Mauclair, Jérôme Farinas, Yannick Estève, Laurent Nicolas et Laurent Boidron (2025). Le Petit Camion : L’intelligence artificielle au service des opérateurs pour optimiser la prise en charge des appels d’urgence. Workshop interdisciplinaire sur la sécurité globale (WISG 2025)
[Jarod Duret et al., 2024] Jarod Duret, Yannick Estève et Titouan Parcollet (2024). Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation. arXiv preprint arXiv:2407.18332
[Estève et al., 2024] Yannick Estève, Patrick Seminor et Jarod Duret (2024). De l’intelligence artificielle au théâtre?. Journées d’informatique théâtrale
[Macary et al., 2023] Manon Macary, Marie Tahon, Yannick Estève et Daniel Luzzati (2023). Acoustic and linguistic representations for speech continuous emotion recognition in call center conversations. arXiv preprint arXiv:2310.04481
[Druart et al., 2023] Lucas Druart, Valentin Vielzeuf et Yannick Estève (2023). Is one brick enough to break the wall of spoken dialogue state tracking?. arXiv preprint arXiv:2311.04923
[Maison et al., 2022] Lucas Maison, Marcely Zanon Boito et Yannick Estève (2022). Promises and Limitations of Self-supervised Learning for Automatic Speech Processing. Conference on Artificial Intelligence for Defense
[Larcher et al., 2022] Anthony Larcher, Yannick Estève, Mickael Rouvier, Natalia Tomashenko, Jarod Duret, Gaelle Laperriere, Santosh Kesijaru, Marek Sarvas, Renata Kohlova et Henry Li (2022). Multi-lingual Speech to Speech Translation for Under-Resourced Languages. JSALT 2022, Baltimore, MD, USA
[Estève et al., 2016] Yannick Estève, Sahar Ghannay et Nathalie Camelin (2016). Recent Improvements on Error Detection for Automatic Speech Recognition.. MMDA@ ECAI, pp. 28-32
[Grivolla et al., 2016] Jens Grivolla, Yannick Estève, Eelco Herder, Nam Le, Kay Macquarrie, Raúl Marín, Sylvain Meignier, Maite Melero, Jean-Marc Odobez et Susanne Preuß (2016). The EUMSSI Project-Event Understanding through Multimodal Social Stream Interpretation.. MMDA@ ECAI, pp. 8-12
[Tomashenko et al., 2016] NA Tomashenko, YY Khohlov, A Larcher, Y Estève et YN Matveev (2016). Gaussian mixture models for adaptation of deep neural network acoustic models in automatic speech recognition systems. Nauchno-Tekhnicheskii Vestnik Informatsionnykh Tekhnologii, Mekhaniki i Optiki, Vol. 16.0(6.0), pp. 1063. St. Petersburg National Research University of Information Technologies
[Adda-Decker and Estève, 2014] Martine Adda-Decker et Yannick Estève (2014). Reconnaissance automatique de la parole. L’Information Grammaticale, Vol. 141.0, pp. 23-30
[Estève, 2009] Yannick Estève (2009). Traitement automatique de la parole: contributions. Habilitation à diriger des recherches. Université du Mans, 2009
[Paul Deléglise et al., 2005] Paul Deléglise, Yannick Estève, Bruno Jacob, Teva Merlin et S Meigner (2005). Campagne d’évaluation ESTER 2005: Transcription et segmentation en locuteurs. Proc. of the ESTER Phase II workshop
[Estève, 2002] Yannick Estève (2002). Intégration de sources de connaissances pour la modélisation stochastique du langage appliquée à la parole continue dans un contexte de dialogue oral homme-machine. PhD thesis, Université d’avignon, 2002
[Estève, 2002] Yannick Estève. Parler à son ordinateur. Tangente Hors-série. Num. 12. p. 26-27. Editions Pôle Paris, 2002 Format: A4, p. 26-27 ISBN: 2-90973-785-3 EAN