CONF Bros_ICWSM'25_2025/IDIAP The Suisse Romande Local News Dataset Bros, Victor Gatica-Perez, Daniel EXTERNAL https://publications.idiap.ch/attachments/papers/2025/Bros_ICWSM25_2025.pdf PUBLIC https://publications.idiap.ch/index.php/publications/showcite/Bros_Idiap-Com-03-2023 Related documents Proceedings of the Nineteenth International AAAI Conference on Web and Social Media 2025 This paper introduces a comprehensive dataset of news articles sourced from ESH Médias, a prominent local press agency in Romandy, the French-speaking region of Switzerland. The dataset encompasses all articles published on their digital platforms from January 2015 through June 2022. With over 130,000 articles written in French, this dataset offers a rich insight into local news from the French-speaking cantons of Switzerland. The articles cover a diverse range of topics and provide valuable material for Natural Language Processing and media studies. To respect privacy and legal considerations, journalists' names have been anonymized, and the dataset is made available for research purposes under a specific agreement with ESH Médias. The dataset adheres to the FAIR principles, and a detailed datasheet is provided to facilitate its use. The dataset is accessible via a DOI link. REPORT Bros_Idiap-Com-03-2023/IDIAP The Suisse Romande Local News Dataset Bros, Victor Gatica-Perez, Daniel EXTERNAL https://publications.idiap.ch/attachments/reports/2023/Bros_Idiap-Com-03-2023.pdf PUBLIC Idiap-Com-03-2023 2023 Idiap November 2023 This report introduces a comprehensive database of news articles sourced from ESH Médias, a prominent local press agency in Romandy, the French-speaking region of Switzerland. The database encompasses all articles published on their digital platforms from January 2015 through June 2022. Given the popularity of ESH Médias’ titles, this database offers a rich and unique insight into local news from the French-speaking cantons of Switzerland. With a total of over 130 000 articles, this database presents a significant opportunity for extensive Natural Language Processing (NLP) analysis.