CONF
Bros_ICWSM'25_2025/IDIAP
The Suisse Romande Local News Dataset
Bros, Victor
Gatica-Perez, Daniel
EXTERNAL
https://publications.idiap.ch/attachments/papers/2025/Bros_ICWSM25_2025.pdf
PUBLIC
https://publications.idiap.ch/index.php/publications/showcite/Bros_Idiap-Com-03-2023
Related documents
Proceedings of the Nineteenth International AAAI Conference on Web and Social Media
2025
This paper introduces a comprehensive dataset of news articles sourced from ESH Médias, a prominent local press agency in Romandy, the French-speaking region of Switzerland. The dataset encompasses all articles published on their digital platforms from January 2015 through June 2022. With over 130,000 articles written in French, this dataset offers a rich insight into local news from the French-speaking cantons of Switzerland. The articles cover a diverse range of topics and provide valuable material for Natural Language Processing and media studies. To respect privacy and legal considerations, journalists' names have been anonymized, and the dataset is made available for research purposes under a specific agreement with ESH Médias. The dataset adheres to the FAIR principles, and a detailed datasheet is provided to facilitate its use. The dataset is accessible via a DOI link.
REPORT
Bros_Idiap-Com-03-2023/IDIAP
The Suisse Romande Local News Dataset
Bros, Victor
Gatica-Perez, Daniel
EXTERNAL
https://publications.idiap.ch/attachments/reports/2023/Bros_Idiap-Com-03-2023.pdf
PUBLIC
Idiap-Com-03-2023
2023
Idiap
November 2023
This report introduces a comprehensive database of news articles sourced from ESH Médias, a prominent local press agency in Romandy, the French-speaking region of Switzerland. The database encompasses all articles published on their digital platforms from January 2015 through June 2022. Given the popularity of ESH Médias’ titles, this database offers a rich and unique insight into local news from the French-speaking cantons of
Switzerland. With a total of over 130 000 articles, this database presents a significant opportunity for extensive Natural Language Processing (NLP) analysis.