Update cookies preferences
 logo Idiap Research Institute        
 [BibTeX] [Marc21]
Skill Extraction from Resumes and Job Offers across Six Languages
Type of publication: Conference paper
Citation: Vasquez-Rodriguez_SWISSTEXT2026_2026
Publication status: Accepted
Booktitle: Proceedings of the 11th edition of the Swiss Text Analytics Conference
Year: 2026
Month: June
Abstract: We comprehensively evaluate multiple skill extraction approaches, including rule-based, semantic, and supervised methods, using resumes and job offers in English, French, German, Italian, Spanish, and Portuguese. Due to inherent privacy concerns in Human Resources (HR) data and the high cost of manual annotations, research on identifying relevant skills for the job market remains limited, often restricted to specific domains, datasets, and entity types, and is available in only a few languages. In the context of an industrial project, we have annotated 1,200 job offers and resumes across diverse domains and six languages, through a multidisciplinary collaboration among HR researchers, NLP researchers, and HR tech professionals. Our evaluation assesses the effectiveness of these systems in a multilingual, multidomain setting, capturing both standardized job offers and highly variable resumes. The results show that supervised models achieve F1 scores of up to 0.6, while rule-based methods offer better interpretability. Furthermore, we find large differences between how skills are formulated in job offers and resumes, while the latter is understudied in academic research.
Keywords:
Projects: Idiap
SEM24
Authors: Vásquez-Rodríguez, Laura
Audrin, Bertrand
Michel, Samuel
Galli, Samuele
Rogenhofer, Julneth
Negro Cusa, Jacopo
van der Plas, Lonneke
Added by: [UNK]
Total mark: 0
Attachments
  • Vasquez-Rodriguez_SWISSTEXT2026_2026.pdf
Notes