CONF glickman:conll:2006/IDIAP Investigating Lexical Substitution Scoring for Subtitle Generation Glickman, Oren Dagan, Ido Keller, Mikaela Bengio, Samy Daelemans, Walter EXTERNAL https://publications.idiap.ch/attachments/papers/2006/glickman-conll-2006.pdf PUBLIC https://publications.idiap.ch/index.php/publications/showcite/glickman:rr06-36 Related documents Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL). 2006 IDIAP-RR 06-36 This paper investigates an isolated setting of the lexical substitution task of replacing words with their synonyms. In particular, we examine this problem in the setting of subtitle generation and evaluate state of the art scoring methods that predict the validity of a given substitution. The paper evaluates two context independent models and two contextual models. The major findings suggest that distributional similarity provides a useful complementary estimate for the likelihood that two Wordnet synonyms are indeed substitutable, while proper modeling of contextual constraints is still a challenging task for future research. REPORT glickman:rr06-36/IDIAP Investigating Lexical Substitution Scoring for Subtitle Generation Glickman, Oren Dagan, Ido Keller, Mikaela Bengio, Samy Daelemans, Walter EXTERNAL https://publications.idiap.ch/attachments/reports/2006/glickman-idiap-rr-06-36.pdf PUBLIC Idiap-RR-36-2006 2006 IDIAP To appear in Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL-2006). This paper investigates an isolated setting of the lexical substitution task of replacing words with their synonyms. In particular, we examine this problem in the setting of subtitle generation and evaluate state of the art scoring methods that predict the validity of a given substitution. The paper evaluates two context independent models and two contextual models. The major findings suggest that distributional similarity provides a useful complementary estimate for the likelihood that two Wordnet synonyms are indeed substitutable, while proper modeling of contextual constraints is still a challenging task for future research.