Automatic Identification of Discourse Markers in Multiparty Dialogues: An In-Depth Study of Like and Well

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Journal paper
Citation:	Popescu-Belis_CSL_2011
Publication status:	Published
Journal:	Computer Speech and Language
Volume:	25
Number:	3
Year:	2011
Pages:	499-518
DOI:	10.1016/j.csl.2010.12.001
Abstract:	The lexical items 'like' and 'well' can serve as discourse markers (DMs), but can also play numerous other roles, such as verb or adverb. Identifying the occurrences that function as DMs is an important step for language understanding by computers. In this study, automatic classifiers using lexical, prosodic/positional and sociolinguistic features are trained over transcribed dialogues, manually annotated with DM information. The resulting classifiers improve state-of-the-art performance, at about 90% recall and 79% precision for like (84.5% accuracy, kappa = 0.69), and 99% recall and 98% precision for well (97.5% accuracy, kappa = 0.88). Automatic feature analysis shows that lexical collocations are the most reliable indicators, followed by prosodic/positional features, while sociolinguistic features are marginally useful for the identification of DM like. The differentiated processing of each type of DM improves classification accuracy, suggesting that these types should be treated individually.
Keywords:	discourse marker identification, lexical features, statistical classifiers
Projects	Idiap IM2
Authors	Popescu-Belis, Andrei Zufferey, Sandrine
Added by:	[UNK]
Total mark:	0
Attachments
Popescu-Belis_CSL_2011.pdf
Notes

processing time: 0.0003 seconds.