Twitter Sentiment Analysis (Almost) from Scratch

We use cookies

This website uses cookies and other tracking technologies to improve your browsing experience for the following purposes: to enable basic functionality of the website, to provide a better experience on the website, to measure your interest in our products and services and to personalize marketing interactions, to deliver ads that are more relevant to you.

[BibTeX] [Marc21]

Type of publication:	Idiap-RR
Citation:	Lebret_Idiap-RR-15-2016
Number:	Idiap-RR-15-2016
Year:	2016
Month:	5
Institution:	Idiap
Abstract:	A popular application in Natural Language Processing (NLP) is the Sentiment Analysis (SA), i.e., the task of extracting contextual polarity from a given text. The social network Twitter provides an immense amount of text (called tweets) generated by users with a maximum number of 140 characters. In this project, we plan to learn a tweet representation from publicly provided data from Tweets in order to infer sentiment from them. One challenge on this task is the fact that tweets are generated from very different users, making the data very heterogeneous (different from regular data which is written in proper English). Another challenge is, clearly, the large scale of the problem. We propose a deep learning sentence representation (called tweet representation) from user generated data to infer sentiment from tweets. This representation is learned from scratch (directly from the words in tweet) over a large unlabeled corpus of tweets. We demonstrate that we achieve state-of-the-art results for SA on tweets.
Keywords:
Projects	Idiap
Authors	Lebret, Rémi Pinheiro, Pedro H. O. Collobert, Ronan
Added by:	[ADM]
Total mark:	0
Attachments
Lebret_Idiap-RR-15-2016.pdf (MD5: 97e9435ff3f298001b751da90cb77478)
Notes

processing time: 0.0004 seconds.