Better Sentiment Analysis Through Forecasting

PI: Steven Skiena

Project Summary

The emerging field of sentiment analysis employs algorithmic methods to identify and summarize opinions expressed in text. Both machine learning and ad hoc approaches lie at the foundations of contemporary sentiment analysis systems, but progress on improving both precision and recall has been slowed by the expense and complexity of obtaining sufficiently broad, general sentiment training/validation data.

Recent work has established that fundamental economic variables can successfully be forecast by applying sentiment analysis methods to news-oriented text streams. This project turns this relation on its head, using such forecasting approaches to improve both the precision and recall of general entity-oriented sentiment analysis methods. In particular, this project provides a three-pronged research effort into entity-level sentiment analysis, focusing on improved assessment and algorithms, with applications to the social sciences and forecasting. In particular:

(1) Developing a complete entity-level, text and language-independent sentiment evaluation environment, both to further the development of the Lydia system and for release to the international sentiment analysis community.
(2) Building on this environment, to develop improved sentiment-detection methods for English news, foreign language news streams, social media such as blogs and Twitter, and historical text corpora.
(3) Finally, applying improved sentiment analysis to a variety of challenges in the social sciences. Sentiment analysis will be distributed to the research community through our news analysis portal, http://www.textmap.org.

Intellectual Merit: The proposed research promises to substantially improve both the precision and recall of sentiment detection methods, by focusing on the weakest link: rigorous yet domain-, source-, and language-independent assessment of sentiment. Beyond improvements in natural language processing (NLP), this includes other issues in opinion mining, including article clustering and duplicate detection, entity-domain context, and combining opinions from large numbers of distinct sources.

Broader Impact: The sentiment analysis methods and data developed under this research project will be directly applicable to research in a broad range of social sciences, including sociology, economics, political science, and media and communication stu dies. They will serve as both an educational and scholarly resource in these fields, empowering students and researchers to conduct their own primary studies on historical trends and social forces. Results will be disseminated to the community through our website http://www.textmap.org/III.

Activities and Findings

The major activities for this project revolved around a new approach to natural language processing and sentiment analysis which naturally generalizes to all the world's major languges. Word embeddings assign each word in a language a unique point in (say) 50 dimensional space. Two words have similar meanings/roles if they lie close to each other in space.

Recently re-introduced techniques in unsupervised feature learning make this possible, by acquiring common features for a specific language vocabulary from unlabeled text. These features, also known as distributed words representations (embeddings), have been used by us and other groups to build a unified NLP architecture that solved multiple tasks; part of speech (POS) tagging, named entity recognition (NER), semantic role labeling and chunking.

We have built word embeddings for one hundred of world's most frequently spoken languages (Al-Rfou, et. al. 2013), using neural networks (auto-encoders) trained on each language's Wikipedia in an unsupervised setting, and shown that they capture surprisingly subtle features of language usage like sentiment, plurality, even nation of origin (Chen et. al 2013) . We have made these word embeddings freely available to the research community and employ them in ou r work on sentiment analysis, with well over 1,000 downloads per date. Further, in work presented at KDD 2014, we (Perozzi, al-Rfou, and Skiena, 2014) have developed DeepWalk, an extension of the ideas behind word embeddings to identifying features in graphs.

We have quantitatively demonstrated the utility of our word embeddings by using them as the sole features for training a part of speech tagger for a subset of these languages. We find their performance to be competitive with near state-of-art methods in English, Danish and Swedish.

In particular, these word embeddings point to a way to build sentiment analysis systems for all the world's languages in an elegant, consistant, non-ad hoc approach, by training on the Wikipedia edition of each language. Our work (Chen and Skiena, 2014) was reported at ACL 2014, where we presented high-quality sentiment lexicons for 136 major languages, by integrating a variety of linguistic resources into an immense knowledge graph. Our lexicons have a ploarity agreement of 95.7% with published lexicons while achieving an overall coverage of 45.2%. Further, we demonstrated the performance of our lexicons in an extrinsic analysis of 2,000 distinct historical figures in Wikipedia articles from 30 languages. Despite cultural difference and the intended neutrality of Wikipedia, our lexicons show an average sentiment correlation of 0.28 across all language pairs.

This paper (and the release of our lexicons) marked the successful completion of our major goal of sentiment detection systems for foreign language streams.

First, our paper Only 15 Minutes? The Social Stratification of Fame in Printed Media appeared in the American Sociological Review, the top journal in Sociology. It received extensive national press coverage, including articles in the Los Angeles Times and the Toronto Daily Globe and Mail. We showed that even low levels of fame persist longer than had been suspected, and that there is a cumulative advantage phenomenon: fame generally begats more fame. Thus early recognition can have a disproportionate impact on future success.
Second, our book Who's Bigger? Where historical figures really rank has been completed, and will be published by Cambridge University Press in November 2013. Here we apply computational methods to large datasets from Wikipedia and Google to rigorously measure the historical significance of over 800,000 people, which we make available at www.whoisbigger.com. We address a variety of important issues in a quantitative way, including the effectiveness of human decision procedures and the underrepresentation of women in the historical record.

Press Coverage

Review of Who's Bigger? by Nicholas Mattei, SIGACT News Book Review Column, Spring 2014.
Eight (No, Nine!) Problems with Big Data by Gary Marcus and Earnest Davis, The New York Times, April 6, 2014.
People as memes: Why is Descartes lower than Elvis in a new list of the historically significant Review in New Scientist, by Jonathon Keats. December 31, 2013.
Jesus tops fame list, Cameron is 1,483rd The Sunday Times (London) by Kevin Dowling, December 15, 2013.
Jesus, Elvis, and Aristotle: Who's bigger? Washington Post (via Religion News Service) by Cathy Lynn Grossman, December 12, 2013.
Review of Who's Bigger? Mathematics Association of America (MAA) by Alexander Bogomolny, December 11, 2013.
Discussion on The View ABC Television talk show with Whoopi Goldberg and Barbara Walters, December 10, 2013.
Big, Bigger, Biggest? Steven Skiena's Algorithms Help Answer the Questions Stony Brook University banner, by Joanne Morici. December 10, 2013.
Who's Biggest? The 100 Most Significant Figures in History Time.com, December 10, 2013.
Who's the Greatest Person in History? by Cass Sunstein, The New Republic, December 3, 2013.
Fame, Once Established, Is Not Fleeting, T. Jacobs, The Pacific Standard, May 2, 2013.
15 minutes of fame may last longer,The Statesman, April 15, 2013.
Why the famously famous never fade from view, Los Angeles Times, March 28, 2013.
Lindsay Lohan is Here to Stay: Fame Not Fleeting, Study Finds, Yahoo News, March 28, 2013.
Fame is `extremely sticky,' not fleeting, say researchers, The Globe and Mail, March 28, 2013.
`True fame' isn't fleeting, study says The Star, March 28, 2013.
Predicting The Future: Fantasy Or A Good Algorithm? by Dina Temple-Raston, National Public Radio, Morning Edition, October 9, 2012.
Fatwire Founders in New Software Venture, Newsday, December 8, 2011.
Weird science at LI labs, Long Island Business News, October 21, 2011.
Hey Google: It's Not Your Sentiment Analytics! SemanticWeb.com, December 22, 2010.
Being Bad to Your Customers is Bad for Business, Google Blog, December 1, 2010. Google claims Lydia's "world-class sentiment analysis system" as its own!
A Web startup demos a predictive search engine, Marketing by Raj, October 17, 2010.
Find for the Spy by Elena Cernenko, Newsweek Russia, October 10, 2010. Translated from the Russian and as text.
Andrew Tours Canrock Ventures, Announces Buisness Council, October 4, 2010. Governor-to-be Andrew Cuomo visits General Sentiment technical meeting. (explaining TextMap technology to Cuomo, detail).

Primary References

Website for the Stony Brook Data Science Laboratory, which contains the latest news about this project.
Ranking Who's Bigger among historical figures.
The complete set of papers on the Lydia/TextMap project are available at http://www.cs.sunysb.edu/~skiena/lydia.
A demonstration of Lydia/Textmap is available at http://www.textmap.org.

Collaborators

Arnout van de Rijt
Yejin Choi, Stony Brook University.
Charles Ward, Stony Brook University.
Eduardo Xavier, University of Campinas - UNICAMP, Brazil.

Students

Acknowledgement

This material is based upon work supported by the National Science Foundation under Grant No. 1017181.

Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Last Update

September 9, 2014.