Research Interests | Yejin Choi |
Statistical approaches and computational models for Natural Language Processing (NLP) and Computational Linguistics. Interdisciplinary research connecting Natural Language Processing with Computer Vision, Psychology, and Cognitive Science. Here are a few themes that intertwine my research so far:
- Words and Pictures
With Prof. Tamara Berg and Alex Berg, we are investigating how to compose natural language descriptions from images [KPDLCBB11, LKBBC11].
- Stylometric Analysis (Authorship, Gender, Deception Detection)
Language is a window into people's minds, and stylometric analysis can be used as a tool for understanding the intent of individual writers, as well as societal characteristics reflected in human language. So far we have looked at gender attribution [SGC11], deceptive opinion spam detection [OCCH11], and Wikipedia vandalism detection [HHSJC11].
- Opinion & Sentiment Analysis
(1) Lexicon Induction and Adaptation for Sentiment Analysis: Lexical resources are one of the key ingredients to many approaches for sentiment analysis. However, as noted by many researchers, word meanings in a specific domain often do not align well with dictionary senses. My work in [CC09] casts the lexicon adaptation problem as a constraint optimization problem, and converts a general-purpose polarity lexicon into a domain-specific one. I also investigate Connotation Lexicon exploiting the selectional preference of Connotative Predicates [FBC11].
(2) Analysis in light of Compositional Semantics: A lot of problems in natural language understanding have compositional nature, which motivates the need to incorporate theories from compositional semantics into statistical models. As two gentle attempts, I explored compositional inference rules for sentiment analysis [CC08], and the use of compositional semantic vectors for extractive summarization to improve sponsored search [CFGJMP10].
(3) Semantic Negators: Semantic negators (e.g., "prevent", "fail") negate the polarity of their arguments (e.g., "prevent cancers"), constituting one of the key challenges in fine-grained opinion analysis. I have investigated the discovery of semantic negators [CC09], and the polarity inference rules dealing with negators [CC08].
(4) Structure-Aware Approaches for Fine-Grained Opinion Analysis This is the umbrella theme of my thesis [C10], spanning over my work for fine-grained opinion analysis [CC10, CC09, CC08, CBC06] and coreference resolution [CC07]. Other work in opinion analysis include [YCC10, BCC07, CCRP05]. Some of above has been employed by Appinions (appinions.com), a start-up company co-founded by Prof. Claire Cardie.
- Stylometric Analysis (Authorship, Gender, Deception Detection)