UMass Amherst
UMass Amherst | Library | Umail | Spire | People Finder 

Search

Match case Regex

Categories

Creative Commons License
This weblog is licensed under a Creative Commons License.

« Elliott Moreton Colloquium | Main | Evidentials/Acquistion Lab Meeting»

UMass Amherst Linguistics Sentiment Corpora

Earlier this year, Noah Constant, Chris Davis, Chris Potts, and Florian Schwarz released the UMass Amherst Linguistics Sentiment Corpora:

The UMass Amherst Linguistics Sentiment Corpora consist of n-gram counts extracted from over 700,000 online product reviews in Chinese, English, German, and Japanese. The files are UTF-8 encoded text. They are formatted to be read in as R data frames, but they can easily be manipulated with other tools.

This data collection effort and research that makes use of it were supported by an NSF grant and by a UMass Amherst College of Humanities and Fine Arts Visioning Grant.