SCARE - The Sentiment Corpus of App Reviews with Fine-grained Annotations in German

The SCARE corpus consists of fine-grained annotations for mobile application reviews from the Google Play Store. For each user review the mentioned application aspects, i.e., the design or the usability, as well as subjective phrases, which evaluate these aspects, are annotated. In addition, the polarity (positive, negative or neutral) of each subjective phrase is recorded as well as the relationship of an aspect to the main app in discussion. Aspects which refer to an app or an aspect of an app other than the app in discussion are marked as “foreign”. All other aspects are “related”. In total, the corpus consists of 1,760 German application reviews with 2,487 aspects and 3,959 subjective phrases.

If you use these resources, please cite:

  • Mario Sänger, Ulf Leser, Steffen Kemmerer, Peter Adolphs, and Roman Klinger. SCARE – The Sentiment Corpus of App Reviews with Fine-grained Annotations in German. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16), Portorož, Slovenia, May 2016. European Language Resources Association (ELRA). [ bib ]

The poster presented at LREC is here.

During the creation of the corpus, we further collected a data set of over 800,000 German user reviews from 11 application categories. We cannot make this resource publicly available, of course. However, if you are interested in this data, please send a mail to scare@romanklinger.de and let us know what you would like to use this data for and clearly state that you will not distribute it. We will then tell you the username and password for the link below. Same holds for the link below which contains the text in addition to the annotations.

Downloads: