Home

Awesome

RuSentiFrames 2.0

📓 Update 01 October 2023: this lexicon is now available in arekit-ss in text processing for most collections of the sentiment attitude extraction task (in Russian) with just single script into JSONL/CSV/SqLite` including (optional) language transferring 🔥 [Learn more ...]

Release notes: Statistics update.

Represents a lexicon which describes sentiments and connotations conveyed with a predicate in a verbal or nominal form. Checkout reference section for details in related paper.

Statistics

Lexical Unit Typev-2.0Diff
Frame Entries311+12%
Verbs3239+16%
Nouns986+20%
Phrases2551+6%
Other12removed
Unique Entries6788+12%
Total Entries7034+10%
PolarityPolarityv-2.0Diff
A0→A1POS2558+14%
A0→A1NEG3289+17%
author→A0POS170removed
author→A0NEG1578-
author→A1POS92removed
author→A1NEG249removed

NOTE: Diff corresponds to comparison with the v-1.0

Structure

The structure of the frames includes:

  1. Role designation: A0 is an argument exhibiting features of a Prototypical Agent, and A1 is a Theme (as in PropBank)

  2. Dimentions:

Format Description

The lexicon presented in JSON format.

Below is an example of the frame выйграть (to win):

"0_4": {
    "title": [
        "выиграть",
        "получить приз" ],
    "variants": [
        "выиграть",
        "выигрывать" ],
    "comment": "comment",
    "roles": {
        "a0": "победитель",
        "a1": "побежденный",
        "a2": "приз",
        "a3": "область, сфера, в которой одержана победа" },
    "frames": {
        "polarity": [
            [ "a1", "a0", "neg", 1.0 ],
            [ "a0", "a2", "pos", 1.0 ],
            [ "a1", "a2", "neg", 0.7 ],
            [ "a0", "a3", "pos", 1.0 ],
            [ "a1", "a3", "pos", 1.0 ] ],
        "value": [
            [ "a0", "a2", 1.0 ],
            [ "a1", "a2", 1.0 ] ],
        "effect": [
            [ "a0", "+", 1.0 ],
            [ "a1", "-", 1.0 ] ],
        "state": [
            [ "a0", "pos", 1.0 ],
            [ "a1", "neg", 1.0 ] ]
    }
}

Where keys denotes as follows:

title -- list of possible frame titles.

variants -- list of possible variants appeared in text.

roles -- is a dictonary of participants (keys) with the related description.

frames -- is a dictionary of parameters, which describes a frame in following directions:

Collection Reader

Folder reader contains a collection reader (source file parsers), written in Python-3.6.

Please refer to read.py, as it provides an example of how this collection could be parsed/readed.

Prior Releases

RuSentiFrames-1.0

References

@article{loukachevitch2020sentiment,
  title={Sentiment Frames for Attitude Extraction in Russian},
  author={Loukachevitch, Natalia and Rusnachenko, Nicolay},
  booktitle={Proceedings of International Conference on 
             Computational Linguistics and Intellectual 
             Technologies Dialogue-2020},
  year={2019}
}