Research Output
A localization toolkit for sentic net
  SenticNet is a popular resource for concept-level sentiment analysis. Because SenticNet was created specifically for opinion mining in English language, however, its localization can be very laborious. In this work, a toolkit for creating non-English versions of SenticNet in a time- and cost-effective way is proposed. This is achieved by exploiting online facilities such as Web dictionaries and translation engines. The challenging issues are three: firstly, when a Web lexicon is used, one sentiment concept in English can usually be mapped to multiple concepts in the local language. In this work, we develop a concept disambiguation algorithm to discover context within texts in the target language. Secondly, the polarity of some concepts in the local language may be different from the counterpart in English, which is referred to as language-dependent sentiment concepts. An algorithm is developed to detect sentiment conflict using sentiment annotation corpora in the two languages. Lastly, some sentiment concepts are not included in the local language after dictionary consulting and online translation. In this work, we develop a tool to extract these concepts from sentiment dictionary in the local language. Our practice and evaluation in constructing the Chinese version of SenticNet indicate that the proposed algorithms represent an effective toolkit for localizing SenticNet.

  • Date:

    29 January 2015

  • Publication Status:

    Published

  • DOI:

    10.1109/ICDMW.2014.179

  • Funders:

    Historic Funder (pre-Worktribe)

Citation

Xia, Y., Li, X., Cambria, E., & Hussain, A. (2015). A localization toolkit for sentic net. In 2014 IEEE International Conference on Data Mining Workshop, (403-408). https://doi.org/10.1109/ICDMW.2014.179

Authors

Keywords

Sentiment analysis, Sentic Net, common sense, localization

Monthly Views:

Available Documents