Guangli Zhu, Wenting Liu, Shunxiang Zhang*, Xiang Chen , Chang Yin
Computer Systems Science and Engineering, Vol.35, No.3, pp. 223-232, 2020, DOI:10.32604/csse.2020.35.223
Abstract The current method of extracting new login sentiment words not only ignores the diversity of patterns constituted by new multi-character words (the number
of words is greater than two), but also disregards the influence of other new words co-occurring with a new word connoting sentiment. To solve this
problem, this paper proposes a method for extracting new login sentiment words from Chinese micro-blog based on improved mutual information. First,
micro-blog data are preprocessed, taking into consideration some nonsense signals such as web links and punctuation. Based on preprocessed data, the
candidate strings are obtained by… More >