Table of Content

Open Access iconOpen Access

ARTICLE

crossmark

A Discovery Method for New Words From Mobile Product Comments

Hai Yang Zhu1,∗,†, Xiaobo Yin1,∗,‡, Shunxiang Zhang1,∗,§, Zhongliang Wei1,¶, Guangli Zhu1,II, Meng-Yen Hsieh2,*,**

1 School of Computer Science and Engineering, Anhui University of Science & Technology, Huainan, China
2 Department of Computer Science and Information Engineering, Providence University, Taichung 43301, Taiwan
† 838412840@qq.com(H.Z.)
‡ xbyin@aust.edu.cn(XY.)
§ sxzhang@aust.edu.cn(S.Z.)
¶ zhlwei@aust.edu.cn(Z.W.)
II glzhu@aust.edu.cn(G.Z.)
∗∗ mengyen@pu.edu.tw(M.H.)

* Corresponding Authors: email; email; email; Tel.: +88-642-6328-001(M.H.).

Computer Systems Science and Engineering 2020, 35(6), 399-410. https://doi.org/10.32604/csse.2020.35.399

Abstract

A large number of new words in product reviews generated by mobile terminals are valuable indicators of the privacy preferences of customers. By clustering these privacy preferences, sufficient information can be collected to characterize users and provide a data basis for the research issues of privacy protection. The widespread use of mobile clients shortens the string length of the comment corpus generated by product reviews, resulting in a high repetition rate. Therefore, the effective and accurate recognition of new words is a problem that requires an urgent solution. Hence, in this paper, we propose a method for discovering new words from product comments based on Mutual Information and improved Branch Entropy. Firstly, by calculating the Co-occurrence Frequency and Mutual Information between words and adjacent words, the character strings of words after pre-processing and word segmentation are expanded left and right respectively to discover the potential word set. The candidate set of new words is obtained by means of an improved support filtering algorithm. Finally, a new word set is built by applying an improved Branch Entropy filtering algorithm and removing old words. The experimental results show that this method can accurately and effectively identify new words in product comments.

Keywords


Cite This Article

H. Yang Zhu, X. Yin, S. Zhang, Z. Wei, G. Zhu et al., "A discovery method for new words from mobile product comments," Computer Systems Science and Engineering, vol. 35, no.6, pp. 399–410, 2020.

Citations




cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 2067

    View

  • 1259

    Download

  • 1

    Like

Share Link