Vol.63, No.1, 2020, pp.163-181, doi:10.32604/cmc.2020.06992
Predicting Simplified Thematic Progression Pattern for Discourse Analysis
  • Xuefeng Xi1, Victor S. Sheng1, 2, *, Shuhui Yang3, Baochuan Fu1, Zhiming Cui1
1 School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou, 215009, China.
2 Department of Computer Science, University of Central Arkansas, Conway, AR 72035, USA.
3 Department of Mathematics, Statistics, and Computer Science, Purdue University Northwest, Hammond, IN 46323, USA.
* Corresponding Author: Victor S. Sheng. Email: ssheng@uca.edu.
Received 22 April 2019; Accepted 29 April 2019; Issue published 30 March 2020
The pattern of thematic progression, reflecting the semantic relationships between contextual two sentences, is an important subject in discourse analysis. We introduce a new corpus of Chinese news discourses annotated with thematic progression information and explore some computational methods to automatically extracting the discourse structural features of simplified thematic progression pattern (STPP) between contextual sentences in a text. Furthermore, these features are used in a hybrid approach to a major discourse analysis task, Chinese coreference resolution. This novel approach is built up via heuristic sieves and a machine learning method that comprehensively utilizes both the top-down STPP features and the bottom-up semantic features. Experimental results on the intersection of the CoNLL-2012 task shared dataset and the CDTC corpus demonstrate the effectiveness of our proposed approach.
Discourse topic, theme-rheme theory, thematic progression.
Cite This Article
Xi, X., Sheng, V. S., Yang, S., Fu, B., Cui, Z. (2020). Predicting Simplified Thematic Progression Pattern for Discourse Analysis. CMC-Computers, Materials & Continua, 63(1), 163–181.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.