Open Access
ARTICLE
An Automated System to Predict Popular Cybersecurity News Using Document Embeddings
1 National University of Sciences and Technology, Islamabad, Pakistan
2 Beriut Arab University, Beirut, Lebanon
3 Department of Computer Science and Engineering, Soonchunhyang University, Asan, Korea
4 Hitec University, Taxila, Pakistan
* Corresponding Authors: Yunyoung Nam. Email: ; Muhammad Attique Khan. Email:
Computer Modeling in Engineering & Sciences 2021, 127(2), 533-547. https://doi.org/10.32604/cmes.2021.014355
Received 19 September 2020; Accepted 03 February 2021; Issue published 19 April 2021
Abstract
The substantial competition among the news industries puts editors under the pressure of posting news articles which are likely to gain more user attention. Anticipating the popularity of news articles can help the editorial teams in making decisions about posting a news article. Article similarity extracted from the articles posted within a small period of time is found to be a useful feature in existing popularity prediction approaches. This work proposes a new approach to estimate the popularity of news articles by adding semantics in the article similarity based approach of popularity estimation. A semantically enriched model is proposed which estimates news popularity by measuring cosine similarity between document embeddings of the news articles. Word2vec model has been used to generate distributed representations of the news content. In this work, we define popularity as the number of times a news article is posted on different websites. We collect data from different websites that post news concerning the domain of cybersecurity and estimate the popularity of cybersecurity news. The proposed approach is compared with different models and it is shown that it outperforms the other models.Keywords
Cite This Article
Citations
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.