GTK: A Hybrid-Search Algorithm of Top-Rank-<i>k</i> Frequent  Patterns Based on Greedy Strategy

Long, Yuhang; Tang, Wensheng; Yang, Bo; Wang, Xinyu; Ma, Hua; Shi, Hang; Cheng, Xueyu

doi:10.32604/cmc.2020.09944

Open Access icon Open Access

ARTICLE

GTK: A Hybrid-Search Algorithm of Top-Rank-k Frequent Patterns Based on Greedy Strategy

by Yuhang Long, Wensheng Tang, Bo Yang, Xinyu Wang, Hua Ma, Hang Shi, Xueyu Cheng

1 Hunan Provincial Key Laboratory of Intelligent Computing and Language Information Processing, Hunan Normal University, Changsha, 410081, China.
2 College of Information Science and Engineering, Hunan University, Changsha, 410082, China.
3 Clayton State University, Morrow, GA 30260, USA.

*Corresponding Authors: Wensheng Tang. Email: email ; Bo Yang. Email: email .

Computers, Materials & Continua 2020, 63(3), 1445-1469. https://doi.org/10.32604/cmc.2020.09944

Received 30 January 2020; Accepted 26 February 2020; Issue published 30 April 2020

Download PDF

Abstract

Currently, the top-rank-k has been widely applied to mine frequent patterns with a rank not exceeding k. In the existing algorithms, although a level-wise-search could fully mine the target patterns, it usually leads to the delay of high rank patterns generation, resulting in the slow growth of the support threshold and the mining efficiency. Aiming at this problem, a greedy-strategy-based top-rank-k frequent patterns hybrid mining algorithm (GTK) is proposed in this paper. In this algorithm, top-rank-k patterns are stored in a static doubly linked list called RSL, and the patterns are divided into short patterns and long patterns. The short patterns generated by a rank-first-search always joins the two patterns of the highest rank in RSL that have not yet been joined. On the basis of the short patterns satisfying specific conditions, the long patterns are extracted through level-wise-search. To reduce redundancy, GTK improves the generation method of subsume index and designs the new pruning strategies of candidates. This algorithm also takes the use of reasonable pruning strategies to reduce the amount of computation to improve the computational speed. Real datasets and synthetic datasets are adopted in experiments to evaluate the proposed algorithm. The experimental results show the obvious advantages in both time efficiency and space efficiency of GTK.

Keywords

Top-rank-k frequent patterns, greedy strategy, hybrid-search.

Cite This Article

APA Style

Long, Y., Tang, W., Yang, B., Wang, X., Ma, H. et al. (2020). GTK: A hybrid-search algorithm of top-rank-k frequent patterns based on greedy strategy. Computers, Materials & Continua, 63(3), 1445-1469. https://doi.org/10.32604/cmc.2020.09944

Vancouver Style

Long Y, Tang W, Yang B, Wang X, Ma H, Shi H, et al. GTK: A hybrid-search algorithm of top-rank-k frequent patterns based on greedy strategy. Comput Mater Contin. 2020;63(3):1445-1469 https://doi.org/10.32604/cmc.2020.09944

IEEE Style

Y. Long et al., “GTK: A Hybrid-Search Algorithm of Top-Rank-k Frequent Patterns Based on Greedy Strategy,” Comput. Mater. Contin., vol. 63, no. 3, pp. 1445-1469, 2020. https://doi.org/10.32604/cmc.2020.09944

BibTex EndNote RIS

Copyright © 2020 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

GTK: A Hybrid-Search Algorithm of Top-Rank-k Frequent Patterns Based on Greedy Strategy

Abstract

Keywords

Cite This Article

2597

1416

1

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link