Open Access iconOpen Access

ARTICLE

crossmark

Fast Access and Retrieval of Big Data Based on Unique Identification

by Wenshun Sheng1,*, Aiping Xu2, Shengli Wu3

1 Pujiang Institute, Nanjing Tech University, Nanjing, 211200, China
2 School of Computer, Wuhan University, Wuhan, 430072, China
3 School of Computing, Ulster University, Belfast, BT370QB, United Kingdom

* Corresponding Author: Wenshun Sheng. Email: email

Intelligent Automation & Soft Computing 2022, 32(3), 1781-1795. https://doi.org/10.32604/iasc.2022.022571

Abstract

In big data applications, the data are usually stored in data files, whose data file structures, field structures, data types and lengths are not uniform. Therefore, if these data are stored in the traditional relational database, it is difficult to meet the requirements of fast storage and access. To solve this problem, we propose the mapping model between the source data file and the target HBase file. Our method solves the heterogeneity of the file object and the universality of the storage conversion. Firstly, based on the mapping model, we design “RowKey”, generation rules and algorithm. Then according to the mapping rules of data file fields with the HBase table column, the data in the data file are transformed into HBase. Finally, the retrieved keywords in “RowKey” are stored and used to achieve fast data retrieval by prefix matching or keyword matching method. Our method has been applied to different projects, which shows these results can be applied to the data conversion from regular row store data file to HBase distributed large data storage and has strong commonality. The method can be widely used in HBase big data storage applications.

Keywords


Cite This Article

APA Style
Sheng, W., Xu, A., Wu, S. (2022). Fast access and retrieval of big data based on unique identification. Intelligent Automation & Soft Computing, 32(3), 1781-1795. https://doi.org/10.32604/iasc.2022.022571
Vancouver Style
Sheng W, Xu A, Wu S. Fast access and retrieval of big data based on unique identification. Intell Automat Soft Comput . 2022;32(3):1781-1795 https://doi.org/10.32604/iasc.2022.022571
IEEE Style
W. Sheng, A. Xu, and S. Wu, “Fast Access and Retrieval of Big Data Based on Unique Identification,” Intell. Automat. Soft Comput. , vol. 32, no. 3, pp. 1781-1795, 2022. https://doi.org/10.32604/iasc.2022.022571



cc Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1635

    View

  • 882

    Download

  • 0

    Like

Share Link