Open Access
ARTICLE
Research on Interpolation Method for Missing Electricity Consumption Data
1 Department of Electronic Commerce, Xiangtan University, Xiangtan, 411105, China
2 School of Information Engineering, Yancheng Teachers University, Yancheng, 224000, China
3 Department of Information and Electrical Engineering, Ningde Normal University, Ningde, 352100, China
4 College of Engineering, Southern University of Science and Technology, Shenzhen, 518005, China
5 School of Informatics, Xiamen University, Xiamen, 361005, China
* Corresponding Author: Yaser A. Nanehkaran. Email:
(This article belongs to the Special Issue: Industrial Big Data and Artificial Intelligence-Driven Intelligent Perception, Maintenance, and Decision Optimization in Industrial Systems)
Computers, Materials & Continua 2024, 78(2), 2575-2591. https://doi.org/10.32604/cmc.2024.048522
Received 10 December 2023; Accepted 17 January 2024; Issue published 27 February 2024
Abstract
Missing value is one of the main factors that cause dirty data. Without high-quality data, there will be no reliable analysis results and precise decision-making. Therefore, the data warehouse needs to integrate high-quality data consistently. In the power system, the electricity consumption data of some large users cannot be normally collected resulting in missing data, which affects the calculation of power supply and eventually leads to a large error in the daily power line loss rate. For the problem of missing electricity consumption data, this study proposes a group method of data handling (GMDH) based data interpolation method in distribution power networks and applies it in the analysis of actually collected electricity data. First, the dependent and independent variables are defined from the original data, and the upper and lower limits of missing values are determined according to prior knowledge or existing data information. All missing data are randomly interpolated within the upper and lower limits. Then, the GMDH network is established to obtain the optimal complexity model, which is used to predict the missing data to replace the last imputed electricity consumption data. At last, this process is implemented iteratively until the missing values do not change. Under a relatively small noise level (α = 0.25), the proposed approach achieves a maximum error of no more than 0.605%. Experimental findings demonstrate the efficacy and feasibility of the proposed approach, which realizes the transformation from incomplete data to complete data. Also, this proposed data interpolation approach provides a strong basis for the electricity theft diagnosis and metering fault analysis of electricity enterprises.Keywords
Cite This Article
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.