Open Access iconOpen Access

ARTICLE

crossmark

KSKV: Key-Strategy for Key-Value Data Collection with Local Differential Privacy

by Dan Zhao1, Yang You2, Chuanwen Luo3,*, Ting Chen4,*, Yang Liu5

1 Artificial Intelligence Development Research Center, Institute of Scientific and Technical Information of China, Beijing, 100038, China
2 Industry Development Department, NSFOCUS Inc., Beijing, China
3 School of Information Science and Technology, Beijing Forestry University, Beijing, 100083, China
4 School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 610054, China
5 Institute of Computing Technology, China Academy of Railway Sciences Corporation Limited, Beijing, 10081, China

* Corresponding Authors: Chuanwen Luo. Email: email; Ting Chen. Email: email

(This article belongs to the Special Issue: Privacy-Preserving Technologies for Large-scale Artificial Intelligence)

Computer Modeling in Engineering & Sciences 2024, 139(3), 3063-3083. https://doi.org/10.32604/cmes.2023.045400

Abstract

In recent years, the research field of data collection under local differential privacy (LDP) has expanded its focus from elementary data types to include more complex structural data, such as set-value and graph data. However, our comprehensive review of existing literature reveals that there needs to be more studies that engage with key-value data collection. Such studies would simultaneously collect the frequencies of keys and the mean of values associated with each key. Additionally, the allocation of the privacy budget between the frequencies of keys and the means of values for each key does not yield an optimal utility tradeoff. Recognizing the importance of obtaining accurate key frequencies and mean estimations for key-value data collection, this paper presents a novel framework: the Key-Strategy Framework for Key-Value Data Collection under LDP. Initially, the Key-Strategy Unary Encoding (KS-UE) strategy is proposed within non-interactive frameworks for the purpose of privacy budget allocation to achieve precise key frequencies; subsequently, the Key-Strategy Generalized Randomized Response (KS-GRR) strategy is introduced for interactive frameworks to enhance the efficiency of collecting frequent keys through group-and-iteration methods. Both strategies are adapted for scenarios in which users possess either a single or multiple key-value pairs. Theoretically, we demonstrate that the variance of KS-UE is lower than that of existing methods. These claims are substantiated through extensive experimental evaluation on real-world datasets, confirming the effectiveness and efficiency of the KS-UE and KS-GRR strategies.

Keywords


Cite This Article

APA Style
Zhao, D., You, Y., Luo, C., Chen, T., Liu, Y. (2024). KSKV: key-strategy for key-value data collection with local differential privacy. Computer Modeling in Engineering & Sciences, 139(3), 3063-3083. https://doi.org/10.32604/cmes.2023.045400
Vancouver Style
Zhao D, You Y, Luo C, Chen T, Liu Y. KSKV: key-strategy for key-value data collection with local differential privacy. Comput Model Eng Sci. 2024;139(3):3063-3083 https://doi.org/10.32604/cmes.2023.045400
IEEE Style
D. Zhao, Y. You, C. Luo, T. Chen, and Y. Liu, “KSKV: Key-Strategy for Key-Value Data Collection with Local Differential Privacy,” Comput. Model. Eng. Sci., vol. 139, no. 3, pp. 3063-3083, 2024. https://doi.org/10.32604/cmes.2023.045400



cc Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 656

    View

  • 292

    Download

  • 0

    Like

Share Link