Reinforcement Learning Based Quantization Strategy Optimal Assignment Algorithm for Mixed Precision

Yuejiao Wang; Zhong Ma; Chaojie Yang; Yu Yang; Lu Wei

doi:10.32604/cmc.2024.047108

Open Access icon Open Access

ARTICLE

Reinforcement Learning Based Quantization Strategy Optimal Assignment Algorithm for Mixed Precision

Yuejiao Wang, Zhong Ma^*, Chaojie Yang, Yu Yang, Lu Wei

Xi’an Microelectronics Technology Institute, Xi’an, 710065, China

* Corresponding Author: Zhong Ma. Email: email

(This article belongs to the Special Issue: Development and Industrial Application of AI Technologies)

Computers, Materials & Continua 2024, 79(1), 819-836. https://doi.org/10.32604/cmc.2024.047108

Received 25 October 2023; Accepted 27 February 2024; Issue published 25 April 2024

Abstract

The quantization algorithm compresses the original network by reducing the numerical bit width of the model, which improves the computation speed. Because different layers have different redundancy and sensitivity to data bit width. Reducing the data bit width will result in a loss of accuracy. Therefore, it is difficult to determine the optimal bit width for different parts of the network with guaranteed accuracy. Mixed precision quantization can effectively reduce the amount of computation while keeping the model accuracy basically unchanged. In this paper, a hardware-aware mixed precision quantization strategy optimal assignment algorithm adapted to low bit width is proposed, and reinforcement learning is used to automatically predict the mixed precision that meets the constraints of hardware resources. In the state-space design, the standard deviation of weights is used to measure the distribution difference of data, the execution speed feedback of simulated neural network accelerator inference is used as the environment to limit the action space of the agent, and the accuracy of the quantization model after retraining is used as the reward function to guide the agent to carry out deep reinforcement learning training. The experimental results show that the proposed method obtains a suitable model layer-by-layer quantization strategy under the condition that the computational resources are satisfied, and the model accuracy is effectively improved. The proposed method has strong intelligence and certain universality and has strong application potential in the field of mixed precision quantization and embedded neural network model deployment.

Keywords

Mixed precision quantization; quantization strategy optimal assignment; reinforcement learning; neural network model deployment

Cite This Article

APA Style

Wang, Y., Ma, Z., Yang, C., Yang, Y., Wei, L. (2024). Reinforcement Learning Based Quantization Strategy Optimal Assignment Algorithm for Mixed Precision. Computers, Materials & Continua, 79(1), 819–836. https://doi.org/10.32604/cmc.2024.047108

Vancouver Style

Wang Y, Ma Z, Yang C, Yang Y, Wei L. Reinforcement Learning Based Quantization Strategy Optimal Assignment Algorithm for Mixed Precision. Comput Mater Contin. 2024;79(1):819–836. https://doi.org/10.32604/cmc.2024.047108

IEEE Style

Y. Wang, Z. Ma, C. Yang, Y. Yang, and L. Wei, “Reinforcement Learning Based Quantization Strategy Optimal Assignment Algorithm for Mixed Precision,” Comput. Mater. Contin., vol. 79, no. 1, pp. 819–836, 2024. https://doi.org/10.32604/cmc.2024.047108

BibTex EndNote RIS

Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Reinforcement Learning Based Quantization Strategy Optimal Assignment Algorithm for Mixed Precision

Abstract

Keywords

Cite This Article

1516

807

1

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link