A Novel Mixed Precision Distributed TPU GAN for Accelerated Learning Curve

Aswathy Ravikumar; Harini Sriraman

doi:10.32604/csse.2023.034710

Open Access icon Open Access

ARTICLE

A Novel Mixed Precision Distributed TPU GAN for Accelerated Learning Curve

Aswathy Ravikumar, Harini Sriraman^*

School of Computer Science and Engineering, Vellore Institute of Technology, Chennai, 600127, India

* Corresponding Author: Harini Sriraman. Email: email

Computer Systems Science and Engineering 2023, 46(1), 563-578. https://doi.org/10.32604/csse.2023.034710

Received 25 July 2022; Accepted 28 October 2022; Issue published 20 January 2023

Abstract

Deep neural networks are gaining importance and popularity in applications and services. Due to the enormous number of learnable parameters and datasets, the training of neural networks is computationally costly. Parallel and distributed computation-based strategies are used to accelerate this training process. Generative Adversarial Networks (GAN) are a recent technological achievement in deep learning. These generative models are computationally expensive because a GAN consists of two neural networks and trains on enormous datasets. Typically, a GAN is trained on a single server. Conventional deep learning accelerator designs are challenged by the unique properties of GAN, like the enormous computation stages with non-traditional convolution layers. This work addresses the issue of distributing GANs so that they can train on datasets distributed over many TPUs (Tensor Processing Unit). Distributed learning training accelerates the learning process and decreases computation time. In this paper, the Generative Adversarial Network is accelerated using the distributed multi-core TPU in distributed data-parallel synchronous model. For adequate acceleration of the GAN network, the data parallel SGD (Stochastic Gradient Descent) model is implemented in multi-core TPU using distributed TensorFlow with mixed precision, bfloat16, and XLA (Accelerated Linear Algebra). The study was conducted on the MNIST dataset for varying batch sizes from 64 to 512 for 30 epochs in distributed SGD in TPU v3 with 128 × 128 systolic array. An extensive batch technique is implemented in bfloat16 to decrease the storage cost and speed up floating-point computations. The accelerated learning curve for the generator and discriminator network is obtained. The training time was reduced by 79% by varying the batch size from 64 to 512 in multi-core TPU.

Keywords

Data parallel; distributed model; generative model; learning curve; mixed precision

Cite This Article

APA Style

Ravikumar, A., Sriraman, H. (2023). A Novel Mixed Precision Distributed TPU GAN for Accelerated Learning Curve. Computer Systems Science and Engineering, 46(1), 563–578. https://doi.org/10.32604/csse.2023.034710

Vancouver Style

Ravikumar A, Sriraman H. A Novel Mixed Precision Distributed TPU GAN for Accelerated Learning Curve. Comput Syst Sci Eng. 2023;46(1):563–578. https://doi.org/10.32604/csse.2023.034710

IEEE Style

A. Ravikumar and H. Sriraman, “A Novel Mixed Precision Distributed TPU GAN for Accelerated Learning Curve,” Comput. Syst. Sci. Eng., vol. 46, no. 1, pp. 563–578, 2023. https://doi.org/10.32604/csse.2023.034710

BibTex EndNote RIS

Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

A Novel Mixed Precision Distributed TPU GAN for Accelerated Learning Curve

Abstract

Keywords

Cite This Article

1136

963

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link