Jargalsaikhan Narantuya1,*, Jun-Sik Shin2, Sun Park2, JongWon Kim2
CMC-Computers, Materials & Continua, Vol.72, No.3, pp. 4375-4395, 2022, DOI:10.32604/cmc.2022.023318
- 21 April 2022
Abstract As the complexity of deep learning (DL) networks and training data grows enormously, methods that scale with computation are becoming the future of artificial intelligence (AI) development. In this regard, the interplay between machine learning (ML) and high-performance computing (HPC) is an innovative paradigm to speed up the efficiency of AI research and development. However, building and operating an HPC/AI converged system require broad knowledge to leverage the latest computing, networking, and storage technologies. Moreover, an HPC-based AI computing environment needs an appropriate resource allocation and monitoring strategy to efficiently utilize the system resources. In… More >