A novel method of heterogeneous parallel machine learning by CPU–TPU for molecular dynamics.

Saved in:
Bibliographic Details
Title: A novel method of heterogeneous parallel machine learning by CPU–TPU for molecular dynamics.
Authors: Zhang, Yujia1 (AUTHOR), Zhang, Xin2 (AUTHOR) zhangxin2302@hnu.edu.cn, Zheng, Gang1 (AUTHOR), Mo, Pinghui2 (AUTHOR), Zhao, Zhuoying1 (AUTHOR), Li, Chenyang2 (AUTHOR), Tang, Kai1 (AUTHOR), Liu, Jie2 (AUTHOR)
Source: Neural Computing & Applications. Sep2025, Vol. 37 Issue 26, p21949-21967. 19p.
Subjects: Molecular dynamics, Parallel computers, Load balancing (Computer networks), Central processing units
Abstract: In this paper, a heterogeneous parallel machine learning molecular dynamics (MLMD) calculation method based on both central processing unit (CPU) and SOPHON BM1684X tensor processing unit (TPU) is proposed. The method aims to offer a new hardware deployment approach for advanced MLMD algorithms, alleviating the constraints imposed by the severe "memory wall" and "power wall" bottlenecks caused by the separation of storage units and computing units inherent in von Neumann architecture-based machines at the hardware level. By decomposing complex MD simulation tasks into subtasks that can be processed in parallel on both CPU and TPU, this method enhances computational efficiency while maintaining high precision. Specifically, the potential energy surface fitting task in MD simulation is deployed on the TPU, leveraging its parallel processing capabilities to accelerate computations. Meanwhile, load balancing between the CPU and TPU is achieved by executing other computational tasks on the CPU. Experimental results demonstrate significant improvements in computational speed, energy efficiency, and the size of computable systems compared to the non-heterogeneous CPU-only system, indicating that heterogeneous parallel computing is an effective method for accelerating MD simulations. [ABSTRACT FROM AUTHOR]
Copyright of Neural Computing & Applications is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Engineering Source
Description
Abstract:In this paper, a heterogeneous parallel machine learning molecular dynamics (MLMD) calculation method based on both central processing unit (CPU) and SOPHON BM1684X tensor processing unit (TPU) is proposed. The method aims to offer a new hardware deployment approach for advanced MLMD algorithms, alleviating the constraints imposed by the severe "memory wall" and "power wall" bottlenecks caused by the separation of storage units and computing units inherent in von Neumann architecture-based machines at the hardware level. By decomposing complex MD simulation tasks into subtasks that can be processed in parallel on both CPU and TPU, this method enhances computational efficiency while maintaining high precision. Specifically, the potential energy surface fitting task in MD simulation is deployed on the TPU, leveraging its parallel processing capabilities to accelerate computations. Meanwhile, load balancing between the CPU and TPU is achieved by executing other computational tasks on the CPU. Experimental results demonstrate significant improvements in computational speed, energy efficiency, and the size of computable systems compared to the non-heterogeneous CPU-only system, indicating that heterogeneous parallel computing is an effective method for accelerating MD simulations. [ABSTRACT FROM AUTHOR]
ISSN:09410643
DOI:10.1007/s00521-025-11498-7