Author
Listed:
- Chenxiang Xiao
(School of Aeronautics and Astronautics, Shanghai Jiao Tong University, 800 Dong Chuan Road, Shanghai 200240, P. R. China)
- Chenchen Zhang
(School of Mathematical Sciences, Peking University, Beijing 100871, P. R. China)
- Bin Zhang
(School of Aeronautics and Astronautics, Shanghai Jiao Tong University, 800 Dong Chuan Road, Shanghai 200240, P. R. China†Sichuan Research Institute, Shanghai Jiao Tong University, Chengdu 610213, P. R. China)
- Hui Xu
(School of Aeronautics and Astronautics, Shanghai Jiao Tong University, 800 Dong Chuan Road, Shanghai 200240, P. R. China)
- Hong Liu
(School of Aeronautics and Astronautics, Shanghai Jiao Tong University, 800 Dong Chuan Road, Shanghai 200240, P. R. China)
Abstract
The Direct Simulation Monte Carlo (DSMC) algorithm is widely employed for simulating rarefied gas flows and is increasingly applied in near-continuum regimes for research and engineering purposes. However, its computational demands, notably load imbalance and extended simulation time, hinder widespread adoption. Addressing these challenges, this paper introduces the Two-Level parallel load balancing strategy. This novel approach combines thread-level and multi-process parallelism to enhance load balancing and reduce simulation time. Key features include a thread-level load-decoupling strategy implemented via OpenMP and a multi-process load balancing mechanism employing distributed memory via MPI. Building upon our previous PartPlusColl [L. Li, W. Ren and B. Zhang, J. Aeronaut. Astronaut. Aviat. Ser. A 46, 88 (2014)] approach, the load balancing mechanism utilizes Stop At Risk (SAR) criteria for repartitioning with METIS. Additionally, a specialized data transmission mechanism utilizing MPI nonblocking communication minimizes global communication between processes. Validation and evaluation are performed using four hypersonic flow cases around a cylinder and sphere, demonstrating significant improvements. Notably, the proposed strategy achieves 30% enhancement over the PartPlusColl strategy under 512 CPU cores compared to 16 CPU cores, and reduces between-process communication time with 33.57%. These advancements contribute to enhancing the effectiveness of the DSMC algorithm in near-continuum aerodynamic simulations.
Suggested Citation
Chenxiang Xiao & Chenchen Zhang & Bin Zhang & Hui Xu & Hong Liu, 2025.
"Two-level parallel load balancing strategy for accelerating DSMC simulations in near-continuum gases,"
International Journal of Modern Physics C (IJMPC), World Scientific Publishing Co. Pte. Ltd., vol. 36(03), pages 1-17, March.
Handle:
RePEc:wsi:ijmpcx:v:36:y:2025:i:03:n:s0129183124501985
DOI: 10.1142/S0129183124501985
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:ijmpcx:v:36:y:2025:i:03:n:s0129183124501985. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/ijmpc/ijmpc.shtml .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.