首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A GPU accelerated Barnes–Hut tree code for FLASH4
Institution:1. Biological Physics Group, School of Physics and Astronomy, University of Manchester, Manchester M13 9PL, UK;2. Department of Physiology and Pharmacology and Cardiovascular Research Laboratories, School of Medical Sciences, University Walk, Bristol BS8 1TD, UK;3. Manchester Heart Centre, Manchester Royal Infirmary, Manchester M13 9PL, UK;1. Institute of Quantum Science, College of Science and Technology, Nihon University, Chiyoda-ku, Tokyo 101-8308, Japan;2. Junior College Funabashi Campus, Nihon University, Narashinodai, Funabashi, Chiba 274-8501, Japan
Abstract:We present a GPU accelerated CUDA-C implementation of the Barnes Hut (BH) tree code for calculating the gravitational potential on octree adaptive meshes. The tree code algorithm is implemented within the FLASH4 adaptive mesh refinement (AMR) code framework and therefore fully MPI parallel. We describe the algorithm and present test results that demonstrate its accuracy and performance in comparison to the algorithms available in the current FLASH4 version. We use a MacLaurin spheroid to test the accuracy of our new implementation and use spherical, collapsing cloud cores with effective AMR to carry out performance tests also in comparison with previous gravity solvers. Depending on the setup and the GPU/CPU ratio, we find a speedup for the gravity unit of at least a factor of 3 and up to 60 in comparison to the gravity solvers implemented in the FLASH4 code. We find an overall speedup factor for full simulations of at least factor 1.6 up to a factor of 10.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号