• Acta Physica Sinica
  • Vol. 68, Issue 12, 120202-1 (2019)
Fu-Zhou Chen1, Chen Cheng1、2, and Hong-Gang Luo1、2、*
Author Affiliations
  • 1School of Physical Science and Technology, Lanzhou University, Lanzhou 730000, China
  • 2Beijing Computational Science Research Center, Beijing 100084, China
  • show less
    DOI: 10.7498/aps.68.20190586 Cite this Article
    Fu-Zhou Chen, Chen Cheng, Hong-Gang Luo. Hybrid parallel optimization of density matrix renormalization group method[J]. Acta Physica Sinica, 2019, 68(12): 120202-1 Copy Citation Text show less
    4 Sub-blocks of super-block超块中的四个子块
    Fig. 1. 4 Sub-blocks of super-block超块中的四个子块
    Performance of acting the Hamiltonian on the wave function in CPU: (a) The matrix multiplication performance; (b) the performance of acting the Hamiltonian on the wave function, and the maximum matrix size of the matrix multiplications.CPU中作用哈密顿量在波函数上的性能 (a)矩阵乘法的浮点性能; (b)作用哈密顿量于波函数的性能, 及矩阵乘法中的最大矩阵尺寸
    Fig. 2. Performance of acting the Hamiltonian on the wave function in CPU: (a) The matrix multiplication performance; (b) the performance of acting the Hamiltonian on the wave function, and the maximum matrix size of the matrix multiplications.CPU中作用哈密顿量在波函数上的性能 (a)矩阵乘法的浮点性能; (b)作用哈密顿量于波函数的性能, 及矩阵乘法中的最大矩阵尺寸
    Time ratio of diagonalization of the Hamiltonian and acting the Hamiltonian on the wave function to the total time cost.对角化哈密顿量和作用哈密顿量到波函数操作占总计算时间的比例
    Fig. 3. Time ratio of diagonalization of the Hamiltonian and acting the Hamiltonian on the wave function to the total time cost.对角化哈密顿量和作用哈密顿量到波函数操作占总计算时间的比例
    The GPU memory cost of temporary data and sub-block operators.存储临时数据, 子块算符需要的GPU显存
    Fig. 4. The GPU memory cost of temporary data and sub-block operators.存储临时数据, 子块算符需要的GPU显存
    Performance of hybrid parallel strategy: (a) The speedup; (b) the GPU memory cost of vectors in Davidson; (c) the performance of 异构并行的性能 (a)加速比; (b) Davidson方法中的向量占用GPU显存; (c)作用哈密顿量到波函数部分的性能
    Fig. 5. Performance of hybrid parallel strategy: (a) The speedup; (b) the GPU memory cost of vectors in Davidson; (c) the performance of 异构并行的性能 (a)加速比; (b) Davidson方法中的向量占用GPU显存; (c)作用哈密顿量到波函数部分的性能
    Groundstate energy as a function of truncation error. The straight line gives a linear extrapolation of the ground energy until 0 truncation-error.基态能量关于截断误差的函数(直线表示对基态能量的线性外推, 直至截断误差为0)
    Fig. 6. Groundstate energy as a function of truncation error. The straight line gives a linear extrapolation of the ground energy until 0 truncation-error.基态能量关于截断误差的函数(直线表示对基态能量的线性外推, 直至截断误差为0)
    Ground state density profile for the 16 × 4 Hubbard ladder with U = 8.0. Charge density stripes can be clearly observed.对于16 × 4 Hubbard模型, U = 8.0时的基态电荷密度分布(可以观察到明显的电荷密度条纹)
    Fig. 7. Ground state density profile for the 16 × 4 Hubbard ladder with U = 8.0. Charge density stripes can be clearly observed. 对于16 × 4 Hubbard模型, U = 8.0时的基态电荷密度分布(可以观察到明显的电荷密度条纹)
    Fu-Zhou Chen, Chen Cheng, Hong-Gang Luo. Hybrid parallel optimization of density matrix renormalization group method[J]. Acta Physica Sinica, 2019, 68(12): 120202-1
    Download Citation