Marvin Alberts, Teodoro Laino
ACS Fall 2025
In this paper, we present a novel, highly efficient, and massively parallel implementation of the sparse matrix-matrix multiplication algorithm inspired by the midpoint method that is suitable for matrices with decay. Compared with the state of the art in sparse matrix-matrix multiplications, the new algorithm heavily exploits data locality, yielding better performance and scalability, approaching a perfect linear scaling up to a process box size equal to a characteristic length that is intrinsic to the matrices. Moreover, the method is able to scale linearly with system size reaching constant time with proportional resources, also regarding memory consumption. We demonstrate how the proposed method can be effectively used for the construction of the density matrix in electronic structure theory, such as Hartree-Fock, density functional theory, and semiempirical Hamiltonians. We present the details of the implementation together with a performance analysis up to 185 193 processes, employing a Hamiltonian matrix generated from a semiempirical NDDO scheme.
Marvin Alberts, Teodoro Laino
ACS Fall 2025
Matteo Manica, Loic Kwate Dassi, et al.
ISGC 2022
Guojing Cong, Talia Ben Naim, et al.
ICDM 2022
Indra Priyadarsini S, Seiji Takeda, et al.
MRS Fall Meeting 2024