zbMATH — the first resource for mathematics

PVFMM: A parallel kernel independent FMM for particle and volume potentials. (English) Zbl 1388.65169
Summary: We describe our implementation of a parallel fast multipole method for evaluating potentials for discrete and continuous source distributions. The first requires summation over the source points and the second requiring integration over a continuous source density. Both problems require \(\mathcal O(N^2)\) complexity when computed directly; however, can be accelerated to \(\mathcal O(N)\) time using FMM. In our PVFMM software library, we use kernel independent FMM and this allows us to compute potentials for a wide range of elliptic kernels. Our method is high order, adaptive and scalable. In this paper, we discuss several algorithmic improvements and performance optimizations including cache locality, vectorization, shared memory parallelism and use of coprocessors. Our distributed memory implementation uses space-filling curve for partitioning data and a hypercube communication scheme. We present convergence results for Laplace, Stokes and Helmholtz (low wavenumber) kernels for both particle and volume FMM. We measure efficiency of our method in terms of CPU cycles per unknown for different accuracies and different kernels. We also demonstrate scalability of our implementation up to several thousand processor cores on the Stampede platform at the Texas Advanced Computing Center.

65N38 Boundary element methods for boundary value problems involving PDEs
35J05 Laplace operator, Helmholtz equation (reduced wave equation), Poisson equation
65R10 Numerical methods for integral transforms
65Y05 Parallel numerical computation
65Y20 Complexity and performance of numerical algorithms
Full Text: DOI