I'm trying to figure out which ffts are fastest, best, etc.
First, it is difficult to get fftw3 + pyfftw installed into python.
For pyfftw to install, you need to enable floats and every specific type when compiling fftw3 (thanks to linux toolkits blog). Apparently you can't have --enable-long-double and --enable-float simultaneously, but pyfftw requires both, so you have to install twice or thrice? Nope, four times!! Also, my own post indicates that I should install to a different path to avoid weird conflicts with other libraries.
./configure --enable-threads --enable-openmp --enable-mpi --enable-shared \ --enable-fortran --enable-avx \ CFLAGS="-O3 -fno-common -fomit-frame-pointer -fstrict-aliasing" make -j 4 sudo make install ./configure --enable-float --enable-threads --enable-openmp --enable-mpi --enable-shared \ --enable-fortran --enable-avx \ CFLAGS="-O3 -fno-common -fomit-frame-pointer -fstrict-aliasing" make -j 4 sudo make install # quad precision is not supported in mpi ./configure --enable-quad-precision --enable-threads --enable-openmp --enable-shared \ --enable-fortran --enable-avx \ CFLAGS="-O3 -fno-common -fomit-frame-pointer -fstrict-aliasing" make -j 4 sudo make install ./configure --enable-long-double --enable-threads --enable-openmp --enable-mpi --enable-shared \ --enable-fortran --enable-avx \ CFLAGS="-O3 -fno-common -fomit-frame-pointer -fstrict-aliasing" make -j 4 sudo make install
Onto speed test comparisons:
If you want to use fftw3 with astropy's convolve_fft, use this example.
Onto some comparisons from astropy issue 4374:
data:image/s3,"s3://crabby-images/2ec3a/2ec3ac5b53c4ed56f1700d06e1bf6cac122bba3d" alt=""
data:image/s3,"s3://crabby-images/7f50e/7f50eacea41265d4c705cfdeda5ce22e466af6f8" alt=""
data:image/s3,"s3://crabby-images/92977/9297752e43762597b11d8634a98daeec8a925d49" alt=""
data:image/s3,"s3://crabby-images/f40f7/f40f7f9168d8155e0bd60d4828ed896c858fd559" alt=""
data:image/s3,"s3://crabby-images/9fc5a/9fc5a10d26317b4dd52a709e6f6726f321d74cda" alt=""