Switch from numpy.fft to scipy.fft? · Issue #6440 · dask/dask

link管理

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

相关文章推荐

力能扛鼎的冰棍 · [Python] curve fitting· 1 周前 ·

傻傻的野马 · [Python] SciPy ...· 1 周前 ·

善良的回锅肉 · 用python进行信号分析（五） - ...· 2 周前 ·

重情义的杯子 · scipy.interpolate.Rbf ...· 2 周前 ·

失眠的玉米 · Multidimensional ...· 3 周前 ·

很酷的莴苣 · 贴吧小程序· 1 月前 ·

闯红灯的大熊猫 · IIS – 一亩园· 2 月前 ·

低调的羊肉串 · 苹果A12和A11有什么不同？苹果A12和A ...· 3 月前 ·

失恋的包子 · 宁波爱发科真空技术有限公司· 4 月前 ·

傻傻的课本 · 10 things you didn’t ...· 5 月前 ·

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement . We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I noticed that the dask FFT wrapper internally uses numpy 's FFT.
Unless this is done to avoid a scipy dependency, I'd suggest switching to scipy.fft (not scipy.fftpack , which is now legacy) instead.
The advantage of scipy.fft is that it is much faster than numpy.fft when transforming multi-D arrays (even if only one axis is transformed), because it uses vector instructions where available. In addition to standard FFTs it also provides DCTs, DSTs and Hartley transforms. Optional shared-memory parallelization is available, and there are dedicated functions for single-precision transforms.

It looks like there is some attempt to use scipy.fftpack if scipy is installed, but not otherwise. Would you might looking through https://github.com/dask/dask/blob/master/dask/array/fft.py and seeing what you think should change?

On Wed, Jul 22, 2020 at 1:11 AM mreineck ***@***.***> wrote: I noticed that the dask FFT wrapper internally uses numpy's FFT. Unless this is done to avoid a scipy dependency, I'd suggest switching to scipy.fft (not scipy.fftpack, which is now legacy) instead. The advantage of scipy.fft is that it is much faster than numpy.fft when transforming multi-D arrays (even if only one axis is transformed), because it uses vector instructions where available. In addition to standard FFTs it also provides DCTs, DSTs and Hartley transforms. Optional shared-memory parallelization is available, and there are dedicated functions for single-precision transforms. You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub < #6440 >, or unsubscribe < https://github.com/notifications/unsubscribe-auth/AACKZTB6UAP6WCOLAVISPLTR42NMDANCNFSM4PEOJF4A >

It looks like there is some attempt to use scipy.fftpack if scipy is
installed, but not otherwise.

I think this is actually unused code. If scipy is available, then fft_wrap checks if it has been invoked with specific scipy.fftpack functions, but later on in the file it is only ever called with numpy functions.

I will have a look through the file and see if I can provide a patch by myself. If I manage to run unit tests locally, that should be feasible.

OK, that was easier than I thought ...

If you have any FFT benchmarks (especially ones with multidimensional transforms), it might be interesting to do a comparison.

I'm not sure how to address the discrepancy in the data type of the returned arrays when the input is float32 or complex64 ; this is something you will have to decide.

One issue which may come up is that calling into scipy.fft.fft triggers an asarray call:
https://github.com/scipy/scipy/blob/0765482e9a7cd7a7975f6161d2ab91994762ac8e/scipy/fft/_pocketfft/helper.py#L89-L92

Doing this would prevent other array libraries like CuPy from using the dask FFT wrapper

Interesting. But on the other hand, what does numpy do when promoting the input array to double precision? Won't this be something equivalent to asarray too?
I must admit that I don't really understand the implications of this, but if this is a fundamental problem, it should probably be addressed in scipy .
BTW, CuPy has its own scipy.fft backend ... no idea if this makes this problem better or worse.

It's important to note that SciPy does not implement NEP-18, nor intends to do so. The change in #6443 will prevent __array_function__ dispatching of FFTs when SciPy is available, meaning np.fft*(dask_cupy_array) would not work anymore. IMO, the solution would be to provide a way for the user to choose explicitly to use SciPy through some Dask configuration while maintaining NumPy as default even if SciPy is available.

EDIT: I had mistakenly written dask.array.fft*(cupy_array) but the correct is np.fft*(dask_cupy_array) (which I changed now), where dask_cupy_array is a Dask array backed by CuPy.

Maybe we create our own per-chunk function that uses scipy if available and the input is a numpy array or doesn't implement the `__array_function__` protocol, and uses numpy otherwise?

On Thu, Jul 23, 2020 at 12:11 PM Peter Andreas Entschev < ***@***.***> wrote: It's important to note that SciPy does *not* implement NEP-18, nor intends to do so. The change in #6443 < #6443 > will prevent __array_function__ dispatching of FFTs when SciPy is available, meaning dask.array.fft*(cupy_array) would not work anymore. IMO, the solution would be to provide a way for the user to choose explicitly to use SciPy through some Dask configuration while maintaining NumPy as default even if SciPy is available. You are receiving this because you commented. Reply to this email directly, view it on GitHub < #6440 (comment) >, or unsubscribe < https://github.com/notifications/unsubscribe-auth/AACKZTC45XHVKXOUHMDLGQTR5CDNBANCNFSM4PEOJF4A >

Maybe we create our own per-chunk function that uses scipy if available and the input is a numpy array or doesn't implement the __array_function__ protocol, and uses numpy otherwise?

Yes, that's a good idea, I don't see why it couldn't work.

I know @rgommers was looking into dispatching between FFTs before . Maybe he can share more about the status of that work?

Independent of that wrapping FFTs is quite simple . So it should be possible to roll your own as needed.

HTH 🙂

I know @rgommers was looking into dispatching between FFTs before . Maybe he can share more about the status of that work?

scipy.fft is basically complete. Let me Cc @peterbell10 here, he wrote scipy.fft and the CuPy and PyFftw backends for scipy.fft , so he may have a suggestion on how to proceed here.

Maybe we create our own per-chunk function that uses scipy if available and the input is a numpy array or doesn't implement the __array_function__ protocol, and uses numpy otherwise?

That sounds like the best solution to me. The cupy backend for scipy.fft calls exactly the same underlying functions as the numpy interface so there should be no downside to going through np.fft instead.

The cupy backend for scipy.fft calls exactly the same underlying functions as the numpy interface so there should be no downside to going through np.fft instead.

Very cool! 😄

Was curious about this. So took it for a spin. Ran into a small issue ( scipy/scipy#12652 ). Though please feel free to correct me if I'm just misunderstanding 🙂

Turns out I was missing a key piece 😅 Thanks Peter!

One option might be to use Dispatch as is done with sizeof . Though it might be even simpler than that. 😉