dask: Var calculation fails with non-pandas Dataframe
When using dask-cudf, the new parallel variance calculation will fail (PR https://github.com/dask/dask/pull/4865/) . This is probably due to two main causes
__array_interface__isn’t fully implemented:np.X(var/nansum) may not be supported but can be added see https://github.com/rapidsai/cudf/issues/1728- Tightly using pandas signatures:
pd.Series -> Series(),pd.concat -> Concat
About this issue
- Original URL
- State: closed
- Created 5 years ago
- Comments: 17 (17 by maintainers)
I believe we can close this now as a lot of work has gone into cudf to support
__array_function__calls:https://github.com/cupy/cupy/pull/2252 has just been merged, let me know if you encounter any issues with that @quasiben .