Pivot_table doesnt work same as pandas - Dask DataFrame

link管理

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

相关文章推荐

留胡子的杨桃 · Rights and ...· 4 月前 ·

刚毅的皮带 · 系统对后台任务的限制 | ...· 2 月前 ·

叛逆的太阳 · ICMJE | ...· 1 月前 ·

风流倜傥的麦片 · Flameshot Clipboard ...· 3 周前 ·

发呆的水煮肉 · RunCellpose Plugin ...· 1 周前 ·

玉树临风的野马 · 如何在PL/SQL上显示添加连字符的字符串- ...· 1 年前 ·

成熟的饭盒 · 杭州高新区（滨江）：打造AIGC产业先行区· 1 年前 ·

安静的西装 · 如何在Linux中获取文件的最后修改日期？_ ...· 1 年前 ·

阳刚的牛排 · TypeScript对象类型_typescr ...· 1 年前 ·

近视的镜子 · tf.layers.dense实现输出层，如 ...· 1 年前 ·

"A": ["foo", "foo", "foo", "foo", "foo", "bar", "bar", "bar", "bar"], "B": ["one", "one", "one", "two", "two", "one", "one", "two", "two"], "C": [1, 2, 2, 3, 3, 4, 5, 6, 7],

running the following:

df.pivot_table(values="C", index=["A", "B"],aggfunc=np.median)
results:
Which is the require result. However, when running this with dask dataframe it doesn’t go through:
ddf = dd.from_pandas(df, npartitions=3)
ddf.pivot_table(values="C", index=["A", "B"],aggfunc=np.median)
results:

ValueError: 'index' must be the name of an existing column
seems like the DD implementation is rather limited to scalars (dask.dataframe.reshape.pivot_table — Dask documentation)

Is there another way to achieve this?
              Hi @jadeidev,
Not exactly the same as you’ll get a Series instead of a DataFrame, but you can still get the same results with:
res = ddf.groupby(["A", "B"]).C.median()
# Optional, depends on what you want to do
pd_series = res.compute()
Does that help?

推荐文章

留胡子的杨桃 · Rights and Protections for Temporary Workers - English

4 月前

刚毅的皮带 · 系统对后台任务的限制 | Background work | Android Developers

2 月前

叛逆的太阳 · ICMJE | Recommendations | Defining the Role of Authors and Contributors

1 月前

风流倜傥的麦片 · Flameshot Clipboard not Work / Applications & Desktop Environments / Arch Linux Forums

3 周前

发呆的水煮肉 · RunCellpose Plugin Installation - Usage & Issues - Image.sc Forum

1 周前

玉树临风的野马 · 如何在PL/SQL上显示添加连字符的字符串-腾讯云开发者社区-腾讯云

1 年前

成熟的饭盒 · 杭州高新区（滨江）：打造AIGC产业先行区

1 年前

安静的西装 · 如何在Linux中获取文件的最后修改日期？_51CTO博客_linux查看文件最后修改时间

1 年前

阳刚的牛排 · TypeScript对象类型_typescript 匿名对象与区别-CSDN博客

1 年前

近视的镜子 · tf.layers.dense实现输出层，如何在loss中加入l2正则?_dense层的l2-CSDN博客

1 年前