Contact Us

If you still have questions or prefer to get help directly from an agent, please submit a request. We’ll get back to you as soon as possible.

Your email address Subject Description

Please enter the details of your request. A member of our support staff will respond as soon as possible.

Problem

You are selecting columns from a DataFrame and you get an error message.

ERROR: AttributeError: 'function' object has no attribute '_get_object_id' in job

Cause

The DataFrame API contains a small number of protected keywords.

If a column in your DataFrame uses a protected keyword as the column name, you will get an error message.

For example, summary is a protected keyword. If you use summary as a column name, you will see the error message.

This sample code uses summary as a column name and generates the error message when run.

%python
df=spark.createDataFrame([1,2], "int").toDF("id")
df.show()
from pyspark.sql.types import StructType,StructField, StringType, IntegerType
df1 = spark.createDataFrame(
  [(10,), (11,), (13,)],
  StructType([StructField("summary", IntegerType(), True)]))
df1.show()
ResultDf = df1.join(df, df1.summary == df.id, "inner").select(df.id,df1.summary)
ResultDf.show()

Solution

You should not use DataFrame API protected keywords as column names.

If you must use protected keywords, you should use bracket based column access when selecting columns from a DataFrame. Do not use dot notation when selecting columns that use protected keywords.

%python
ResultDf = df1.join(df, df1["summary"] == df.id, "inner").select(df.id,df1["summary"])

推荐文章

耍酷的沙滩裤 · MathJax + Ember.js重新渲染问题_React重新渲染限制问题_React的重新渲染问题太多 - 腾讯云开发者社区 - 腾讯云

1 月前

光明磊落的葡萄酒 · java 集成finereport_mob649e81547b8f的技术博客_51CTO博客

4 月前

文质彬彬的大象 · 空调缺少氟利昂会怎么样？空调没有氟利昂怎么判断？_行业资讯_制冷资讯

6 月前

安静的领带 · 女律师的堕落故事完整版下载2024安卓最新版_手机app官方版免费安装下载_日新游戏网

9 月前

帅气的瀑布 · LearnDash Learning Management System. Sell Courses using WordPress | LearnDash

9 月前