CSV export and modification — Dataiku Community

link管理

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

相关文章推荐

含蓄的电影票 · Unable to parse csv ...· 1 月前 ·

叛逆的洋葱 · 15 R输入输出 | R语言教程· 3 周前 ·

难过的橙子 · 使用HTML5和JQuery读取CSV(Te ...· 2 周前 ·

稳重的猴子 · Node.js 解决 csv ...· 1 周前 ·

冷静的柑橘 · r语言读取多个文件的方法是什么 - 问答 ...· 3 天前 ·

酒量大的足球 · Autofac 2.5 Released· 8 月前 ·

幸福的书签 · 泰迪熊· 8 月前 ·

精明的伤痕 · qemu/debian/armhf: ...· 10 月前 ·

精明的开心果 · Checked Baggage ...· 11 月前 ·

文武双全的鸵鸟 · 山西煤炭产业在整合中实现跨越发展---国家能源局· 1 年前 ·

I have a python recipe that export a dataset to a csv.

But I need to modify the first row of this csv and its extension.

I'm trying to do this (read the csv and write a new first line) after exporting the dataset but i can't read it with the library csv.

Could you help me ?

# Recipe outputs
managed_folder_id = "pourImport"
output_folder = dataiku.Folder(managed_folder_id)
output_folder.upload_data(filename, analyses_df.to_csv(index=False, header=True, sep=";").encode("utf-8"))
with output_folder.get_download_stream(filename) as stream:
    with open(stream, 'r', newline='') as csvfile:
        # read csv
        reader = csv.reader('/'+csvfile, delimiter=';')
        lignes = list(reader)
    # first row
    new_first_row = ['2','toto']
    lignes[0] = new_first_row
    # write csv
    with open(stream, 'w', newline='') as csvfile:
        writer = csv.writer(csvfile, delimiter=';')
        for ligne in lignes:
            writer.writerow(ligne)
TypeError: expected str, bytes or os.PathLike object, not HTTPResponse

Tagged:

Python

0 ·

Answers

This makes no sense to me. Why would you write it in the first place if you are going to modify later? Why not write it correctly in the first place? Exactly what are you trying to achieve? What is the problem with the file that requires you to write it twice? Thanks

0 ·

The new first row has only 2 features while the folowing rows have 30 features.

The pandas .to_csv function adds separators at the first row.

I need a fisrt row like this

"2";"tot"

and not like this

"2";"tot";;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;

0 ·

Again you have not explined what your requirement is just what you think it should be done. I can't think of a reason as to why you would want to have a custom format CSV file. It is normal for empty columns to show like that, otherwise how would you know which column is which piece of data? In your sample the first two columns are fine the rest are empty but what if the empty columns are at the middle of the row?

0 ·

it is a specific format intended to be integrated into other software.

I don't know why this format is like this

0 ·

Then scrap what you got and write the file correctly in the first place. Here is how you can iterate rows of a dataframe:

for index, row in df.iterrows():
    print(row['c1'], row['c2'])

0 ·

Thank you for your help,

I don't know how to directly write a csv in the output folder of the recipe without using the function df.to_csv().

can you help me again ?

0 ·

What is exactly the format of the file you want? What should be in the first row, second row, third row, etc.

0 ·

推荐文章

含蓄的电影票 · Unable to parse csv over 20000 with Data Visualizer - Kibana - Discuss the Elastic Stack

1 月前

叛逆的洋葱 · 15 R输入输出 | R语言教程

3 周前

难过的橙子 · 使用HTML5和JQuery读取CSV(Text)文件的实例 - js技术_卡卡网

2 周前

稳重的猴子 · Node.js 解决 csv 文件乱码的两种办法 | 全栈渐进之路

1 周前

冷静的柑橘 · r语言读取多个文件的方法是什么 - 问答 - 亿速云

3 天前

酒量大的足球 · Autofac 2.5 Released

8 月前

幸福的书签 · 泰迪熊

8 月前

精明的伤痕 · qemu/debian/armhf: site/jacoco.xml.in does not exist (#22328) · Issues · CMake / CMake · GitLab

10 月前

精明的开心果 · Checked Baggage Allowance - HK Express

11 月前

文武双全的鸵鸟 · 山西煤炭产业在整合中实现跨越发展---国家能源局

1 年前