CCF | 数据去重
示例代码
import csv
import pandas as pd
import numpy as np
csv_file = "data.csv"
df = pd.read_csv(csv_file)
df_no_duplicated = df.drop_duplicates(
subset=["client_ip", " device"],
keep="first"
)
df_no_duplicated.to_csv("data_m2_t1_s1.csv",
index=False,
escapechar=" ",
quoting=csv.QUOTE_NONE,
quotechar='',)
本文是原创文章,采用 CC BY-NC-ND 4.0 协议,完整转载请注明来自 Summer
评论
匿名评论
隐私政策
你无需删除空行,直接评论以获取最佳展示效果